<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Data on Guy Freeman</title><link>https://gfrm.in/categories/data/</link><description>Recent content in Data on Guy Freeman</description><generator>Hugo</generator><language>en</language><lastBuildDate>Wed, 22 Apr 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://gfrm.in/categories/data/index.xml" rel="self" type="application/rss+xml"/><item><title>A Content-Addressed Foundation for Personal Knowledge</title><link>https://gfrm.in/posts/pkm-phase-1/</link><pubDate>Wed, 22 Apr 2026 00:00:00 +0000</pubDate><guid>https://gfrm.in/posts/pkm-phase-1/</guid><description>&lt;p&gt;I have about eleven million words of personal documents. Contracts, invoices, court filings, medical notes, research papers, travel itineraries, conversation transcripts, CVs of various vintages, takeaway menus from restaurants that have since closed. A decade of Syncthing directories and Dropbox archives and email attachments saved twice because I wasn&amp;rsquo;t sure which copy was authoritative.&lt;/p&gt;
&lt;p&gt;I would like to ask questions about this corpus. Not vague questions — specific ones:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;When did I last see my doctor?&lt;/em&gt;&lt;/li&gt;
&lt;li&gt;&lt;em&gt;What did Velotix pay me in September 2024?&lt;/em&gt;&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Which of my subscriptions auto-renew in the next sixty days?&lt;/em&gt;&lt;/li&gt;
&lt;li&gt;&lt;em&gt;What&amp;rsquo;s the warranty status on the water heater?&lt;/em&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Each of these fails on a different existing tool, for a different reason.&lt;/p&gt;</description></item><item><title>On Owning Your Data</title><link>https://gfrm.in/posts/on-owning-your-data/</link><pubDate>Wed, 31 Dec 2025 00:00:00 +0000</pubDate><guid>https://gfrm.in/posts/on-owning-your-data/</guid><description>&lt;p&gt;I step on a bathroom scale and my body fat percentage gets beamed to a server in Shenzhen. This is the arrangement. In exchange for this intimacy, the app suggests I upgrade to premium, which I find touching in a way the developers probably didn&amp;rsquo;t intend.&lt;/p&gt;
&lt;p&gt;I don&amp;rsquo;t have a problem with companies making money — I have a problem with the default. The default is: your body&amp;rsquo;s measurements belong to someone else&amp;rsquo;s database, and you&amp;rsquo;re welcome to look at them through their app, on their terms, until they pivot to a different business model or get acqui-hired and shut down the API. The data isn&amp;rsquo;t especially sensitive on its own. Nobody is going to blackmail me with my impedance readings. But if I can&amp;rsquo;t own the numbers that describe my own physical form, what exactly can I own? It&amp;rsquo;s a question worth asking, even if the answer turns out to be &amp;ldquo;not much, but you should try anyway.&amp;rdquo;&lt;/p&gt;</description></item><item><title>Scalibur: Reading Body Composition from a Cheap Bluetooth Scale</title><link>https://gfrm.in/posts/scalibur/</link><pubDate>Mon, 22 Dec 2025 00:00:00 +0000</pubDate><guid>https://gfrm.in/posts/scalibur/</guid><description>&lt;p&gt;Body composition scales are, on their own terms, genuinely useful devices. Step on, wait a few seconds, receive a small dossier on your own physical form: weight, body fat percentage, muscle mass, and several other numbers whose accuracy I&amp;rsquo;m diplomatically not questioning here. The problem is what happens next. Your data vanishes into whichever proprietary app the manufacturer saw fit to build, typically bundled with aggressive upsells and privacy practices that would make a data broker wince.&lt;/p&gt;</description></item><item><title>Hanukkah of Data 5783</title><link>https://gfrm.in/posts/hanukkah-of-code/</link><pubDate>Sun, 25 Dec 2022 00:00:00 +0000</pubDate><guid>https://gfrm.in/posts/hanukkah-of-code/</guid><description>&lt;p&gt;The madpersons behind the retro-chic data analysis tool &lt;a href="https://visidata.org"&gt;VisiData&lt;/a&gt; have created an alternative to the infamous &amp;ldquo;Advent of Code&amp;rdquo;. Instead of programming to solve yet more optimisation puzzles, and instead of the hackneyed Christmas theme, the &lt;a href="https://hanukkah.bluebird.sh/5783/"&gt;Hanukkah of Data&lt;/a&gt; requires data analysis to solve a &lt;del&gt;murder mystery&lt;/del&gt; rug-finding mission.&lt;/p&gt;
&lt;p&gt;I only just found out about it, so although one puzzle was released for each day of real-life Chanukah, I was able to gorge on the first 7 puzzles, needing only to wait for the last one. Each puzzle is represented by an ASCII candle in an ASCII world &amp;mdash; quite an impressive achievement in its own right.&lt;/p&gt;</description></item></channel></rss>