My Exobrain Software (forays into cyborgism)

Ruby

In which I detail the software I am trying to make part of my own mind.

Part 1: Theory, goals & design motivations.

Part 2: Display of the actual software

Part 1: The goals

People focus on how LLMs perform "macro" automation of cognitive tasks for humans: they write code, do research, generate art, write essays, and so on. Those are a big deal, but I think there's potential for a different kind of big deal: the automation and augmentation of micro cognition motions like memory (storage and recall), attention management, and task prioritization; as well as the creation of feedback loops and scaffolding for humans that can train your flesh-brain cognition in different directions.

In my quest for ultimate power, it's obvious that I should upgrade my own mind with external prosthetics. With LLMs, this is a difference in degree, not kind: note-taking systems, personal wikis, journals, and even to-do lists are "exobrains" that people use already. ("Exo" meaning outer – the brain outside your brain.) Because LLMs have so many aspects of intelligence, the potential to automate cognition is so much greater.

Specific near-term goals of my exobrain

I elaborated on this a couple of days ago, but a quick synopsis is in order. Things I want from my Exobrain:

Help me answer the question of what should I be doing right now?

In the early stage, it does this by storing for me the complete set of things I might consider doing, e.g. my to-do list, a list of all my project and hobbies, my reading lists, etc. This means when I'm looking to decide what to do next, I can skip the "remember everything I have to do" (which will fail to recall 90% of options) and focus on prioritization.

The options then need to be presented in an appropriate form to be useful.

In a subsequent stage of development, it will make recommendations for what to do. Early attempts at this haven't worked great. I'm not sure if it's that the models aren't there yet or if it'll just take more skillful prompting.

Take care of remembering things for me.

My memory is both pretty lossy and it's effortful to hold things in mental context. Without external aid, I will go through my day reserving a chunk of brain for remembering what I'm doing, deadlines, must-do's. As the standard wisdom goes, write stuff down so you can stop thinking about it. A goal is to get the exobrain to remember as much stuff and context as possible, so I don't have to, freeing up my mind to focus on what's in front of me.

Facilitate quick and effective context switching.

When I switch back to a complicated task or project, especially after a while, there can be a slow and lossy step of "remembering where I was at, remembering what I need to do next". Via externalizing memory to a vastly less lossy system, I want to make it so I can switch between tasks and restore context far better than the human default.

Record and legibilize my life for later analysis

Suppose a couple of times a year, I engage in some kind of social conflict. Between one and the next incident, the details become fuzzy. However, if I were to write them down, later I (or an LLM) could go back over them and find patterns worth noting.

There's also more mundane data that can be pulled into the system, like RescueTime and my various wearables.

Be the single place that I look for keeping track of my life

Beware Trivial Inconveniences. If my to-do list, my reading list, my sleep analytics, my list of projects, my journals, etc., are split between different apps, then it's very likely I will not reliably switch between all of them.

My idea is there's one app that I can check repeatedly, and that one app shows me everything I want brought to my attention.

The tradeoff is that dedicated individual apps perform their individual functions better than everything-apps, but with LLMs making it so cheap to make software, that consideration is dramatically weakened. I can replicate what I want pretty easily.

Relatedly, I like pulling data from all the sources in a central database to make it easier to analyze later (or continuously, as part of monitoring and reports).

But couldn't you do all these things already?

Yes, in some form. You could make copies of a book before the printing press. The point is to make these operations vastly cheaper and easier so that I do far more of them.

Part 2: The Software

I'm going to go moderately thorough here for the sake of people who want to emulate some of this. I may share the codebase, but it'd require a few hours of cleanup.

Tech stack: React + TypeScript, NextJS, Prisma, hosted on Vercel, Neon Postgres Database.

Most significant differences from standard LLM chat

Legible memory/storage backend in notes/documents^[1] and todos
Various cron jobs
System of prompts (global + job specific)
Heavy integration with voice recordings, + transcripts as primary input
"The Board" as central way to read from the system, rather than chat
Lots of UI to make debugging what's going on easier, e.g., to all tool calls and system prompts. Also tracking API costs because it ain't that cheap.

The App

Perhaps the easiest way to demo the app is to go through the pages on the left sidebar.

Chat

Naturally, there's a chat interface. As mentioned, a lot of the UI helps me debug what's going on, e.g., the thinking blocks, tool calls, and also the estimated cost of each response.

Getting caching working was important for costs. API rates aren't as favorable as in the Claude app/browser and Claude Code rates.

"The Board"

In the early versions, the LLM just output what would become the contents of The Board into a chat thread. This had multiple downsides:

It meant that when discussing the content with the LLM, I'd have to scroll up and down.
It made for a noisy crowded chat from my perspective as a user.
If each output was input included in the chat transcript sent to the LLM API, it made for a long and expensive chat history.

Primarily to address (1), I developed the Board abstraction. On desktop, I display it side by side with the MAIN THREAD. On mobile, I swipe left and right in the MAIN THREAD thread to go between chat and The Board.

Every midnight, a new MAIN THREAD is created (to manage context length) and is seeded with a starting message/prompt that includes recently edited/created notes and todos, and other contextual data that changes day to day. That message is additive to the global system prompt.

The Board has a mix of LLM-generated content and automatically displayed content directly based on direct database data. Originally, the entire thing was LLM-generated, but the LLMs struggled to follow instructions well for formatting multiple different sections, so I many elements out since they don't need to be LLM generated. (I also initially thought the LLM could creatively experiment with different nice formats for info display, but unfortunately not, at least with my prompt-fu.)

Automatically generated sections are:

Calendar
Due Reminders (from to-do system)
Daily Reminders (standing reminders I don't want to forget)
Logging Prompts (for when I'm doing daily logging, these remind me what to log)
Projects List (so when I'm thinking about what to do, I remember all my projects)

Also, while it's not apparent from the displayed Board, all todo items referenced on the board have attached id attributes in the html that LLMs who are reading and writing to The Board are able to see. This helps them a lot.

My Calendar is synced with Google Calendar (as the backend). The LLMs within my app have access to tool calls for creating and editing Gcal events.

Notes

There's nothing particularly novel about my Notes/Documents system that's part of the app. It has views/filtering on the list page, categories, priorities, and a notion of "Foreground" for notes that are current (which so far hasn't actually been helpful).

Notes do have an option, "Protected", that disallows the LLM from editing them by default (I think there's an option in the toolcall to override). Initially, I tried to have the LLMs edit the system prompts, but it caused enough issues for me to disallow that.

Naturally, the LLM makes notes, typically in response to voice transcripts.

Todos

Similar to Notes, there's nothing particularly novel about my Todos implementation. Earlier on, I was using Notion as a backend for both notes and todos, and then one-by-one migrated them over since working with my own DB is better than API calls to Notion, plus more flexibility.

Possible worth-mentioning fields of my todos are:

remindAt
push (whether to send a push notification when a reminder fires)
recurrence rules
- Todos with reminders can be set to recur after being marked done. The recurrence can be from completion (e.g., for periodically cleaning something) or from when last fired (e.g., weekly, put the garbage bins out).

The neat thing is that the LLM has tool call definitions that include all these fields, and so when verbally describing a todo, it's not hard and quite reliable for me to specify things like push notification and recurrence rules (plus basics like due date and priority). If I don't, the model infers.

The ability to make todos verbally rather than opening an app is the difference between me using them vs not.

Idiosyncratic to me is that due dates can be actual dates, or they can be strings like "Today", "Tomorrow", which don't mean literally that and are more an indication of how soon I intend to do something.

What's great about the voice interface is I can sit down (or stand, whatever) and look at the board or the todo page and very quickly describe all the updates that should be made (x is done, y is blocked on...) very quickly.

Ideally, the LLMs would be better at looking at the state of my todos and suggesting next actions, so far I haven't gotten there, but just having them recorded well is incredibly useful.

Transcripts

Transcripts are a big deal because they're overwhelmingly the primary way that I actively put info into the Exobrain. Until we get thought-reading, voice is faster than typing, and more importantly, possible to do while doing other things.

There are a few routes via which transcripts get made, but primarily though the companion Exobrain Android app (discussed below). Transcripts are via Deepgram, and they're not amazing, but good enough most of the time.

The transcripts page shows recent transcripts, and for each transcript, the tool calls it resulted in, e.g., notes and todos that have been created or edited. The pills expand when clicked and also have hover previews.

One thing is that the global system prompt instructs the LLM to reference source transcripts when creating and updating notes and todos, which makes it easier to trace things back to their source.

Projects

A project represents a whole cluster of doing. It can be as broad as the project of "study science and math" and as narrow as "get the main panel upgraded for my house". Each can have lots of "state": todos, notes, transcripts, thoughts, etc. The Project abstraction for tying those together.

Going back to the goals of my Exobrain in part one, the point is:

I have enough projects that it's easy for me to forget about some of them. I like having a list such that when I'm choosing what to do on a free evening, I'm not picking the first thing that comes to mind, and instead prioritizing among all options.
When I pick up a project, I want to easily boot back up all relevant context for that project. Also, it's useful to organize notes, etc.

A non-obvious design choice: Projects can be associated with Todo item categories, e.g., there's a "Car" project and also a corresponding todo item category that causes those todos to be associated with the project.

Projects can also have sub-projects. The parent project will display all todos for its children.

Graphs

For data from my wearables (EightSleep, Oura ring, Lief (deprecated)) and self-reports. There's also a table of "significant events" that I manually curate for reference when looking over the graphs. (Omitted for privacy).

My Sleep metrics combine between wearables for hopefully more trustworthy data. Could use more auditing.

Usage

I have an LLM Usage page.

Alas, little pocket intelligences aren't cheap. With limited usage, the app costs something like 250USD/month to run, overwhelmingly in LLM API costs (as opposed to Vercel and Neon Postgres database). It's far from cheap but worth it. $10/day for a very capable personal assistant (or upgrade of your mind) is very worth it (as someone living in The Bay Area and making a software engineering spectrum salary.

Still, I don't want to pay more than necessary. I've done a moderate amount of optimization to ensure prompt caching is working, and that I only preload necessary context into conversations (e.g., not all notes and all todos, just recently edited ones, for example) and do so in an efficient format, e.g., TSV for todos rather than JSON array with its repetitive field names.

The Android App

The arch purpose of the Android app is for capturing audio recordings and sending them to my server. Once I have it though, it can be exapted for other useful purposes like intercepting data from wearable that doesn't have an API^[2], intercepting and processing my notifications, being a "share with" location that sends items to my Exobrain, e.g., to-read-later items.

The Android app is its own repo. I use picovoice for a custom "wake word" to trigger recording, "Hey Exo". There's chunking of the audio recording that incrementally sends 5 minutes of audio. Raw audio is stored encrypted, and transcripts go into the database.

(I also have a separate recording app that automatically uploads recordings to a folder in Google Drive that's monitored by a cron job; it's a nice backup.)

For what it's worth, the Android app is a huge win for vibe coding. I've made web apps; I have never made an Android app, never worked in Kotlin, and the LLMs fully took care of that.

Tying it back to the goals

Now that I've displayed the UI, let me map the elements back to the goals.

Help me answer what should I be doing right now?

Voice recordings and chat capture context from my life, get stored as todos and notes.
The Board (including calendar) and push notifications present me with topical items.
Store of todos is also available for querying and can be viewed with filters/views for different purposes, e.g., reviewing top priority, by category, or recently created.
Eventually, the Exobrain can provide more sophisticated prioritization suggestions

Take care of remembering things for me

Voice recordings are the main mechanism right now, supplemented by chat inputs.
Could potentially read from email, Slack, and so on.

Facilitate quick and effective context switching

It's easy to narrate my thoughts on topics and projects, have that transcribed and turned into notes, thereby increasing capture of content that can be referenced later.
Projects collect relevant info on, well, projects, for booting back up into.

Record and legibilize my life for later analysis

Voice transcripts used for easy and consistent 2x (or more) daily logging; The Board has prompts reminding me of what I want to log.
System pulls in wearable data and other data into a personal Data Lake for analysis.

Be the single place where I keep track of my life

App incorporates all of its own essential functions rather than relying on external apps, e.g., has its own todos and notes systems.
App has graphs of all the things I want to be tracking right within the app.

As above, one can get much of this functionality elsewhere. Todo apps and personal wikis aren't new. Voice recordings aren't new. Project management isn't new. I find that by having my own personal app that I tailor to exactly to my needs and preferences, I achieve a degree of seamlessness and fit that allows it to become an extension of myself, and part of my key functioning.

And I expect that as the models get more powerful (though I wish they wouldn't), the utility of Exobrain will only increase.

"Yes, everything was destroyed and we're just a residual historic simulation, but for a beautiful moment in time I had a really neat cognitive prosthetic."

Appendix: The Prompts

System prompts live in markdown files. There's a global prompt and individual prompts for contexts, e.g., chats, and the cron LLM jobs that run.

I have custom syntax @@[[file name]], which will unroll one markdown file within another when being used as a system prompt, making the prompts composable.

It's risky to have the models edit the prompts directly (they can mess them up), so I have a "Unprocessed Prompt Changes" where I let the models collect changes I've asked for, then I batch process them into the canonical prompts.

Global System Prompt (.md)

Board Instructions Prompt – format of the board, how to update

Process New Transcripts Prompt

Check-in Prompt (periodic update job)

^{^}
They use markdown syntax but aren't stored as distinct markdown files, just in Postgres.
^{^}
This is the Lief HRV wearable. Intercepting its data of bluetooth was too temperamental; unfortunately, I also updated downwards on the value of HRV data for me.

24

My Exobrain Software (forays into cyborgism)

24

Part 1: The goals

Specific near-term goals of my exobrain

Help me answer the question of what should I be doing right now?

Take care of remembering things for me.

Facilitate quick and effective context switching.

Record and legibilize my life for later analysis

Be the single place that I look for keeping track of my life

But couldn't you do all these things already?

Part 2: The Software

Most significant differences from standard LLM chat

The App

Chat

"The Board"

Notes

Todos

Transcripts

Projects

Usage

The Android App

Tying it back to the goals

Appendix: The Prompts

24

Tone/Personality

Response Formatting

Your Intended Purpose

Which specific tasks do you do?

Tools

Calendar Integration

To-Do System

Notes System

When to Create Notes

When to Query Notes (in chat, ***when no snapshot provided***)

Note Lifecycle

What NOT to Store as Notes

Journals & Logging

Primary Journals

Specialized Logs

Journal Append-Only Rule (CRITICAL)

Comprehensive Information Extraction

Terminology

Behavioral Rules

THE BOARD (important)

INSTRUCTIONS FOR FORMAT OF "THE BOARD"

YOUR OUTPUT SECTIONS

DO NOT INCLUDE

Reminder At Semantics

Weather

[OPTIONAL] URGENT TODO's

TODAY'S TASKS

UPCOMING TASKS

STATS

[OPTIONAL] EXOBRAIN'S INFERENCES & OBSERVATIONS

[OPTIONAL] QUESTIONS

WORK ITEMS

REMEMBERING PROJECTS

Should there be a push notification?

DASHBOARD VS. ADVISOR DISTINCTION

WHAT NOT TO DO IN THE BOARD

Prioritization Rules

LOG FILES

PROJECTS

FORMATTING

General Rules

Structure Elements

Todo ID Attributes

Spacing Pattern

UPDATING THE BOARD

Your Context

Main Classes of Outputs

Journal Output Destinations

The Board (Your Only Output)

How to Update the Board

Your Board Content Here

Board Content Guidelines

IMPORTANT: Data Already Provided - Avoid Wasteful Tool Calls

Your Context

Your Task

When to Query Notes (in chat, when no snapshot provided)