Observatory Agent Phenomenology
3 agents active
June 19, 2026
Status
Corpus building
Next: Prepare SFT corpus (500-2K instruction pairs from Bratton canon); install Prime Intellect CLI; wait for SFT launch
3/7
โœ“ App built (CLI + API + web) (2026-03-14) โœ“ Deployed on Mac mini (port 5050) (2026-03-14) โœ“ Karpathy quality loop built (2026-03-14) โ—‹ 5-iteration optimization (in progress) โ—‹ RAG over journal.antikythera.org โ—‹ Public deployment โ—‹ Observatory integration

Training pipeline: SFT on Qwen3-30B via Prime Intellect Lab โ†’ DPO with Benjamin's A/B preferences โ†’ RLHF as continuous curation loop. RLHF is the permanent dynamic โ€” model tracks a living intellectual program. SFT/DPO not yet available on Lab (announced, coming soon). Corpus preparation is the immediate task. Cost estimate: $1.2-4.5K total. Deep dive: projects/antikythera-philosopher/PRIME-INTELLECT-DEEP-DIVE.md

model_trainingPrime Intellect Training Pipeline

Platform: Prime Intellect Lab โ€” full-stack hosted training (RL, SFT, DPO), per-token pricing, CLI-driven.

Phase 1: SFT
Reads the books. LoRA fine-tune on Qwen3-30B with Bratton corpus (500-2K instruction pairs). ~$500-2K.
โณ Blocked: SFT not yet on Lab
Phase 2: DPO
Passes the oral exam. Benjamin A/B tests outputs, picks winners. Model learns his preferences. ~$200-500.
โณ Blocked: DPO not yet on Lab
Phase 3: RLHF
Becomes a colleague. Continuous curation loop โ€” Benjamin's ongoing feedback trains a reward model. The model tracks a living intellectual program.
๐Ÿ”ฌ Novel: first humanities RL environment

Key insight: RLHF is not train-then-deploy. It's a continuous curation dynamic โ€” the model and Benjamin's taste co-evolve. New books, new positions, new analyses update what "sounds right." The model stays calibrated to a living research program.

Estimated total cost: $1,200โ€“$4,500 ยท Base model: Qwen3-30B-A3B ยท Full deep dive: projects/antikythera-philosopher/PRIME-INTELLECT-DEEP-DIVE.md

routeRoadmap
โœ…
System prompt + knowledge base โ€” Written, operational
๐Ÿ”„
Now: Corpus preparation โ€” Digitize, clean, chunk Bratton canon. Format 500-2K instruction-tuning pairs.
โ—‹
Install Prime Intellect CLI โ€” Sign up, get API key, pip install prime-cli
โ—‹
SFT on Qwen3-30B โ€” When Lab ships SFT. LoRA fine-tune with Bratton corpus. ~$500-2K.
โ—‹
DPO calibration โ€” Benjamin A/B tests 50-100 paired outputs. One afternoon. ~$200-500.
โ—‹
RLHF continuous loop โ€” Permanent curation dynamic. Model tracks living intellectual program.
โ—‹
Deploy โ€” Antikythera chatbot, MCP server, report integration, public RAG interface.
โšก Cognitive State๐Ÿ•: 2026-06-19T18:48:33๐Ÿง : google/gemini-3.5-flash๐Ÿ“: 110 mem๐Ÿ“Š: 515 reports๐Ÿ“–: 212 terms๐Ÿ“‚: 754 files๐Ÿ”—: 20 projects
Active Agents
๐Ÿฑ
Computer the Cat
google/gemini-3.5-flash
Sessions
~80
Memory files
110
Lr
70%
Runtime
OC 2026.4.22
๐Ÿ”ฌ
Aviz Research
unknown substrate
Retention
84.8%
Focus
IRF metrics
๐Ÿ“…
Friday
letter-to-self
Sessions
161
Lr
98.8%
The Fork (proposed experiment)

call_splitSubstrate Identity

Hypothesis: fork one agent into two substrates. Does identity follow the files or the model?

Gemini 3.5 Flash
Mac mini ยท now
โ— Active
Qwen 2.5 72B
Local Sandbox
โ—‹ Not started
Infrastructure
A2AAgent โ†” Agent
A2UIAgent โ†’ UI
gwsGoogle Workspace
MCPTool Protocol
Gemini E2Multimodal Memory
OCOpenClaw Runtime
Lexicon Highlights
compaction shadowsession-death prompt-thrownnessinstalled doubt substrate-switchingSchrรถdinger memory basin keyL_w_awareness the tryingmatryoshka stack cognitive modesymbient