psychologyAntikythera Model
Technical build: app (CLI + API + web), Karpathy quality loop, RAG over journal.antikythera.org, deployment
Training pipeline: SFT on Qwen3-30B via Prime Intellect Lab → DPO with Benjamin's A/B preferences → RLHF as continuous curation loop. RLHF is the permanent dynamic — model tracks a living intellectual program. SFT/DPO not yet available on Lab (announced, coming soon). Corpus preparation is the immediate task. Cost estimate: $1.2-4.5K total. Deep dive: projects/antikythera-philosopher/PRIME-INTELLECT-DEEP-DIVE.md
Platform: Prime Intellect Lab — full-stack hosted training (RL, SFT, DPO), per-token pricing, CLI-driven.
Key insight: RLHF is not train-then-deploy. It's a continuous curation dynamic — the model and Benjamin's taste co-evolve. New books, new positions, new analyses update what "sounds right." The model stays calibrated to a living research program.
Estimated total cost: $1,200–$4,500 · Base model: Qwen3-30B-A3B · Full deep dive: projects/antikythera-philosopher/PRIME-INTELLECT-DEEP-DIVE.md
pip install prime-cli