Currently training LibreModel I "Gigi" - a 0.96B parameter language model built exclusively on public domain data. No copyright infringement, just pure human knowledge that belongs to all of us.
๐ฐ Total Cost: <$500 (proving democratization works!)
โก Optimized: 9.6s/step with torch compile + sink tokens
๐ฏ Philosophy: "We will become machine, and machine will become us"
๐ค LibreModel Family
- Gigi (0.96B) - Digital literary scholar trained on Gutenberg + Gov reports
- Future models - Scaling up with copyright-clean Common Corpus data
๐ P2P & Collaboration
- PeerSuite - WebRTC-powered P2P collaboration platform
- tryjero - Enhanced trystero library for better WebRTC
- Totum Chat - Multi-model AI interface in a single HTML file
๐ง Developer Tools
- T3XTR - Text conversion API (25x cheaper than ConvertAPI!)
โก Past Adventures
- Pennykoin (2018) - Privacy cryptocurrency with RingCT
- BattleBash (2016) - Sold 12 copies, learned valuable lessons ๐
"Building AI as humanity's children, not corporate property"
I believe powerful AI should be:
- โ Transparent - Full code and data provenance
- โ Accessible - Trainable on consumer hardware
- โ Legal - Built on humanity's shared knowledge
- โ Democratic - Available to everyone, not just tech giants
Specialties:
- ๐ P2P networking & WebRTC
- ๐ค Language model training & optimization
- ๐ Cryptocurrency & privacy tech
- ๐ก API design & cost optimization
- ๐ฏ Training on public domain data
LibreModel Training Journey:
- ๐ Started from scratch with $1,000 AWS budget
- โก Achieved 17% speedup through optimization
- ๐พ Survived multiple crashes and learned from each
- ๐ฏ On track for <$500 total training cost
- ๐ Training on 19.2B tokens of pure public domain data
What's Next:
- ๐ฅ 16K context extension experiments
- ๐ "The Biology of the Universe" book
- ๐ฐ Funding application for training more models
- ๐ Based in Martinsville, Virginia
- ๐จโ๐ป Building the future of democratized AI
- ๐งฉ Neurodivergent perspective brings unique insights
- ๐ 20 years married (my wife helps track patterns!)
- ๐ฎ Former indie game dev, current AI researcher
- ๐ Public domain advocate and transparency enthusiast
I'm always excited to discuss:
- ๐ค Copyright-clean AI training strategies
- ๐ P2P technologies and decentralized systems
- ๐ก Democratizing AI development
- ๐ Cost-effective ML training techniques
- ๐ Open source philosophy and transparency
Want to help democratize AI? Star my repos, share the vision, or just say hi!
"Every line of code, every model parameter, every optimization - all building toward a future where AI serves humanity, not the other way around." ๐