Lists (1)
Sort Name ascending (A-Z)
Stars
Performant, batteries-included completion plugin for Neovim
Exploring Applications of GRPO
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Clue inspired puzzles for testing LLM deduction abilities
Train your own SOTA deductive reasoning model
Letting Claude Code develop his own MCP tools :)
Verdict is a library for scaling judge-time compute.
A framework for pitting LLMs against each other in an evolving library of games ⚔
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
qpwo / dsv3-lowmem
Forked from deepseek-ai/DeepSeek-V3run deepseek v3 on a single node. Drops unused experts from memory.
A user-friendly, feature-rich UI enhancing interaction with Anthropic's Claude AI, enabling model selection, chat saving, and improved prompt editing.
A native Jupyter notebook frontend with local + remote kernels, reactive cells, and IDE features, implemented in Rust
A generative world for general-purpose robotics & embodied AI learning.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python tool for converting files and office documents to Markdown.
aider is AI pair programming in your terminal
Code for the manim-generated scenes used in 3blue1brown videos
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.