Skip to content
View mahdiabdollahpour's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report mahdiabdollahpour

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Inference Llama 2 in one file of pure C

C 18,132 2,207 Updated Aug 6, 2024

Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.

Jupyter Notebook 9 Updated Aug 20, 2024

LLM training in simple, raw C/CUDA

Cuda 25,930 2,971 Updated Oct 2, 2024
Jupyter Notebook 782 384 Updated Mar 12, 2024

Language Quantized AutoEncoders

Python 100 5 Updated Feb 7, 2023

Mora: More like Sora for Generalist Video Generation

Python 1,550 103 Updated Oct 10, 2024

Code for the paper "Training Diffusion Models with Reinforcement Learning"

Python 401 26 Updated Jul 5, 2023

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 492 47 Updated Mar 22, 2024

Large Context Attention

Python 687 53 Updated Jan 24, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,873 6,539 Updated Dec 9, 2024

Serve, optimize and scale PyTorch models in production

Java 4,298 875 Updated Mar 3, 2025

Supercharge huggingface transformers with model parallelism.

Python 76 3 Updated Oct 7, 2024

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Python 8,510 747 Updated Dec 10, 2023

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,517 152 Updated Oct 28, 2024

Query language for blending SQL logic and LLM reasoning across structured + unstructured data. [Findings of ACL 2024]

Python 88 5 Updated Oct 26, 2024
Jupyter Notebook 55 62 Updated Apr 16, 2024

DSPy: The framework for programming—not prompting—language models

Python 22,296 1,707 Updated Mar 6, 2025

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 978 72 Updated Feb 1, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,455 899 Updated Jul 1, 2024
Jupyter Notebook 1 Updated Feb 12, 2024

[NeurIPS 2020] Official code for the paper "DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation". Includes a PyTorch library for deep learning with SVG data.

Jupyter Notebook 1,027 103 Updated Aug 26, 2024

Manage scalable open LLM inference endpoints in Slurm clusters

Python 253 23 Updated Jul 11, 2024

Train transformer language models with reinforcement learning.

Python 12,251 1,659 Updated Mar 5, 2025

Official inference library for Mistral models

Jupyter Notebook 10,037 897 Updated Nov 12, 2024
Python 1 Updated Dec 17, 2023

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,631 820 Updated Sep 1, 2024

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,459 306 Updated Jul 15, 2024

Mamba SSM architecture

Python 14,154 1,234 Updated Jan 18, 2025
Next
Showing results