-
AGH University of Science and Technology
- Kraków, Poland
- in/arkadiusz-paterak
Highlights
- Pro
Stars
aider is AI pair programming in your terminal
Simple and easily configurable grid world environments for reinforcement learning
Attention based model for learning to solve different routing problems
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Open-source, state-of-the-art vehicle routing problem solver in an easy-to-use Python package.
Python package to read and write vehicle routing problem instances.
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
A bibliography and survey of the papers surrounding o1
A course on aligning smol models.
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
Reinforcement Learning environments based on the 1993 game Doom
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
COLMAP - Structure-from-Motion and Multi-View Stereo
A Python implementation of the "CoSyne" algorithim, as described in this paper: https://pdfs.semanticscholar.org/966e/41903b4aff42601a188bd7b26d71ef120d11.pdf
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Access to all MiniZinc functionality directly from Python
🗺 MapSCII is a Braille & ASCII world map renderer for your console - enter => telnet mapscii.me <= on Mac (brew install telnet) and Linux, connect with PuTTY on Windows
We write your reusable computer vision tools. 💜
Code for the manim-generated scenes used in 3blue1brown videos
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.