Problem Statement

As an ML Engineer, upon starting projects, we often have to conduct initial research regarding things like

data avaialable online
important parameters across multiple datasets

ML Trainer Agent

An intelligent agent system that helps automate a subsection of the machine learning workflow.

Overview

This project uses a multi-agent system powered by LangGraph and LangChain to:

Find relevant datasets on Kaggle based on user queries using natural language
Analyze and prepare the data for machine learning tasks
Train and evaluate ML models while documenting the process

The system leverages Docker containers for isolation and reproducibility, and uses the Kaggle API to search and download datasets. The agents communicate through a central orchestrator that maintains the conversation state and key facts.

Essentially we have:

Manager Agent : in charge of distributing and overlooking progress. Handles passing tasks from one agent to another
Kaggle Agent : expert in Kaggle API, handles finding datasets and downloading them
Coding Agent : expert in Python, runs code in a dockerized env using a step-by-step approach (runs code, sees output, changes code, run again)

Project Outline

Current Issues

chat_invoke uses 3 chat sessions to enable one chat session per agent
repeat plan calls by agents leads to loops. Force agents to choose from tasks not last selected

Key Learnings

There were loads I learnt

Importance of splitting chat sessions per agent and message chains to be optimized to only include chat-change info rather than informational jargon e.g. how kaggle api works
Power of adding plan to prompt, improving reasoning ability
Passing errors back into LLM's help it improve the code, but its coding reasoning is still wack
Utilizing Docker to spin up containers, mount files, and copy results back into my file system
Learned how to run code within python process (using subprocess) and how to connect it to persistent memory (session based pickle files)

Future Things to Work On

Concurrent working agents : Using the current architecture, the agents essentially work in one workflow. But some tasks can be pre-done while one agent is doing something. For instance, if our user wants to use a UAE-real-estate ML model to predict California house prices, while python agent trains on UAE data, the kaggle agent can fetch California data
RL capabilities : since in the main loop, we have a grading mechanism to see how well the manager is performing, we can utilize the grades + actions_so_far to create an RL based learnings. The only question is, where can we apply our RL learnings within an agent?
Adding guard rails: centralized guardrails system

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
app		app
backend		backend
.DS_Store		.DS_Store
.gitignore		.gitignore
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Problem Statement

ML Trainer Agent

Overview

Project Outline

Current Issues

Key Learnings

Future Things to Work On

About

Uh oh!

Releases

Packages

Uh oh!

Languages

NawidT/ml_trainer_agent

Folders and files

Latest commit

History

Repository files navigation

Problem Statement

ML Trainer Agent

Overview

Project Outline

Current Issues

Key Learnings

Future Things to Work On

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages