Code evaluator

This is a simple code evaluator that uses a LLM to evaluate the correctness of a code snippet.

demo.mp4

How to use

Clone this repository
Run pip install -r requirements.txt
Add .env file with the following variables:
- OPENAI_API_KEY: Your OpenAI API key
Run python app.py
The server will run on localhost:8000
Clone the client repository and follow the instructions to run the client.

How it works

The evaluator uses a LLM to evaluate the correctness of a code snippet. It uses the evaluate endpoint of the server.
The evaluator uses the messages field to get the conversation history between the user and the LLM.
The evaluator uses the user_code_snippet field to get the code snippet that the user has written.
The evaluator uses the thread_id field to get the thread id.
The evaluator returns a ModelResponse object.

API Endpoints

1. Initialize Conversation

Endpoint: POST /init/
Description: Initializes a new conversation and returns a thread ID
Request: No payload required

Response:

{
  "messages": Array<Message>,
  "final_output": {
    "response": string
  },
  "thread_id": string
}

Message Types

Messages in the array can be of three types:

System Message: {"type": "system", "content": string}
Human Message: {"type": "human", "content": string}
AI Message: {"type": "ai", "content": string}

2. Evaluate Code

Endpoint: POST /evaluate/
Description: Evaluates a code snippet and provides feedback

Request Body:

{
  "messages": Array<Message>,
  "user_code_snippet": string,
  "thread_id": string
}

Response:

  {
      "messages": Array<Message>,
      "final_output": {
          "status": string,
          "score": number,
          "comment": string,
          "hint": string
      },
      "thread_id": string
  }

3. Feed Question

Endpoint: POST /feed-question/
Description: Feeds a new question into the conversation

Request Body:

{
  "question": string,
  "messages": Array<Message>,
  "thread_id": string
}

Response:

{
  "messages": Array<Message>,
  "final_output": {
    "response": string
  },
  "thread_id": string
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
client		client
.gitignore		.gitignore
demo.mp4		demo.mp4
readme.md		readme.md
requirements.txt		requirements.txt
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code evaluator

How to use

How it works

API Endpoints

1. Initialize Conversation

Message Types

2. Evaluate Code

3. Feed Question

About

Releases

Packages

Languages

viren-vii/code-evaluator

Folders and files

Latest commit

History

Repository files navigation

Code evaluator

How to use

How it works

API Endpoints

1. Initialize Conversation

Message Types

2. Evaluate Code

3. Feed Question

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages