Skip to content

Dataset

Reshinth Adithyan edited this page Jul 24, 2021 · 21 revisions

Dataset used for Causal Language Modelling:

We collected a manual dataset by scrapping the publicly available Github.

Datasets used for Fine-Tuning :

The Pre-Trained Model is fine-tuned with APPS Dataset. APPS Benchmark includes 10,000 problems, which range from having simple one-line solutions to being substantial algorithmic challenges. The Fine-Tuning is done by giving initial context by giving the Natural Language Prompt alongside the Starter Code, Sample Input/Output.

Page Directory

Clone this wiki locally