How to build a fine-tuning dataset for code completion？ #35

FWLamb · 2024-07-23T14:11:04Z

I want to implement code completion based on the company's self-developed component source code fine-tuning model. How should I build the dataset?
Is instruction based dialogue generation code built in this form?
{
"input":"#write a quick sort algorithm"
"output":"your quick sort algorithm code"
}
How to build a dataset based on code Insertion？（FIM）

Muhtasham linked a pull request Jan 29, 2025 that will close this issue

add fine-tuning code with lora support #44

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to build a fine-tuning dataset for code completion？ #35

How to build a fine-tuning dataset for code completion？ #35

FWLamb commented Jul 23, 2024 •

edited

Loading

How to build a fine-tuning dataset for code completion？ #35

How to build a fine-tuning dataset for code completion？ #35

Comments

FWLamb commented Jul 23, 2024 • edited Loading

FWLamb commented Jul 23, 2024 •

edited

Loading