Skip to content

Latest commit

 

History

History
19 lines (12 loc) · 1.75 KB

README.md

File metadata and controls

19 lines (12 loc) · 1.75 KB

dolly-v2-12b

Databricks' dolly-v2-12b, an instruction-following large language model trained on the Databricks machine learning platform that is licensed for commercial use. Based on pythia-12b, Dolly is trained on ~15k instruction/response fine tuning records databricks-dolly-15k generated by Databricks employees in capability domains from the InstructGPT paper, including brainstorming, classification, closed QA, generation, information extraction, open QA and summarization. dolly-v2-12b is not a state-of-the-art model, but does exhibit surprisingly high quality instruction following behavior not characteristic of the foundation model on which it is based.

For more information -> https://huggingface.co/databricks/dolly-v2-12b

Notes

This deployment рассчитан на использоваание on GPUs NVIDIA V100, A100 and H100. After launch container, the application should download the trained model from the project repository with total weight of 24Gb and it may take some time depending on the internet speed of GPU provider.

Logs

Unfortunately, I could not achieve a normal display of logs. They do not display an indicator that the trained application models are loading, and they do not display requests to the server. But when manually launched locally, the logs are displayed. Therefore, in this case, in order to find out that the application is ready to work, you will have to constantly update the application page and wait for the interface to appear. Now the logs look like on this screenshot: Screenshot_20230703_201823

Demo Video

Peek.2023-07-03.20-16.mp4