Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
deploy.yaml		deploy.yaml

README.md

Llama-3 70B

Meta developed and publicly released the Llama 3 family of large language models (LLMs), a collection of pretrained and fine-tuned generative text models ranging in scale from 8 billion to 70 billion parameters. Llama 3 is an auto-regressive language model that uses an optimized transformer architecture.

In this deployment, the meta-llama/Llama-3-70B-Instruct pretrained model is used, which generates a continuation of the incoming text. But to access this model you must have access granted by the Meta that you can request from https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct.

Deploying

Use this SDL to deploy the application on Akash. You will need to enter your Huggingface Access Key in "HF_TOKEN=" ENV variable and you can adjust the parameters passed into the "vllm serve" argument according to your hardware cluster configuration (refer to vLLM documentation for the various parameters). Lastly you can add additional debug flags through the ENV variables (consult the vLLM and Pytorch documentation for this as well)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama-3-70B

Llama-3-70B

README.md

Llama-3 70B

Deploying

Files

Llama-3-70B

Directory actions

More options

Directory actions

More options

Latest commit

History

Llama-3-70B

Folders and files

parent directory

README.md

Llama-3 70B

Deploying