Skip to content

Commit

Permalink
Update TGI copy (#438)
Browse files Browse the repository at this point in the history
  • Loading branch information
gongy authored Sep 5, 2023
1 parent 45dbdc1 commit 2b1cbe6
Showing 1 changed file with 2 additions and 3 deletions.
5 changes: 2 additions & 3 deletions 06_gpu_and_ml/text_generation_inference.py
Original file line number Diff line number Diff line change
Expand Up @@ -35,9 +35,8 @@
# the largest 70B model, the 135GB model can be loaded in as little as 70 seconds.
#
# ### Download the weights
# Since TGI uses `snapshot_download` under the hood, running this function for our image build
# will place the weights into the cache directly. There are no progress bars as this uses
# the high-throughput `hf-transfer` library, but expect ~700MB/s for this step.
# We can use the included utilities to download the model weights (and convert to safetensors, if necessary)
# as part of the image build.
#


Expand Down

0 comments on commit 2b1cbe6

Please sign in to comment.