You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[TensorRT-LLM] TensorRT-LLM version: 0.15.0
0.15.0
[12/20/2024-20:09:51] [TRT-LLM] [W] Implicitly setting LLaMAConfig.tie_word_embeddings = False
5it [00:00, 25.38it/s]
Traceback (most recent call last):
File "/home/ubuntu/TensorRT-LLM/examples/llama/convert_checkpoint.py", line 555, in <module>
main()
File "/home/ubuntu/TensorRT-LLM/examples/llama/convert_checkpoint.py", line 547, in main
convert_and_save_hf(args)
File "/home/ubuntu/TensorRT-LLM/examples/llama/convert_checkpoint.py", line 488, in convert_and_save_hf
execute(args.workers, [convert_and_save_rank] * world_size, args)
File "/home/ubuntu/TensorRT-LLM/examples/llama/convert_checkpoint.py", line 495, in execute
f(args, rank)
File "/home/ubuntu/TensorRT-LLM/examples/llama/convert_checkpoint.py", line 472, in convert_and_save_rank
llama = LLaMAForCausalLM.from_hugging_face(
File "/home/ubuntu/.local/lib/python3.10/site-packages/tensorrt_llm/models/llama/model.py", line 405, in from_hugging_face
loader.generate_tllm_weights(model)
File "/home/ubuntu/.local/lib/python3.10/site-packages/tensorrt_llm/models/model_weights_loader.py", line 408, in generate_tllm_weights
self.load(tllm_key,
File "/home/ubuntu/.local/lib/python3.10/site-packages/tensorrt_llm/models/model_weights_loader.py", line 296, in load
v = sub_module.postprocess(tllm_key, v, **postprocess_kwargs)
File "/home/ubuntu/.local/lib/python3.10/site-packages/tensorrt_llm/quantization/layers.py", line 1478, in postprocess
new_amax = max(weight_scaling_factors).reshape(1, ).to(
TypeError: '>' not supported between instances of 'NoneType' and 'NoneType'
additional notes
N/A
The text was updated successfully, but these errors were encountered:
System Info
x86_64, debian 11, L40s GPU
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
The ckpt should be converted
actual behavior
Got error
additional notes
N/A
The text was updated successfully, but these errors were encountered: