How to Load OpenELM Pre-training Checkpoints using Hugging Face AutoModelForCausalLM ? #43

jasonkrone · 2024-08-08T21:53:53Z

Hi there,

First, really admire the work on OpenELM! Thank you for making your models and code available.

Question regarding the pre-training checkpoints linked here: how can we convert these checkpoints into the format expected by AutoModelForCausalLM.from_pretrained?

I presume there's a script that was used for conversion of the final model weights into HF format, but I couldn't find it in the repo.

Would very much appreciate any help on this!

Best,
Jason

The text was updated successfully, but these errors were encountered:

a154377713 · 2024-08-28T10:02:40Z

I have also encountered the same problem. Do you have a solution?

jasonkrone · 2024-09-03T22:50:36Z

I didn't wind up solving this but here's a reference that might be helpful https://github.com/foundation-model-stack/foundation-model-stack/blob/4349dacef63e86b6c1acdccb69b48fe562365bb2/fms/models/llama.py#L592

athrvkk · 2024-10-09T03:38:41Z

As a follow question - Are there any plans to push the model checkpoints to Huggingface hub for ease of access (like the Pythia suite of models)?
It would really help the NLP community!
Thanks in Advance!

jasonkrone changed the title ~~How to Load Pre-training Checkpoints using Hugging Face AutoModelForCausalLM ?~~ How to Load OpenELM Pre-training Checkpoints using Hugging Face AutoModelForCausalLM ? Aug 8, 2024

jasonkrone closed this as completed Sep 3, 2024

jasonkrone reopened this Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to Load OpenELM Pre-training Checkpoints using Hugging Face AutoModelForCausalLM ? #43

How to Load OpenELM Pre-training Checkpoints using Hugging Face AutoModelForCausalLM ? #43

jasonkrone commented Aug 8, 2024

a154377713 commented Aug 28, 2024

jasonkrone commented Sep 3, 2024

athrvkk commented Oct 9, 2024 •

edited

Loading

How to Load OpenELM Pre-training Checkpoints using Hugging Face AutoModelForCausalLM ? #43

How to Load OpenELM Pre-training Checkpoints using Hugging Face AutoModelForCausalLM ? #43

Comments

jasonkrone commented Aug 8, 2024

a154377713 commented Aug 28, 2024

jasonkrone commented Sep 3, 2024

athrvkk commented Oct 9, 2024 • edited Loading

athrvkk commented Oct 9, 2024 •

edited

Loading