Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

**"Inconsistent checkpoint loading behavior between local machine and GPU cluster"** #151

Open
PabloDumenil opened this issue Jun 3, 2024 · 1 comment

Comments

@PabloDumenil
Copy link

Description

I am encountering a RuntimeError when trying to load a checkpoint using the CLAP model on a GPU cluster. The error message indicates that there are unexpected key(s) in the state_dict.

Error Message

RuntimeError: Error(s) in loading state_dict for CLAP:
Unexpected key(s) in state_dict: "text_branch.embeddings.position_ids".

Steps to Reproduce

  1. Install laion-clap using pip.
  2. Attempt to load the checkpoint using the following code:
import torch
import laion_clap

# Load the CLAP model
model = laion_clap.CLAP_Module(enable_fusion=False)
model.load_ckpt()

Additional Information:

The code works fine on my local machine, but fails on the GPU cluster.

Python version: 3.10.8

What I Have Tried:

Verified that all dependencies are correctly installed.
Attempted to filter the state_dict to remove unexpected keys, but encountered further issues.
@lukewys
Copy link
Contributor

lukewys commented Jun 3, 2024

Please see #118. Apologize that we haven't pushed the pip package.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants