Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] How to control the parallelism way of modules #7017

Open
mengniwang95 opened this issue Feb 8, 2025 · 0 comments
Open

[Question] How to control the parallelism way of modules #7017

mengniwang95 opened this issue Feb 8, 2025 · 0 comments

Comments

@mengniwang95
Copy link

mengniwang95 commented Feb 8, 2025

Hi , I am new to deepspeed and have some questions about how to use it.

I want to apply tensor parallel to a huggingface model, and want to apply row parallelism to one module while column parallelism to another module, how can I realize it with configuration?

My way to parallel:

        ds_inference_kwargs = {
            "dtype": torch_dtype,
            "tensor_parallel": {"tp_size": world_size},
        }
        model = deepspeed.init_inference(model, **ds_inference_kwargs)

Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant