Feature : Enable Intel GPU/XPU finetuning and inference #116

abhilash1910 · 2023-08-11T06:54:50Z

What does this PR do?

Motivation: Thanks for creating this repository. At Intel, we are using this repository for finetuning and inference for our GPUs /XPU (Ipex backend) and hence the reason for this PR.
This change would help us run LLama v2 without issues on our GPU devices. (Since accelerate/peft already has support of XPUs enabled)

Fixes # (issue) -NA

Feature/Issue validation/testing

Feature for finetuning and inference tested on our Data Centre GPUs with the change

for CLA purposes this would be on behalf of Intel.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Thanks for contributing 🎉!

HamidShojanazeri · 2023-08-11T18:11:51Z

@abhilash1910 thanks the PR, very interesting, would love to learn more about how we can test Intel GPUs/XPUs. I sent you a message on LD.

chauhang · 2023-08-26T19:46:21Z

@abhilash1910 Thanks for submitting this PR. Please attach the logs for the tests for verification that things are working on the Intel GPUs with the changes. We don't have a mechanism to test this.

Can you also please check if torch.set_default_device api can be used to simply the device setting logic for the XPU devices?

abhilash1910 · 2023-08-29T12:36:55Z

@chauhang sure I will attach the logs; also as discussed with @HamidShojanazeri there is a plan to support device for integration testing for any future PRs from Intel side. I am closing on it this week internally , and eventually would like to discuss on the next steps.

utils/train_utils.py

abhilash1910 · 2023-09-05T05:27:31Z

@chauhang @HamidShojanazeri logs for distributed finetuning on 8 ranks :
7B_8ranks_intel_xpu.log
Could you also share a tentative timing for a discussion sometime this week or next on possible next steps?

HamidShojanazeri · 2023-09-06T18:59:34Z

Thanks very much @abhilash1910, I am wondering if there is a way to add support/ get intel GPU memory stats as well the way we are getting for nvidia.

abhilash1910 · 2023-09-07T03:41:35Z

@HamidShojanazeri yes the memory stats are also added , this can be found in the new log here:
7B_8ranks_intel_xpu.log
Also requesting re-review . Thanks for suggestions.

Upstream from main

src/llama_recipes/utils/train_utils.py

enable xpu finetuning and inference

ed7ba99

facebook-github-bot added the cla signed label Aug 11, 2023

abhilash1910 and others added 2 commits August 14, 2023 05:38

Fix bugs in data loading

82d3ca6

Merge branch 'main' into ipex_feature

d5f3991

Merge branch 'main' into ipex_feature

4a7bad8

add header

95e845c

maximegmd reviewed Sep 1, 2023

View reviewed changes

utils/train_utils.py Outdated Show resolved Hide resolved

utils/train_utils.py Outdated Show resolved Hide resolved

abhilash1910 mentioned this pull request Sep 4, 2023

Feature: Enable codellama on Intel GPUs meta-llama/codellama#90

Open

fix bug on indent

a2ca4a7

fix memory bug

977d68a

abhilash1910 and others added 9 commits September 6, 2023 23:38

enable grad on loss tensor

81fecf3

upstream resolve conflict

33da341

Fix bugs in data loading

491d555

resolve merge conflict

820c68a

fix bug on indent

42a0ad1

fix memory bug

6e4455c

revert gradient loss detachment

ace8b55

merge conflicts

ad6b27d

Merge pull request #1 from abhilash1910/main

6e65d15

Upstream from main

HamidShojanazeri approved these changes Sep 13, 2023

View reviewed changes

Merge branch 'main' into ipex_feature

6a78b96

gujinghui reviewed Oct 17, 2023

View reviewed changes

src/llama_recipes/utils/train_utils.py Outdated Show resolved Hide resolved

abhilash1910 added 2 commits October 17, 2023 14:16

Merge branch 'facebookresearch:main' into ipex_feature

d1739f1

remove duplicate import

11465d6

Merge branch 'main' into ipex_feature

4793f0f

HamidShojanazeri merged commit dbfea48 into meta-llama:main Jan 11, 2024
1 check passed

HamidShojanazeri mentioned this pull request Jun 3, 2024

Integrate Llama2 recipe on Intel platforms to llama-recipes #293

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature : Enable Intel GPU/XPU finetuning and inference #116

Feature : Enable Intel GPU/XPU finetuning and inference #116

abhilash1910 commented Aug 11, 2023

HamidShojanazeri commented Aug 11, 2023

chauhang commented Aug 26, 2023

abhilash1910 commented Aug 29, 2023

abhilash1910 commented Sep 5, 2023

HamidShojanazeri commented Sep 6, 2023

abhilash1910 commented Sep 7, 2023 •

edited

Loading

Feature : Enable Intel GPU/XPU finetuning and inference #116

Feature : Enable Intel GPU/XPU finetuning and inference #116

Conversation

abhilash1910 commented Aug 11, 2023

What does this PR do?

Feature/Issue validation/testing

Before submitting

HamidShojanazeri commented Aug 11, 2023

chauhang commented Aug 26, 2023

abhilash1910 commented Aug 29, 2023

abhilash1910 commented Sep 5, 2023

HamidShojanazeri commented Sep 6, 2023

abhilash1910 commented Sep 7, 2023 • edited Loading

abhilash1910 commented Sep 7, 2023 •

edited

Loading