Specifying batch size on llama2-70B cm automation #190

rajesh-s · 2024-08-27T17:46:11Z

I could not find information either in the documentation or in the cm scripts on the batch size that is being used to report the results in the MLCommons database.

The default batch size from the implementation seems to be 1. Is the cm automation specifying a value different from this?
What knobs can the user have to view the configuration used on a particular submission to ensure alignment while profiling new systems?
If I used the automation scripts as indicated on the documentation page, on the same hardware used in the submissions, should I see nearly the same performance?

The text was updated successfully, but these errors were encountered:

arjunsuresh · 2024-08-28T23:24:10Z

@rajesh-s most of the inference submissions are done using Nvidia implementation. In CM we have tried to match the typical batch sizes as in the Nvidia submissions but we haven't tested all of the systems. In the CM run command you can specify --batch_size= to use custom batch size for Nvidia implementation.

For reference implementation I'm not sure if different batch sizes work as many things are hardwired and no one has done any submission using it.

rajesh-s · 2024-08-29T00:54:07Z

It would help if the batch sizes are listed atleast on the submissions which I could not find on the results.

The CM run command seems to default to the batch size of 1 as I indicated above, which might be good to note in the documentation. The results vary largely on the sizes and it maybe imperative to document them.

anandhu-eng · 2024-09-10T07:02:16Z

Hi @rajesh-s, sorry for replying late. Have noted the required addition.

@arjunsuresh , would it be apt to include in collapsible section or should we give that as a tip since there is a chance of users ignoring the collapsible option.

anandhu-eng · 2024-09-11T07:01:56Z

Hi @rajesh-s , we have added the changes in our forks but its yet to be merged to MLCommons official inference repo. You can find the changes here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specifying batch size on llama2-70B cm automation #190

Specifying batch size on llama2-70B cm automation #190

rajesh-s commented Aug 27, 2024 •

edited

Loading

arjunsuresh commented Aug 28, 2024

rajesh-s commented Aug 29, 2024

anandhu-eng commented Sep 10, 2024

anandhu-eng commented Sep 11, 2024

Specifying batch size on llama2-70B cm automation #190

Specifying batch size on llama2-70B cm automation #190

Comments

rajesh-s commented Aug 27, 2024 • edited Loading

arjunsuresh commented Aug 28, 2024

rajesh-s commented Aug 29, 2024

anandhu-eng commented Sep 10, 2024

anandhu-eng commented Sep 11, 2024

rajesh-s commented Aug 27, 2024 •

edited

Loading