Change default batch_size for finetuning to max_batch_size for a model #189

artek0chumak · 2024-09-23T14:33:14Z

Issue #ENG-10437

I added a logic to request limits for hyperparameters for a requested model. If a user does not specify hyperparameters, togther-python will change it to recommended values.

mryab · 2024-09-24T12:13:22Z

src/together/resources/finetune.py

@@ -55,7 +58,7 @@ def create(
            n_evals (int, optional): Number of evaluation loops to run. Defaults to 0.
            n_checkpoints (int, optional): Number of checkpoints to save during fine-tuning.
                Defaults to 1.
-            batch_size (int, optional): Batch size for fine-tuning. Defaults to 32.
+            batch_size (int, optional): Batch size for fine-tuning. Defaults to auto.


Suggested change

batch_size (int, optional): Batch size for fine-tuning. Defaults to auto.

batch_size (int, optional): Batch size for fine-tuning. Defaults to max.

mryab · 2024-09-24T12:16:46Z

src/together/resources/finetune.py

+        else:
+            if model_limits.full_training is None:
+                raise ValueError(
+                    "Full training is not supported for the selected model."
+                )


It looks like you have duplicated validation logic here and in cli/finetune.py. Maybe it's best to extract it to a function, call it in cli/finetune.py and reraise the exception as click.BadParameter if necessary?

This logic is mostly for mypy -- it will error out that full_training is None in the following lines without this check

artek0chumak requested review from azahed98 and thepowerfuldeez September 23, 2024 14:49

thepowerfuldeez approved these changes Sep 23, 2024

View reviewed changes

azahed98 approved these changes Sep 23, 2024

View reviewed changes

mryab reviewed Sep 24, 2024

View reviewed changes

artek0chumak added 7 commits September 25, 2024 15:32

add code

83cfb65

add auto

c284dc8

style

b302700

fix handlers

727602a

auto to max

46ff5be

renaming

7d8dbc7

add warning message

e3b2cea

artek0chumak force-pushed the artem/auto-batch-size branch from b4595a0 to e3b2cea Compare September 25, 2024 13:32

artek0chumak added 2 commits September 25, 2024 15:37

some fixes

cf9f2c4

new warning msg

2d7615e

orangetin merged commit f13c7a1 into main Sep 25, 2024
12 of 13 checks passed

orangetin deleted the artem/auto-batch-size branch September 25, 2024 20:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change default batch_size for finetuning to max_batch_size for a model #189

Change default batch_size for finetuning to max_batch_size for a model #189

Uh oh!

artek0chumak commented Sep 23, 2024

Uh oh!

mryab Sep 24, 2024

Uh oh!

mryab Sep 24, 2024

Uh oh!

artek0chumak Sep 24, 2024

Uh oh!

Uh oh!

Uh oh!

	batch_size (int, optional): Batch size for fine-tuning. Defaults to auto.
	batch_size (int, optional): Batch size for fine-tuning. Defaults to max.

Change default batch_size for finetuning to max_batch_size for a model #189

Change default batch_size for finetuning to max_batch_size for a model #189

Uh oh!

Conversation

artek0chumak commented Sep 23, 2024

Uh oh!

mryab Sep 24, 2024

Choose a reason for hiding this comment

Uh oh!

mryab Sep 24, 2024

Choose a reason for hiding this comment

Uh oh!

artek0chumak Sep 24, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!