Add the logic for Energy Star #261

regisss · 2024-09-12T16:05:14Z

No description provided.

…sing (WIP)

…hmark into energy_star_dev

trying another model

back to GPU

trying to manually define dir on cluster

nevermind

IlyasMoutawwakil · 2024-09-16T06:40:54Z

can you try fixing the energy tracker ? it seems to be broken (or pin a codecarbon version)

IlyasMoutawwakil · 2024-09-19T07:02:32Z

optimum_benchmark/backends/pytorch/backend.py

+        # See https://github.com/huggingface/diffusers/issues/4649
+        if "xl" in self.config.model:
+            self.pretrained_model.unet.config.addition_embed_type = None


what's the actual problem here ? reading the conversation it seems that users have this issue when they use the wrong pipeline for their target models (stable diffusion for an xl model) which is not something we wanna hack here.

IlyasMoutawwakil · 2024-09-19T07:08:00Z

optimum_benchmark/scenarios/energy_star/preprocessing_utils.py

+def preprocess(
+    dataset: Dataset,
+    task: str,
+    config: EnergyStarConfig,
+    preprocessor: PretrainedProcessor,
+    pretrained_config: PretrainedConfig,
+) -> Dataset:
+    task_to_preprocessing = {
+        "feature-extraction": feature_extraction_preprocessing,
+        "sentence-similarity": sentence_similarity_preprocessing,
+        "text-classification": text_classification_preprocessing,
+        "question-answering": question_answering_preprocessing,
+        "text-generation": text_generation_preprocessing,
+        "text2text-generation": text2text_generation_preprocessing,
+        "summarization": summarization_preprocessing,
+        "stable-diffusion": image_generation_preprocessing,
+        "automatic-speech-recognition": automatic_speech_recognition_preprocessing,
+        "image-to-text": image_to_text_preprocessing,
+        "image-classification": image_preprocessing,
+        "object-detection": image_preprocessing,
+    }

-    return task_to_preprocessing[task](dataset, config, preprocessor)
+    return task_to_preprocessing[task](dataset, config, preprocessor, pretrained_config)


maybe using the dict directly makes more sense ? why passing by a func if it's only dispatching using a dict ?

IlyasMoutawwakil · 2024-09-19T07:08:47Z

optimum_benchmark/scenarios/energy_star/preprocessing_utils.py

@@ -32,7 +58,7 @@ def tokenize_function(examples):
            examples[config.text_column_name],
            padding=padding,
            truncation=config.truncation,
-            max_length=config.max_length if config.max_length != -1 else None,
+            max_length=getattr(pretrained_config, "max_position_embeddings", 512),


using model_shapes (normalized dictionary of model variables) makes more sense here.

IlyasMoutawwakil · 2024-09-19T07:10:08Z

optimum_benchmark/scenarios/energy_star/preprocessing_utils.py

+    if getattr(tokenizer, "pad_token", None) is None:
+        tokenizer.pad_token = tokenizer.eos_token
+
+    padding = False if config.input_shapes["batch_size"] == 1 else True


Suggested change

padding = False if config.input_shapes["batch_size"] == 1 else True

padding = config.input_shapes["batch_size"] != 1

IlyasMoutawwakil · 2024-09-19T07:10:48Z

optimum_benchmark/scenarios/energy_star/preprocessing_utils.py

+    if getattr(tokenizer, "pad_token", None) is None:
+        tokenizer.pad_token = tokenizer.eos_token
+
+    padding = False if config.input_shapes["batch_size"] == 1 else True


IlyasMoutawwakil · 2024-09-19T07:13:09Z

optimum_benchmark/scenarios/energy_star/preprocessing_utils.py

+    # Add a pad token if the tokenizer doesn't have one
+    if getattr(tokenizer, "pad_token", None) is None:
+        tokenizer.pad_token = tokenizer.eos_token


repeated multiple times, makes more sense to do this after tokenizer instantiation (in transformers_utils.py)

IlyasMoutawwakil · 2024-09-19T07:28:39Z