Image Annotation Moderation and FLUX Model Integration #65

benliang99 · 2024-09-23T10:38:14Z

This PR upgrades our synthetic image pipeline with a new state-of-the-art (SOTA) model and various quality of life enhancements. It introduces Black Forest Labs' advanced FLUX.1-dev model, upgrades our BLIP-2 annotation model from 2.7B to 6.7B parameters, and incorporates Unsloth's Meta-Llama-3.1-8B-Instruct-bnb-4bit for image moderation. Further refinements include optimized parameterization for both CPU and GPU operations, streamlined .env setup for automatic login credentials management, and verification of model loading before a validation starts.

Key Changes

Model Upgrades and Enhancements:

Model Integration: Added Black Forest Labs' FLUX.1-dev model for SOTA synthetic image generation capabilities.
Annotation Model Upgrade: Transitioned from Salesforce's "blip2-opt-2.7b-coco" to "blip2-opt-6.7b-coco" to enhance image annotation detail and precision.
Text Moderation: Implemented Unsloth's "Meta-Llama-3.1-8B-Instruct-bnb-4bit" for adding coherence and removing bias and redundancy in image annotations.
Diffusion Models: Added GPU-specific loading for better parallel processing and added stress tests for evaluating diffusion models generation latency.
Configuration Enhancements: Improved parameterization for CPU offload and diffusion settings and specified torch data types for computational efficiency.

Infrastructure and Configuration:

Configuration Management: Updated min_compute.yml to detail new GPU requirements like tflops.
Validator Setup: Upgraded setup_validator_env.sh and start_validator.sh to automate Weights * Biases API key and Hugging Face token user login.
Model Verification: Implemented a verification script for initial model download and load testing, now part of the autoupdate_validator_steps.sh sequence and executed prior to start_validator.sh. This modification eliminates the initial download delay when models are loaded for the first time.

Code Quality and Maintenance:

Conducted general code cleanup (unnecessary imports, spacing).

…age dataset annotations.

…ion.

…General cleanup for PR.

bitmind/constants.py

bitmind/synthetic_image_generation/image_annotation_generator.py

bitmind/synthetic_image_generation/notebooks/gs1_nis100_1024_1024.ipynb

bitmind/synthetic_image_generation/synthetic_image_generator.py

bitmind/synthetic_image_generation/utils/stress_test.py

autoupdate_validator_steps.sh

bitmind/constants.py

benliang99 · 2024-09-23T10:45:30Z

bitmind/constants.py

@@ -39,19 +41,36 @@
        {
            "path": "stabilityai/stable-diffusion-xl-base-1.0",
            "use_safetensors": True,
+            "torch_dtype": torch.float16,


torch_dtype is no longer hardcoded, so it must be specified in the pipeline / DIFFUSER_ARGS.

benliang99 · 2024-09-23T10:46:37Z

bitmind/constants.py

+            "path": 'black-forest-labs/FLUX.1-dev',
+            "use_safetensors": True,
+            "torch_dtype": torch.bfloat16,
+            "generate_args": {


Arguments decided based on experimental generation latencies.

benliang99 · 2024-09-23T10:47:55Z

bitmind/constants.py

+    } for m in VALIDATOR_MODEL_META['diffusers']
+}
+
+GENERATE_ARGS = {


New field for generation arguments, which FLUX-1.dev can utilize.

benliang99 · 2024-09-23T10:52:04Z

bitmind/synthetic_image_generation/image_annotation_generator.py

@@ -80,11 +140,17 @@ def generate_description(self,

        if not verbose:
            transformers_logging.set_verbosity_info()
-
+
+        if description.startswith(prompts[0]):


Check for the moderation model repeating the prompt in its response.

benliang99 · 2024-09-23T10:54:02Z

bitmind/synthetic_image_generation/synthetic_image_generator.py

@@ -122,9 +136,13 @@ def clear_gpu(self):
            torch.cuda.empty_cache()
            self.diffuser = None

-    def load_diffuser(self, diffuser_name) -> None:
+    def load_diffuser(self, diffuser_name, gpu_id=None) -> None:


Added GPU parameterization for image generation parallelization. Defaults to cuda if not specified (previous behavior).

start_validator.sh

bitmind/synthetic_image_generation/utils/stress_test.py

bitmind/synthetic_image_generation/notebooks/gs1_nis100_1024_1024.ipynb

… logging cleanup.

…eration

dylanuys and others added 30 commits August 12, 2024 03:24

adding blackforest labs models

2f6160e

temporarily disabling real image challenges to maximize data production

2adec09

adding cpu offload

c81a4a7

adding arg for flux generation

b3b4f60

calling flux generation with new args in constants.py

aaaedd5

pulling in latest from main

8af1cc6

constants update for flux mirror gen

e5e9f60

adding cache dir to model loads

fa19189

temp notebooks for flux tests

4bec465

typo

51d39f8

updating guidance_scale

31ccd59

Added Meta's Llama-3.1-8B-Instruct model for annotation moderation.

02dcea8

Upgraded BLIP2 model to 6.7B param version

f6de46e

Replaced Llama-3.1-8B-Instruct model with Unsloth's ungated version.

ddc06ad

Updated requirements with Unsloth model dependency.

e128200

Clear image generation annotation models from GPU after generating im…

cf16da4

…age dataset annotations.

Fixed usage of generation args, updated image generation helper funct…

bdb3693

…ion.

Added gpu-specific diffusion model loading for parallelization

d6199a9

Fixed docstring indent

cd5b897

Added gpu specification for synthetic image generation

aed5346

Added dataset gen GPU delegation shell script and SDXL to constants

160dd66

Removed HF token

2f7fab5

Fixed specification of torch.dtype for SDXL

7ea024d

Reformatting empty lines, added stress test script.

56b2cec

Added temporary stress testing prints

46b2cdd

Added all diffusion models and updated generate args

f7ed8f3

Added float16 tensor tflop specs metric

585f42f

Set default GPU for diffusion models to cuda if no gpu_id specified. …

3e5128f

…General cleanup for PR.

Merged with testnet

46fcaef

Reverted merge

c1e2ac9

Fixed typo.

8c51b22

benliang99 requested review from aliang322, dylanuys and kenobijon and removed request for kenobijon September 23, 2024 10:38