add performance statistics for image generation #1405

xufang-lisa · 2024-12-18T11:54:55Z

tickets: CVS-157338

ilya-lavrenov · 2024-12-19T07:49:03Z

Could you please provide example of such prints?
E.g. for SDXL and FLUX

likholat · 2024-12-20T14:47:16Z

@xufang-lisa let's add a custom struct ImageGenerationPerfMetrics and collect the metrics there:

struct OPENVINO_GENAI_EXPORTS ImageGenerationPerfMetrics {
    float load_time; // model load time (includes reshape & read_model time)
    float generate_duration; // duration of method generate(...)

    MeanStdPair iteration_duration; // Mean-Std time of one generation iteration
    std::map<std::string, float> encoder_inference_duration; // inference durations for each encoder
    MeanStdPair unet_inference_duration; // inference duration for unet model, should be filled with zeros if we don't have unet
    MeanStdPair transformer_inference_duration; // inference duration for transformer model, should be filled with zeros if we don't have transformer
    float vae_encoder_inference_duration; // inference duration of vae_encoder model, should be filled with zeros if we don't use it
    float vae_decoder_inference_duration; // inference duration of  vae_decoder model
 
    bool m_evaluated = false;
 
    RawImageGenerationPerfMetrics raw_metrics;
};
 
struct OPENVINO_GENAI_EXPORTS RawImageGenerationPerfMetrics {
    std::vector<MicroSeconds> unet_inference_durations; // unet durations for each step
    std::vector<MicroSeconds> transformer_inference_durations; // transformer durations for each step
    std::vector<MicroSeconds> iteration_durations;  //  durations of each step
};

I'd also like to propose return ov::Tensor as a generate method output and add get_perfomance_metrics() method

xufang-lisa · 2024-12-30T11:18:05Z

@xufang-lisa let's add a custom struct ImageGenerationPerfMetrics and collect the metrics there:

updated

ilya-lavrenov · 2024-12-30T11:21:22Z

@xufang-lisa let's add a custom struct ImageGenerationPerfMetrics and collect the metrics there:

updated

could you please add benchmark for image generation similar to benchmark for GenAI LLM, which can print all these detailed statistic?
See https://github.com/openvinotoolkit/openvino.genai/blob/master/samples/cpp/benchmark_genai/benchmark_genai.cpp#L54-L61

And provide here an output of such sample.

BTW, original samples should be kept as is and avoid printed timings.

xufang-lisa · 2025-01-06T11:10:08Z

@xufang-lisa let's add a custom struct ImageGenerationPerfMetrics and collect the metrics there:

updated

could you please add benchmark for image generation similar to benchmark for GenAI LLM, which can print all these detailed statistic? See https://github.com/openvinotoolkit/openvino.genai/blob/master/samples/cpp/benchmark_genai/benchmark_genai.cpp#L54-L61

And provide here an output of such sample.

BTW, original samples should be kept as is and avoid printed timings.

@ilya-lavrenov Do you mean to add support for image generation in benchmark_genai?

ilya-lavrenov · 2025-01-06T13:22:07Z

@xufang-lisa let's add a custom struct ImageGenerationPerfMetrics and collect the metrics there:

updated

could you please add benchmark for image generation similar to benchmark for GenAI LLM, which can print all these detailed statistic? See https://github.com/openvinotoolkit/openvino.genai/blob/master/samples/cpp/benchmark_genai/benchmark_genai.cpp#L54-L61
And provide here an output of such sample.
BTW, original samples should be kept as is and avoid printed timings.

@ilya-lavrenov Do you mean to add support for image generation in benchmark_genai?

no, create dedicated benchmark application for image generation, which is similar to VLM / LLM benchmarks

ilya-lavrenov · 2025-01-29T07:20:39Z

samples/cpp/image_generation/benchmark_text2image.cpp

+    cxxopts::Options options("benchmark_image_generation", "Help command");
+
+    options.add_options()
+    ("m,model", "Path to model and tokenizers base directory", cxxopts::value<std::string>()->default_value("."))


please, don't imply that model by default is located in current folder.
I would make this parameters as required as in other benchmark applications:

openvino.genai/samples/cpp/text_generation/benchmark_genai.cpp

Lines 10 to 11 in 4fb48de

options.add_options()

("m,model", "Path to model and tokenizers base directory", cxxopts::value<std::string>())

ilya-lavrenov · 2025-01-29T07:22:24Z

samples/cpp/image_generation/benchmark_text2image.cpp

+    std::cout << std::fixed << std::setprecision(2);
+    std::cout << "Load time: " << load_time << " ms" << std::endl;
+    std::cout << "One generate avg time: " << generate_mean << " ms" << std::endl;
+    std::cout << "Total inference for one generate avg time: " << inference_mean << " ms" << std::endl;


can we print more information? E.g. how much time is taken by text encoders, VAE encode / decode, first / other iterations for main denoising loop

Because currently printed information is non informative (about what happens in pipeline) and can be achieved by external benchmarking around generate method.

ilya-lavrenov · 2025-01-29T07:22:45Z

samples/cpp/image_generation/README.md

+
+## benchmarking sample for text to image pipeline
+
+This `benchmark_text2image.cpp` sample script demonstrates how to benchmark the text to image pipeline. The script includes functionality for warm-up iterations, generating image, and calculating various performance metrics.


can we generalize this benchmark to support inpainting / image to image as well?
We can add an argument to specify pipeline type

ilya-lavrenov · 2025-01-29T07:26:01Z

src/cpp/src/image_generation/models/autoencoder_kl.cpp

    m_decoder_request.infer();
+    infer_duration = ov::genai::PerfMetrics::get_microsec(std::chrono::steady_clock::now() - infer_start);


can external code (within Image generation pipeline) measure time of decode function? W/ such approach we don't need extra output arguments for this method.

The same approach for all other models within this image_generation/models folder.

ilya-lavrenov · 2025-01-29T07:27:28Z

src/cpp/include/openvino/genai/image_generation/image2image_pipeline.hpp

 private:
    std::shared_ptr<DiffusionPipeline> m_impl;
+    ImageGenerationPerfMetrics m_perf_metrics;


this field should be hidden inside std::shared_ptr<DiffusionPipeline> m_impl;

ilya-lavrenov · 2025-01-29T07:31:14Z

src/cpp/src/image_generation/flux_pipeline.hpp

@@ -477,6 +500,7 @@ class FluxPipeline : public DiffusionPipeline {
    std::shared_ptr<T5EncoderModel> m_t5_text_encoder = nullptr;
    std::shared_ptr<AutoencoderKL> m_vae = nullptr;
    ImageGenerationConfig m_custom_generation_config;
+    ImageGenerationPerfMetrics m_perf_metrics;


should it be moved to base DiffusionPipeline class? All derived pipelines will inherit this field.

xufang-lisa added 4 commits December 18, 2024 09:49

add profile for image generation

41137de

Merge branch 'master' into xufang/text_image

21b450f

modify comments

778510c

reset inference duration

ce3d3af

github-actions bot added category: text to image Text 2 image pipeline category: Python API Python API for GenAI category: samples GenAI samples category: GenAI C++ API Changes in GenAI C++ public headers labels Dec 18, 2024

xufang-lisa added 2 commits December 30, 2024 08:49

Merge branch 'master' into xufang/text_image

5463e90

add get_perfomance_metrics() method

fb7ea85

ilya-lavrenov assigned ilya-lavrenov and likholat Dec 30, 2024

fix build error

ac55e25

peterchen-intel requested a review from wgzintel January 3, 2025 06:22

fix build error

2b360aa

xufang-lisa added 4 commits January 8, 2025 16:17

add python api

1ada73e

fix error

1fccaad

Merge branch 'master' into xufang/text_image

808308c

restore sample

a09aad0

github-actions bot removed the category: samples GenAI samples label Jan 9, 2025

xufang-lisa added 4 commits January 9, 2025 17:08

fix error

5fa1f25

remove python api

8bb7e2a

add python api

2e8aad6

add debug info

7ee74b1

github-actions bot added the category: cmake / build Cmake scripts label Jan 10, 2025

xufang-lisa added 9 commits January 10, 2025 17:07

add more debug info

1ec94d1

fix error

bec1327

fix error

ffef587

fix error

28590fe

fix python error

5952a9a

add newline at end of file

ac68d53

fix pyi file

328b5e9

add load_time statistic

bd8c3f9

add benchmark for image generation

45d27d1

github-actions bot added the category: samples GenAI samples label Jan 13, 2025

xufang-lisa added 6 commits January 14, 2025 14:51

remove debug info

989e587

Merge branch 'master' into xufang/text_image

2de1302

move benchmark_text2image to image_generation folder

c291d46

fix build error

ab4ce26

Merge branch 'master' into xufang/text_image

2dc7c02

modify default output dir

35f2a16

xufang-lisa marked this pull request as ready for review January 16, 2025 11:59

peterchen-intel requested a review from ilya-lavrenov January 17, 2025 06:08

ilya-lavrenov reviewed Jan 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add performance statistics for image generation #1405

add performance statistics for image generation #1405

xufang-lisa commented Dec 18, 2024 •

edited by peterchen-intel

Loading

ilya-lavrenov commented Dec 19, 2024

likholat commented Dec 20, 2024 •

edited

Loading

xufang-lisa commented Dec 30, 2024

ilya-lavrenov commented Dec 30, 2024

xufang-lisa commented Jan 6, 2025

ilya-lavrenov commented Jan 6, 2025

ilya-lavrenov Jan 29, 2025

ilya-lavrenov Jan 29, 2025

ilya-lavrenov Jan 29, 2025

ilya-lavrenov Jan 29, 2025

ilya-lavrenov Jan 29, 2025

ilya-lavrenov Jan 29, 2025

	options.add_options()
	("m,model", "Path to model and tokenizers base directory", cxxopts::value<std::string>())


		## benchmarking sample for text to image pipeline

		This `benchmark_text2image.cpp` sample script demonstrates how to benchmark the text to image pipeline. The script includes functionality for warm-up iterations, generating image, and calculating various performance metrics.

		m_decoder_request.infer();
		infer_duration = ov::genai::PerfMetrics::get_microsec(std::chrono::steady_clock::now() - infer_start);

add performance statistics for image generation #1405

Are you sure you want to change the base?

add performance statistics for image generation #1405

Conversation

xufang-lisa commented Dec 18, 2024 • edited by peterchen-intel Loading

ilya-lavrenov commented Dec 19, 2024

likholat commented Dec 20, 2024 • edited Loading

xufang-lisa commented Dec 30, 2024

ilya-lavrenov commented Dec 30, 2024

xufang-lisa commented Jan 6, 2025

ilya-lavrenov commented Jan 6, 2025

ilya-lavrenov Jan 29, 2025

Choose a reason for hiding this comment

ilya-lavrenov Jan 29, 2025

Choose a reason for hiding this comment

ilya-lavrenov Jan 29, 2025

Choose a reason for hiding this comment

ilya-lavrenov Jan 29, 2025

Choose a reason for hiding this comment

ilya-lavrenov Jan 29, 2025

Choose a reason for hiding this comment

ilya-lavrenov Jan 29, 2025

Choose a reason for hiding this comment

xufang-lisa commented Dec 18, 2024 •

edited by peterchen-intel

Loading

likholat commented Dec 20, 2024 •

edited

Loading