Port VLM functionalities from main branch to the refactor branch #769

nv-hwoo · 2024-07-30T19:26:30Z

This PR ports VLM support from #756 and #759 into this dev branch.

src/c++/perf_analyzer/genai-perf/tests/test_llm_inputs.py

dyastremsky

Great work, Hyunjae! This is critical work for the refactor.

Started the review and leaving a few initial comments. Heading to a meeting and will finish the review later this afternoon.

dyastremsky · 2024-07-30T19:51:34Z

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/dataset_retriever.py

-    def from_file(file_path: Path) -> List[Dict[str, str]]:
-        with open(file_path, "r") as file:
-            data = [load_json_str(line) for line in file]
+    def from_file(file_path: Path, output_format: OutputFormat) -> List[Dict[str, str]]:


I think the Dataset Retriever should have no concept of output format. Otherwise, we're moving back towards where we currently are with everything being interdependent. We might want to discuss the best way to proceed here. Perhaps we can add a separate from_image_file method that can get an image from a file and then use it in the LLM Inputs file to add that image to each input. Or follow another approach.

That's a valid point, and I agree. But I would rather focus on porting the VLM functionalities for this PR as there's already too many changes happening in this PR. I can tag a ticket for decoupling output format from this method. How does that sound?

I need to think on it. I know that this port was already difficult. At the same time, deferring some parts is going to make other ports harder and make it more likely this gets missed. Introducing clutter to this class seems like a mistake unless we're going to immediately submit another PR to fix this. So then that becomes one more ticket/PR we need to do.

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/dataset_retriever.py

dyastremsky

Thanks for doing this work! I left some comments, but we can file follow-on tickets to refactor these immediately since this PR is large. It'd be better to get the port done, then we can refactor out these similar practices again and find a design that works.

dyastremsky · 2024-07-30T21:41:03Z

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/dataset_retriever.py

-    def from_file(file_path: Path) -> List[Dict[str, str]]:
-        with open(file_path, "r") as file:
-            data = [load_json_str(line) for line in file]
+    def from_file(file_path: Path, output_format: OutputFormat) -> List[Dict[str, str]]:


I need to think on it. I know that this port was already difficult. At the same time, deferring some parts is going to make other ports harder and make it more likely this gets missed. Introducing clutter to this class seems like a mistake unless we're going to immediately submit another PR to fix this. So then that becomes one more ticket/PR we need to do.

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/llm_inputs.py

dyastremsky · 2024-07-30T22:00:41Z

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/llm_inputs.py

@@ -101,14 +117,20 @@ def create_llm_inputs(
                prompt_tokens_mean,
                prompt_tokens_stddev,
                num_of_output_prompts,
+                image_width_mean,


We can leave this for now... but same thing here. Maybe we need an image class. Adding all of these args replicates the issues we're trying to get rid of with this refactor. So this will need another refactor after this port.

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/dataset_retriever.py

nv-hwoo added 5 commits July 29, 2024 17:15

port VLM input generation

d62df10

add tests

dddf4db

add new tests

7cee8b0

port output parsing and metrics

2e339f4

add one more test

d6d8b23

nv-hwoo requested review from debermudez and dyastremsky July 30, 2024 19:26

github-advanced-security bot found potential problems Jul 30, 2024

View reviewed changes

src/c++/perf_analyzer/genai-perf/tests/test_llm_inputs.py Fixed Show fixed Hide fixed

Fix PR run

1f8c926

dyastremsky reviewed Jul 30, 2024

View reviewed changes

dyastremsky approved these changes Jul 30, 2024

View reviewed changes

address feedback

ed90c85

dyastremsky approved these changes Jul 30, 2024

View reviewed changes

nv-hwoo merged commit 93b2f5a into feat-inputs-refactor Jul 30, 2024
5 checks passed

nv-hwoo deleted the hwoo-port-vlm branch July 30, 2024 23:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port VLM functionalities from main branch to the refactor branch #769

Port VLM functionalities from main branch to the refactor branch #769

nv-hwoo commented Jul 30, 2024

dyastremsky left a comment

dyastremsky Jul 30, 2024

nv-hwoo Jul 30, 2024

dyastremsky Jul 30, 2024

dyastremsky left a comment

dyastremsky Jul 30, 2024

dyastremsky Jul 30, 2024

Port VLM functionalities from main branch to the refactor branch #769

Port VLM functionalities from main branch to the refactor branch #769

Conversation

nv-hwoo commented Jul 30, 2024

dyastremsky left a comment

Choose a reason for hiding this comment

dyastremsky Jul 30, 2024

Choose a reason for hiding this comment

nv-hwoo Jul 30, 2024

Choose a reason for hiding this comment

dyastremsky Jul 30, 2024

Choose a reason for hiding this comment

dyastremsky left a comment

Choose a reason for hiding this comment

dyastremsky Jul 30, 2024

Choose a reason for hiding this comment

dyastremsky Jul 30, 2024

Choose a reason for hiding this comment