Set up data for ui - WIP #87

DaltheCow · 2025-03-05T00:57:40Z

This is a roughly thrown together version of my understanding of how guidellm data can populate the UI. It isn't tied into the html injection functionality which will serve the UI (as I'm just generating json I manually drop into the UI for now). Nor is it set up to be the basis of the api that will serve the guidellm UI, which would be where this logic would likely fit in the future. But the calculations are meant to be 100% correct/accurate.

I'll add more tests when I finish the frontend work to help walk through what this is trying to achieve. For now attention is most needed on the calculations for benchmark metrics, prompt/output token metrics, requests over time, etc, which are used to make histograms and line charts in the UI.

I've attached the data generated from this code in files below, and the model I've used thus far (microsoft/DialoGPT-small) is not a great chat model afaik so it probably isn't ideal data, but the performance is a little better running on my mac due to its small size which I think will make for more realistic looking plots on the charts. Will have to get a realistic set up to output better data soon.

I don't have a goal of trying to get this merged in urgently, there is a bit more UI work to do before this is useful. But I'll be pestering people for more reviews soon.

DaltheCow · 2025-03-05T02:19:43Z

src/guidellm/core/result.py

+    def output_token_throughput_distribution(self) -> Distribution:
+        """
+        Get the distribution for output token throughput.
+
+        :return: The distribution of output token throughput.
+        :rtype: Distribution
+        """
+        throughputs = []
+        for r in self.results:
+            duration = (r.end_time or 0) - (r.start_time or 0)
+            if duration > 0:
+                throughputs.append(r.output_token_count / duration)
+
+        return Distribution(data=throughputs)
+


The UI relies on the output throughput distribution, and I didn't find any methods/properties that were in the token/(unit of time) shape the UI expects so I added this.

DaltheCow · 2025-03-05T02:20:11Z

src/guidellm/main.py

+    generate_ui_api_data(report)
+


This is just so I can run this simply and look at the json generated

DaltheCow · 2025-03-05T02:23:15Z

src/guidellm/utils/generate_ui_data.py

+        bucket_width = dist.range / n_buckets
+        bucket_counts = [0] * n_buckets
+
+        for val in dist.data:
+
+            idx = int((val - minv) // bucket_width)
+            if idx == n_buckets:
+                idx = n_buckets - 1
+            bucket_counts[idx] += 1
+
+        buckets = []
+        for i, count in enumerate(bucket_counts):
+            bucket_start = minv + i * bucket_width
+            buckets.append({
+                "value": bucket_start,
+                "count": count
+            })


I am not sure the proper way to generate these buckets or if there is code somewhere else in guidellm that could manage this and I missed it.

But this code assumes we have a set number of buckets we want to generate and then determines the bucket width based off of that. It is a hard coded approach, and some data analysis first might result in a better number of buckets or bucket size. But generally I figured the UI would look good with there being a set number of buckets so our histograms conveniently look the same and take up a comfortable amount of space.

DaltheCow · 2025-03-05T02:24:57Z

src/guidellm/utils/generate_ui_data.py

+    with open("ben_test/run_info.json", "w") as f:
+        json.dump(run_info_json, f, indent=2)
+    with open("ben_test/workload_details.json", "w") as f:
+        json.dump(workload_details_json, f, indent=2)
+    with open("ben_test/benchmarks.json", "w") as f:
+        json.dump(benchmarks_json, f, indent=2)


This is just for testing purposes, to view the generated json.

DaltheCow · 2025-03-05T02:30:17Z

What the data looks like currently. Converted .js to .txt so I could post them here

run_info.txt
workload_details.txt
benchmarks.txt

…ixes

…f request over time data and use raw, refactor and test interpolation functionality

…ector and output html report

set up data for ui

7e90a73

DaltheCow added the UI Front-end workstream label Mar 5, 2025

add in request over time calculation

f870c20

DaltheCow commented Mar 5, 2025

View reviewed changes

DaltheCow changed the title ~~Set up data for ui~~ Set up data for ui - WIP Mar 5, 2025

DaltheCow added 8 commits March 7, 2025 15:12

update data generation to better handle sample strings, other small f…

d1bbc0c

…ixes

hack changes together to get values for request over time data, wip

d59cada

add interpolation of benchmark metrics by rps, remove interpolation o…

76fc2b4

…f request over time data and use raw, refactor and test interpolation functionality

remove commented code

1833518

update json property name from snake_case to camelCase

8c63255

include mode

9691ef3

Merge branch 'main' into set-up-data-for-ui

a016585

remove backen benchmark interoplation, hook up data generation to inj…

656e6dd

…ector and output html report

DaltheCow closed this Jun 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Set up data for ui - WIP #87

Set up data for ui - WIP #87

Uh oh!

DaltheCow commented Mar 5, 2025 •

edited

Loading

Uh oh!

DaltheCow Mar 5, 2025

Uh oh!

DaltheCow Mar 5, 2025

Uh oh!

DaltheCow Mar 5, 2025

Uh oh!

DaltheCow Mar 5, 2025

Uh oh!

DaltheCow commented Mar 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

Set up data for ui - WIP #87

Set up data for ui - WIP #87

Uh oh!

Conversation

DaltheCow commented Mar 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DaltheCow Mar 5, 2025

Choose a reason for hiding this comment

Uh oh!

DaltheCow Mar 5, 2025

Choose a reason for hiding this comment

Uh oh!

DaltheCow Mar 5, 2025

Choose a reason for hiding this comment

Uh oh!

DaltheCow Mar 5, 2025

Choose a reason for hiding this comment

Uh oh!

DaltheCow commented Mar 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

DaltheCow commented Mar 5, 2025 •

edited

Loading

DaltheCow commented Mar 5, 2025 •

edited

Loading