-
Notifications
You must be signed in to change notification settings - Fork 53
Cannot query rows larger than 100MB limit. #2
Comments
Hey, we're also encountering this issue on the session view. Did you come up with a workaround for it? |
I would love for the Looker folks to raise this with the BQ folks. |
Hello, Would it be feasible to test the main session query in isolation but exclude the event_params and/or the entire event_packing cte (as well as their corresponding selected columns + join in the final selection) to see if the row limit is related to this portion? |
Thanks for the reply and insight. I had come to the same conclusion and had started to test a revised sessions view SQL to carve out just the session_list_with_event_history's sl.event_params and then use an additional join using sl_key. I was able to recreate, test and then temporarily resolve the error by simply removing the sl.event_params (commenting out) one line (line 133) in the sessions.view.lkml file, but I fear just moving the event parameters into its own subquery might still recreate a wide row scenario if a single user has too many events (or events with parameters of excessive length). Were you planning on leaving the events un-nested in one struct? What I will do next is try to isolate a specific user and examine the events and parameters themselves. Perhaps that will add insight. |
Interesting "fix" I found today which may seem a bit arbitrary, but it was based on my finding a half dozen visitors with > 5,000 events during a session. Your results may vary. :) The code update is simple: at the end of your sessions view SQL I added ONE LINE: ... Which removed 1 single visitor out of 800K who had 64,000+ events. Good Lord! Once added, the query ran fine without hitting any 100MB limit errors -- which included the 5 visitors with over 5,000 events. |
That is fantastic, and very helpful information. |
First off, your session-ized table (sessions.view) is amazing. Thank you for this work. I've been using that sessions.vew root SQL now to create daily GA4 tables for use outside of Looker.
The recent issue I have encountered isn't a problem with your code base -- but since you might have more influence than I do over the root cause -- the problem is with a noted BigQuery limitation. "Cannot query rows larger than 100MB limit". I'm experimenting with a reduction in the session_list_with_event_history to get around this but I suspect you and others will eventually bump into this constraint. If you have any other alternatives in the works I'd love to hear.
The text was updated successfully, but these errors were encountered: