-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Slow event-wise scores evaluation #17
Comments
ping @jorisvandenbossche |
To give more details, it's running |
Yes, we recently had an issue about that on the pandas issue tracker (it might be fixed in the latest pandas release).
I don't fully recall, but it might be we can simply cast to ints there? (I mean, the fact that it is float is maybe just because it was concatenated into a 2D array with actual floats, but those dates are originally ints?) |
Yes, they are ints: Line 87 in 0156adc
So doing a |
See #18 |
Thanks @jorisvandenbossche ! Looks great! |
Let's keep this open for now, even if the to_datatime is a major improvement -- I'll try see if caching some of calculations in event-wise scores could improve performance more. |
Computing the event-wise scores evaluation appears to be very time consuming, mostly due to the repeated calls to
pd.to_dataframe
within theEventwise*
scores.At least for
ramp_test_submission --quick-test
on the starting kit this accounts for most of the runtime,Originally the starting kit with the
--quick-test
option takes 63s on my laptop, with event-wise scores disabled this reduces to 9s.Looking for a way to speeding it up, but generally, such long run time (on a dataset that is not that big) makes iterations slower which is problematic when running events.
The text was updated successfully, but these errors were encountered: