Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

updates on benchmark #3

Open
johnnychen94 opened this issue May 12, 2019 · 12 comments
Open

updates on benchmark #3

johnnychen94 opened this issue May 12, 2019 · 12 comments

Comments

@johnnychen94
Copy link

johnnychen94 commented May 12, 2019

It's a lovely benchmark, thanks for the information!

I observed that this benchmark is 7-months old. Perhaps it's quite outdated since both Pytorch and Flux have upgraded a lot.

It would be great if you could run this benchmark again. Though it might not so fair to Flux until Tracker is replaced by Zygote.

Best

@avik-pal
Copy link
Owner

I have been meaning to update these benchmarks, but it would be ideal for Flux to have migrated to Zygote before running these. That being said I can definitely run it on one of the development branches for comparison sake and post those results.

@EMCP
Copy link

EMCP commented May 16, 2019

can you include memory usage somehow? I want to see how good or bad, on memory usage, switching to flux would be on the GPU

im wondering if it will be worth porting this project https://github.com/jwyang/faster-rcnn.pytorch to Flux, in order to fit on my crappy GPU

@avik-pal
Copy link
Owner

I am not very sure how I can get the memory usage of GPU automatically. But if you have any pointers on how to do that, feel free to submit a PR.

@EMCP
Copy link

EMCP commented May 17, 2019

I generally just do a reading at the beginning of training.. and do a

$ watch nvidia-smi

after 30 seconds or so, the memory has stabilized to some measurement and then I am sure that it will not crash due to running out of headroom.. at least this is what I do in pyTorch

I can look around for automated solutions to this or maybe make a simple one that just parses nvidia-smi

@tbenst
Copy link

tbenst commented May 17, 2019

@avik-pal @EMCP nvidia-settings -q useddedicatedgpumemory might be easier

@staticfloat
Copy link

Here's another vote for ResNet timings with Zygote. :D I really hope there's been progress!

@avik-pal
Copy link
Owner

I am just waiting for FluxML/Zygote.jl#198 to be fixed. I have the scripts ready for pytorch 1.0 and flux vgg models.

@EMCP
Copy link

EMCP commented Jul 17, 2019

@tbenst , not sure what I am needing to do , but executing this command you sent was not working..

(base) e@e:~$ nvidia-settings -q useddedicatedgpumemory
Unable to init server: Could not connect: Connection refused
ERROR: The control display is undefined; please run `nvidia-settings --help` for usage information.
(base) e@e:~$ 

@tbenst
Copy link

tbenst commented Jul 18, 2019

@EMCP hm, looks like it needs X server to be running so not a solution for headless, sorry

@DhairyaLGandhi
Copy link

Good to get an update on this. Also, trying out resnet with https://github.com/dhairyagandhi96/Torch.jl might be useful

@avik-pal
Copy link
Owner

Interesting. I will give these a shot on the weekend.

@avik-pal
Copy link
Owner

@dhairyagandhi96 I put together a quick benchmark suite for the layers in the update branch. The bson files contain the timings.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants