-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Backport for videocore 4 #13
Comments
Thank you for your interest! Yes, the ISA is different and we cannot directly port VC6 code to VC4. |
Ideally, we would need a full blas suite for VC4, as we use some non ML workloads as well for our research. I'm currently interested in getting it running on the pi zero, since that is the cheapest raspberry pi and i've been working on a performance/cost analysis for various embedded systems. Does you Company plan on open sourcing its ML library at all? Our lab, SOCAL would be interested in collaborating. |
There are so many, and some of them (e.g. stride, diagonal, or sparse matrices) are not feasible for VC4 because VC4 QPU writes to memory in the unit of 64 bytes. I heard we're not planning, but, if you're interested, could you tell us the details of your research through https://idein.jp/en/contact/ ? |
Yeah, i know :( Also, I've sent an email through your contacts, so we can discuss further through there if you like. |
Thank you for telling us the details of your projects. Instead, can your project be done by using Actcast? For CPU, yes, you can use PyTorch or TensorFlow Lite for ML, and OpenBLAS is the fastest among the BLAS libraries IIRC. |
I think we could use Actcast, at least for the benchmarking portion of our project. Does your MobileNet work on the Pi Zero? I'll start to play around with it. Regarding the BLAS for VC4, If its not too much to ask, would it be possible for you to develop axpy, gemm, and gemv as an example? Then I could take over and develop others that we might need. I am familiar with GPU programming, but not how the QPU driver works, so an example would help a lot. |
Yes, the app runs on all models of Raspberry Pis. I see, QMKL, the BLAS for VC4, already includes gemm, so I'm going to develop axpy and gemv. |
Hello,
I am a Ph.D. Student at UC RIverside doing research in ML on edge devices. I noticed that your qmkl6 has a full blas library while qmkl for the videocore 6 only has sgemm. Do you plan on making a backport to support videocore 4 devices. If not, what would it take to do such a thing? I know that the ISA is different, but would it be possible to make a compatible version? I am very much interested in your work and would love to learn more!
Thanks,
Marcus
The text was updated successfully, but these errors were encountered: