Are there any plans of publishing this library in Pub.dev? #5

dhombios · 2022-06-22T19:30:32Z

Most speech recognition libraries available in pub.dev can't process audio files as they just use the speech recognition capabilities provided by android or ios. Therefore, this project seems quite interesting as it allows more advanced tuning and a wider range of applications. However, being just available in github makes it harder to find as most flutter packages are provided through pub.dev

manuindersekhon · 2022-07-01T14:10:50Z

Hi @dhombios, Right now, I have included the very limited set of functions for deepspeech library. I am not working further on it right now, but if more functionality is added from the lib, then I think it will make more sense to release it as a package.

dhombios · 2022-07-01T21:18:02Z

What other parts of the deepspeech library do you consider that this port should include in order to be published? I don’t have a lot of spare time, but I might be able to work on it from time to time

manuindersekhon · 2022-07-02T09:11:13Z

There are two major features left which I think we should include before releasing the library.
Hot word feature and Decode audio from stream

Currently, we give it the whole audio buffer and it decodes in one-shot only. I think realtime decoding from audio stream should be there for a speech recognition library. Though I am not sure how feasible this will work in flutter application, and how I am gonna test them in example app.
Another approach would be to release the library as alpha right now, and work on the above features later. What do you say?

dhombios · 2022-07-02T15:20:56Z

I think that being able to decode an audio stream at least in desktop environments is an important feature, as there are other flutter libraries that allow doing that using the native API of the target platform but they just support mobile and web targets. Therefore, I think that it is better to wait until it is implemented to publish it

Regarding the hot word feature, I wouldn't expect it to be quite problematic to port

I might be able to start working on this next week, but as I'm still getting familiar with flutter (I come from the embedded world) I think I'll start with the hot word feature, which seems easier

dhombios · 2022-07-11T17:54:41Z

I've been reviewing the header files of the deepspeech library, but those features are not exposed in them. Did you make those headers or were they provided by mozilla? (In case they were provided by mozilla I understand that additional headers are needed)

manuindersekhon · 2022-07-13T15:06:02Z

Yes. I had created these header files myself. Main idea of this project was to create the tutorial on how to integrate libs with flutter applications using dart ffi. If we are exposing all functions, then we can directly use the header file of mozilla to skip duplication, and remove my header file.
You can also explore ffigen package, which auto-generate ffi bindings from C header file. Then only flutter integration part will be left. It will create all the necessary structs and pointer conversions.

dhombios mentioned this issue Jul 26, 2022

Complete deepspech api #6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are there any plans of publishing this library in Pub.dev? #5

Are there any plans of publishing this library in Pub.dev? #5

dhombios commented Jun 22, 2022

manuindersekhon commented Jul 1, 2022

dhombios commented Jul 1, 2022

manuindersekhon commented Jul 2, 2022

dhombios commented Jul 2, 2022

dhombios commented Jul 11, 2022

manuindersekhon commented Jul 13, 2022 •

edited

Loading

Are there any plans of publishing this library in Pub.dev? #5

Are there any plans of publishing this library in Pub.dev? #5

Comments

dhombios commented Jun 22, 2022

manuindersekhon commented Jul 1, 2022

dhombios commented Jul 1, 2022

manuindersekhon commented Jul 2, 2022

dhombios commented Jul 2, 2022

dhombios commented Jul 11, 2022

manuindersekhon commented Jul 13, 2022 • edited Loading

manuindersekhon commented Jul 13, 2022 •

edited

Loading