Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Text classification to .NET #107

Open
silvacaio opened this issue May 19, 2017 · 5 comments
Open

Text classification to .NET #107

silvacaio opened this issue May 19, 2017 · 5 comments

Comments

@silvacaio
Copy link

silvacaio commented May 19, 2017

I am doing a software to classify texts about health.
So, I have many texts classified in "positive" and "negative" and these are the texts for my training.
The user will put the new texts and my software will evaluate and say with this text is "positive" or "negative".
Today, I use Weka and Naïve Bayes to this, but I would like of a framework specific to .NET.

So, I have found the code above with a similar case, but I have found the "BagOfWord" to .NET.

https://github.com/encog/encog-java-examples/blob/master/src/main/java/org/encog/examples/ml/bayesian/BayesianSpam.java

It's possible user Encog for this? How?

Thank's

@YuriyZaletskyy
Copy link

So, try to make software that imitates behavior of doctor. What as usually happens when person visits a doctor? Person says what he feels and doctor gives some additional question, with some additional analysis gives diagnosis. I recommend you not just allow to user to enter any text, but also to add some questionnaire and it will improve performance of your app.

@silvacaio
Copy link
Author

YuriyZaletskyy, I really like of your suggestion and I will think about this.

About the texts classification, do you know of is it possible with Encog?
I need this because it is a teacher's request to my school paper.

Thanks

@YuriyZaletskyy
Copy link

YuriyZaletskyy commented May 20, 2017

As you already mentioned text classification as usually is implemented with technique that is known as Bag of words. In the example that you mentioned Encog uses Bag of words for text classification with idea to does some text belong to class spam or does not belong. If your task sounds like: "detect whether words description belong to disease a" then you can use Encog and actually you have almost completed your school paper. But with Encog uses BayesianNetwork class for this task. But if you need neural network approach, then consider recurrent neural networks ( they are good for sequences ). Encog also have some of RNN implemented. If you need something more sophisticated ( LSTM ) I recommend you consider deeplearning4j.

@silvacaio
Copy link
Author

YuriyZaletskyy for now, a need only this: "detect whether words description belong to disease a".

But, the example is to Java and when I have implemented to C#, I have not found the BagOfWords.
Do you know if this is implemented or not?

Thanks for the Help YuriyZaletskyy

@jeroldhaas
Copy link
Contributor

jeroldhaas commented May 24, 2017 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants