Skip to content

hfs/trilang

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

--------------------------------------------------------------------------------
                    trilang -- Statistical language detector
--------------------------------------------------------------------------------

trilang is a statistical language detector. To detect the language of a text it
divides it into trigrams (blocks of three letters) and compares their frequency
with reference values in its database. The database is initially empty and has
to be filled by learning from texts with known languages.

The statistical approach has been described in the article "A Statistical
Approach to the Spam Problem" by Gary Robinson, 1 Mar 2003, Linux Journal,
http://www.linuxjournal.com/article/6467 .


Contact

Please email me with any comments or questions you have:
Hermann Schwarting <[email protected]>

About

Statistical language detector

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages