Skip to content

amacinho/Rovereto-Twitter-Tokenizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

A tokenizer for Twitter-based text. Keeps @mentions and #hastags intact. Written by Amaç Herdağdelen 2011.

The code is licensed under the Apache License 2.0: http://www.apache.org/licenses/LICENSE-2.0.html

For emoticon and URL recognition, this code uses parts of TweetMotif (https://github.com/brendano/tweetmotif). TweetMotif is also licensed under the Apache License 2.0: http://www.apache.org/licenses/LICENSE-2.0.html

About

Tokenizer for Twitter-based text

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages