Skip to content

Tamil language tools in Python. By the people who brought you TamilClass.org

License

Notifications You must be signed in to change notification settings

rsujeevan/amuthaa

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

amuthaa

amuthaa provides a number of tools to handle the processing of Tamil letters and words.

Here is a sampling of the TamilLetter class:

>>> from amuthaa import TamilLetter
>>>
>>> letter = u'ம'
>>>
>>> TamilLetter.is_aytham(letter)
False
>>> TamilLetter.is_vowel(letter)
False
>>> TamilLetter.is_consonant(letter)
False
>>> TamilLetter.is_combination(letter)
True
>>>
>>> consonant, vowel = TamilLetter.split_combination(letter)
>>> print consonant
ம்
>>> print vowel
அ
>>>
>>> TamilLetter.is_consonant(consonant)
True
>>> TamilLetter.is_vowel(vowel)
True
>>>
>>> print TamilLetter.get_combination(u'க்', u'ஐ')
கை
>>>
>>>
>>> vowels = TamilLetter.get_vowels()
>>>
>>> for v in vowels:
...     print v, TamilLetter.get_vowel_type(v)
...
அ KURIL
இ KURIL
ஆ NEDIL
உ KURIL
ஈ NEDIL
ஊ NEDIL
ஏ NEDIL
எ KURIL
ஐ NEDIL
ஓ NEDIL
ஒ KURIL
ஔ NEDIL
>>>
>>> consonants = TamilLetter.get_consonants()
>>>
... for c in consonants:
...     print c, TamilLetter.get_consonant_type(c)
...
ய் IDAIYINAM
ர் IDAIYINAM
ல் IDAIYINAM
வ் IDAIYINAM
ழ் IDAIYINAM
ள் IDAIYINAM
ஜ் GRANTHA
ஷ் GRANTHA
ஸ் GRANTHA
ஹ் GRANTHA
க் VALLINAM
ச் VALLINAM
ட் VALLINAM
த் VALLINAM
ப் VALLINAM
ற் VALLINAM
ங் MELLINAM
ஞ் MELLINAM
ண் MELLINAM
ந் MELLINAM
ம் MELLINAM
ன் MELLINAM
>>>
>>> foreign_letters = [
...                    u'ഢ', # Malayalam
...                    u'ම', # Sinhala
...                    u'న', # Telugu
...                    u'ನ', # Kannada
...                    u'औ', # Devanagari (Sanskrit, Hindi, Marathi)
...                    u'ល', # Khmer (Cambodia)
...                    u'a', # Latin
...                    ]
>>>
>>> for f in foreign_letters:
...     print "The letter %s is in the %s script" %(f, TamilLetter.get_script_name(f))
...
The letter ഢ is in the MALAYALAM script
The letter ම is in the SINHALA script
The letter న is in the TELUGU script
The letter ನ is in the KANNADA script
The letter औ is in the DEVANAGARI script
The letter ល is in the KHMER script
The letter a is in the LATIN script
>>>

About

Tamil language tools in Python. By the people who brought you TamilClass.org

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published