Skip to content

Chinese new word discovery with computing PMI, Left and right entropy and Word-position Probability. 新词发现

Notifications You must be signed in to change notification settings

milliemaoo/New-word-discovery

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

New-word-discovery

Chinese new word discovery

This is a simple project to find Chinese new word. Inspired by Matrix67 and ChineseWordSegmentation, wirtten with python.

Usage

  • Place the corpus in the root directory of the project.
  • All hyperparameters are configured in the "./config.py"

Involved algorithm

  • PMI
  • Left and right entropy
  • Position-Word Probability (word-initial,suffix)

About

Chinese new word discovery with computing PMI, Left and right entropy and Word-position Probability. 新词发现

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages