chinese-text-segmentation

Here are 41 public repositories matching this topic...

wolfgarbe / SymSpell

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

spellcheck fuzzy-search fuzzy-matching edit-distance levenshtein levenshtein-distance spelling spell-check chinese-text-segmentation word-segmentation approximate-string-matching spelling-correction damerau-levenshtein text-segmentation chinese-word-segmentation symspell

Updated Mar 29, 2025
C#

koth / kcws

Star

Deep Learning Chinese Word Segment

nlp deep-learning tensorflow chinese-text-segmentation pos-tagger

Updated May 18, 2018
C++

fukuball / jieba-php

Star

"結巴"中文分詞：做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.

nlp machine-learning natural-language-processing chinese-text-segmentation

Updated Apr 11, 2025
PHP

Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch

java nlp natural-language-processing chinese-nlp chinese-text-segmentation nlp-keywords-extraction pos-tagging solr-plugin chinese-word-segmentation jcseg mmseg lucene-analyzer elasticsearch-analyzer keywords-extraction lucene-tokenizer jcseg-analyzer opensearch-analyzer opensearch-tokenizer elasticsearch-tokenizer

Updated Sep 18, 2023
Java

mammothb / symspellpy

Sponsor

Star

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

python spellcheck fuzzy-search fuzzy-matching edit-distance levenshtein levenshtein-distance spelling spell-check chinese-text-segmentation word-segmentation approximate-string-matching spelling-correction damerau-levenshtein text-segmentation chinese-word-segmentation symspell

Updated Apr 26, 2025
Python

amutu / zhparser

Star

zhparser is a PostgreSQL extension for full-text search of Chinese language

extension postgresql chinese chinese-nlp chinese-text-segmentation scws zhparser

Updated Jan 24, 2025
C

qinwf / jiebaR

Star

Chinese text segmentation with R. R语言中文分词（文档已更新 🎉 ：https://qinwenfeng.com/jiebaR/ )

nlp chinese lexical-analysis cppjieba jieba chinese-text-segmentation

Updated Jul 13, 2020
C++

yongzhuo / Pytorch-NLU

Star

中文文本分类、序列标注工具包（pytorch），支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text classification and sequence labeling toolkit, supports multi class and multi label classification, text similsrity, text summary and NER.