This branch is 38 commits ahead of takuyaa/kuromoji.js:master.

Name	Name	Last commit message	Last commit date
Latest commit sglkc chore: release 1.1.0 Sep 17, 2023 3d4e9e1 · Sep 17, 2023 History 201 Commits
build	build	fix: browser dictionary loader pass buffer	Aug 3, 2023
demo	demo	Update vue dependency	Jun 14, 2018
dict	dict	Revert "feat: update dict"	Aug 3, 2023
example	example	fix: change dictionary path in example from dist/dict/ to dict/	Aug 6, 2016
src	src	type: add index.d.ts	Sep 9, 2023
test	test	Revert "Merge pull request #1 from sglkc/typescript"	Aug 1, 2023
.codeclimate.yml	.codeclimate.yml	Revert "Merge pull request #1 from sglkc/typescript"	Aug 1, 2023
.gitignore	.gitignore	Revert "Merge pull request #1 from sglkc/typescript"	Aug 1, 2023
.jshintrc	.jshintrc	Revert "Merge pull request #1 from sglkc/typescript"	Aug 1, 2023
.node-version	.node-version	Revert "Merge pull request #1 from sglkc/typescript"	Aug 1, 2023
.npmignore	.npmignore	build: add .npmignore to ignore test directory	Aug 19, 2023
.travis.yml	.travis.yml	Revert "Merge pull request #1 from sglkc/typescript"	Aug 1, 2023
CHANGELOG.md	CHANGELOG.md	Update CHANGELOG.md for release 0.1.2	Mar 19, 2018
LICENSE-2.0.txt	LICENSE-2.0.txt	Initial commit	Dec 4, 2014
NOTICE.md	NOTICE.md	Initial commit	Dec 4, 2014
README.md	README.md	Remove bower badge in README.md	Mar 21, 2018
bower.json	bower.json	chore: release 1.1.0	Sep 17, 2023
gulpfile.js	gulpfile.js	Revert "Merge pull request #1 from sglkc/typescript"	Aug 1, 2023
jsdoc.json	jsdoc.json	Revert "Merge pull request #1 from sglkc/typescript"	Aug 1, 2023
package-lock.json	package-lock.json	type: add index.d.ts	Sep 9, 2023
package.json	package.json	chore: release 1.1.0	Sep 17, 2023

Repository files navigation

kuromoji.js

JavaScript implementation of Japanese morphological analyzer. This is a pure JavaScript porting of Kuromoji.

You can see how kuromoji.js works in demo site.

Usage

You can tokenize sentences with only 5 lines of code. If you need working examples, you can see the files under the demo or example directory.

Node.js

Install with npm package manager:

npm install kuromoji

Load this library as follows:

var kuromoji = require("kuromoji");

You can prepare tokenizer like this:

kuromoji.builder({ dicPath: "path/to/dictionary/dir/" }).build(function (err, tokenizer) {
    // tokenizer is ready
    var path = tokenizer.tokenize("すもももももももものうち");
    console.log(path);
});

Browser

You only need the build/kuromoji.js and dict/*.dat.gz files

Install with Bower package manager:

bower install kuromoji

Or you can use the kuromoji.js file and dictionary files from the GitHub repository.

In your HTML:

<script src="url/to/kuromoji.js"></script>

In your JavaScript:

kuromoji.builder({ dicPath: "/url/to/dictionary/dir/" }).build(function (err, tokenizer) {
    // tokenizer is ready
    var path = tokenizer.tokenize("すもももももももものうち");
    console.log(path);
});

API

The function tokenize() returns an JSON array like this:

[ {
    word_id: 509800,          // 辞書内での単語ID
    word_type: 'KNOWN',       // 単語タイプ(辞書に登録されている単語ならKNOWN, 未知語ならUNKNOWN)
    word_position: 1,         // 単語の開始位置
    surface_form: '黒文字',    // 表層形
    pos: '名詞',               // 品詞
    pos_detail_1: '一般',      // 品詞細分類1
    pos_detail_2: '*',        // 品詞細分類2
    pos_detail_3: '*',        // 品詞細分類3
    conjugated_type: '*',     // 活用型
    conjugated_form: '*',     // 活用形
    basic_form: '黒文字',      // 基本形
    reading: 'クロモジ',       // 読み
    pronunciation: 'クロモジ'  // 発音
  } ]

(This is defined in src/util/IpadicFormatter.js)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kuromoji.js

Directory

Usage

Node.js

Browser

API

About

Releases

Packages

Languages

miseya/kuromoji.js

Folders and files

Latest commit

History

Repository files navigation

kuromoji.js

Directory

Usage

Node.js

Browser

API

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages