Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Минимальная конфигурация для Томиты #8

Merged
merged 36 commits into from
Oct 6, 2017

Conversation

pinbraerts
Copy link
Member

@pinbraerts pinbraerts commented Sep 29, 2017

Можно проанализиповать файл test/data/input/$file на наличие точек, зайдя в директорию extract/ и выполнив extract $file. Появится test/data/output/$(basename $file .txt).xml.


This change is Reviewable

@cblp
Copy link
Member

cblp commented Sep 30, 2017

Reviewed 13 of 13 files at r1.
Review status: all files reviewed at latest revision, 8 unresolved discussions.


extract/config.proto, line 4 at r1 (raw file):

TTextMinerConfig {
	Dictionary = "mydic.gzt";

заменить табы на пробелы


extract/extract, line 1 at r1 (raw file):

tomitaparser config.proto < ../test/data/input/$0 > ../test/data/output/$(basename $0 .txt).xml

много ошибок. кто-нибудь пробовал запускать?


extract/facttypes.proto, line 5 at r1 (raw file):

message Point : NFactType.TFact  {
	optional string Name = 1;

заменить табы на пробелы


extract/mydic.gzt, line 8 at r1 (raw file):


TAuxDicArticle "точка" {
	key = { "tomita:Point.cxx" type=CUSTOM }

заменить табы на пробелы


extract/mydic.gzt.bin, line 0 at r1 (raw file):
Это не исходный код. Убрать.


extract/Point.bin, line 0 at r1 (raw file):
Это не исходный код. Убрать.


extract/pr.html, line 1 at r1 (raw file):

<html xmlns:fo="http://www.w3.org/1999/XSL/Format"><HEAD><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"></HEAD></html><body xmlns:fo="http://www.w3.org/1999/XSL/Format">

Это не исходный код. Убрать.


extract/Readme.md, line 1 at r1 (raw file):

To get facts from file in test\data\input run `extract $file` in **this** directory

Нет, не заработает. Скажет, что нет программы extract.


Comments from Reviewable

@pinbraerts
Copy link
Member Author

Сделал на телефоне, сэмулировав tomitaparser. В данном случае парсер просто выводит всё, что есть в STDIN и в конце строчку с названием файла конфигурации. В любом случае это не важно, можно пока так делать.

@cblp
Copy link
Member

cblp commented Oct 6, 2017

Reviewed 9 of 10 files at r2.
Review status: all files reviewed at latest revision, all discussions resolved.


Comments from Reviewable

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants