Input type #8

xsway · 2017-11-02T10:03:48Z

Hi!

I was considering using your parser to parse some wiki corpora. A quick question: what is the type of input for pre-trained models? Is it possible to give to your parser raw text and get the whole pipeline (tokenization, tagging, parsing) running, or do you require the pre-processed conll-style input with POS tags?

Thanks!

msklvsk · 2017-11-18T11:33:08Z

You have to pre-spit into sentences and tokens with UDPipe. That's what Stanford did for this parser:

That is, the input should be CoNLL-U-formatted. This parser/tagger will fill the corresponding columns in CoNLL-U.

Vimos · 2019-03-05T11:40:08Z

This better goes to the Readme.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input type #8

Input type #8

xsway commented Nov 2, 2017

msklvsk commented Nov 18, 2017

Vimos commented Mar 5, 2019

Input type #8

Input type #8

Comments

xsway commented Nov 2, 2017

msklvsk commented Nov 18, 2017

Vimos commented Mar 5, 2019