Skip to content

Preserve Spanish « open and closing » double quote marks in the training data. #297

Open
@davidbaines

Description

@davidbaines

It seems that these Spanish « open and closing » double quote marks are removed from the training data.
A lot of pre and post processing is required to support them:

I did a find and replace to change them to English “open and closing quotes” prior to training.
Then replace them in the source text with English open and close double quote marks so that it matches those seen in training.

Then edit the draft which is complicated by the fact that it has straight English double quotes - so there's no distinction between the opening and closing marks. More care had to be taken changing those into opening and closing Spanish quotation marks. 

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    Status

    🔖 Ready

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions