Skip to content

Hyphens and spaces omitted in scraped words #3

Open
@dkalantzi

Description

@dkalantzi

Hello,

Thank you for this very useful resource. I've noticed two potential issues with the words in the scraped data:

  • Hyphens Omission: Hyphens from the original dictionary entries seem to be missing. For example, the first word in the txt is aalugalog, but the entry in the dictionary is aalug-alog.
  • Spaces Omission: Spaces between words in phrases from the original dictionary also seem to be missing, causing phrases to be scraped as single words. For example, patay na Bulan is scraped as pataynaBulan, “patay na hayop” is scraped as pataynahayop.

Kind regards,
Dimitra

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions