Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CSV module includes UTF-8 BOM when parsing lines #27

Open
j2salmingo opened this issue Aug 15, 2023 · 1 comment
Open

CSV module includes UTF-8 BOM when parsing lines #27

j2salmingo opened this issue Aug 15, 2023 · 1 comment

Comments

@j2salmingo
Copy link
Contributor

Some programs (namely, Notepad) will prepend a 3-byte signature to text files to indicate that it is UTF-8 encoded. Although not endorsed by the actual UTF standard, if it is there it causes problems with parsing as the 3 bytes will be included in the first field.

This may cause problems with validating the input.

@yashpatel6
Copy link
Collaborator

That's a good point, a lot of the Windows programs end up causing format clashes when transferred to Linux. It's a good thing to keep in mind though

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants