Special character \H removed from filepath #492

stephanedebove · 2024-10-28T00:06:39Z

Describe the bug

convert_to_unicode() function interprets \H strings in filepaths as special characters.

Code

This code:

with open(BIB_PATH, 'r', encoding='utf-8') as bib_file:
    parser = BibTexParser()
    parser.customization = convert_to_unicode
    bib_database = bibtexparser.load(bib_file, parser=parser)

running on a bib file containing this entry:

@article{Hagger2022,
  title = {Perceived Behavioral Control Moderating Effects in the Theory of Planned Behavior: {{A}} Meta-Analysis},
  file = {C:\Users\name\Documents\Zotero\storage\7J78GAC5\Hagger et al_2022_Perceived behavioral control moderating effects in the theory of planned.pdf}
}

will remove the "\H" from the filepath, and file path will become:

C:\Users\name\Documents\Zotero\storage\7J78GAC5a̋gger et al_2022_Perceived behavioral control moderating effects in the theory o f planned.pdf

Reproducing

Version: 1.4.2

Workaround
For now, I just rewrote the convert_to_unicode function to skip the file field:

def convert_to_unicode(record):
    for val in record:
        if val == "file":
            continue

The text was updated successfully, but these errors were encountered:

MiWeiss added the v1 label Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Special character \H removed from filepath #492

Special character \H removed from filepath #492

stephanedebove commented Oct 28, 2024

Special character \H removed from filepath #492

Special character \H removed from filepath #492

Comments

stephanedebove commented Oct 28, 2024