Encoding problem with reading files downloaded from brenda #7

MistyField · 2023-07-27T05:30:58Z

The .txt file downloaded from http:/www.brenda-enzymes.org is encoded in utf-8. But in the line 76 of parser.py it's:
with open(path_to_database, encoding="iso-8859-1") as file:
maybe it should be changed into:
with open(path_to_database, encoding="utf-8") as file:?
because if the file was decoded in iso-8859-1, the meta of the reaction may become
'#4# pH 7.0, 30Â°C, recombinantwild-type enzyme <6>; #4# pH 7.0, 30Â°C, recombinant free enzyme <8>'
but if it was changed into utf-8, it turns normal: '#4# pH 7.0, 30°C, recombinantwild-type enzyme <6>; #4# pH 7.0, 30°C, recombinant free enzyme <8>'

The text was updated successfully, but these errors were encountered:

Robaina · 2023-09-10T09:32:18Z

Hi @MistyField ,

thanks for noticing this and apologies for the late reply. I don't remember now why I used that particular encoding but you are right there. I'm planning to update BRENDApyrser to accommodate the "new" json format BRENDA released a while ago, see issue #9. It doesn't make much sense to continue parsing the txt file having now a json.

Best!

Robaina · 2023-09-15T17:45:39Z

Issue solved in PR #8

Robaina closed this as completed Sep 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encoding problem with reading files downloaded from brenda #7

Encoding problem with reading files downloaded from brenda #7

MistyField commented Jul 27, 2023 •

edited

Loading

Robaina commented Sep 10, 2023

Robaina commented Sep 15, 2023

Encoding problem with reading files downloaded from brenda #7

Encoding problem with reading files downloaded from brenda #7

Comments

MistyField commented Jul 27, 2023 • edited Loading

Robaina commented Sep 10, 2023

Robaina commented Sep 15, 2023

MistyField commented Jul 27, 2023 •

edited

Loading