More 'asciify' documentation #929

gwern · 2014-09-03T23:22:17Z

The documentation for the asciify_paths option says

Convert all non-ASCII characters in paths to ASCII equivalents. For example, if your path template for singletons is singletons/$title and the title of a track is “Café”, then the track will be saved as singletons/Cafe.mp3.

This is clear enough for a Latin script (one would expect 'é' to be converted to 'e'), but it's unclear what this command would do for anything written in entirely different scripts or writing systems like Japanese kanji. (It says 'all' - does that mean they would be deleted since they have no ASCII equivalents?) I have no idea what it might do, and am too terrified to let this option anywhere near my files to figure it out empirically, so more documentation would be helpful for deciding whether to use this option.

The text was updated successfully, but these errors were encountered:

andriykohut · 2014-09-04T00:37:55Z

You can check out unidecode description, it's just doing transliteration:

function unidecode() takes Unicode data and tries to represent it in ASCII characters (i.e., the universally displayable characters between 0x00 and 0x7F), where the compromises taken when mapping between two character sets are chosen to be near what a human with a US keyboard would choose.

Something like:

from unidecode import unidecode

unidecode.unidecode(u'パイソン') # 'paison'
unidecode.unidecode(u'蠎') # 'mang'

sampsyo · 2014-09-04T00:39:39Z

Thanks for chiming in, @andriykohut.

@gwern, if you do some investigation here, could you please consider adding what you find to the docs? The Unidecode mapping is pretty straightforward and there's no reason we shouldn't spend a sentence or two giving more detail.

This was proposed in beetbox#929 but never dealt with, so after going through some confusion around this myself I figured it's about time.

sampsyo added the docs label Sep 4, 2014

sampsyo added the needinfo We need more details or follow-up from the filer before this can be tagged "bug" or "feature." label Sep 7, 2014

sampsyo closed this as completed Nov 25, 2014

emiham added a commit to emiham/beets that referenced this issue Sep 17, 2021

Add additional asciify documentation

e48d76a

This was proposed in beetbox#929 but never dealt with, so after going through some confusion around this myself I figured it's about time.

emiham mentioned this issue Sep 18, 2021

Add additional asciify documentation #4067

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More 'asciify' documentation #929

More 'asciify' documentation #929

gwern commented Sep 3, 2014

andriykohut commented Sep 4, 2014

sampsyo commented Sep 4, 2014

More 'asciify' documentation #929

More 'asciify' documentation #929

Comments

gwern commented Sep 3, 2014

andriykohut commented Sep 4, 2014

sampsyo commented Sep 4, 2014