Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Character escape sequences #14

Open
bripmccann opened this issue Mar 26, 2019 · 1 comment
Open

Character escape sequences #14

bripmccann opened this issue Mar 26, 2019 · 1 comment

Comments

@bripmccann
Copy link

bripmccann commented Mar 26, 2019

The character mappings mostly represent Unicode and hexadecimal characters with escape sequences rather than the characters themselves. (E.g., æ is represented by \xe6 and ʌ is represented by \u028c)

But there are non-escaped, non-ASCII IPA characters in the mappings under // edits arising from testing. The same is true for all the instances of /mʊmˈbaɪ/ in the HTML.

Are escape sequences helpful here? If so, should these unescaped characters be converted?

If not, could all the characters be unescaped? Testing locally, it seems to work fine that way. And it would make the code more readable.

@ssb22
Copy link
Contributor

ssb22 commented Dec 12, 2019

The escape sequences came from Lexconvert, and the reason why I wrote them as escape sequences is I was using an editor on an ASCII-only terminal at the time. I see no reason not to convert them all to UTF-8 if we can guarantee that UTF-8 will be used as the character set for the script.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants