the output is not escaped on error #572

yegor256 · 2024-11-28T12:11:52Z

I see this in the log:

$ eo-phi-normalizer rewrite --rules 0.yml Foo.phi --single -o Bar.phi
eo-phi-normalizer: syntax error at line 1, column 1 due to lexer error
on input
?.org.eolang.bytes ( ?0 ? ? ? ? 00-00-00-00-00-00-1E-61 ? )

Here, I don't understand whether the problem is with the encoding or the input was indeed formatted as ?0 instead of α0. I suggest you to "escape" non-ASCII symbols in the output. Instead of printing UTF-8 as is, convert them to something like \u045e.

Maybe you can say on input (non-ASCII symbols escaped) instead of just on input.

The text was updated successfully, but these errors were encountered:

yegor256 · 2024-11-28T12:11:57Z

@deemp please, help

deemp · 2024-11-28T12:33:45Z

@yegor256, run export LC_ALL=C.UTF-8 before running this command.

yegor256 · 2024-11-28T13:03:50Z

@deemp yes, we know the workaround, but please make the output escaped :)

deemp · 2024-11-28T15:56:58Z

@yegor256,

Does normalizer render Unicode correctly in error messages with export ...?
Does normalizer render Unicode correctly in normal output without export ...? If it doesn't, then export ... is not a workaround, but a necessity. We can write it explicitly on command pages on the docs site.

yegor256 · 2024-11-28T16:17:47Z

@deemp yes, it works with the export, but I kindly ask you to implement this escaping feature because it will help users debug much faster

deemp · 2024-11-29T13:19:48Z

@yegor256, can you suggest how to distinguish when to print Unicode and when to escape?

I thought about:

Checking the LANG environment variable, but the locale may be set in other ways on different platforms.
Adding an option like --use-unicode-code-points that would always output unicode.

yegor256 · 2024-11-29T13:22:00Z

@deemp just escape always, when you print this error message. Why not to escape? It's an error message, it won't be parsed by any software, it will always be read by humans. Replace all 0x7f+ symbols with their mnemos, that's it.

deemp · 2024-11-29T13:36:47Z

@yegor256, it's inconvenient to read numbers when you can read Unicode characters. If the locale is set correctly, users may prefer to see Unicode.

yegor256 · 2024-11-29T13:38:57Z

@deemp I'm the primary user of this app :) I'm telling you, as a user, that error messages must be as non-ambiguous as possible. Unicode is more ambiguous than ASCII.

deemp · 2024-11-29T13:42:46Z

I'm the primary user of this app

@yegor256, OK, I'll keep that in mind :) Let's escape.

deemp · 2024-11-29T15:12:30Z

@yegor256, here are representations of errors.

With escaping:

syntax error at line 1, column 1 before `\961'
on input
\961 \8614 \10214 t \8614 \958.\961.k.\961.t

With correctly set locale and without escaping:

syntax error at line 1, column 1 before `ρ'
on input
ρ ↦ ⟦ t ↦ ξ.ρ.k.ρ.t

Do you really prefer the option with escaping?

yegor256 · 2024-11-29T16:50:14Z

@deemp can you do both? show the original one and then print the escaped one?

deemp · 2024-11-29T16:57:58Z

the original one

@yegor256, which one do you mean?

yegor256 · 2024-11-29T17:03:07Z

@deemp how many do you have? :) print them both

deemp · 2024-11-29T17:21:32Z

@yegor256, see #572 (comment)

yegor256 · 2024-11-30T04:54:24Z

@deemp please, print both outputs in case of error: 1) not escaped, and 2) escaped

deemp · 2024-12-02T19:24:58Z

@yegor256

Platform: Linux

Input program: ξ.a.b(c ↦ ⟦ Δ ⤍ 3F-FC ⟧)

Not escaped, export LANG=en_US.UTF-8:

eo-phi-normalizer: An error occurred when parsing the input program:
syntax error at line 1, column 1 before `'
on the input:
.a.b(c     3F-FC )

Not escaped, export LANG=C.UTF-8:

eo-phi-normalizer: An error occurred when parsing the input program:
syntax error at line 1, column 1 before `ξ'
on the input:
ξ.a.b(c ↦ ⟦ Δ ⤍ 3F-FC ⟧)

Non-ASCII escaped (Unicode characters replaced with their numbers), LANG doesn't matter:

eo-phi-normalizer: An error occurred when parsing the input program:
syntax error at line 1, column 1 before `\961'
on the input:
\961 \8614 \10214 t \8614 \958.\961.k.\961.t

yegor256 · 2024-12-03T03:10:24Z

@deemp the first option is not "escaping" but "removing" :) please, use option two and option three together

deemp · 2024-12-03T19:27:04Z

@yegor256, I've implemented in #590 a way to always use the option 2 despite the locale. The eo-phi-normalizer will set the locale on its own as well as the code page on Windows.

We'll soon make a release where this functionality is supported.

deemp linked a pull request Nov 29, 2024 that will close this issue

Add simple unicode escaping #590

Open

deemp mentioned this issue Nov 29, 2024

Add simple unicode escaping #590

Open

deemp removed a link to a pull request Dec 3, 2024

Add simple unicode escaping #590

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the output is not escaped on error #572

the output is not escaped on error #572

yegor256 commented Nov 28, 2024 •

edited

Loading

yegor256 commented Nov 28, 2024

deemp commented Nov 28, 2024

yegor256 commented Nov 28, 2024

deemp commented Nov 28, 2024 •

edited

Loading

yegor256 commented Nov 28, 2024

deemp commented Nov 29, 2024

yegor256 commented Nov 29, 2024

deemp commented Nov 29, 2024

yegor256 commented Nov 29, 2024

deemp commented Nov 29, 2024

deemp commented Nov 29, 2024 •

edited

Loading

yegor256 commented Nov 29, 2024

deemp commented Nov 29, 2024

yegor256 commented Nov 29, 2024

deemp commented Nov 29, 2024

yegor256 commented Nov 30, 2024

deemp commented Dec 2, 2024 •

edited

Loading

yegor256 commented Dec 3, 2024

deemp commented Dec 3, 2024

the output is not escaped on error #572

the output is not escaped on error #572

Comments

yegor256 commented Nov 28, 2024 • edited Loading

yegor256 commented Nov 28, 2024

deemp commented Nov 28, 2024

yegor256 commented Nov 28, 2024

deemp commented Nov 28, 2024 • edited Loading

yegor256 commented Nov 28, 2024

deemp commented Nov 29, 2024

yegor256 commented Nov 29, 2024

deemp commented Nov 29, 2024

yegor256 commented Nov 29, 2024

deemp commented Nov 29, 2024

deemp commented Nov 29, 2024 • edited Loading

yegor256 commented Nov 29, 2024

deemp commented Nov 29, 2024

yegor256 commented Nov 29, 2024

deemp commented Nov 29, 2024

yegor256 commented Nov 30, 2024

deemp commented Dec 2, 2024 • edited Loading

yegor256 commented Dec 3, 2024

deemp commented Dec 3, 2024

yegor256 commented Nov 28, 2024 •

edited

Loading

deemp commented Nov 28, 2024 •

edited

Loading

deemp commented Nov 29, 2024 •

edited

Loading

deemp commented Dec 2, 2024 •

edited

Loading