Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bad codepoint range #423

Closed
ccleve opened this issue Oct 31, 2022 · 2 comments
Closed

Bad codepoint range #423

ccleve opened this issue Oct 31, 2022 · 2 comments

Comments

@ccleve
Copy link

ccleve commented Oct 31, 2022

I'm trying to compile the exact example here:

https://re2c.org/manual/manual_rust.html#encoding-support

I've copied unicode_categories.re into the test directory and attempted to compile the code using re2rust. I get this error:

tests/pipeline/tokenizers/re2c/unicode_categories.re:2:68: error: bad code point range: '0xF8 - 0x2C1'

What am I doing wrong?

Also: I see that unicode_categories.re is three years old at this point. Should it be regenerated with a more recent version of unicode?

@skvadrik
Copy link
Owner

What am I doing wrong?

Can you provide you command line? Did you forget --utf8 argument?

Also: I see that unicode_categories.re is three years old at this point. Should it be regenerated with a more recent version of unicode?

Yes, it should. We even had a project for rewriting the generator (#235 (comment)) but that somehow got stuck.

@ccleve
Copy link
Author

ccleve commented Oct 31, 2022

Yes, I did forget the --ut8 argument. Thank you, works now.

I'll take a look at the regeneration code. Maybe I can help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants