-
Notifications
You must be signed in to change notification settings - Fork 24
Issues: NMZivkovic/BertTokenizers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG] BertUncasedBaseTokenizer ran forever with input "SixGe1−xH"
#28
opened Jun 20, 2024 by
darren-zdc
Word piece tokenizer never exits if a sub-word token doesn't exist
#26
opened Aug 1, 2023 by
matteocontrini
Looks for Vocabularies in source dir instead of (e.g.) bin/release/net6/
#21
opened May 16, 2023 by
ctwardy
Words surrounded by backwards quotation marks causing inaccurate tokenization results
#17
opened Feb 16, 2023 by
rghavimi
ProTip!
Exclude everything labeled
bug
with -label:bug.