-
Notifications
You must be signed in to change notification settings - Fork 586
Open
Description
Disclaimer
Issue
But there seem to be some characters that should be ignored in Urdu :
https://github.com/hermitdave/FrequencyWords/blob/master/content/2018/ur/ur_full.txt
See lines 3 and 5
What makes me think these are not words but really punctuation as someone who doesn't speak Urdu are the characters' name :
ARABIC COMMA and ARABIC FULL STOP
Potential fix
If you made sure this is not just me that has not enough knowledge of the language but a real issue, my fix would be to add
،
and
۔
in the ignored characters list
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels