JamSpell | Modern spell checking library - accurate | Natural Language Processing library
kandi X-RAY | JamSpell Summary
kandi X-RAY | JamSpell Summary
JamSpell is a spell checking library with following features:.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of JamSpell
JamSpell Key Features
JamSpell Examples and Code Snippets
Community Discussions
Trending Discussions on JamSpell
QUESTION
I trained a model using the commands found here...
https://github.com/bakwc/JamSpell#train
There is no problem with the English text. But I need to train a similar model based on Hindi Corpus.
I have a file that can be replaced with sherlockholmes.txt
but I am not sure what should I refer to instead of alphabet_en.txt
.
Should I just collect all Unicode characters used in Hindi in a text file?
...ANSWER
Answered 2021-Jan-17 at 08:45Yes, following the example for English you are supposed to collect all of the characters that are used in the Hindi text of the corpus (here stored in sherlockholmes.txt
file).
I guess these characters help the algorithm figure out which characters compose words and which characters are not (e.g. punctuation).
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install JamSpell
en.tar.gz (35Mb)
fr.tar.gz (31Mb)
ru.tar.gz (38Mb)
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page