NLP Tools For Language Detection
by Sri Balaji J Updated: Jun 21, 2022
Solution Kit
Language identification can be an important step in a Natural Language Processing (NLP) problem. It involves trying to predict the natural language of a piece of text. It is important to know the language of the text before other actions (i.e. translation/ sentiment analysis) can be taken.
NLP packages
These are general NLP libraries that support language detection.
fastTextby facebookresearch
Library for fast text representation and classification.
fastTextby facebookresearch
HTML
24613
Version:v0.9.2
License: Permissive (MIT)
TextBlobby sloria
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
TextBlobby sloria
Python
8586
Version:0.7.0
License: Permissive (MIT)
polyglotby aboSamoor
Multilingual text (NLP) processing toolkit
polyglotby aboSamoor
Python
2166
Version:Current
License: Others (Non-SPDX)
spaCyby explosion
💫 Industrial-strength Natural Language Processing (NLP) in Python
spaCyby explosion
Python
26205
Version:v3.5.3
License: Permissive (MIT)
Language detection packages
These packages are specifically language detection. These make it faster and easier.
langdetectby Mimino666
Port of Google's language-detection library to Python.
langdetectby Mimino666
Python
1447
Version:1.0.8
License: Others (Non-SPDX)
pycld3by bsolomon1124
Python3 bindings for the Compact Language Detector v3 (CLD3)
pycld3by bsolomon1124
C++
121
Version:Current
License: Permissive (Apache-2.0)
fast-langidby ffreemt
fast-langidby ffreemt
Python
0
Version:Current
License: No License