pythainlp | Thai Natural Language Processing in Python | Natural Language Processing library
kandi X-RAY | pythainlp Summary
kandi X-RAY | pythainlp Summary
pythainlp is a Python library typically used in Artificial Intelligence, Natural Language Processing applications. pythainlp has no bugs, it has no vulnerabilities, it has build file available, it has a Permissive License and it has medium support. You can install using 'pip install pythainlp' or download it from GitHub, GitLab, PyPI.
PyThaiNLP is a Python package for text processing and linguistic analysis, similar to NLTK with focus on Thai language. PyThaiNLP เป็นไลบารีภาษาไพทอนสำหรับประมวลผลภาษาธรรมชาติ คล้ายกับ NLTK โดยเน้นภาษาไทย ดูรายละเอียดภาษาไทยได้ที่ README_TH.MD. Since PyThaiNLP 3.0, We will end support PyThaiNLP on Python 3.6. Python 3.6 users can use PyThaiNLP 2.3.2.
PyThaiNLP is a Python package for text processing and linguistic analysis, similar to NLTK with focus on Thai language. PyThaiNLP เป็นไลบารีภาษาไพทอนสำหรับประมวลผลภาษาธรรมชาติ คล้ายกับ NLTK โดยเน้นภาษาไทย ดูรายละเอียดภาษาไทยได้ที่ README_TH.MD. Since PyThaiNLP 3.0, We will end support PyThaiNLP on Python 3.6. Python 3.6 users can use PyThaiNLP 2.3.2.
Support
Quality
Security
License
Reuse
Support
pythainlp has a medium active ecosystem.
It has 819 star(s) with 244 fork(s). There are 43 watchers for this library.
There were 7 major release(s) in the last 6 months.
There are 33 open issues and 266 have been closed. On average issues are closed in 85 days. There are 2 open pull requests and 0 closed requests.
It has a neutral sentiment in the developer community.
The latest version of pythainlp is 5.0.4
Quality
pythainlp has 0 bugs and 0 code smells.
Security
pythainlp has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
pythainlp code analysis shows 0 unresolved vulnerabilities.
There are 0 security hotspots that need review.
License
pythainlp is licensed under the Apache-2.0 License. This license is Permissive.
Permissive licenses have the least restrictions, and you can use them in most projects.
Reuse
pythainlp releases are available to install and integrate.
Deployable package is available in PyPI.
Build file is available. You can build the component from source.
Installation instructions, examples and code snippets are available.
Top functions reviewed by kandi - BETA
kandi has reviewed pythainlp and discovered the below as its top functions. This is intended to give you an instant insight into pythainlp implemented functionality, and help decide if they suit your requirements.
- Split text into sentences
- Tokenize text
- Applies postprocessors to segments
- Convert a list to a string
- Get the full path to the corpus
- Download a corpus catalog
- The path to the corpus database
- Return the corpus database URL
- Return a list of nlp characters
- Perform forward computation
- Correct the spelling of a word
- Get the transliteration dictionary
- Reads the corpus from a file
- Benchmark a reference sequence
- Correct a word
- Convert a custom dictionary into a list of tuples
- Parse the text into a single sentence
- Tag words
- Get the full path to a corpus
- Translates the text into a single string
- Loads the model
- Translate the given text
- Translate the input text
- Post - process words
- Augment a sentence
- Corrects the given list of words
- Set of THAI stopwords
Get all kandi verified functions for this library.
pythainlp Key Features
No Key Features are available at this moment for pythainlp.
pythainlp Examples and Code Snippets
No Code Snippets are available at this moment for pythainlp.
Community Discussions
Trending Discussions on pythainlp
QUESTION
how can i remove stopwords in dataframe (Python)
Asked 2020-Mar-12 at 08:12
this is my code and it doesn't work
...ANSWER
Answered 2020-Mar-12 at 08:12Assuming your stopwords
is a list and df['tokens']
is a list of words or tokens each.
Simple Method:
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install pythainlp
PyThaiNLP 2 requires Python 3.6+. Python 2.7 users can use PyThaiNLP 1.6. See 2.0 change log | Upgrading from 1.7 | Upgrading ThaiNER from 1.7
PyThaiNLP Get Started notebook | API document | Tutorials
Official website | PyPI | Facebook page
Who uses PyThaiNLP?
Model cards - for technical details, caveats, and ethical considerations of the models developed and used in PyThaiNLP
This will install the latest stable release of PyThaiNLP.
Stable release: pip install --upgrade pythainlp
Pre-release (near ready): pip install --upgrade --pre pythainlp
Development (likely to break things): pip install https://github.com/PyThaiNLP/pythainlp/archive/dev.zip
Some functionalities, like Thai WordNet, may require extra packages. To install those requirements, specify a set of [name] immediately after pythainlp:. For dependency details, look at extras variable in setup.py.
full (install everything)
attacut (to support attacut, a fast and accurate tokenizer)
benchmarks (for word tokenization benchmarking)
icu (for ICU, International Components for Unicode, support in transliteration and tokenization)
ipa (for IPA, International Phonetic Alphabet, support in transliteration)
ml (to support ULMFiT models for classification)
thai2fit (for Thai word vector)
thai2rom (for machine-learnt romanization)
wordnet (for Thai WordNet API)
PyThaiNLP Get Started notebook | API document | Tutorials
Official website | PyPI | Facebook page
Who uses PyThaiNLP?
Model cards - for technical details, caveats, and ethical considerations of the models developed and used in PyThaiNLP
This will install the latest stable release of PyThaiNLP.
Stable release: pip install --upgrade pythainlp
Pre-release (near ready): pip install --upgrade --pre pythainlp
Development (likely to break things): pip install https://github.com/PyThaiNLP/pythainlp/archive/dev.zip
Some functionalities, like Thai WordNet, may require extra packages. To install those requirements, specify a set of [name] immediately after pythainlp:. For dependency details, look at extras variable in setup.py.
full (install everything)
attacut (to support attacut, a fast and accurate tokenizer)
benchmarks (for word tokenization benchmarking)
icu (for ICU, International Components for Unicode, support in transliteration and tokenization)
ipa (for IPA, International Phonetic Alphabet, support in transliteration)
ml (to support ULMFiT models for classification)
thai2fit (for Thai word vector)
thai2rom (for machine-learnt romanization)
wordnet (for Thai WordNet API)
Support
Please do fork and create a pull request :)For style guide and other information, including references to algorithms we use, please refer to our contributing page.
Find more information at:
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page