pythainlp | Thai Natural Language Processing in Python | Natural Language Processing library

 by   PyThaiNLP Python Version: 5.0.4 License: Apache-2.0

kandi X-RAY | pythainlp Summary

kandi X-RAY | pythainlp Summary

pythainlp is a Python library typically used in Artificial Intelligence, Natural Language Processing applications. pythainlp has no bugs, it has no vulnerabilities, it has build file available, it has a Permissive License and it has medium support. You can install using 'pip install pythainlp' or download it from GitHub, GitLab, PyPI.

PyThaiNLP is a Python package for text processing and linguistic analysis, similar to NLTK with focus on Thai language. PyThaiNLP เป็นไลบารีภาษาไพทอนสำหรับประมวลผลภาษาธรรมชาติ คล้ายกับ NLTK โดยเน้นภาษาไทย ดูรายละเอียดภาษาไทยได้ที่ README_TH.MD. Since PyThaiNLP 3.0, We will end support PyThaiNLP on Python 3.6. Python 3.6 users can use PyThaiNLP 2.3.2.
Support
    Quality
      Security
        License
          Reuse

            kandi-support Support

              pythainlp has a medium active ecosystem.
              It has 819 star(s) with 244 fork(s). There are 43 watchers for this library.
              There were 7 major release(s) in the last 6 months.
              There are 33 open issues and 266 have been closed. On average issues are closed in 85 days. There are 2 open pull requests and 0 closed requests.
              It has a neutral sentiment in the developer community.
              The latest version of pythainlp is 5.0.4

            kandi-Quality Quality

              pythainlp has 0 bugs and 0 code smells.

            kandi-Security Security

              pythainlp has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
              pythainlp code analysis shows 0 unresolved vulnerabilities.
              There are 0 security hotspots that need review.

            kandi-License License

              pythainlp is licensed under the Apache-2.0 License. This license is Permissive.
              Permissive licenses have the least restrictions, and you can use them in most projects.

            kandi-Reuse Reuse

              pythainlp releases are available to install and integrate.
              Deployable package is available in PyPI.
              Build file is available. You can build the component from source.
              Installation instructions, examples and code snippets are available.

            Top functions reviewed by kandi - BETA

            kandi has reviewed pythainlp and discovered the below as its top functions. This is intended to give you an instant insight into pythainlp implemented functionality, and help decide if they suit your requirements.
            • Split text into sentences
            • Tokenize text
            • Applies postprocessors to segments
            • Convert a list to a string
            • Get the full path to the corpus
            • Download a corpus catalog
            • The path to the corpus database
            • Return the corpus database URL
            • Return a list of nlp characters
            • Perform forward computation
            • Correct the spelling of a word
            • Get the transliteration dictionary
            • Reads the corpus from a file
            • Benchmark a reference sequence
            • Correct a word
            • Convert a custom dictionary into a list of tuples
            • Parse the text into a single sentence
            • Tag words
            • Get the full path to a corpus
            • Translates the text into a single string
            • Loads the model
            • Translate the given text
            • Translate the input text
            • Post - process words
            • Augment a sentence
            • Corrects the given list of words
            • Set of THAI stopwords
            Get all kandi verified functions for this library.

            pythainlp Key Features

            No Key Features are available at this moment for pythainlp.

            pythainlp Examples and Code Snippets

            No Code Snippets are available at this moment for pythainlp.

            Community Discussions

            Trending Discussions on pythainlp

            QUESTION

            how can i remove stopwords in dataframe (Python)
            Asked 2020-Mar-12 at 08:12

            this is my code and it doesn't work

            ...

            ANSWER

            Answered 2020-Mar-12 at 08:12

            Assuming your stopwords is a list and df['tokens'] is a list of words or tokens each.
            Simple Method:

            Source https://stackoverflow.com/questions/60649764

            Community Discussions, Code Snippets contain sources that include Stack Exchange Network

            Vulnerabilities

            No vulnerabilities reported

            Install pythainlp

            PyThaiNLP 2 requires Python 3.6+. Python 2.7 users can use PyThaiNLP 1.6. See 2.0 change log | Upgrading from 1.7 | Upgrading ThaiNER from 1.7
            PyThaiNLP Get Started notebook | API document | Tutorials
            Official website | PyPI | Facebook page
            Who uses PyThaiNLP?
            Model cards - for technical details, caveats, and ethical considerations of the models developed and used in PyThaiNLP
            This will install the latest stable release of PyThaiNLP.
            Stable release: pip install --upgrade pythainlp
            Pre-release (near ready): pip install --upgrade --pre pythainlp
            Development (likely to break things): pip install https://github.com/PyThaiNLP/pythainlp/archive/dev.zip
            Some functionalities, like Thai WordNet, may require extra packages. To install those requirements, specify a set of [name] immediately after pythainlp:. For dependency details, look at extras variable in setup.py.
            full (install everything)
            attacut (to support attacut, a fast and accurate tokenizer)
            benchmarks (for word tokenization benchmarking)
            icu (for ICU, International Components for Unicode, support in transliteration and tokenization)
            ipa (for IPA, International Phonetic Alphabet, support in transliteration)
            ml (to support ULMFiT models for classification)
            thai2fit (for Thai word vector)
            thai2rom (for machine-learnt romanization)
            wordnet (for Thai WordNet API)

            Support

            Please do fork and create a pull request :)For style guide and other information, including references to algorithms we use, please refer to our contributing page.
            Find more information at:

            Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items

            Find more libraries
            Install
          • PyPI

            pip install pythainlp

          • CLONE
          • HTTPS

            https://github.com/PyThaiNLP/pythainlp.git

          • CLI

            gh repo clone PyThaiNLP/pythainlp

          • sshUrl

            git@github.com:PyThaiNLP/pythainlp.git

          • Stay Updated

            Subscribe to our newsletter for trending solutions and developer bootcamps

            Agree to Sign up and Terms & Conditions

            Share this Page

            share link

            Consider Popular Natural Language Processing Libraries

            transformers

            by huggingface

            funNLP

            by fighting41love

            bert

            by google-research

            jieba

            by fxsjy

            Python

            by geekcomputers

            Try Top Libraries by PyThaiNLP

            lexicon-thai

            by PyThaiNLPPython

            wisesight-sentiment

            by PyThaiNLPJupyter Notebook

            WangChanGLM

            by PyThaiNLPJupyter Notebook

            attacut

            by PyThaiNLPPython

            pythaiasr

            by PyThaiNLPPython