KeyBERT | Minimal keyword extraction with BERT | Natural Language Processing library

 by   MaartenGr Python Version: v0.7.0 License: MIT

kandi X-RAY | KeyBERT Summary

kandi X-RAY | KeyBERT Summary

KeyBERT is a Python library typically used in Artificial Intelligence, Natural Language Processing, Bert applications. KeyBERT has no bugs, it has no vulnerabilities, it has build file available, it has a Permissive License and it has medium support. You can download it from GitHub.

Although there are already many methods available for keyword generation (e.g., Rake, YAKE!, TF-IDF, etc.) I wanted to create a very basic, but powerful method for extracting keywords and keyphrases. This is where KeyBERT comes in! Which uses BERT-embeddings and simple cosine similarity to find the sub-phrases in a document that are the most similar to the document itself. First, document embeddings are extracted with BERT to get a document-level representation. Then, word embeddings are extracted for N-gram words/phrases. Finally, we use cosine similarity to find the words/phrases that are the most similar to the document. The most similar words could then be identified as the words that best describe the entire document. KeyBERT is by no means unique and is created as a quick and easy method for creating keywords and keyphrases. Although there are many great papers and solutions out there that use BERT-embeddings (e.g., 1, 2, 3, ), I could not find a BERT-based solution that did not have to be trained from scratch and could be used for beginners (correct me if I'm wrong!). Thus, the goal was a pip install keybert and at most 3 lines of code in usage.
Support
    Quality
      Security
        License
          Reuse

            kandi-support Support

              KeyBERT has a medium active ecosystem.
              It has 2419 star(s) with 276 fork(s). There are 27 watchers for this library.
              OutlinedDot
              It had no major release in the last 12 months.
              There are 27 open issues and 119 have been closed. On average issues are closed in 61 days. There are 1 open pull requests and 0 closed requests.
              It has a neutral sentiment in the developer community.
              The latest version of KeyBERT is v0.7.0

            kandi-Quality Quality

              KeyBERT has 0 bugs and 3 code smells.

            kandi-Security Security

              KeyBERT has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
              KeyBERT code analysis shows 0 unresolved vulnerabilities.
              There are 0 security hotspots that need review.

            kandi-License License

              KeyBERT is licensed under the MIT License. This license is Permissive.
              Permissive licenses have the least restrictions, and you can use them in most projects.

            kandi-Reuse Reuse

              KeyBERT releases are available to install and integrate.
              Build file is available. You can build the component from source.
              Installation instructions, examples and code snippets are available.
              It has 489 lines of code, 26 functions and 19 files.
              It has medium code complexity. Code complexity directly impacts maintainability of the code.

            Top functions reviewed by kandi - BETA

            kandi has reviewed KeyBERT and discovered the below as its top functions. This is intended to give you an instant insight into KeyBERT implemented functionality, and help decide if they suit your requirements.
            • Embed documents
            • Embed a document using the tokenizer
            Get all kandi verified functions for this library.

            KeyBERT Key Features

            No Key Features are available at this moment for KeyBERT.

            KeyBERT Examples and Code Snippets

            Publications
            Pythondot img1Lines of Code : 67dot img1License : Permissive (Apache-2.0)
            copy iconCopy
            @inproceedings{reimers-2019-sentence-bert,
                title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
                author = "Reimers, Nils and Gurevych, Iryna",
                booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Nat  

            Community Discussions

            Trending Discussions on KeyBERT

            QUESTION

            KeyBERT package is not working on Google Colab
            Asked 2021-Jun-24 at 03:46

            I'm using KeyBERT on Google Colab to extract keywords from the text.

            ...

            ANSWER

            Answered 2021-Jun-24 at 03:46

            I couldn't reproduce this issue with the code you've provided but from the provided error message I believe you're just missing an 's' in the model name so just make sure that the model name is as follows:

            distilbert-base-nli-mean-tokens

            and not

            distilbert-base-nli-mean-token

            Also refer to this link for all models available for use.

            Source https://stackoverflow.com/questions/68107887

            Community Discussions, Code Snippets contain sources that include Stack Exchange Network

            Vulnerabilities

            No vulnerabilities reported

            Install KeyBERT

            Installation can be done using pypi:.

            Support

            For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .
            Find more information at:

            Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items

            Find more libraries
            CLONE
          • HTTPS

            https://github.com/MaartenGr/KeyBERT.git

          • CLI

            gh repo clone MaartenGr/KeyBERT

          • sshUrl

            git@github.com:MaartenGr/KeyBERT.git

          • Stay Updated

            Subscribe to our newsletter for trending solutions and developer bootcamps

            Agree to Sign up and Terms & Conditions

            Share this Page

            share link

            Consider Popular Natural Language Processing Libraries

            transformers

            by huggingface

            funNLP

            by fighting41love

            bert

            by google-research

            jieba

            by fxsjy

            Python

            by geekcomputers

            Try Top Libraries by MaartenGr

            BERTopic

            by MaartenGrPython

            PolyFuzz

            by MaartenGrPython

            Concept

            by MaartenGrPython

            soan

            by MaartenGrPython

            cTFIDF

            by MaartenGrPython