spacymoji | 💙 Emoji handling and meta data | Icon library
kandi X-RAY | spacymoji Summary
kandi X-RAY | spacymoji Summary
spacymoji is a Python library typically used in User Interface, Icon applications. spacymoji has no bugs, it has no vulnerabilities, it has build file available, it has a Permissive License and it has low support. You can install using 'pip install spacymoji' or download it from GitHub, PyPI.
spaCy extension and pipeline component for adding emoji meta data to Doc objects. Detects emoji consisting of one or more unicode characters, and can optionally merge multi-char emoji (combined pictures, emoji with skin tone modifiers) into one token. Human-readable emoji descriptions are added as a custom attribute, and an optional lookup table can be provided for your own descriptions. The extension sets the custom Doc, Token and Span attributes ._.is_emoji, ._.emoji_desc, ._.has_emoji and ._.emoji. You can read more about custom pipeline components and extension attributes here. Emoji are matched using spaCy's PhraseMatcher, and looked up in the data table provided by the emoji package.
spaCy extension and pipeline component for adding emoji meta data to Doc objects. Detects emoji consisting of one or more unicode characters, and can optionally merge multi-char emoji (combined pictures, emoji with skin tone modifiers) into one token. Human-readable emoji descriptions are added as a custom attribute, and an optional lookup table can be provided for your own descriptions. The extension sets the custom Doc, Token and Span attributes ._.is_emoji, ._.emoji_desc, ._.has_emoji and ._.emoji. You can read more about custom pipeline components and extension attributes here. Emoji are matched using spaCy's PhraseMatcher, and looked up in the data table provided by the emoji package.
Support
Quality
Security
License
Reuse
Support
spacymoji has a low active ecosystem.
It has 175 star(s) with 20 fork(s). There are 17 watchers for this library.
It had no major release in the last 12 months.
There are 5 open issues and 7 have been closed. On average issues are closed in 271 days. There are no pull requests.
It has a neutral sentiment in the developer community.
The latest version of spacymoji is 3.1.0
Quality
spacymoji has no bugs reported.
Security
spacymoji has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
License
spacymoji is licensed under the MIT License. This license is Permissive.
Permissive licenses have the least restrictions, and you can use them in most projects.
Reuse
spacymoji releases are available to install and integrate.
Deployable package is available in PyPI.
Build file is available. You can build the component from source.
Installation instructions, examples and code snippets are available.
Top functions reviewed by kandi - BETA
kandi has reviewed spacymoji and discovered the below as its top functions. This is intended to give you an instant insight into spacymoji implemented functionality, and help decide if they suit your requirements.
- Initialize the emoji .
- Apply tokenization .
- Create an emoji .
- Get the emoji description .
- Iterate over emoji tags .
- Returns True if tokens have emoji .
Get all kandi verified functions for this library.
spacymoji Key Features
No Key Features are available at this moment for spacymoji.
spacymoji Examples and Code Snippets
import pandas as pd
import numpy as np
from sklearn.feature_extraction.text import CountVectorizer
texts=["hello :)"]
cv = CountVectorizer(stop_words=None, token_pattern=r'(?u)\b\w\w+\b|[:)]+')
cv_fit=cv.fit_transform(texts)
Community Discussions
Trending Discussions on spacymoji
QUESTION
Is it possible to parse emojis using spaCy?
Asked 2018-Sep-28 at 14:02
Is it possible to tokenize emojis like :)
, :(
, ;~(
properly using the spaCy Python library? e.g. If I run the following code:
ANSWER
Answered 2018-Feb-26 at 09:23Yes, spaCy actually includes a pretty comprehensive list of text-based emoticons as part of its tokenizer exceptions. So using your example above and printing the individual tokens, the emoticon is tokenized correctly:
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install spacymoji
spacymoji requires spacy v3.0.0 or higher. For spaCy v2.x, instally spacymoji==2.0.0.
Support
For any new features, suggestions and bugs create an issue on GitHub.
If you have any questions check and ask questions on community page Stack Overflow .
Find more information at:
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page