quanteda.textmodels | Text scaling and classification models for quanteda | Machine Learning library
kandi X-RAY | quanteda.textmodels Summary
kandi X-RAY | quanteda.textmodels Summary
An R package adding text scaling models and classifiers for quanteda. Prior to quanteda v2, many of these were part of that package. Early development was supported by the European Research Council grant ERC-2011-StG 283794-QUANTESS. For more details, see
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of quanteda.textmodels
quanteda.textmodels Key Features
quanteda.textmodels Examples and Code Snippets
Community Discussions
Trending Discussions on quanteda.textmodels
QUESTION
I am new to quantitative text analysis, and I am attempting to extract the keywords associated with a particular classification category from the output of a naive bayes classifier. I am running the below example (classifying movie reviews as either positive or negative). I want two vectors, each containing those key words associated with either the positive and negative category respectively. Am I right in saying I should be focusing on the 'Estimated Feature Scores' from the summary() output, and if so, how do I interpret these?
...ANSWER
Answered 2020-Sep-14 at 15:26If you just want to know the most negative and positive words, consider textstat_keyness()
on a dfm created from the entire corpus, partitioned into positive and negative reviews. This does not create two word vectors, but a single word vector with a score indicating the strength of association with the negative or positive category.
QUESTION
From a process like this:
...ANSWER
Answered 2020-Jul-03 at 16:31The object quant_dfm
is not a data.frame, but rather an object of class dfm
or document-feature matrix. Therefore, you cannot simply add a new column.
One approach might be to bind the topic proportion onto the document metadata:
QUESTION
Is there any measurement which could help to see if the quality of text for text analysis techniques after cleaning process has better results for example for lsa from quanteda.textmodels.
Example from the package:
...ANSWER
Answered 2020-Jun-18 at 18:00In the upcoming version of quanteda (available on Github), textstat_summary()
is added. You can use it to check how clean the texts are:
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install quanteda.textmodels
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page