newsmap | Newsmap : geographical news classifier | Natural Language Processing library
kandi X-RAY | newsmap Summary
kandi X-RAY | newsmap Summary
Semi-supervised Bayesian model for geographical document classification. Newsmap automatically constructs a large geographical dictionary from a corpus to accurate classify documents. Currently, the newsmap package contains seed dictionaries in English, German, French, Spanish, Portuguese, Russian, Italian, Hebrew, Arabic, Japanese, Chinese languages. The detail of the algorithm is explained in Newsmap: semi-supervised approach to geographical news classification. newsmap has also been used in scientific research in various fields (Google Scholar).
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of newsmap
newsmap Key Features
newsmap Examples and Code Snippets
require(newsmap)
## Loading required package: newsmap
require(quanteda)
## Loading required package: quanteda
## Package version: 3.2.0.9000
## Unicode version: 13.0
## ICU version: 66.1
## Parallel computing: 6 of 6 threads used.
## See https://quan
install.packages("newsmap")
install.packages("devtools")
devtools::install_github("koheiw/newsmap")
download.file('https://www.dropbox.com/s/e19kslwhuu9yc2z/yahoo-news.RDS?dl=1', '~/yahoo-news.RDS')
Community Discussions
Trending Discussions on newsmap
QUESTION
I have trained a newsmap model in the Newsmap package for quanteda in R and am trying to export the large dictionary it constructed based on my corpus (not the seed dictionary). I have tried this code, but it only gives me the 10 most associated terms per country in a list format, which I also fail to extract in order to form a dictionary object I can use in R.
...ANSWER
Answered 2021-May-31 at 08:20You only need to extract the names of the vectors with desired number of words passed to n
.
QUESTION
I'm trying to do a tokens_lookup() with the Newsmap package in a 1980s computer mag from Switzerland in german. But countries had other names back then. I need to include the sovjet union, tschechoslovakia etc. Also I need to replace all german "ß" with "ss" (We don't use ß in Switzerland.) How can I change values in a dictionary?
I have tried:
...ANSWER
Answered 2020-Nov-27 at 00:26You can find the original dictionary in the YAML format in Newsmap's repository, so you can edit in a text editor and load using dictionary(file = "xxx.yml")
.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install newsmap
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page