citlab-article-separation-new | Modules used for separating articles
kandi X-RAY | citlab-article-separation-new Summary
kandi X-RAY | citlab-article-separation-new Summary
citlab-article-separation-new is a Python library. citlab-article-separation-new has no bugs, it has no vulnerabilities, it has build file available and it has low support. You can download it from GitHub.
This repository is part of the European Union's Horizon 2020 project NewsEye and is mainly used for separating articles in (historical) newspapers and similar documents. The purpose of the NewsEye project is to enable historians and humanities scholars to investigate a great amount of newspaper collections. The newspaper pages are digitized and are available as scanned images. To ensure efficient work, the data processing steps should be as automatic as possible. Generally, newspapers are structured into large numbers of articles. These usually contain a distinct piece of content or describe a certain topic and can mostly be understood without any context. Newspaper articles are crucial entities for historians and humanities scholars who focus on a specific research area and are only interested in articles related to that topic. Additionally, some natural language processing applications, like e.g. topic modeling or event detection, rely on a logical structuring of the underlying text, to be able to extract meaningful information. For this reason it is important to tackle the article separation (AS) task, which tries to form coherent articles, based on previously detected baselines and their respective text. In the following image a schematic overview of the overall AS workflow can be found.
This repository is part of the European Union's Horizon 2020 project NewsEye and is mainly used for separating articles in (historical) newspapers and similar documents. The purpose of the NewsEye project is to enable historians and humanities scholars to investigate a great amount of newspaper collections. The newspaper pages are digitized and are available as scanned images. To ensure efficient work, the data processing steps should be as automatic as possible. Generally, newspapers are structured into large numbers of articles. These usually contain a distinct piece of content or describe a certain topic and can mostly be understood without any context. Newspaper articles are crucial entities for historians and humanities scholars who focus on a specific research area and are only interested in articles related to that topic. Additionally, some natural language processing applications, like e.g. topic modeling or event detection, rely on a logical structuring of the underlying text, to be able to extract meaningful information. For this reason it is important to tackle the article separation (AS) task, which tries to form coherent articles, based on previously detected baselines and their respective text. In the following image a schematic overview of the overall AS workflow can be found.
Support
Quality
Security
License
Reuse
Support
citlab-article-separation-new has a low active ecosystem.
It has 2 star(s) with 0 fork(s). There are 2 watchers for this library.
It had no major release in the last 12 months.
citlab-article-separation-new has no issues reported. There are no pull requests.
It has a neutral sentiment in the developer community.
The latest version of citlab-article-separation-new is 2.0
Quality
citlab-article-separation-new has no bugs reported.
Security
citlab-article-separation-new has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
License
citlab-article-separation-new does not have a standard license declared.
Check the repository for any license declaration and review the terms closely.
Without a license, all rights are reserved, and you cannot use the library in your applications.
Reuse
citlab-article-separation-new releases are available to install and integrate.
Build file is available. You can build the component from source.
Installation instructions, examples and code snippets are available.
Top functions reviewed by kandi - BETA
kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of citlab-article-separation-new
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of citlab-article-separation-new
citlab-article-separation-new Key Features
No Key Features are available at this moment for citlab-article-separation-new.
citlab-article-separation-new Examples and Code Snippets
No Code Snippets are available at this moment for citlab-article-separation-new.
Community Discussions
No Community Discussions are available at this moment for citlab-article-separation-new.Refer to stack overflow page for discussions.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install citlab-article-separation-new
The Python modules in this repository are all tested with Python 3.6. The best way to use the modules is by creating a virtual environment and install the packages given in the requirements.txt file. The packages should work with TensorFlow 1.12 (pip install tensorflow==1.12) to TensorFlow 1.14 (pip install tensorflow==1.14).
Support
For any new features, suggestions and bugs create an issue on GitHub.
If you have any questions check and ask questions on community page Stack Overflow .
Find more information at:
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page