KIT SOLUTION SOURCE
ASATYA | A Browser extension to Combat Disinformation Asatya is a browser extension that bundles a suite of tools to fight disinformation on the web. These inform the user of a news report's reliability, media bias, objectivity and even summarizes the report for the user's convenience.
VSCode and Jupyter Notebook are used for development and debugging. Jupyter Notebook is a web based interactive environment often used for experiments, whereas VSCode is used to get a typical experience of IDE for developers. Jupyter Notebook is used for our development. cmder is a console emulator package for Windows which supports bash.
EXPLORATORY DATA ANALYSIS
For extensive analysis and exploration of data, and to deal with arrays, these libraries are used. They are also used for performing scientific computation and data manipulation. pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.
Python 36647 Version:1.5.2 License: Permissive (BSD-3-Clause)
Web scraping is an automatic method to obtain large amounts of data from websites. Beautiful Soup is a Python package for parsing HTML and XML documents. It creates a parse tree for parsed pages that can be used to extract data from HTML, which is useful for web scraping. urllib3 is a powerful, user-friendly HTTP client for Python.
Python 3261 Version:1.26.13 License: Permissive (MIT)
Libraries in this group are used for analysis and processing of unstructured natural language. The data, as in its original form aren't used as it has to go through processing pipeline to become suitable for applying machine learning techniques and algorithms. The Natural Language Toolkit, or more commonly NLTK, is a suite of libraries and programs for symbolic and statistical natural language processing for English written in the Python programming language.
A REST API (also known as RESTful API) is an application programming interface (API or web API) that conforms to the constraints of REST architectural style and allows for interaction with RESTful web services. REST stands for representational state transfer. Flask is a web application framework written in Python. Flask is based on the Werkzeug WSGI toolkit and Jinja2 template engine. flask-cors is a Cross Origin Resource Sharing ( CORS ) support for Flask. TinyDB is a lightweight document oriented database optimized for your database applications. It's written in pure Python and has no external dependencies. The target are small apps that would be blown away by a SQL-DB or an external database server.
Extractive methods attempt to summarise articles by identifying the important sentences or phrases from the original text and stitch together portions of the content to produce a condensed version. Newspaper can extract and detect languages seamlessly, including metadata and can perform NLP algorithms on the same.
NEWS PARAMETER PREDICTION
We predict the veracity of news and use machine learning methods to find parameters like objectivity, bias and reliability. scikit learn includes simple and efficient tools for predictive data analysis . It is accessible to everybody, and reusable in various contexts
Joblib is a set of tools to provide lightweight pipelining in Python.
Aaryak Garg (github.com/Darthfire ) Arsh Kohli ( github.com/arshxyz ) Manan Suri ( github.com/MananSuri27 )
Open Weaver – Develop Applications Faster with Open Source