8 best Java Dataset libraries in 2024
by marketing.admin@openweaver.com Updated: Feb 1, 2023
Guide Kit
Java is an object-oriented programming language for applications and websites that was first released by Oracle in 1995. Data is a very important part of business. If a business does not have data, it will not be able to grow its revenue. In the past, businesses used to collect data manually from their users. Nowadays, companies use computer programs to gather data from their clients. These programs are called "datasets". Datasets are a structured collection of data which can be used for storing tabular, non-tabular and hierarchical data. Java ecosystem has many libraries and frameworks to help developers to manage data at scale. Data are the foundation of all research. They play a pivotal role in various fields such as data science and machine learning. Developers tend to use some of the following Java Dataset open source libraries: hollow - java library and toolset for disseminating in memory datasets; MiDaS - Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular De; mongolastic - dataset migration tool.
hollowby Netflix
Hollow is a java library and toolset for disseminating in-memory datasets from a single producer to many consumers for high performance read-only access.
hollowby Netflix
Java 1094 Version:v7.5.5 License: Permissive (Apache-2.0)
MiDaSby isl-org
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
MiDaSby isl-org
Python 3028 Version:v3_1 License: Permissive (MIT)
chart-fxby GSI-CS-CO
A scientific charting library focused on performance optimised real-time data visualisation at 25 Hz update rates for data sets with a few 10 thousand up to 5 million data points.
chart-fxby GSI-CS-CO
Java 329 Version:11.2.6 License: Weak Copyleft (LGPL-3.0)
whylogs-javaby whylabs
Profile and monitor your ML data pipeline end-to-end
whylogs-javaby whylabs
Java 165 Version:v0.1.0 License: Permissive (Apache-2.0)
mongolasticby ozlerhakan
:traffic_light: A dataset migration tool from MongoDB to Elasticsearch and vice versa.
mongolasticby ozlerhakan
Java 136 Version:v1.4.4 License: Permissive (MIT)
bio2rdf-scriptsby bio2rdf
Scripts that Bio2RDF users have created to generate RDF versions of scientific datasets
bio2rdf-scriptsby bio2rdf
Java 107 Version:release3 License: Permissive (MIT)
NLIWODby dice-group
Collection of tools, utilities, datasets and approaches towards realising natural language interfaces for the Web of Data.
NLIWODby dice-group
Java 91 Version:0.0.1 License: Strong Copyleft (AGPL-3.0)
parsoby epam
lightweight Java library designed to read SAS7BDAT datasets
parsoby epam
Java 56 Version:v2.0.14 License: Permissive (Apache-2.0)