8 best Java Dataset libraries in 2023
by Open Weaver kits ✔ Updated: Feb 1, 2023
Java is an object-oriented programming language for applications and websites that was first released by Oracle in 1995. Data is a very important part of business. If a business does not have data, it will not be able to grow its revenue. In the past, businesses used to collect data manually from their users. Nowadays, companies use computer programs to gather data from their clients. These programs are called "datasets". Datasets are a structured collection of data which can be used for storing tabular, non-tabular and hierarchical data. Java ecosystem has many libraries and frameworks to help developers to manage data at scale. Data are the foundation of all research. They play a pivotal role in various fields such as data science and machine learning. Developers tend to use some of the following Java Dataset open source libraries: hollow - java library and toolset for disseminating in memory datasets; MiDaS - Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular De; mongolastic - dataset migration tool.
Hollow is a java library and toolset for disseminating in-memory datasets from a single producer to many consumers for high performance read-only access.
Java 1086 Version:v7.5.0 License: Permissive (Apache-2.0)
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
Python 2971 Version:v3_1 License: Permissive (MIT)
A scientific charting library focused on performance optimised real-time data visualisation at 25 Hz update rates for data sets with a few 10 thousand up to 5 million data points.
Java 329 Version:11.2.6 License: Weak Copyleft (LGPL-3.0)
Profile and monitor your ML data pipeline end-to-end
Java 165 Version:v0.1.0 License: Permissive (Apache-2.0)
:traffic_light: A dataset migration tool from MongoDB to Elasticsearch and vice versa.
Java 136 Version:v1.4.4 License: Permissive (MIT)
Scripts that Bio2RDF users have created to generate RDF versions of scientific datasets
Java 107 Version:release3 License: Permissive (MIT)
Collection of tools, utilities, datasets and approaches towards realising natural language interfaces for the Web of Data.
Java 91 Version:0.0.1 License: Strong Copyleft (AGPL-3.0)
lightweight Java library designed to read SAS7BDAT datasets
Java 56 Version:v2.0.14 License: Permissive (Apache-2.0)