Support
Quality
Security
License
Reuse
Snap of Apache Cassandra, a highly-scalable partitioned row store
Support
Quality
Security
License
Reuse
Generic HDFS data and Hive Database transfer automation between any environment(Production/QA/Development) utilizing Amazon S3 storage
Support
Quality
Security
License
Reuse
Different configurations for setting up Hadoop
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Vagrant Box with Python 3.6.1, Apache Spark 2.1.1 with Scala 2.11.8 and PySpark (2.1.1).
Support
Quality
Security
License
Reuse
Flume Hive ElasticSearch
Support
Quality
Security
License
Reuse
Streaming weather data from API to kinesis firehose and doing aggregation in kinesis analytics
Support
Quality
Security
License
Reuse
Zeppelin docker image prepared to integrate with Cassandra Spark
Support
Quality
Security
License
Reuse
Conteiner com Python, Zookeeper, Hadoop e HBase
Support
Quality
Security
License
Reuse
Hadoop in Docker containers
Support
Quality
Security
License
Reuse
Dr.Pedram Distributed System Homework :desktop_computer:
Support
Quality
Security
License
Reuse
OneFS HDFS Scripts
Support
Quality
Security
License
Reuse
d
docker-cassandra-backup-to-s3by UKHomeOffice
Shell 2 Version:Current License: No License (No License)
Support
Quality
Security
License
Reuse
Projeto de demonstração e compreensão do Apache Hadoop2
Support
Quality
Security
License
Reuse
hived is a honeypot
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
A super hacky way to initialize keyspaces and tables in cassandra using docker-compose.
Support
Quality
Security
License
Reuse
Dockerized Hadoop Compilier
Support
Quality
Security
License
Reuse
A big data project for the AI/IoT age.
Support
Quality
Security
License
Reuse
hive druid integration benchmark
Support
Quality
Security
License
Reuse
Project to collect large amounts of vegetable data using IoT
Support
Quality
Security
License
Reuse
This is a repository that builds for hadoop clustor used docker tool.
Support
Quality
Security
License
Reuse
A collection of scripts to easily start HDFS and Spark clusters
Support
Quality
Security
License
Reuse
A shaded version of org.apache.hadoop:hadoop-common: shades Jersey and excludes logging.
Support
Quality
Security
License
Reuse
A shaded version of org.apache.hadoop:hadoop-hdfs: shades Jersey and excludes logging.
Support
Quality
Security
License
Reuse
b
bigdata-pipeline-vagrant-virtualboxby mahedi-kaysar
Shell 2 Version:Current License: Permissive (Apache-2.0)
This project aims for developing scripts for setting up the environments of the big data analytics with the technologies including hadoop, yarn, hive, spark, hbase and so on.
Support
Quality
Security
License
Reuse
An example integration of StreamSets and HDFS, using Docker
Support
Quality
Security
License
Reuse
Collaboration with Luigi Marchionni (JHU) and RIKEN with recount2 data
Support
Quality
Security
License
Reuse
Golang SDK for Apache PredictionIO
Support
Quality
Security
License
Reuse
S
Shell_Scripts_Data_Movementby SriRavindranath
Shell 2 Version:Current License: No License (No License)
Hadoop Utility Scripts to Automate , Pull, Push, Manage the data in Hadoop, Hive and RDBMS
Support
Quality
Security
License
Reuse
Automated, pluggable standup/teardown of cloud schtuff
Support
Quality
Security
License
Reuse
Standalone Spark with Ignite In-memory Rdd sharding on Docker
Support
Quality
Security
License
Reuse
vagrant vm running spark
Support
Quality
Security
License
Reuse
moder clickstream data architecture using NiFi + Hadoop components
Support
Quality
Security
License
Reuse
:memo:
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Plugin of RowCache for Apache Cassandra
Support
Quality
Security
License
Reuse
I
Incremental-Data-Ingestionby shahrukhkhan489
Shell 2 Version:Current License: No License (No License)
Example of Continuous Ingestion of Public Datasets Snapsots into Hive and indexing into Elasticsearch
Support
Quality
Security
License
Reuse
Ansible role to install Apache Spark
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Dockerized Livy, REST server for Apache Spark
Support
Quality
Security
License
Reuse
Import data from MySql to Hbase and MySql to Hive
Support
Quality
Security
License
Reuse
Shell script to schedule repairs on Cassandra nodes
Support
Quality
Security
License
Reuse
Docker images with Apache Spark and advanced config loading
Support
Quality
Security
License
Reuse
Find big files.
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
【2020-10-30 已迁移】Note of learning BigData
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
2013在北京hive加工脚本
Support
Quality
Security
License
Reuse
d
dataqualityv2by spanda2020
Shell 2Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
cassandraby snapcrafters
Snap of Apache Cassandra, a highly-scalable partitioned row store
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
DataTransferby RajdeepBiswas
Generic HDFS data and Hive Database transfer automation between any environment(Production/QA/Development) utilizing Amazon S3 storage
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
H
Hadoop-Setupby mickesv
Different configurations for setting up Hadoop
Shell 2Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
clickhouse-rpmby hnakamur
Shell 2Updated: 5 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
v
vagrant-spark2by kadnan
Vagrant Box with Python 3.6.1, Apache Spark 2.1.1 with Scala 2.11.8 and PySpark (2.1.1).
Shell 2Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
c
Support
Quality
Security
License
Reuse
k
kinesisWeatherStreamDemoby rossdstuart
Streaming weather data from API to kinesis firehose and doing aggregation in kinesis analytics
Shell 2Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
docker-zeppelin-cassandraby Project-EPIC
Zeppelin docker image prepared to integrate with Cassandra Spark
Shell 2Updated: 7 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
phoenix-hbaseby joao-parana
Conteiner com Python, Zookeeper, Hadoop e HBase
Shell 2Updated: 7 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
docker-hadoopby hanwentao
Hadoop in Docker containers
Shell 2Updated: 7 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
Dsystem-Homeworkby 9231058
Dr.Pedram Distributed System Homework :desktop_computer:
Go 2Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
O
OneFS-HDFS-Toolsby bonibruno
OneFS HDFS Scripts
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
docker-cassandra-backup-to-s3by UKHomeOffice
Shell 2Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
hadoop2-single-node-appby felipeguerra19
Projeto de demonstração e compreensão do Apache Hadoop2
Shell 2Updated: 7 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
h
Support
Quality
Security
License
Reuse
d
docker-zeppelinby samelamin
Shell 2Updated: 6 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
i
initcassandraby ansrivas
A super hacky way to initialize keyspaces and tables in cassandra using docker-compose.
Shell 2Updated: 6 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
docker-hadoop-compilerby sangwonl
Dockerized Hadoop Compilier
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
witdataby maxwit
A big data project for the AI/IoT age.
Shell 2Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
hive-druid-benchmarkby b-slim
hive druid integration benchmark
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
vegetaby Code-Hex
Project to collect large amounts of vegetable data using IoT
Go 2Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
d
docker-for-hadoopby gtchaos
This is a repository that builds for hadoop clustor used docker tool.
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
sparkutilsby gioenn
A collection of scripts to easily start HDFS and Spark clusters
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
hadoop-common-shadedby gchq
A shaded version of org.apache.hadoop:hadoop-common: shades Jersey and excludes logging.
Shell 2Updated: 5 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
h
hadoop-hdfs-shadedby gchq
A shaded version of org.apache.hadoop:hadoop-hdfs: shades Jersey and excludes logging.
Shell 2Updated: 5 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
b
bigdata-pipeline-vagrant-virtualboxby mahedi-kaysar
This project aims for developing scripts for setting up the environments of the big data analytics with the technologies including hadoop, yarn, hive, spark, hbase and so on.
Shell 2Updated: 7 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
streamsets-hdfs-demoby zketley
An example integration of StreamSets and HDFS, using Docker
Shell 2Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
marchionni_projectsby LieberInstitute
Collaboration with Luigi Marchionni (JHU) and RIKEN with recount2 data
Shell 2Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
go-pioby tempura-shrimp
Golang SDK for Apache PredictionIO
Go 2Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
S
Shell_Scripts_Data_Movementby SriRavindranath
Hadoop Utility Scripts to Automate , Pull, Push, Manage the data in Hadoop, Hive and RDBMS
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
cloud-basherby keith-ratcliffe
Automated, pluggable standup/teardown of cloud schtuff
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
spark-ignite-dockerby tonycox
Standalone Spark with Ignite In-memory Rdd sharding on Docker
Shell 2Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
vagrant-sparkby manjuraj
vagrant vm running spark
Shell 2Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
modern-clickstreamby heman-duraiswamy
moder clickstream data architecture using NiFi + Hadoop components
Shell 2Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
Support
Quality
Security
License
Reuse
d
docker-ubuntu16-hadoopby prographer
Shell 2Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
capi-rowcacheby ppc64le
Plugin of RowCache for Apache Cassandra
Shell 2Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
I
Incremental-Data-Ingestionby shahrukhkhan489
Example of Continuous Ingestion of Public Datasets Snapsots into Hive and indexing into Elasticsearch
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
ansible-role-apache-sparkby wtanaka
Ansible role to install Apache Spark
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
hadoop-ansibleby chibiegg
Shell 2Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
docker-livyby elek
Dockerized Livy, REST server for Apache Spark
Shell 2Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
S
Sqoopby pavan8444
Import data from MySql to Hbase and MySql to Hive
Shell 2Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
cassandra-repairby olx-brasil
Shell script to schedule repairs on Cassandra nodes
Shell 2Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
docker-sparkby flokkr
Docker images with Apache Spark and advanced config loading
Shell 2Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
L
Support
Quality
Security
License
Reuse
d
docker-bigdataby tataopop
Shell 2Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
bigdatanoteby mumingv
【2020-10-30 已迁移】Note of learning BigData
Shell 2Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
b
bigtop_clusterby ibmsoe
Shell 2Updated: 7 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
h
Support
Quality
Security
License
Reuse