This repository for Teaching Analytics using Python Programming
Support
Quality
Security
License
Reuse
Full Stack Data Science in Python
Support
Quality
Security
License
Reuse
Multiple Imputation with LightGBM in Python
Support
Quality
Security
License
Reuse
Automated Quality Control and visual reports for Quality Assessment of structural (T1w, T2w) and functional MRI of the brain
Support
Quality
Security
License
Reuse
Detailed Python developer roadmap
Support
Quality
Security
License
Reuse
R package implementation of Milo for testing for differential abundance in KNN graphs
Support
Quality
Security
License
Reuse
Quantitative Finance book
Support
Quality
Security
License
Reuse
An introductory workshop on pandas with notebooks and exercises for following along.
Support
Quality
Security
License
Reuse
Pandas is a high-level data manipulation tool developed by Wes McKinney. It is built on the Numpy package and its key data structure is called the DataFrame. DataFrames allow you to store and manipulate tabular data in rows of observations and columns of variables.
Support
Quality
Security
License
Reuse
A library for recording and reading data in notebooks.
Support
Quality
Security
License
Reuse
The ultimate reference guide to data wrangling with Python and R
Support
Quality
Security
License
Reuse
Python package for Imputation Methods
Support
Quality
Security
License
Reuse
Statistical Analysis of Network Data with R, 2nd Edition
Support
Quality
Security
License
Reuse
c
california-coronavirus-databy datadesk
Jupyter Notebook 
214
Version:Current
License: Proprietary (Proprietary)
The Los Angeles Times' open-source archive of California coronavirus data
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
y
your-first-kaggle-submissionby mrdbourke
Jupyter Notebook 
208
Version:Current
License: No License (No License)
How to perform an exploratory data analysis on the Kaggle Titanic dataset and make a submission to the leaderboard.
Support
Quality
Security
License
Reuse
p
python-for-data-scienceby blobcity
Jupyter Notebook 
206
Version:Current
License: Permissive (Apache-2.0)
A collection of Jupyter Notebooks for learning Python for Data Science.
Support
Quality
Security
License
Reuse
New generation decentralized data warehouse and streaming data pipeline
Support
Quality
Security
License
Reuse
Devenez Data-Scientist sur Le Wagon On Demand
Support
Quality
Security
License
Reuse
r
rosettaby columbia-applied-data-science
Jupyter Notebook 
203
Version:Current
License: Proprietary (Proprietary)
Tools, wrappers, etc... for data science with a concentration on text processing
Support
Quality
Security
License
Reuse
I
IBM-Data-Science-Professional-Certificationby Thomas-George-T
Jupyter Notebook 
203
Version:Current
License: No License (No License)
Learning materials, Quizzes & Assignment solutions for the entire IBM data science professional certification. Also included, a few resources that I found helpful.
Support
Quality
Security
License
Reuse
Elementary is an open-source data observability framework for modern data teams, starting with data lineage.
Support
Quality
Security
License
Reuse
Learn how to build a data analysis library from scratch
Support
Quality
Security
License
Reuse
A predictive model developed to identify medium-voltage electrical distribution grid infrastructure using publicly available data sources.
Support
Quality
Security
License
Reuse
Jupyter Notebook extension leveraging pandas DataFrames by integrating DataTables and ChartJS.
Support
Quality
Security
License
Reuse
Repositório de códigos da disciplina de Algoritmos e Estrutura de Dados II
Support
Quality
Security
License
Reuse
A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results
Support
Quality
Security
License
Reuse
A validation library for Pandas data frames using user-friendly schemas
Support
Quality
Security
License
Reuse
Repository for data science course Spring 14
Support
Quality
Security
License
Reuse
Redshift Python Connector. It supports Python Database API Specification v2.0.
Support
Quality
Security
License
Reuse
D
DataScience_Interview_Questionsby milaan9
Jupyter Notebook 
181
Version:Current
License: Permissive (MIT)
My Solutions to 120 commonly asked data science interview questions.
Support
Quality
Security
License
Reuse
Progress monitor: monitor a job's progress
Support
Quality
Security
License
Reuse
An example MLflow project
Support
Quality
Security
License
Reuse
The solution to cameyon16 and camelyon17 challenge and also to your own WSI data project.
Support
Quality
Security
License
Reuse
m
matgenbby materialsvirtuallab
Jupyter Notebook 
174
Version:Current
License: Permissive (BSD-3-Clause)
Jupyter notebooks demonstrating the utilization of open-source codes for the study of materials science.
Support
Quality
Security
License
Reuse
Python package for multivariate hypothesis testing
Support
Quality
Security
License
Reuse
P
Python-Data-Cleaning-Cookbookby PacktPublishing
Python 
168
Version:Current
License: Permissive (MIT)
Python Data Cleaning Cookbook, published by Packt
Support
Quality
Security
License
Reuse
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
Support
Quality
Security
License
Reuse
P
Python-Fundamentalsby dlab-berkeley
Jupyter Notebook 
159
Version:Current
License: Proprietary (Proprietary)
D-Lab's 12 hour introduction to Python. Learn how to create variables and functions, use control flow structures, use libraries, import data, and more, using Python and Jupyter Notebooks.
Support
Quality
Security
License
Reuse
Roadmap for becoming Python developer.
Support
Quality
Security
License
Reuse
i
introDataScienceby iewaij
Jupyter Notebook 
155
Version:Current
License: Strong Copyleft (CC-BY-SA-4.0)
Notes on Data Science. 数理统计、机器学习和数据编程的学习笔记。
Support
Quality
Security
License
Reuse
A Toolbox for Non-Tabular Data Manipulation
Support
Quality
Security
License
Reuse
P
Python-Data-Structures-and-Algorithmsby PacktPublishing
Python 
153
Version:Current
License: Permissive (MIT)
Python Data Structures and Algorithms, published by Packt
Support
Quality
Security
License
Reuse
[BMVC'19] Tracking Holistic Object Representations
Support
Quality
Security
License
Reuse
The Open Source Time-Series Data Historian
Support
Quality
Security
License
Reuse
m
machine_learning_for_goodby DeltaAnalytics
Jupyter Notebook 
153
Version:Current
License: Permissive (CC-BY-4.0)
Machine learning fundamentals lesson in interactive notebooks
Support
Quality
Security
License
Reuse
Code execution via Python package installation.
Support
Quality
Security
License
Reuse
Python toolbox for sampling Determinantal Point Processes
Support
Quality
Security
License
Reuse
h
hypertools-paper-notebooksby ContextLab
Jupyter Notebook 
146
Version:Current
License: Permissive (MIT)
Supporting notebooks and data from hypertools paper
Support
Quality
Security
License
Reuse
🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.
Support
Quality
Security
License
Reuse
p
pyAnalyticsby DUanalytics
This repository for Teaching Analytics using Python Programming
Python
239
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
f
full-stack-data-scienceby amitkaps
Full Stack Data Science in Python
Jupyter Notebook
238
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
miceforestby AnotherSamWilson
Multiple Imputation with LightGBM in Python
Python
237
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
mriqcby nipreps
Automated Quality Control and visual reports for Quality Assessment of structural (T1w, T2w) and functional MRI of the brain
Python
236
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pyroadby amaargiru
Detailed Python developer roadmap
Jupyter Notebook
235
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
m
miloRby MarioniLab
R package implementation of Milo for testing for differential abundance in KNN graphs
R
228
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
Q
QuantFinanceBookby LechGrzelak
Quantitative Finance book
Python
227
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
p
pandas-workshopby stefmolin
An introductory workshop on pandas with notebooks and exercises for following along.
HTML
221
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
1
10_Python_Pandas_Moduleby milaan9
Pandas is a high-level data manipulation tool developed by Wes McKinney. It is built on the Numpy package and its key data structure is called the DataFrame. DataFrames allow you to store and manipulate tabular data in rows of observations and columns of variables.
Jupyter Notebook
221
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
scrapbookby nteract
A library for recording and reading data in notebooks.
Python
220
Updated: 4 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
D
DataWranglingby ben519
The ultimate reference guide to data wrangling with Python and R
R
220
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
a
autoimputeby kearnz
Python package for Imputation Methods
Python
218
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
sandby kolaczyk
Statistical Analysis of Network Data with R, 2nd Edition
R
217
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
c
california-coronavirus-databy datadesk
The Los Angeles Times' open-source archive of California coronavirus data
Jupyter Notebook
214
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
D
DataGotham2013by yhat
Python
211
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
y
your-first-kaggle-submissionby mrdbourke
How to perform an exploratory data analysis on the Kaggle Titanic dataset and make a submission to the leaderboard.
Jupyter Notebook
208
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
p
python-for-data-scienceby blobcity
A collection of Jupyter Notebooks for learning Python for Data Science.
Jupyter Notebook
206
Updated: 4 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
k
kamu-cliby kamu-data
New generation decentralized data warehouse and streaming data pipeline
Rust
205
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
d
data-kitby lewagon
Devenez Data-Scientist sur Le Wagon On Demand
Jupyter Notebook
203
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
r
rosettaby columbia-applied-data-science
Tools, wrappers, etc... for data science with a concentration on text processing
Jupyter Notebook
203
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
I
IBM-Data-Science-Professional-Certificationby Thomas-George-T
Learning materials, Quizzes & Assignment solutions for the entire IBM data science professional certification. Also included, a few resources that I found helpful.
Jupyter Notebook
203
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
e
elementary-lineageby elementary-data
Elementary is an open-source data observability framework for modern data teams, starting with data lineage.
Python
196
Updated: 3 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pandas_cubby tdpetrou
Learn how to build a data analysis library from scratch
Python
194
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
m
many-to-many-dijkstraby facebookresearch
A predictive model developed to identify medium-voltage electrical distribution grid infrastructure using publicly available data sources.
Python
192
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
j
jupyter-datatablesby CermakM
Jupyter Notebook extension leveraging pandas DataFrames by integrating DataTables and ChartJS.
JavaScript
192
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
aeds2by icei-pucminas
Repositório de códigos da disciplina de Algoritmos e Estrutura de Dados II
Java
192
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
B
BagofTricks-LTby zhangyongshun
A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results
Python
191
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
PandasSchemaby multimeric
A validation library for Pandas data frames using user-friendly schemas
Python
184
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
d
datascience-sp14by amplab
Repository for data science course Spring 14
Shell
183
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
a
amazon-redshift-python-driverby aws
Redshift Python Connector. It supports Python Database API Specification v2.0.
Python
182
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
D
DataScience_Interview_Questionsby milaan9
My Solutions to 120 commonly asked data science interview questions.
Jupyter Notebook
181
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pmonitorby dspinellis
Progress monitor: monitor a job's progress
Shell
179
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
m
mlflow-exampleby mlflow
An example MLflow project
Python
178
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
C
CAMELYONby ilikewind
The solution to cameyon16 and camelyon17 challenge and also to your own WSI data project.
Python
174
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
m
matgenbby materialsvirtuallab
Jupyter notebooks demonstrating the utilization of open-source codes for the study of materials science.
Jupyter Notebook
174
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
h
hyppoby neurodata
Python package for multivariate hypothesis testing
Python
171
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
Python-Data-Cleaning-Cookbookby PacktPublishing
Python Data Cleaning Cookbook, published by Packt
Python
168
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fastverseby fastverse
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
R
166
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
P
Python-Fundamentalsby dlab-berkeley
D-Lab's 12 hour introduction to Python. Learn how to create variables and functions, use control flow structures, use libraries, import data, and more, using Python and Jupyter Notebooks.
Jupyter Notebook
159
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
Python-developer-roadmapby ErdemOzgen
Roadmap for becoming Python developer.
Python
156
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
i
introDataScienceby iewaij
Notes on Data Science. 数理统计、机器学习和数据编程的学习笔记。
Jupyter Notebook
155
Updated: 2 y ago
License: Strong Copyleft (CC-BY-SA-4.0)
Support
Quality
Security
License
Reuse
r
rlistby renkun-ken
A Toolbox for Non-Tabular Data Manipulation
R
154
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
Python-Data-Structures-and-Algorithmsby PacktPublishing
Python Data Structures and Algorithms, published by Packt
Python
153
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
THORby xl-sr
[BMVC'19] Tracking Holistic Object Representations
Python
153
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
o
openHistorianby GridProtectionAlliance
The Open Source Time-Series Data Historian
TypeScript
153
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
machine_learning_for_goodby DeltaAnalytics
Machine learning fundamentals lesson in interactive notebooks
Jupyter Notebook
153
Updated: 4 y ago
License: Permissive (CC-BY-4.0)
Support
Quality
Security
License
Reuse
0
0wnedby mschwager
Code execution via Python package installation.
Python
149
Updated: 4 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
D
DPPyby guilgautier
Python toolbox for sampling Determinantal Point Processes
Python
148
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
h
hypertools-paper-notebooksby ContextLab
Supporting notebooks and data from hypertools paper
Jupyter Notebook
146
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
papermill-mlflowby eugeneyan
🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.
Jupyter Notebook
146
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse