This repository for Teaching Analytics using Python Programming
Support
Quality
Security
License
Reuse
Full Stack Data Science in Python
Support
Quality
Security
License
Reuse
Multiple Imputation with LightGBM in Python
Support
Quality
Security
License
Reuse
Automated Quality Control and visual reports for Quality Assessment of structural (T1w, T2w) and functional MRI of the brain
Support
Quality
Security
License
Reuse
Detailed Python developer roadmap
Support
Quality
Security
License
Reuse
R package implementation of Milo for testing for differential abundance in KNN graphs
Support
Quality
Security
License
Reuse
Quantitative Finance book
Support
Quality
Security
License
Reuse
An introductory workshop on pandas with notebooks and exercises for following along.
Support
Quality
Security
License
Reuse
Pandas is a high-level data manipulation tool developed by Wes McKinney. It is built on the Numpy package and its key data structure is called the DataFrame. DataFrames allow you to store and manipulate tabular data in rows of observations and columns of variables.
Support
Quality
Security
License
Reuse
A library for recording and reading data in notebooks.
Support
Quality
Security
License
Reuse
The ultimate reference guide to data wrangling with Python and R
Support
Quality
Security
License
Reuse
Python package for Imputation Methods
Support
Quality
Security
License
Reuse
Statistical Analysis of Network Data with R, 2nd Edition
Support
Quality
Security
License
Reuse
c
california-coronavirus-databy datadesk
Jupyter Notebook 214 Version:Current License: Proprietary (Proprietary)
The Los Angeles Times' open-source archive of California coronavirus data
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
y
your-first-kaggle-submissionby mrdbourke
Jupyter Notebook 208 Version:Current License: No License (No License)
How to perform an exploratory data analysis on the Kaggle Titanic dataset and make a submission to the leaderboard.
Support
Quality
Security
License
Reuse
p
python-for-data-scienceby blobcity
Jupyter Notebook 206 Version:Current License: Permissive (Apache-2.0)
A collection of Jupyter Notebooks for learning Python for Data Science.
Support
Quality
Security
License
Reuse
New generation decentralized data warehouse and streaming data pipeline
Support
Quality
Security
License
Reuse
Devenez Data-Scientist sur Le Wagon On Demand
Support
Quality
Security
License
Reuse
r
rosettaby columbia-applied-data-science
Jupyter Notebook 203 Version:Current License: Proprietary (Proprietary)
Tools, wrappers, etc... for data science with a concentration on text processing
Support
Quality
Security
License
Reuse
I
IBM-Data-Science-Professional-Certificationby Thomas-George-T
Jupyter Notebook 203 Version:Current License: No License (No License)
Learning materials, Quizzes & Assignment solutions for the entire IBM data science professional certification. Also included, a few resources that I found helpful.
Support
Quality
Security
License
Reuse
Elementary is an open-source data observability framework for modern data teams, starting with data lineage.
Support
Quality
Security
License
Reuse
Learn how to build a data analysis library from scratch
Support
Quality
Security
License
Reuse
A predictive model developed to identify medium-voltage electrical distribution grid infrastructure using publicly available data sources.
Support
Quality
Security
License
Reuse
Jupyter Notebook extension leveraging pandas DataFrames by integrating DataTables and ChartJS.
Support
Quality
Security
License
Reuse
Repositório de códigos da disciplina de Algoritmos e Estrutura de Dados II
Support
Quality
Security
License
Reuse
A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results
Support
Quality
Security
License
Reuse
A validation library for Pandas data frames using user-friendly schemas
Support
Quality
Security
License
Reuse
Repository for data science course Spring 14
Support
Quality
Security
License
Reuse
Redshift Python Connector. It supports Python Database API Specification v2.0.
Support
Quality
Security
License
Reuse
D
DataScience_Interview_Questionsby milaan9
Jupyter Notebook 181 Version:Current License: Permissive (MIT)
My Solutions to 120 commonly asked data science interview questions.
Support
Quality
Security
License
Reuse
Progress monitor: monitor a job's progress
Support
Quality
Security
License
Reuse
An example MLflow project
Support
Quality
Security
License
Reuse
The solution to cameyon16 and camelyon17 challenge and also to your own WSI data project.
Support
Quality
Security
License
Reuse
m
matgenbby materialsvirtuallab
Jupyter Notebook 174 Version:Current License: Permissive (BSD-3-Clause)
Jupyter notebooks demonstrating the utilization of open-source codes for the study of materials science.
Support
Quality
Security
License
Reuse
Python package for multivariate hypothesis testing
Support
Quality
Security
License
Reuse
P
Python-Data-Cleaning-Cookbookby PacktPublishing
Python 168 Version:Current License: Permissive (MIT)
Python Data Cleaning Cookbook, published by Packt
Support
Quality
Security
License
Reuse
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
Support
Quality
Security
License
Reuse
P
Python-Fundamentalsby dlab-berkeley
Jupyter Notebook 159 Version:Current License: Proprietary (Proprietary)
D-Lab's 12 hour introduction to Python. Learn how to create variables and functions, use control flow structures, use libraries, import data, and more, using Python and Jupyter Notebooks.
Support
Quality
Security
License
Reuse
Roadmap for becoming Python developer.
Support
Quality
Security
License
Reuse
i
introDataScienceby iewaij
Jupyter Notebook 155 Version:Current License: Strong Copyleft (CC-BY-SA-4.0)
Notes on Data Science. 数理统计、机器学习和数据编程的学习笔记。
Support
Quality
Security
License
Reuse
A Toolbox for Non-Tabular Data Manipulation
Support
Quality
Security
License
Reuse
P
Python-Data-Structures-and-Algorithmsby PacktPublishing
Python 153 Version:Current License: Permissive (MIT)
Python Data Structures and Algorithms, published by Packt
Support
Quality
Security
License
Reuse
[BMVC'19] Tracking Holistic Object Representations
Support
Quality
Security
License
Reuse
The Open Source Time-Series Data Historian
Support
Quality
Security
License
Reuse
m
machine_learning_for_goodby DeltaAnalytics
Jupyter Notebook 153 Version:Current License: Permissive (CC-BY-4.0)
Machine learning fundamentals lesson in interactive notebooks
Support
Quality
Security
License
Reuse
Code execution via Python package installation.
Support
Quality
Security
License
Reuse
Python toolbox for sampling Determinantal Point Processes
Support
Quality
Security
License
Reuse
h
hypertools-paper-notebooksby ContextLab
Jupyter Notebook 146 Version:Current License: Permissive (MIT)
Supporting notebooks and data from hypertools paper
Support
Quality
Security
License
Reuse
🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.
Support
Quality
Security
License
Reuse
p
pyAnalyticsby DUanalytics
This repository for Teaching Analytics using Python Programming
Python 239Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
f
full-stack-data-scienceby amitkaps
Full Stack Data Science in Python
Jupyter Notebook 238Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
miceforestby AnotherSamWilson
Multiple Imputation with LightGBM in Python
Python 237Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
mriqcby nipreps
Automated Quality Control and visual reports for Quality Assessment of structural (T1w, T2w) and functional MRI of the brain
Python 236Updated: 10 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pyroadby amaargiru
Detailed Python developer roadmap
Jupyter Notebook 235Updated: 1 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
m
miloRby MarioniLab
R package implementation of Milo for testing for differential abundance in KNN graphs
R 228Updated: 10 mo ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
Q
QuantFinanceBookby LechGrzelak
Quantitative Finance book
Python 227Updated: 10 mo ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
pandas-workshopby stefmolin
An introductory workshop on pandas with notebooks and exercises for following along.
HTML 221Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
1
10_Python_Pandas_Moduleby milaan9
Pandas is a high-level data manipulation tool developed by Wes McKinney. It is built on the Numpy package and its key data structure is called the DataFrame. DataFrames allow you to store and manipulate tabular data in rows of observations and columns of variables.
Jupyter Notebook 221Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
scrapbookby nteract
A library for recording and reading data in notebooks.
Python 220Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
D
DataWranglingby ben519
The ultimate reference guide to data wrangling with Python and R
R 220Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
autoimputeby kearnz
Python package for Imputation Methods
Python 218Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
sandby kolaczyk
Statistical Analysis of Network Data with R, 2nd Edition
R 217Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
california-coronavirus-databy datadesk
The Los Angeles Times' open-source archive of California coronavirus data
Jupyter Notebook 214Updated: 1 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
D
DataGotham2013by yhat
Python 211Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
y
your-first-kaggle-submissionby mrdbourke
How to perform an exploratory data analysis on the Kaggle Titanic dataset and make a submission to the leaderboard.
Jupyter Notebook 208Updated: 11 mo ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
python-for-data-scienceby blobcity
A collection of Jupyter Notebooks for learning Python for Data Science.
Jupyter Notebook 206Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
k
kamu-cliby kamu-data
New generation decentralized data warehouse and streaming data pipeline
Rust 205Updated: 10 mo ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
d
data-kitby lewagon
Devenez Data-Scientist sur Le Wagon On Demand
Jupyter Notebook 203Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
r
rosettaby columbia-applied-data-science
Tools, wrappers, etc... for data science with a concentration on text processing
Jupyter Notebook 203Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
I
IBM-Data-Science-Professional-Certificationby Thomas-George-T
Learning materials, Quizzes & Assignment solutions for the entire IBM data science professional certification. Also included, a few resources that I found helpful.
Jupyter Notebook 203Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
e
elementary-lineageby elementary-data
Elementary is an open-source data observability framework for modern data teams, starting with data lineage.
Python 196Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pandas_cubby tdpetrou
Learn how to build a data analysis library from scratch
Python 194Updated: 1 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
m
many-to-many-dijkstraby facebookresearch
A predictive model developed to identify medium-voltage electrical distribution grid infrastructure using publicly available data sources.
Python 192Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
j
jupyter-datatablesby CermakM
Jupyter Notebook extension leveraging pandas DataFrames by integrating DataTables and ChartJS.
JavaScript 192Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
aeds2by icei-pucminas
Repositório de códigos da disciplina de Algoritmos e Estrutura de Dados II
Java 192Updated: 11 mo ago License: No License (No License)
Support
Quality
Security
License
Reuse
B
BagofTricks-LTby zhangyongshun
A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results
Python 191Updated: 12 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
PandasSchemaby multimeric
A validation library for Pandas data frames using user-friendly schemas
Python 184Updated: 10 mo ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
d
datascience-sp14by amplab
Repository for data science course Spring 14
Shell 183Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
amazon-redshift-python-driverby aws
Redshift Python Connector. It supports Python Database API Specification v2.0.
Python 182Updated: 11 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
D
DataScience_Interview_Questionsby milaan9
My Solutions to 120 commonly asked data science interview questions.
Jupyter Notebook 181Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pmonitorby dspinellis
Progress monitor: monitor a job's progress
Shell 179Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
m
mlflow-exampleby mlflow
An example MLflow project
Python 178Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
C
CAMELYONby ilikewind
The solution to cameyon16 and camelyon17 challenge and also to your own WSI data project.
Python 174Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
matgenbby materialsvirtuallab
Jupyter notebooks demonstrating the utilization of open-source codes for the study of materials science.
Jupyter Notebook 174Updated: 10 mo ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
h
hyppoby neurodata
Python package for multivariate hypothesis testing
Python 171Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
Python-Data-Cleaning-Cookbookby PacktPublishing
Python Data Cleaning Cookbook, published by Packt
Python 168Updated: 10 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fastverseby fastverse
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
R 166Updated: 1 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
P
Python-Fundamentalsby dlab-berkeley
D-Lab's 12 hour introduction to Python. Learn how to create variables and functions, use control flow structures, use libraries, import data, and more, using Python and Jupyter Notebooks.
Jupyter Notebook 159Updated: 1 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
Python-developer-roadmapby ErdemOzgen
Roadmap for becoming Python developer.
Python 156Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
i
introDataScienceby iewaij
Notes on Data Science. 数理统计、机器学习和数据编程的学习笔记。
Jupyter Notebook 155Updated: 12 mo ago License: Strong Copyleft (CC-BY-SA-4.0)
Support
Quality
Security
License
Reuse
r
rlistby renkun-ken
A Toolbox for Non-Tabular Data Manipulation
R 154Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
Python-Data-Structures-and-Algorithmsby PacktPublishing
Python Data Structures and Algorithms, published by Packt
Python 153Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
THORby xl-sr
[BMVC'19] Tracking Holistic Object Representations
Python 153Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
o
openHistorianby GridProtectionAlliance
The Open Source Time-Series Data Historian
TypeScript 153Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
machine_learning_for_goodby DeltaAnalytics
Machine learning fundamentals lesson in interactive notebooks
Jupyter Notebook 153Updated: 3 y ago License: Permissive (CC-BY-4.0)
Support
Quality
Security
License
Reuse
0
0wnedby mschwager
Code execution via Python package installation.
Python 149Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
D
DPPyby guilgautier
Python toolbox for sampling Determinantal Point Processes
Python 148Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
h
hypertools-paper-notebooksby ContextLab
Supporting notebooks and data from hypertools paper
Jupyter Notebook 146Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
papermill-mlflowby eugeneyan
🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.
Jupyter Notebook 146Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse