OpenDS4All project, hosted by LF AI & Data
Support
Quality
Security
License
Reuse
Tools for exploratory data analysis in Python
Support
Quality
Security
License
Reuse
another book on data science
Support
Quality
Security
License
Reuse
Jupyter notebooks in Russian. Introduction to Python, basic algorithms and data structures
Support
Quality
Security
License
Reuse
Pandas Cookbook, published by Packt
Support
Quality
Security
License
Reuse
O
OpenSource-RoadMap-DataScienceby DataScienceResearchPeru
Jupyter Notebook 
546
Version:Current
License: Permissive (Apache-2.0)
¡Camino a una educación autodidacta en Ciencia de Datos!
Support
Quality
Security
License
Reuse
l
learningPySparkby drabastomek
Jupyter Notebook 
541
Version:Current
License: Strong Copyleft (GPL-3.0)
Code base for the Learning PySpark book (in preparation)
Support
Quality
Security
License
Reuse
fMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse fMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.
Support
Quality
Security
License
Reuse
PyPrind - Python Progress Indicator Utility
Support
Quality
Security
License
Reuse
Curso de introducción a la estadística descriptiva con R Studio
Support
Quality
Security
License
Reuse
Structured data processing in Kotlin
Support
Quality
Security
License
Reuse
Pycortex is a python-based toolkit for surface visualization of fMRI data
Support
Quality
Security
License
Reuse
A tutorial on Julia DataFrames package
Support
Quality
Security
License
Reuse
A Python library for introductory data science
Support
Quality
Security
License
Reuse
Advanced and Fast Data Transformation in R
Support
Quality
Security
License
Reuse
Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)
Support
Quality
Security
License
Reuse
i
introducao-a-data-scienceby alura-cursos
Jupyter Notebook 
450
Version:Current
License: No License (No License)
Conteúdo da primeira parte do curso de introdução a Data Science da Alura
Support
Quality
Security
License
Reuse
Imputation of missing values in tables.
Support
Quality
Security
License
Reuse
Template for a data science project
Support
Quality
Security
License
Reuse
p
production-data-scienceby FilippoBovo
Jupyter Notebook 
440
Version:Current
License: No License (No License)
Production Data Science: a workflow for collaborative data science aimed at production
Support
Quality
Security
License
Reuse
A javascript library providing a new data structure for datascientists and developpers
Support
Quality
Security
License
Reuse
R package of data and code behind the stories and interactives at FiveThirtyEight
Support
Quality
Security
License
Reuse
A Pandas Styler class for making beautiful tables
Support
Quality
Security
License
Reuse
Jupyter notebooks that support my graph data science blog posts at https://bratanic-tomaz.medium.com/
Support
Quality
Security
License
Reuse
Interface to use R from Python
Support
Quality
Security
License
Reuse
A tool for visualizing and experimenting with JavaScript object relationships.
Support
Quality
Security
License
Reuse
1
120-DS-Interview-Questionsby JifuZhao
Jupyter Notebook 
392
Version:Current
License: No License (No License)
My Answer to 120 Data Science Interview Questions
Support
Quality
Security
License
Reuse
Helper functions for modelling
Support
Quality
Security
License
Reuse
p
pyspark-tutorialsby UrbanInstitute
Jupyter Notebook 
379
Version:Current
License: Proprietary (Proprietary)
Code snippets and tutorials for working with social science data in PySpark
Support
Quality
Security
License
Reuse
IPython Notebooks to learn Python
Support
Quality
Security
License
Reuse
Data Engineering with Python, published by Packt
Support
Quality
Security
License
Reuse
sidetable builds simple but useful summary tables of your data
Support
Quality
Security
License
Reuse
Tutoriales de Python para el análisis de datos en el curso IE0405 - "Modelos Probabilísticos de Señales y Sistemas" de la Universidad de Costa Rica.
Support
Quality
Security
License
Reuse
D
Data-Science-Hacksby kunalj101
Jupyter Notebook 
351
Version:Current
License: Strong Copyleft (GPL-3.0)
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Support
Quality
Security
License
Reuse
Multivariate Imputation by Chained Equations
Support
Quality
Security
License
Reuse
Short Tutorial to Probabilistic Graphical Models(PGM) and pgmpy
Support
Quality
Security
License
Reuse
S
SoccermaticsForPythonby Friends-of-Tracking-Data-FoTD
Python 
336
Version:Current
License: Permissive (MIT)
This repo is dedicated for people getting started with Python using the concepts derived from the book Soccermatics (Sumpter 2016)
Support
Quality
Security
License
Reuse
A Python implementation of the DESeq2 pipeline for bulk RNA-seq DEA.
Support
Quality
Security
License
Reuse
k
kaggle-talkingdata-visualizationby adilmoujahid
JavaScript 
332
Version:Current
License: No License (No License)
Source code for blog post: Interactive Data Visualization of Geospatial Data using D3.js, DC.js, Leaflet.js and Python
Support
Quality
Security
License
Reuse
Data science portfolio
Support
Quality
Security
License
Reuse
Neuroimaging in Python FMRI analysis package
Support
Quality
Security
License
Reuse
i
Jupyter Notebook 
296
Version:Current
License: Proprietary (Proprietary)
Book: Introduction to Python for Computational Science and Engineering
Support
Quality
Security
License
Reuse
Exploratory data analysis for large datasets (10-100 million observations)
Support
Quality
Security
License
Reuse
A Rust DataFrame implementation, built on Apache Arrow
Support
Quality
Security
License
Reuse
Full pipeline of a data science competition (public version)
Support
Quality
Security
License
Reuse
An Elasticsearch client exposing DataFrame API
Support
Quality
Security
License
Reuse
Annotated data.
Support
Quality
Security
License
Reuse
Using Project Jupyter for data science.
Support
Quality
Security
License
Reuse
p
personal_data_science_projectsby robsalgado
Jupyter Notebook 
246
Version:Current
License: No License (No License)
Support
Quality
Security
License
Reuse
CARTO Python package for data scientists
Support
Quality
Security
License
Reuse
O
OpenDS4Allby odpi
OpenDS4All project, hosted by LF AI & Data
HTML
628
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
D
Doraby NathanEpstein
Tools for exploratory data analysis in Python
Python
613
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
b
book_sampleby rnorm
another book on data science
Python
612
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
p
python_introby Yorko
Jupyter notebooks in Russian. Introduction to Python, basic algorithms and data structures
Jupyter Notebook
605
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
P
Pandas-Cookbookby PacktPublishing
Pandas Cookbook, published by Packt
Jupyter Notebook
593
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
O
OpenSource-RoadMap-DataScienceby DataScienceResearchPeru
¡Camino a una educación autodidacta en Ciencia de Datos!
Jupyter Notebook
546
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
l
learningPySparkby drabastomek
Code base for the Learning PySpark book (in preparation)
Jupyter Notebook
541
Updated: 4 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
f
fmriprepby nipreps
fMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse fMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.
HTML
531
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pyprindby rasbt
PyPrind - Python Progress Indicator Utility
Python
529
Updated: 4 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
r
r-basicby joanby
Curso de introducción a la estadística descriptiva con R Studio
HTML
523
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
d
dataframeby Kotlin
Structured data processing in Kotlin
Kotlin
512
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pycortexby gallantlab
Pycortex is a python-based toolkit for surface visualization of fMRI data
JavaScript
490
Updated: 2 y ago
License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
J
Julia-DataFrames-Tutorialby bkamins
A tutorial on Julia DataFrames package
Jupyter Notebook
485
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
datascienceby data-8
A Python library for introductory data science
Jupyter Notebook
481
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
c
collapseby SebKrantz
Advanced and Fast Data Transformation in R
C
470
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
z
zmPDSwRby WinVector
Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)
HTML
457
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
i
introducao-a-data-scienceby alura-cursos
Conteúdo da primeira parte do curso de introdução a Data Science da Alura
Jupyter Notebook
450
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
d
datawigby awslabs
Imputation of missing values in tables.
JavaScript
446
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
d
data-science-templateby khuyentran1401
Template for a data science project
Python
446
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
p
production-data-scienceby FilippoBovo
Production Data Science: a workflow for collaborative data science aimed at production
Jupyter Notebook
440
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
d
dataframe-jsby Gmousse
A javascript library providing a new data structure for datascientists and developpers
JavaScript
438
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fivethirtyeightby rudeboybert
R package of data and code behind the stories and interactives at FiveThirtyEight
R
426
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
PrettyPandasby HHammond
A Pandas Styler class for making beautiful tables
Python
406
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
b
blogsby tomasonjo
Jupyter notebooks that support my graph data science blog posts at https://bratanic-tomaz.medium.com/
Jupyter Notebook
406
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
r
rpy2by rpy2
Interface to use R from Python
Python
402
Updated: 2 y ago
License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
o
object_playgroundby jamesshore
A tool for visualizing and experimenting with JavaScript object relationships.
JavaScript
399
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
1
120-DS-Interview-Questionsby JifuZhao
My Answer to 120 Data Science Interview Questions
Jupyter Notebook
392
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
m
modelrby tidyverse
Helper functions for modelling
R
382
Updated: 3 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
p
pyspark-tutorialsby UrbanInstitute
Code snippets and tutorials for working with social science data in PySpark
Jupyter Notebook
379
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
Python-Lecturesby rajathkmp
IPython Notebooks to learn Python
Jupyter Notebook
364
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
D
Data-Engineering-with-Pythonby PacktPublishing
Data Engineering with Python, published by Packt
Python
361
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
sidetableby chris1610
sidetable builds simple but useful summary tables of your data
Python
360
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pythonby fabianabarca
Tutoriales de Python para el análisis de datos en el curso IE0405 - "Modelos Probabilísticos de Señales y Sistemas" de la Universidad de Costa Rica.
Jupyter Notebook
359
Updated: 2 y ago
License: Permissive (CC0-1.0)
Support
Quality
Security
License
Reuse
D
Data-Science-Hacksby kunalj101
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Jupyter Notebook
351
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
m
miceby amices
Multivariate Imputation by Chained Equations
R
343
Updated: 2 y ago
License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
p
pgmpy_notebookby pgmpy
Short Tutorial to Probabilistic Graphical Models(PGM) and pgmpy
Jupyter Notebook
341
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
SoccermaticsForPythonby Friends-of-Tracking-Data-FoTD
This repo is dedicated for people getting started with Python using the concepts derived from the book Soccermatics (Sumpter 2016)
Python
336
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
PyDESeq2by owkin
A Python implementation of the DESeq2 pipeline for bulk RNA-seq DEA.
Python
336
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
k
kaggle-talkingdata-visualizationby adilmoujahid
Source code for blog post: Interactive Data Visualization of Geospatial Data using D3.js, DC.js, Leaflet.js and Python
JavaScript
332
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
E
Erlemar.github.ioby Erlemar
Data science portfolio
Jupyter Notebook
321
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
n
nipyby nipy
Neuroimaging in Python FMRI analysis package
Python
297
Updated: 4 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
i
introduction-to-python-for-computational-science-and-engineeringby fangohr
Book: Introduction to Python for Computational Science and Engineering
Jupyter Notebook
296
Updated: 3 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
b
bigvisby hadley
Exploratory data analysis for large datasets (10-100 million observations)
C++
282
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
r
rust-dataframeby nevi-me
A Rust DataFrame implementation, built on Apache Arrow
Rust
276
Updated: 4 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
ml-arsenal-publicby liaopeiyuan
Full pipeline of a data science competition (public version)
Python
266
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
p
pandasticsearchby onesuper
An Elasticsearch client exposing DataFrame API
Python
263
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
Support
Quality
Security
License
Reuse
j
jupyter-tips-and-tricksby jbwhit
Using Project Jupyter for data science.
Jupyter Notebook
247
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
personal_data_science_projectsby robsalgado
Jupyter Notebook
246
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
c
cartoframesby CartoDB
CARTO Python package for data scientists
Python
244
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse