OpenDS4All project, hosted by LF AI & Data
Support
Quality
Security
License
Reuse
Tools for exploratory data analysis in Python
Support
Quality
Security
License
Reuse
another book on data science
Support
Quality
Security
License
Reuse
Jupyter notebooks in Russian. Introduction to Python, basic algorithms and data structures
Support
Quality
Security
License
Reuse
Pandas Cookbook, published by Packt
Support
Quality
Security
License
Reuse
O
OpenSource-RoadMap-DataScienceby DataScienceResearchPeru
Jupyter Notebook 546 Version:Current License: Permissive (Apache-2.0)
¡Camino a una educación autodidacta en Ciencia de Datos!
Support
Quality
Security
License
Reuse
l
learningPySparkby drabastomek
Jupyter Notebook 541 Version:Current License: Strong Copyleft (GPL-3.0)
Code base for the Learning PySpark book (in preparation)
Support
Quality
Security
License
Reuse
fMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse fMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.
Support
Quality
Security
License
Reuse
PyPrind - Python Progress Indicator Utility
Support
Quality
Security
License
Reuse
Curso de introducción a la estadística descriptiva con R Studio
Support
Quality
Security
License
Reuse
Structured data processing in Kotlin
Support
Quality
Security
License
Reuse
Pycortex is a python-based toolkit for surface visualization of fMRI data
Support
Quality
Security
License
Reuse
A tutorial on Julia DataFrames package
Support
Quality
Security
License
Reuse
A Python library for introductory data science
Support
Quality
Security
License
Reuse
Advanced and Fast Data Transformation in R
Support
Quality
Security
License
Reuse
Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)
Support
Quality
Security
License
Reuse
i
introducao-a-data-scienceby alura-cursos
Jupyter Notebook 450 Version:Current License: No License (No License)
Conteúdo da primeira parte do curso de introdução a Data Science da Alura
Support
Quality
Security
License
Reuse
Imputation of missing values in tables.
Support
Quality
Security
License
Reuse
Template for a data science project
Support
Quality
Security
License
Reuse
p
production-data-scienceby FilippoBovo
Jupyter Notebook 440 Version:Current License: No License (No License)
Production Data Science: a workflow for collaborative data science aimed at production
Support
Quality
Security
License
Reuse
A javascript library providing a new data structure for datascientists and developpers
Support
Quality
Security
License
Reuse
R package of data and code behind the stories and interactives at FiveThirtyEight
Support
Quality
Security
License
Reuse
A Pandas Styler class for making beautiful tables
Support
Quality
Security
License
Reuse
Jupyter notebooks that support my graph data science blog posts at https://bratanic-tomaz.medium.com/
Support
Quality
Security
License
Reuse
Interface to use R from Python
Support
Quality
Security
License
Reuse
A tool for visualizing and experimenting with JavaScript object relationships.
Support
Quality
Security
License
Reuse
1
120-DS-Interview-Questionsby JifuZhao
Jupyter Notebook 392 Version:Current License: No License (No License)
My Answer to 120 Data Science Interview Questions
Support
Quality
Security
License
Reuse
Helper functions for modelling
Support
Quality
Security
License
Reuse
p
pyspark-tutorialsby UrbanInstitute
Jupyter Notebook 379 Version:Current License: Proprietary (Proprietary)
Code snippets and tutorials for working with social science data in PySpark
Support
Quality
Security
License
Reuse
IPython Notebooks to learn Python
Support
Quality
Security
License
Reuse
Data Engineering with Python, published by Packt
Support
Quality
Security
License
Reuse
sidetable builds simple but useful summary tables of your data
Support
Quality
Security
License
Reuse
Tutoriales de Python para el análisis de datos en el curso IE0405 - "Modelos Probabilísticos de Señales y Sistemas" de la Universidad de Costa Rica.
Support
Quality
Security
License
Reuse
D
Data-Science-Hacksby kunalj101
Jupyter Notebook 351 Version:Current License: Strong Copyleft (GPL-3.0)
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Support
Quality
Security
License
Reuse
Multivariate Imputation by Chained Equations
Support
Quality
Security
License
Reuse
Short Tutorial to Probabilistic Graphical Models(PGM) and pgmpy
Support
Quality
Security
License
Reuse
S
SoccermaticsForPythonby Friends-of-Tracking-Data-FoTD
Python 336 Version:Current License: Permissive (MIT)
This repo is dedicated for people getting started with Python using the concepts derived from the book Soccermatics (Sumpter 2016)
Support
Quality
Security
License
Reuse
A Python implementation of the DESeq2 pipeline for bulk RNA-seq DEA.
Support
Quality
Security
License
Reuse
k
kaggle-talkingdata-visualizationby adilmoujahid
JavaScript 332 Version:Current License: No License (No License)
Source code for blog post: Interactive Data Visualization of Geospatial Data using D3.js, DC.js, Leaflet.js and Python
Support
Quality
Security
License
Reuse
Data science portfolio
Support
Quality
Security
License
Reuse
Neuroimaging in Python FMRI analysis package
Support
Quality
Security
License
Reuse
i
Jupyter Notebook 296 Version:Current License: Proprietary (Proprietary)
Book: Introduction to Python for Computational Science and Engineering
Support
Quality
Security
License
Reuse
Exploratory data analysis for large datasets (10-100 million observations)
Support
Quality
Security
License
Reuse
A Rust DataFrame implementation, built on Apache Arrow
Support
Quality
Security
License
Reuse
Full pipeline of a data science competition (public version)
Support
Quality
Security
License
Reuse
An Elasticsearch client exposing DataFrame API
Support
Quality
Security
License
Reuse
Annotated data.
Support
Quality
Security
License
Reuse
Using Project Jupyter for data science.
Support
Quality
Security
License
Reuse
p
personal_data_science_projectsby robsalgado
Jupyter Notebook 246 Version:Current License: No License (No License)
Support
Quality
Security
License
Reuse
CARTO Python package for data scientists
Support
Quality
Security
License
Reuse
O
OpenDS4Allby odpi
OpenDS4All project, hosted by LF AI & Data
HTML 628Updated: 10 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
D
Doraby NathanEpstein
Tools for exploratory data analysis in Python
Python 613Updated: 11 mo ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
book_sampleby rnorm
another book on data science
Python 612Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
python_introby Yorko
Jupyter notebooks in Russian. Introduction to Python, basic algorithms and data structures
Jupyter Notebook 605Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
P
Pandas-Cookbookby PacktPublishing
Pandas Cookbook, published by Packt
Jupyter Notebook 593Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
O
OpenSource-RoadMap-DataScienceby DataScienceResearchPeru
¡Camino a una educación autodidacta en Ciencia de Datos!
Jupyter Notebook 546Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
l
learningPySparkby drabastomek
Code base for the Learning PySpark book (in preparation)
Jupyter Notebook 541Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
f
fmriprepby nipreps
fMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse fMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.
HTML 531Updated: 10 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pyprindby rasbt
PyPrind - Python Progress Indicator Utility
Python 529Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
r
r-basicby joanby
Curso de introducción a la estadística descriptiva con R Studio
HTML 523Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
dataframeby Kotlin
Structured data processing in Kotlin
Kotlin 512Updated: 10 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pycortexby gallantlab
Pycortex is a python-based toolkit for surface visualization of fMRI data
JavaScript 490Updated: 11 mo ago License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
J
Julia-DataFrames-Tutorialby bkamins
A tutorial on Julia DataFrames package
Jupyter Notebook 485Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
datascienceby data-8
A Python library for introductory data science
Jupyter Notebook 481Updated: 1 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
c
collapseby SebKrantz
Advanced and Fast Data Transformation in R
C 470Updated: 10 mo ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
z
zmPDSwRby WinVector
Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)
HTML 457Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
i
introducao-a-data-scienceby alura-cursos
Conteúdo da primeira parte do curso de introdução a Data Science da Alura
Jupyter Notebook 450Updated: 11 mo ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
datawigby awslabs
Imputation of missing values in tables.
JavaScript 446Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
d
data-science-templateby khuyentran1401
Template for a data science project
Python 446Updated: 11 mo ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
production-data-scienceby FilippoBovo
Production Data Science: a workflow for collaborative data science aimed at production
Jupyter Notebook 440Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
dataframe-jsby Gmousse
A javascript library providing a new data structure for datascientists and developpers
JavaScript 438Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fivethirtyeightby rudeboybert
R package of data and code behind the stories and interactives at FiveThirtyEight
R 426Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
PrettyPandasby HHammond
A Pandas Styler class for making beautiful tables
Python 406Updated: 10 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
b
blogsby tomasonjo
Jupyter notebooks that support my graph data science blog posts at https://bratanic-tomaz.medium.com/
Jupyter Notebook 406Updated: 10 mo ago License: No License (No License)
Support
Quality
Security
License
Reuse
r
rpy2by rpy2
Interface to use R from Python
Python 402Updated: 10 mo ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
o
object_playgroundby jamesshore
A tool for visualizing and experimenting with JavaScript object relationships.
JavaScript 399Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
1
120-DS-Interview-Questionsby JifuZhao
My Answer to 120 Data Science Interview Questions
Jupyter Notebook 392Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
modelrby tidyverse
Helper functions for modelling
R 382Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
p
pyspark-tutorialsby UrbanInstitute
Code snippets and tutorials for working with social science data in PySpark
Jupyter Notebook 379Updated: 1 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
Python-Lecturesby rajathkmp
IPython Notebooks to learn Python
Jupyter Notebook 364Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
Data-Engineering-with-Pythonby PacktPublishing
Data Engineering with Python, published by Packt
Python 361Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
sidetableby chris1610
sidetable builds simple but useful summary tables of your data
Python 360Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pythonby fabianabarca
Tutoriales de Python para el análisis de datos en el curso IE0405 - "Modelos Probabilísticos de Señales y Sistemas" de la Universidad de Costa Rica.
Jupyter Notebook 359Updated: 10 mo ago License: Permissive (CC0-1.0)
Support
Quality
Security
License
Reuse
D
Data-Science-Hacksby kunalj101
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Jupyter Notebook 351Updated: 1 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
m
miceby amices
Multivariate Imputation by Chained Equations
R 343Updated: 12 mo ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
p
pgmpy_notebookby pgmpy
Short Tutorial to Probabilistic Graphical Models(PGM) and pgmpy
Jupyter Notebook 341Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
SoccermaticsForPythonby Friends-of-Tracking-Data-FoTD
This repo is dedicated for people getting started with Python using the concepts derived from the book Soccermatics (Sumpter 2016)
Python 336Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
PyDESeq2by owkin
A Python implementation of the DESeq2 pipeline for bulk RNA-seq DEA.
Python 336Updated: 10 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
k
kaggle-talkingdata-visualizationby adilmoujahid
Source code for blog post: Interactive Data Visualization of Geospatial Data using D3.js, DC.js, Leaflet.js and Python
JavaScript 332Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
E
Erlemar.github.ioby Erlemar
Data science portfolio
Jupyter Notebook 321Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
n
nipyby nipy
Neuroimaging in Python FMRI analysis package
Python 297Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
i
introduction-to-python-for-computational-science-and-engineeringby fangohr
Book: Introduction to Python for Computational Science and Engineering
Jupyter Notebook 296Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
b
bigvisby hadley
Exploratory data analysis for large datasets (10-100 million observations)
C++ 282Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
r
rust-dataframeby nevi-me
A Rust DataFrame implementation, built on Apache Arrow
Rust 276Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
ml-arsenal-publicby liaopeiyuan
Full pipeline of a data science competition (public version)
Python 266Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
pandasticsearchby onesuper
An Elasticsearch client exposing DataFrame API
Python 263Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
Support
Quality
Security
License
Reuse
j
jupyter-tips-and-tricksby jbwhit
Using Project Jupyter for data science.
Jupyter Notebook 247Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
personal_data_science_projectsby robsalgado
Jupyter Notebook 246Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
cartoframesby CartoDB
CARTO Python package for data scientists
Python 244Updated: 1 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse