Scikit-learn (Sklearn) is the most useful and robust library for machine learning in Python. It provides a selection of efficient tools for machine learning and statistical modeling including classification, regression, clustering, and dimensionality reduction via a consistence interface in Python.
Classification
In Classification, the output variable must be a discrete value. The task of the classification algorithm is to map the input value(x) with the discrete output variable(y).
cnn-text-classification-tfby dennybritz
Convolutional Neural Network for Text Classification in Tensorflow
cnn-text-classification-tfby dennybritz
Python
5574
Version:Current
License: Permissive (Apache-2.0)
Chinese-Text-Classification-Pytorchby 649453932
Chinese text classification, TextCNN, TextRNN, FastText, TextRCNN, BiLSTM_Attention, DPCNN, Transformer, based on pytorch, out of the box.
Chinese-Text-Classification-Pytorchby 649453932
Python
4459
Version:Current
License: Permissive (MIT)
pytorch-classificationby bearpaw
Classification with PyTorch.
pytorch-classificationby bearpaw
Python
1579
Version:Current
License: Permissive (MIT)
Regression
In Regression, the output variable must be of continuous nature or real value. The task of the regression algorithm is to map the input value (x) with the continuous output variable(y).
regression-jsby Tom-Alexander
Curve Fitting in JavaScript.
regression-jsby Tom-Alexander
JavaScript
894
Version:Current
License: Permissive (MIT)
cypress-visual-regressionby mjhea0
Module for adding visual regression testing to Cypress
cypress-visual-regressionby mjhea0
JavaScript
310
Version:Current
License: Permissive (MIT)
Visual-Regression-Trackerby Visual-Regression-Tracker
Backend and Frontend application for tracking differences via image comparison
Visual-Regression-Trackerby Visual-Regression-Tracker
Shell
491
Version:4.20.7
License: Permissive (Apache-2.0)
Clustering
A way of grouping the data points into different clusters, consisting of similar data points. The objects with the possible similarities remain in a group that has less or no similarities with another group.
moaby Waikato
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
moaby Waikato
Java
537
Version:2021.07.0
License: Strong Copyleft (GPL-3.0)
image-similarity-clusteringby zegami
This project allows images to be automatically grouped into like clusters using a combination of machine learning techniques.
image-similarity-clusteringby zegami
Python
177
Version:Current
License: Permissive (MIT)
ml-email-clusteringby anthdm
Email clustering with machine learning
ml-email-clusteringby anthdm
Python
150
Version:Current
License: Permissive (MIT)
Dimensionality reduction
It is a way of converting the higher dimensions dataset into lesser dimensions dataset ensuring that it provides similar information.
feature-selectorby WillKoehrsen
Feature selector is a tool for dimensionality reduction of machine learning datasets
feature-selectorby WillKoehrsen
Jupyter Notebook
2080
Version:Current
License: Strong Copyleft (GPL-3.0)
deeptimeby deeptime-ml
Python library for analysis of time series data including dimensionality reduction, clustering, and Markov model estimation
deeptimeby deeptime-ml
Python
562
Version:v0.4.4
License: Weak Copyleft (LGPL-3.0)
siamesenetwork-tensorflowby ardiya
Using siamese network to do dimensionality reduction and similar image retrieval
siamesenetwork-tensorflowby ardiya
Jupyter Notebook
260
Version:Current
License: Permissive (MIT)
Model selection
Model selection is the process of selecting one final machine learning model from among a collection of candidate machine learning models for a training dataset.
yellowbrickby DistrictDataLabs
Visual analysis and diagnostic tools to facilitate machine learning model selection.
yellowbrickby DistrictDataLabs
Python
4016
Version:v1.5
License: Permissive (Apache-2.0)
ATMby HDI-Project
Auto Tune Models - A multi-tenant, multi-data system for automated machine learning (model selection and tuning).
ATMby HDI-Project
Python
509
Version:v0.2.2
License: Permissive (MIT)
backbone.collectionViewby rotundasoftware
Easily render backbone.js collections. In addition to managing model views, this class supports automatic selection of models in response to clicks, reordering models via drag and drop, and more.
backbone.collectionViewby rotundasoftware
JavaScript
175
Version:Current
License: Permissive (MIT)
Preprocessing
Data preprocessing is a process of preparing the raw data and making it suitable for a machine learning model. It is the first and crucial step while creating a machine learning model.
keras-preprocessingby keras-team
Utilities for working with image data, text data, and sequence data.
keras-preprocessingby keras-team
Python
1022
Version:1.1.0
License: Others (Non-SPDX)
python-wsi-preprocessingby deroneriksson
Python Whole Slide Image Preprocessing
python-wsi-preprocessingby deroneriksson
Python
201
Version:Current
License: No License
imagededupby idealo
😎 Finding duplicate images made easy!
imagededupby idealo
Python
4497
Version:v0.3.1
License: Permissive (Apache-2.0)