Scikit-learn (Sklearn) is the most useful and robust library for machine learning in Python. It provides a selection of efficient tools for machine learning and statistical modeling including classification, regression, clustering, and dimensionality reduction via a consistence interface in Python.
Classification
In Classification, the output variable must be a discrete value. The task of the classification algorithm is to map the input value(x) with the discrete output variable(y).
cnn-text-classification-tfby dennybritz
Convolutional Neural Network for Text Classification in Tensorflow
cnn-text-classification-tfby dennybritz
Python 5574 Version:Current License: Permissive (Apache-2.0)
Chinese-Text-Classification-Pytorchby 649453932
Chinese text classification, TextCNN, TextRNN, FastText, TextRCNN, BiLSTM_Attention, DPCNN, Transformer, based on pytorch, out of the box.
Chinese-Text-Classification-Pytorchby 649453932
Python 4459 Version:Current License: Permissive (MIT)
pytorch-classificationby bearpaw
Classification with PyTorch.
pytorch-classificationby bearpaw
Python 1579 Version:Current License: Permissive (MIT)
Regression
In Regression, the output variable must be of continuous nature or real value. The task of the regression algorithm is to map the input value (x) with the continuous output variable(y).
regression-jsby Tom-Alexander
Curve Fitting in JavaScript.
regression-jsby Tom-Alexander
JavaScript 894 Version:Current License: Permissive (MIT)
cypress-visual-regressionby mjhea0
Module for adding visual regression testing to Cypress
cypress-visual-regressionby mjhea0
JavaScript 310 Version:Current License: Permissive (MIT)
Visual-Regression-Trackerby Visual-Regression-Tracker
Backend and Frontend application for tracking differences via image comparison
Visual-Regression-Trackerby Visual-Regression-Tracker
Shell 491 Version:4.20.7 License: Permissive (Apache-2.0)
Clustering
A way of grouping the data points into different clusters, consisting of similar data points. The objects with the possible similarities remain in a group that has less or no similarities with another group.
moaby Waikato
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
moaby Waikato
Java 537 Version:2021.07.0 License: Strong Copyleft (GPL-3.0)
image-similarity-clusteringby zegami
This project allows images to be automatically grouped into like clusters using a combination of machine learning techniques.
image-similarity-clusteringby zegami
Python 177 Version:Current License: Permissive (MIT)
ml-email-clusteringby anthdm
Email clustering with machine learning
ml-email-clusteringby anthdm
Python 150 Version:Current License: Permissive (MIT)
Dimensionality reduction
It is a way of converting the higher dimensions dataset into lesser dimensions dataset ensuring that it provides similar information.
feature-selectorby WillKoehrsen
Feature selector is a tool for dimensionality reduction of machine learning datasets
feature-selectorby WillKoehrsen
Jupyter Notebook 2080 Version:Current License: Strong Copyleft (GPL-3.0)
deeptimeby deeptime-ml
Python library for analysis of time series data including dimensionality reduction, clustering, and Markov model estimation
deeptimeby deeptime-ml
Python 562 Version:v0.4.4 License: Weak Copyleft (LGPL-3.0)
siamesenetwork-tensorflowby ardiya
Using siamese network to do dimensionality reduction and similar image retrieval
siamesenetwork-tensorflowby ardiya
Jupyter Notebook 260 Version:Current License: Permissive (MIT)
Model selection
Model selection is the process of selecting one final machine learning model from among a collection of candidate machine learning models for a training dataset.
yellowbrickby DistrictDataLabs
Visual analysis and diagnostic tools to facilitate machine learning model selection.
yellowbrickby DistrictDataLabs
Python 4016 Version:v1.5 License: Permissive (Apache-2.0)
ATMby HDI-Project
Auto Tune Models - A multi-tenant, multi-data system for automated machine learning (model selection and tuning).
ATMby HDI-Project
Python 509 Version:v0.2.2 License: Permissive (MIT)
backbone.collectionViewby rotundasoftware
Easily render backbone.js collections. In addition to managing model views, this class supports automatic selection of models in response to clicks, reordering models via drag and drop, and more.
backbone.collectionViewby rotundasoftware
JavaScript 175 Version:Current License: Permissive (MIT)
Preprocessing
Data preprocessing is a process of preparing the raw data and making it suitable for a machine learning model. It is the first and crucial step while creating a machine learning model.
keras-preprocessingby keras-team
Utilities for working with image data, text data, and sequence data.
keras-preprocessingby keras-team
Python 1022 Version:1.1.0 License: Others (Non-SPDX)
python-wsi-preprocessingby deroneriksson
Python Whole Slide Image Preprocessing
python-wsi-preprocessingby deroneriksson
Python 201 Version:Current License: No License
imagededupby idealo
😎 Finding duplicate images made easy!
imagededupby idealo
Python 4497 Version:v0.3.1 License: Permissive (Apache-2.0)