Getting started with Predictive Analysis

share link

by Sri Balaji J dot icon Updated: Jun 13, 2022

technology logo
technology logo

Solution Kit Solution Kit  

Scikit-learn (Sklearn) is the most useful and robust library for machine learning in Python. It provides a selection of efficient tools for machine learning and statistical modeling including classification, regression, clustering, and dimensionality reduction via a consistence interface in Python.

Features of scikit learn are : Supervised Learning algorithms − Almost all the popular supervised learning algorithms, like Linear Regression, Support Vector Machine (SVM), Decision Tree etc., are the part of scikit-learn. Unsupervised Learning algorithms − On the other hand, it also has all the popular unsupervised learning algorithms from clustering, factor analysis, PCA (Principal Component Analysis) to unsupervised neural networks. Clustering − This model is used for grouping unlabeled data. Dimensionality Reduction − It is used for reducing the number of attributes in data which can be further used for summarisation, visualisation and feature selection. Ensemble methods − As name suggest, it is used for combining the predictions of multiple supervised models. Feature extraction − It is used to extract the features from data to define the attributes in image and text data. Feature selection − It is used to identify useful attributes to create supervised models. Some libraries apart from scikit learn are :

Classification

In Classification, the output variable must be a discrete value. The task of the classification algorithm is to map the input value(x) with the discrete output variable(y).

Python doticonstar image 5574 doticonVersion:Currentdoticon
License: Permissive (Apache-2.0)

Convolutional Neural Network for Text Classification in Tensorflow

Support
    Quality
      Security
        License
          Reuse

            cnn-text-classification-tfby dennybritz

            Python doticon star image 5574 doticonVersion:Currentdoticon License: Permissive (Apache-2.0)

            Convolutional Neural Network for Text Classification in Tensorflow
            Support
              Quality
                Security
                  License
                    Reuse
                      Python doticonstar image 4459 doticonVersion:Currentdoticon
                      License: Permissive (MIT)

                      Chinese text classification, TextCNN, TextRNN, FastText, TextRCNN, BiLSTM_Attention, DPCNN, Transformer, based on pytorch, out of the box.

                      Support
                        Quality
                          Security
                            License
                              Reuse

                                Chinese-Text-Classification-Pytorchby 649453932

                                Python doticon star image 4459 doticonVersion:Currentdoticon License: Permissive (MIT)

                                Chinese text classification, TextCNN, TextRNN, FastText, TextRCNN, BiLSTM_Attention, DPCNN, Transformer, based on pytorch, out of the box.
                                Support
                                  Quality
                                    Security
                                      License
                                        Reuse
                                          Python doticonstar image 1579 doticonVersion:Currentdoticon
                                          License: Permissive (MIT)

                                          Classification with PyTorch.

                                          Support
                                            Quality
                                              Security
                                                License
                                                  Reuse

                                                    pytorch-classificationby bearpaw

                                                    Python doticon star image 1579 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                    Classification with PyTorch.
                                                    Support
                                                      Quality
                                                        Security
                                                          License
                                                            Reuse

                                                              Regression

                                                              In Regression, the output variable must be of continuous nature or real value. The task of the regression algorithm is to map the input value (x) with the continuous output variable(y).

                                                              regression-jsby Tom-Alexander

                                                              JavaScript doticonstar image 894 doticonVersion:Currentdoticon
                                                              License: Permissive (MIT)

                                                              Curve Fitting in JavaScript.

                                                              Support
                                                                Quality
                                                                  Security
                                                                    License
                                                                      Reuse

                                                                        regression-jsby Tom-Alexander

                                                                        JavaScript doticon star image 894 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                                        Curve Fitting in JavaScript.
                                                                        Support
                                                                          Quality
                                                                            Security
                                                                              License
                                                                                Reuse
                                                                                  JavaScript doticonstar image 310 doticonVersion:Currentdoticon
                                                                                  License: Permissive (MIT)

                                                                                  Module for adding visual regression testing to Cypress

                                                                                  Support
                                                                                    Quality
                                                                                      Security
                                                                                        License
                                                                                          Reuse

                                                                                            cypress-visual-regressionby mjhea0

                                                                                            JavaScript doticon star image 310 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                                                            Module for adding visual regression testing to Cypress
                                                                                            Support
                                                                                              Quality
                                                                                                Security
                                                                                                  License
                                                                                                    Reuse

                                                                                                      Visual-Regression-Trackerby Visual-Regression-Tracker

                                                                                                      Shell doticonstar image 491 doticonVersion:4.20.7doticon
                                                                                                      License: Permissive (Apache-2.0)

                                                                                                      Backend and Frontend application for tracking differences via image comparison

                                                                                                      Support
                                                                                                        Quality
                                                                                                          Security
                                                                                                            License
                                                                                                              Reuse

                                                                                                                Visual-Regression-Trackerby Visual-Regression-Tracker

                                                                                                                Shell doticon star image 491 doticonVersion:4.20.7doticon License: Permissive (Apache-2.0)

                                                                                                                Backend and Frontend application for tracking differences via image comparison
                                                                                                                Support
                                                                                                                  Quality
                                                                                                                    Security
                                                                                                                      License
                                                                                                                        Reuse

                                                                                                                          Clustering

                                                                                                                          A way of grouping the data points into different clusters, consisting of similar data points. The objects with the possible similarities remain in a group that has less or no similarities with another group.

                                                                                                                          moaby Waikato

                                                                                                                          Java doticonstar image 537 doticonVersion:2021.07.0doticon
                                                                                                                          License: Strong Copyleft (GPL-3.0)

                                                                                                                          MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.

                                                                                                                          Support
                                                                                                                            Quality
                                                                                                                              Security
                                                                                                                                License
                                                                                                                                  Reuse

                                                                                                                                    moaby Waikato

                                                                                                                                    Java doticon star image 537 doticonVersion:2021.07.0doticon License: Strong Copyleft (GPL-3.0)

                                                                                                                                    MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
                                                                                                                                    Support
                                                                                                                                      Quality
                                                                                                                                        Security
                                                                                                                                          License
                                                                                                                                            Reuse
                                                                                                                                              Python doticonstar image 177 doticonVersion:Currentdoticon
                                                                                                                                              License: Permissive (MIT)

                                                                                                                                              This project allows images to be automatically grouped into like clusters using a combination of machine learning techniques.

                                                                                                                                              Support
                                                                                                                                                Quality
                                                                                                                                                  Security
                                                                                                                                                    License
                                                                                                                                                      Reuse

                                                                                                                                                        image-similarity-clusteringby zegami

                                                                                                                                                        Python doticon star image 177 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                                                                                                                        This project allows images to be automatically grouped into like clusters using a combination of machine learning techniques.
                                                                                                                                                        Support
                                                                                                                                                          Quality
                                                                                                                                                            Security
                                                                                                                                                              License
                                                                                                                                                                Reuse
                                                                                                                                                                  Python doticonstar image 150 doticonVersion:Currentdoticon
                                                                                                                                                                  License: Permissive (MIT)

                                                                                                                                                                  Email clustering with machine learning

                                                                                                                                                                  Support
                                                                                                                                                                    Quality
                                                                                                                                                                      Security
                                                                                                                                                                        License
                                                                                                                                                                          Reuse

                                                                                                                                                                            ml-email-clusteringby anthdm

                                                                                                                                                                            Python doticon star image 150 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                                                                                                                                            Email clustering with machine learning
                                                                                                                                                                            Support
                                                                                                                                                                              Quality
                                                                                                                                                                                Security
                                                                                                                                                                                  License
                                                                                                                                                                                    Reuse

                                                                                                                                                                                      Dimensionality reduction

                                                                                                                                                                                      It is a way of converting the higher dimensions dataset into lesser dimensions dataset ensuring that it provides similar information.

                                                                                                                                                                                      feature-selectorby WillKoehrsen

                                                                                                                                                                                      Jupyter Notebook doticonstar image 2080 doticonVersion:Currentdoticon
                                                                                                                                                                                      License: Strong Copyleft (GPL-3.0)

                                                                                                                                                                                      Feature selector is a tool for dimensionality reduction of machine learning datasets

                                                                                                                                                                                      Support
                                                                                                                                                                                        Quality
                                                                                                                                                                                          Security
                                                                                                                                                                                            License
                                                                                                                                                                                              Reuse

                                                                                                                                                                                                feature-selectorby WillKoehrsen

                                                                                                                                                                                                Jupyter Notebook doticon star image 2080 doticonVersion:Currentdoticon License: Strong Copyleft (GPL-3.0)

                                                                                                                                                                                                Feature selector is a tool for dimensionality reduction of machine learning datasets
                                                                                                                                                                                                Support
                                                                                                                                                                                                  Quality
                                                                                                                                                                                                    Security
                                                                                                                                                                                                      License
                                                                                                                                                                                                        Reuse

                                                                                                                                                                                                          deeptimeby deeptime-ml

                                                                                                                                                                                                          Python doticonstar image 562 doticonVersion:v0.4.4doticon
                                                                                                                                                                                                          License: Weak Copyleft (LGPL-3.0)

                                                                                                                                                                                                          Python library for analysis of time series data including dimensionality reduction, clustering, and Markov model estimation

                                                                                                                                                                                                          Support
                                                                                                                                                                                                            Quality
                                                                                                                                                                                                              Security
                                                                                                                                                                                                                License
                                                                                                                                                                                                                  Reuse

                                                                                                                                                                                                                    deeptimeby deeptime-ml

                                                                                                                                                                                                                    Python doticon star image 562 doticonVersion:v0.4.4doticon License: Weak Copyleft (LGPL-3.0)

                                                                                                                                                                                                                    Python library for analysis of time series data including dimensionality reduction, clustering, and Markov model estimation
                                                                                                                                                                                                                    Support
                                                                                                                                                                                                                      Quality
                                                                                                                                                                                                                        Security
                                                                                                                                                                                                                          License
                                                                                                                                                                                                                            Reuse
                                                                                                                                                                                                                              Jupyter Notebook doticonstar image 260 doticonVersion:Currentdoticon
                                                                                                                                                                                                                              License: Permissive (MIT)

                                                                                                                                                                                                                              Using siamese network to do dimensionality reduction and similar image retrieval

                                                                                                                                                                                                                              Support
                                                                                                                                                                                                                                Quality
                                                                                                                                                                                                                                  Security
                                                                                                                                                                                                                                    License
                                                                                                                                                                                                                                      Reuse

                                                                                                                                                                                                                                        siamesenetwork-tensorflowby ardiya

                                                                                                                                                                                                                                        Jupyter Notebook doticon star image 260 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                                                                                                                                                                                                        Using siamese network to do dimensionality reduction and similar image retrieval
                                                                                                                                                                                                                                        Support
                                                                                                                                                                                                                                          Quality
                                                                                                                                                                                                                                            Security
                                                                                                                                                                                                                                              License
                                                                                                                                                                                                                                                Reuse

                                                                                                                                                                                                                                                  Model selection

                                                                                                                                                                                                                                                  Model selection is the process of selecting one final machine learning model from among a collection of candidate machine learning models for a training dataset.

                                                                                                                                                                                                                                                  yellowbrickby DistrictDataLabs

                                                                                                                                                                                                                                                  Python doticonstar image 4016 doticonVersion:v1.5doticon
                                                                                                                                                                                                                                                  License: Permissive (Apache-2.0)

                                                                                                                                                                                                                                                  Visual analysis and diagnostic tools to facilitate machine learning model selection.

                                                                                                                                                                                                                                                  Support
                                                                                                                                                                                                                                                    Quality
                                                                                                                                                                                                                                                      Security
                                                                                                                                                                                                                                                        License
                                                                                                                                                                                                                                                          Reuse

                                                                                                                                                                                                                                                            yellowbrickby DistrictDataLabs

                                                                                                                                                                                                                                                            Python doticon star image 4016 doticonVersion:v1.5doticon License: Permissive (Apache-2.0)

                                                                                                                                                                                                                                                            Visual analysis and diagnostic tools to facilitate machine learning model selection.
                                                                                                                                                                                                                                                            Support
                                                                                                                                                                                                                                                              Quality
                                                                                                                                                                                                                                                                Security
                                                                                                                                                                                                                                                                  License
                                                                                                                                                                                                                                                                    Reuse

                                                                                                                                                                                                                                                                      ATMby HDI-Project

                                                                                                                                                                                                                                                                      Python doticonstar image 509 doticonVersion:v0.2.2doticon
                                                                                                                                                                                                                                                                      License: Permissive (MIT)

                                                                                                                                                                                                                                                                      Auto Tune Models - A multi-tenant, multi-data system for automated machine learning (model selection and tuning).

                                                                                                                                                                                                                                                                      Support
                                                                                                                                                                                                                                                                        Quality
                                                                                                                                                                                                                                                                          Security
                                                                                                                                                                                                                                                                            License
                                                                                                                                                                                                                                                                              Reuse

                                                                                                                                                                                                                                                                                ATMby HDI-Project

                                                                                                                                                                                                                                                                                Python doticon star image 509 doticonVersion:v0.2.2doticon License: Permissive (MIT)

                                                                                                                                                                                                                                                                                Auto Tune Models - A multi-tenant, multi-data system for automated machine learning (model selection and tuning).
                                                                                                                                                                                                                                                                                Support
                                                                                                                                                                                                                                                                                  Quality
                                                                                                                                                                                                                                                                                    Security
                                                                                                                                                                                                                                                                                      License
                                                                                                                                                                                                                                                                                        Reuse

                                                                                                                                                                                                                                                                                          backbone.collectionViewby rotundasoftware

                                                                                                                                                                                                                                                                                          JavaScript doticonstar image 175 doticonVersion:Currentdoticon
                                                                                                                                                                                                                                                                                          License: Permissive (MIT)

                                                                                                                                                                                                                                                                                          Easily render backbone.js collections. In addition to managing model views, this class supports automatic selection of models in response to clicks, reordering models via drag and drop, and more.

                                                                                                                                                                                                                                                                                          Support
                                                                                                                                                                                                                                                                                            Quality
                                                                                                                                                                                                                                                                                              Security
                                                                                                                                                                                                                                                                                                License
                                                                                                                                                                                                                                                                                                  Reuse

                                                                                                                                                                                                                                                                                                    backbone.collectionViewby rotundasoftware

                                                                                                                                                                                                                                                                                                    JavaScript doticon star image 175 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                                                                                                                                                                                                                                                                    Easily render backbone.js collections. In addition to managing model views, this class supports automatic selection of models in response to clicks, reordering models via drag and drop, and more.
                                                                                                                                                                                                                                                                                                    Support
                                                                                                                                                                                                                                                                                                      Quality
                                                                                                                                                                                                                                                                                                        Security
                                                                                                                                                                                                                                                                                                          License
                                                                                                                                                                                                                                                                                                            Reuse

                                                                                                                                                                                                                                                                                                              Preprocessing

                                                                                                                                                                                                                                                                                                              Data preprocessing is a process of preparing the raw data and making it suitable for a machine learning model. It is the first and crucial step while creating a machine learning model.

                                                                                                                                                                                                                                                                                                              keras-preprocessingby keras-team

                                                                                                                                                                                                                                                                                                              Python doticonstar image 1022 doticonVersion:1.1.0doticon
                                                                                                                                                                                                                                                                                                              License: Others (Non-SPDX)

                                                                                                                                                                                                                                                                                                              Utilities for working with image data, text data, and sequence data.

                                                                                                                                                                                                                                                                                                              Support
                                                                                                                                                                                                                                                                                                                Quality
                                                                                                                                                                                                                                                                                                                  Security
                                                                                                                                                                                                                                                                                                                    License
                                                                                                                                                                                                                                                                                                                      Reuse

                                                                                                                                                                                                                                                                                                                        keras-preprocessingby keras-team

                                                                                                                                                                                                                                                                                                                        Python doticon star image 1022 doticonVersion:1.1.0doticon License: Others (Non-SPDX)

                                                                                                                                                                                                                                                                                                                        Utilities for working with image data, text data, and sequence data.
                                                                                                                                                                                                                                                                                                                        Support
                                                                                                                                                                                                                                                                                                                          Quality
                                                                                                                                                                                                                                                                                                                            Security
                                                                                                                                                                                                                                                                                                                              License
                                                                                                                                                                                                                                                                                                                                Reuse

                                                                                                                                                                                                                                                                                                                                  python-wsi-preprocessingby deroneriksson

                                                                                                                                                                                                                                                                                                                                  Python doticonstar image 201 doticonVersion:Currentdoticon
                                                                                                                                                                                                                                                                                                                                  no licences License: No License (null)

                                                                                                                                                                                                                                                                                                                                  Python Whole Slide Image Preprocessing

                                                                                                                                                                                                                                                                                                                                  Support
                                                                                                                                                                                                                                                                                                                                    Quality
                                                                                                                                                                                                                                                                                                                                      Security
                                                                                                                                                                                                                                                                                                                                        License
                                                                                                                                                                                                                                                                                                                                          Reuse

                                                                                                                                                                                                                                                                                                                                            python-wsi-preprocessingby deroneriksson

                                                                                                                                                                                                                                                                                                                                            Python doticon star image 201 doticonVersion:Currentdoticonno licences License: No License

                                                                                                                                                                                                                                                                                                                                            Python Whole Slide Image Preprocessing
                                                                                                                                                                                                                                                                                                                                            Support
                                                                                                                                                                                                                                                                                                                                              Quality
                                                                                                                                                                                                                                                                                                                                                Security
                                                                                                                                                                                                                                                                                                                                                  License
                                                                                                                                                                                                                                                                                                                                                    Reuse

                                                                                                                                                                                                                                                                                                                                                      imagededupby idealo

                                                                                                                                                                                                                                                                                                                                                      Python doticonstar image 4497 doticonVersion:v0.3.1doticon
                                                                                                                                                                                                                                                                                                                                                      License: Permissive (Apache-2.0)

                                                                                                                                                                                                                                                                                                                                                      😎 Finding duplicate images made easy!

                                                                                                                                                                                                                                                                                                                                                      Support
                                                                                                                                                                                                                                                                                                                                                        Quality
                                                                                                                                                                                                                                                                                                                                                          Security
                                                                                                                                                                                                                                                                                                                                                            License
                                                                                                                                                                                                                                                                                                                                                              Reuse

                                                                                                                                                                                                                                                                                                                                                                imagededupby idealo

                                                                                                                                                                                                                                                                                                                                                                Python doticon star image 4497 doticonVersion:v0.3.1doticon License: Permissive (Apache-2.0)

                                                                                                                                                                                                                                                                                                                                                                😎 Finding duplicate images made easy!
                                                                                                                                                                                                                                                                                                                                                                Support
                                                                                                                                                                                                                                                                                                                                                                  Quality
                                                                                                                                                                                                                                                                                                                                                                    Security
                                                                                                                                                                                                                                                                                                                                                                      License
                                                                                                                                                                                                                                                                                                                                                                        Reuse

                                                                                                                                                                                                                                                                                                                                                                          See similar Kits and Libraries