sklearn2pmml | Python library for converting Scikit-Learn pipelines to PMML | Machine Learning library

by jpmml Python Version: 0.109.0 License: AGPL-3.0

X-Ray Key Features Code Snippets(10)Community Discussions(5)Vulnerabilities Install Support

kandi X-RAY | sklearn2pmml Summary

sklearn2pmml is a Python library typically used in Artificial Intelligence, Machine Learning applications. sklearn2pmml has no bugs, it has no vulnerabilities, it has build file available, it has a Strong Copyleft License and it has low support. You can install using 'pip install sklearn2pmml' or download it from GitHub, PyPI.

Python library for converting Scikit-Learn pipelines to PMML

Support

Quality

Security

License

Reuse

Support

sklearn2pmml has a low active ecosystem.

It has 653 star(s) with 106 fork(s). There are 29 watchers for this library.

It had no major release in the last 12 months.

There are 14 open issues and 331 have been closed. On average issues are closed in 22 days. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of sklearn2pmml is 0.109.0

Quality

sklearn2pmml has 0 bugs and 0 code smells.

Security

sklearn2pmml has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

sklearn2pmml code analysis shows 0 unresolved vulnerabilities.

There are 0 security hotspots that need review.

License

sklearn2pmml is licensed under the AGPL-3.0 License. This license is Strong Copyleft.

Strong Copyleft licenses enforce sharing, and you can use them when creating open source projects.

Reuse

sklearn2pmml releases are not available. You will need to build from source code and install.

Deployable package is available in PyPI.

Build file is available. You can build the component from source.

Installation instructions are not available. Examples and code snippets are available.

sklearn2pmml saves you 1045 person hours of effort in developing the same functionality from scratch.

It has 2860 lines of code, 311 functions and 31 files.

It has low code complexity. Code complexity directly impacts maintainability of the code.

Top functions reviewed by kandi - BETA

kandi has reviewed sklearn2pmml and discovered the below as its top functions. This is intended to give you an instant insight into sklearn2pmml implemented functionality, and help decide if they suit your requirements.

Convert a sklearn pipeline to pmml
Return the java version
Dump object to disk
Return a list of the classpath
Fit the model
Compute mask for missing values
Count the frequencies in the given mask
Cast X to dtype
Make an Xgboost dataframe mapper
Return True if dtype is a categorical
Check if dtype is categorical
Transform valid values in X
Replace missing values
Construct a tree from training data
Return a boolean mask for the input data
Apply function to each row of rows
Check if a GBDDT is supported
Transform X
Replace invalid values
Check if the given gbdDT is supported
Compute the duration of the time series
Convert X to int
Apply the apply transformation
Apply the step
Apply function to data
Compute time series

Get all kandi verified functions for this library.

sklearn2pmml Key Features

No Key Features are available at this moment for sklearn2pmml.

sklearn2pmml Examples and Code Snippets

How to do Onehotencoding in Sklearn Pipeline

Python

Lines of Code : 12

License : Strong Copyleft (CC BY-SA 4.0)

Copy

mapper = DataFrameMapper(
    [(d, LabelBinarizer()) for d in dummies]
)

mapper = DataFrameMapper(
    [(d, LabelEncoder()) for d in dummies]
)

lm = PMMLPipeline([("mapper", mapper),
                   ("onehot",

How can I do HyperParameter Tuning for PMML model in python?

Python

Lines of Code : 12

License : Strong Copyleft (CC BY-SA 4.0)

Copy

pipeline = PMMLPipeline([
  ("tuned-rf", GridSearchCV(RandomForestClassifier(..), param_grid = {..}))
])
pipeline.fit(X, y)

pipeline = PMMLPipeline([
  ("rf", RandomForestClassifier(..))
])
gridsearch = GridSearchC

Custom function in sklearn2pmml PMMLPipeline

Python

Lines of Code : 19

License : Strong Copyleft (CC BY-SA 4.0)

Copy

from sklearn.preprocessing import FunctionTransformer
import numpy as np
from sklearn2pmml import make_pmml_pipeline

# fake data with 7 columns
X = np.random.rand(10,7)

n_rows = X.shape[0]

def custom_function(X):
    #averiging 4 first

Custom function in sklearn2pmml PMMLPipeline

Python

Lines of Code : 12

License : Strong Copyleft (CC BY-SA 4.0)

Copy

from sklearn_pandas import DataFrameMapper
from sklearn2pmml.preprocessing import Aggregator

pipeline = PMMLPipeline([
  ("mapper", DataFrameMapper([
    (["stiffness_1", "stiffness_2", "stiffness_3", "stiffness_4"], Aggregator(function =

verbose parameter not working in PMML pipeline

Python

Lines of Code : 3

License : Strong Copyleft (CC BY-SA 4.0)

Copy

pipeline = PMMLPipeline([('classifier', clf)])
pipeline.fit(x, y, classifier__gbdt__verbose = 2)

How to convert python xgboost model into the pmml?

Python

Lines of Code : 7

License : Strong Copyleft (CC BY-SA 4.0)

Copy

pmml_pipeline = PMMLPipeline([
  ("regressor", XGBRegressor())
])
tuner = GridSearchCV(pmml_pipeline, ...)
tuner.fit(X, y)
sklearn2pmml(tuner.best_estimator_, "xgbregressor-pipeline.pmml")

How to Build PMML from Logistic Regression Coefficients

Python

Lines of Code : 16

License : Strong Copyleft (CC BY-SA 4.0)

Copy

from sklearn.linear_model import LogisticRegression
from skleanr2pmml import sklearn2pmml
from sklearn2pmml.pipeline import PMMLPipeline

logreg = LogisticRegression()
logreg.classes_ = numpy.asarray([, ])
logreg.coef_ = numpy.asarray([])

How to generate an ifelse transformer in a PMML file?

Python

Lines of Code : 2

License : Strong Copyleft (CC BY-SA 4.0)

Copy

transformer = ExpressionTransformer("1 if X['Y'] < 1 else X[1]")

Is there a way to import a pmml file into python?

Python

Lines of Code : 11

License : Strong Copyleft (CC BY-SA 4.0)

Copy

org.dmg.pmml.PMML pmml = loadFromFile(..)
org.dmg.pmml.Visitor mfUpdater = new org.jpmml.model.visitors.AbstractVisitor(){
  @Override
  public VisitorAction visit(MiningField miningField){
    miningField.setInvalidValueTreatment(InvalidV

Unable to create PMMLPipeline

Python

Lines of Code : 20

License : Strong Copyleft (CC BY-SA 4.0)

Copy

pipeline = PMMLPipeline([("mapper", DataFrameMapper([
                                                 (["balanced_data.type",
                                                   "balanced_data.amount",

Community Discussions

Trending Discussions on sklearn2pmml

DataType of InputField is double although in the PMMLPipeline it is string

PMML can not be created because the number of input features is not specified

pmml4s model.predict() returns array instead of single value

Error writing XGBoost Classifier to pmml with sklearn2pmml

How to make predictions with Syncfusion PMML from a Neural Network trained with sklearn MLPClassifier?

QUESTION

DataType of InputField is double although in the PMMLPipeline it is string

Asked 2020-May-25 at 11:27

I am exporting a PMMLPipeline with a categorical string feature day_of_week as a PMML file. When I open the file in Java and list the InputFields I see that the data type of day_of_week field is double:

...

ANSWER

Answered 2020-May-25 at 11:27

You can assist SkLearn2PMML by providing "feature type hints" using sklearn2pmml.decoration.CategoricalDomain and sklearn2pmml.decoration.ContinuousDomain decorators (see here for more details).

In the current case, you should prepend a CategoricalDomain step to the pipeline that deals with categorical features:

Source https://stackoverflow.com/questions/61999765

QUESTION

PMML can not be created because the number of input features is not specified

Asked 2020-May-25 at 07:56

I cannot convert the following pipeline to pmml because "the number of input features is not specified".

A minimal example pipeline that reproduces the error is:

...

ANSWER

Answered 2020-May-25 at 07:56

What's the point of creating a single-step sklearn2pmml.pipline.PMMLPipeline based on a sklearn.pipeline.Pipeline?

Leave out this no-op, and the conversion should succeed:

Source https://stackoverflow.com/questions/61997983

QUESTION

pmml4s model.predict() returns array instead of single value

Asked 2020-May-07 at 14:02

I used sklearn2pmml to serialize my decision tree classifier to a pmml file. I used pmml4s in java to deserialize the model and use it to predict.

Iuse the code below to make a prediction over a single incoming value. This should return either 0/1/2/3/4/5/6.

...

ANSWER

Answered 2020-May-07 at 08:10

It is certainty of model for each class. In your case it means that it's 4 with probability 94.5% or 5 with probability 5.5% In simple case, if you want to receive value, you should pick index for the maximal value.

However you might use this probabilities to additional control logic, like thresholding when decision is ambiguous (two values with probability ~0.4, etc.)

Source https://stackoverflow.com/questions/61652525

QUESTION

Error writing XGBoost Classifier to pmml with sklearn2pmml

Asked 2020-Feb-21 at 15:32

I want to save my XGBoost model as pmml using sklearn2pmml. I'm using Python V3.7.3 with Sklearn 0.20.3 & sklearn2pmml V0.53.0. My data is mainly binary, with just 3 columns of continuous data, I'm running my notebook in Databricks and convert my Spark dataframe to a pandas dataframe. Code snippet below

...

ANSWER

Answered 2020-Feb-21 at 15:32

It is probably a XGBoost package version issue. The SkLearn2PMML package expects the label encoder (XGBClassifier._le attribute) to be a "normal" Scikit-Learn label encoder class (sklearn.preprocessing.(label|_label).LabelEncoder), but in your case it's something different (xgboost.compat.XGBoostLabelEncoder).

In which XGBOost package version was this xgboost.compat.XGBoostLabelEncoder introduced? It's either some very old, or very new thing.

In any case, please open a feature request with the JPMML-SkLearn project here to have this issue sorted out.

Source https://stackoverflow.com/questions/60340231

QUESTION

How to make predictions with Syncfusion PMML from a Neural Network trained with sklearn MLPClassifier?

Asked 2020-Jan-24 at 04:19

I trained a model in Python using sklearn.neural_network.MLPClassifier (0.20.3) and saved it in PMML format using sklearn2pmml (0.48.0). The saved PMML model works as expected when loaded in Java using org.jpmml:pmml-evaluator:1.4.14.

I now want to load the PMML model and make predictions in C# using the Syncfusion package:

...

ANSWER

Answered 2020-Jan-23 at 11:49

We have checked sample PMML file using NeuralNetworkModelEvaluator and we couldn’t reproduce the issue. Can you share your PMML file to check our side and provide you the solution sooner.

Also, we would suggest you to try the below code,

Source https://stackoverflow.com/questions/59853972

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install sklearn2pmml

You can install using 'pip install sklearn2pmml' or download it from GitHub, PyPI.
You can use sklearn2pmml like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: