shapley | Compute Shapley-Shorrocks value decompositions | Machine Learning library

by elbersb R Version: Current License: Non-SPDX

X-Ray Key Features Code Snippets(2)Community Discussions(10)Vulnerabilities Install Support

kandi X-RAY | shapley Summary

shapley is a R library typically used in Artificial Intelligence, Machine Learning, Deep Learning applications. shapley has no bugs, it has no vulnerabilities and it has low support. However shapley has a Non-SPDX License. You can download it from GitHub.

The Shapley value is a concept from game theory that quantifies how much each player contributes to the game outcome (Shapley 1953). The concept, however, has many more use cases: it provides a method to quantify the importance of predictors in regression analysis or machine learning models, and can be used in a wide variety of decomposition problems (Shorrocks 2013). Most implementations focus on one narrow use case, although the algorithm for the Shapley value decomposition is always the same – it is just the concrete value function that varies. This package provides a simple algorithm for the Shapley value decomposition, and also supports hierarchical decomposition using the Owen value. The key advantage of the Shapley decomposition framework is the connection with counterfactuals: Once appropriate counterfactuals for each combination of factors have been identified, the method will produce an appropriate decomposition.

Support

Quality

Security

License

Reuse

Support

shapley has a low active ecosystem.

It has 5 star(s) with 1 fork(s). There are 2 watchers for this library.

It had no major release in the last 6 months.

There are 0 open issues and 1 have been closed. On average issues are closed in 8 days. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of shapley is current.

Quality

shapley has 0 bugs and 0 code smells.

Security

shapley has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

shapley code analysis shows 0 unresolved vulnerabilities.

There are 0 security hotspots that need review.

License

shapley has a Non-SPDX License.

Non-SPDX licenses can be open source with a non SPDX compliant license, or non open source licenses, and you need to review them closely before use.

Reuse

shapley releases are not available. You will need to build from source code and install.

Installation instructions are not available. Examples and code snippets are available.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of shapley

Get all kandi verified functions for this library.

shapley Key Features

No Key Features are available at this moment for shapley.

shapley Examples and Code Snippets

Deep learning example with DeepExplainer (TensorFlow/Keras models)

pypi

Lines of Code : 16

License : No License

Copy

# ...include code from https://github.com/keras-team/keras/blob/master/examples/mnist_cnn.py

import shap
import numpy as np

# select a set of background examples to take an expectation over
background = x_train[np.random.choice(x_train.shape[0], 10

Natural language example (transformers)

pypi

Lines of Code : 12

License : No License

Copy

import transformers
import shap

# load a transformers pipeline model
model = transformers.pipeline('sentiment-analysis', return_all_scores=True)

# explain the model on two sample inputs
explainer = shap.Explainer(model) 
shap_values = explainer(["W

Community Discussions

Trending Discussions on shapley

Manipulating an R dataframe consisting of vector and list columns to 'long' format

Creating a loop to utilise it in R package GameTheory

Shapely point object and Geopandas data frame yield empty output

Retrieve Model Results for Shapley values from GridSearch CV

Given a polygon & line in It: Find position on line where to split the polygon with perpendicular line to get subpolygon with cetain area

In Leave One Out Cross Validation, How can I Use `shap.Explainer()` Function to Explain a Machine Learning Model?

JS array returning element at index i+1 instead of index i element

How to understand Shapley value for binary classification problem?

shap.Explainer constructor error asking for undocumented positional argument

'tensorflow_federated' has no attribute 'NamedTupleType

QUESTION

Manipulating an R dataframe consisting of vector and list columns to 'long' format

Asked 2022-Jan-27 at 22:02

Please see the attached dput. I would need to transform the dataframe in question to a form that consists of five columns: Area, Group, Seats, Votes (%) and ShapleyShubik. The number of rows per certain area should be dependent on the number of Groups within that Area. I believe this desired end result is somewhat like of what is referenced as 'long format' of data.

...

ANSWER

Answered 2022-Jan-27 at 22:02

It looks like you're fairly new to SO; welcome to the community! To get the best answers quickly, it's always best to make your question reproducible. You've got the data here, but not the libraries.

Either way, I think I can help. This is using several of the packages called with tidyverse.

Source https://stackoverflow.com/questions/70884134

QUESTION

Creating a loop to utilise it in R package GameTheory

Asked 2022-Jan-26 at 19:18

I'm using R package GameTheory to calculate Shapley-Shubik power indices. The command itself is very simple, ShapleyShubik(quota, y, Names = NULL)

where quota is the minimum amount of votes to pass a vote, y seats of each party (number of) and Names are labels for the parties. It is simple to use 'manually', but I would like extend my usage to automate it to iterate through a vast amount of data that is compiled in an dataframe DF.

My dataframe DF includes four columns: AREA, PARTY LABEL, PARTYSEATS and MAJORITY:

...

ANSWER

Answered 2022-Jan-26 at 19:18

disclaimer I have not worked with this package before, please take a careful look at the results.

Preparing the data

Source https://stackoverflow.com/questions/70868699

QUESTION

Shapely point object and Geopandas data frame yield empty output

Asked 2021-Oct-17 at 09:56

I have a Geopandas df 'districts' of all districts in Paris and a shapley point object 'eiffel_tower' of the Eiffel Tower. When I execute

...

ANSWER

Answered 2021-Oct-17 at 09:56

Your geolocation for the Eiffel Tower seems not to match the districts. The first value is much smaller (255422) then the one of the dirstricts (451922, ...).

To be sure about it, perform district.geometry.bounds, which will return minx, miny, maxx and maxy, then you can check by hand and probably you will see, that the location of the Eiffel Tower is not in this area.

Guessing: My first thought was, that you have a typo in eiffel_tower = Point(255422.6, 6250868.9) and it should be eiffel_tower = Point(455422.6, 6250868.9).

Source https://stackoverflow.com/questions/69595838

QUESTION

Retrieve Model Results for Shapley values from GridSearch CV

Asked 2021-Aug-06 at 06:22

I have tuned a model using GridSearchCV. Now I would like to calculate the Shapley values and visualize them. The difficulty is that the shap package excepts a model, not the GridSearch Results. Likewise it does not like when I pass it the best_estimator_ attribute. It says the model is not supported. How can I get the Shapley values from the GridSearchCV or something to calculate the Shapley values. One of my columns is categorical, hence the need for preprocessing. Since I have the best_params from the Grid Search I could run the model as an xgboost_regressor model, but it has been a while since doing this without preprocessing.

...

ANSWER

Answered 2021-Aug-06 at 06:22

You need to fit both the preprocessor and the best model from the grid search to the data before calculating the Shap values, see the code below for an example.

Source https://stackoverflow.com/questions/68674211

QUESTION

Given a polygon & line in It: Find position on line where to split the polygon with perpendicular line to get subpolygon with cetain area

Asked 2021-Jul-12 at 17:00

Inputs:

Polygon (you can imagine this as a street: long and relatively narrow)
Line: the line is assumed to lie within the polygon and to run along the full length of the polygon
Required Area: The area the resulting output sub-polygon must have

Outputs:

Subpolygon of the input polygon with an area of the required area from input.

Example image of inputs and outputs: input polygon (blue), input line (red), line where the polygon is cut to produce output polygon (green)

The input polygon is cut into two pieces at some point along the given line, with a line that is (if possible) perpendicular to the line.

I hope its clear what I mean - it's already rather difficult to alone describe the problem.

I'm using the shapley geometry library (for python). Polygons are described as a set of points that represent the outer boundary and optionally also sets of point that describe holes inside the polygon. Lines are described as a list of points.

...

ANSWER

Answered 2021-Jul-12 at 17:00

You could consider a binary search along the red polyline.

Calculate the total length (L) of the red polyline (= sum of all segment lengths)
Assume we have a function that can calculate the point (p) and normal (n) corresponding to a value v along the polyline, where 0 <= v <= L.
Assume we have a function that can calculate the result of splitting the input polygon given a line defined by a point p and direction vector n.
Perform a binary search, starting with left = 0, right = L, splitting the polygon at the line defined by mid(left, right) and comparing the resulting areas against the target.

Here's a sketch of the solution:

Source https://stackoverflow.com/questions/68348779

QUESTION

In Leave One Out Cross Validation, How can I Use `shap.Explainer()` Function to Explain a Machine Learning Model?

Asked 2021-Jun-23 at 08:02

Background of the Problem

I want to explain the outcome of machine learning (ML) models using SHapley Additive exPlanations (SHAP) which is implemented in the shap library of Python. As a parameter of the function shap.Explainer(), I need to pass an ML model (e.g. XGBRegressor()). However, in each iteration of the Leave One Out Cross Validation (LOOCV), the ML model will be different as in each iteration, I am training on a different dataset (1 participant’s data will be different). Also, the model will be different as I am doing feature selection in each iteration.

Then, My Question

In LOOCV, How can I use shap.Explainer() function of shap library to present the performance of a machine learning model? It can be noted that I have checked several tutorials (e.g. this one, this one) and also several questions (e.g. this one) of SO. But I failed to find the answer of the problem.

Thanks for reading!

Update

I know that in LOOCV, the model found in each iteration can be explained by shap.Explainer(). However, as there is 250 participants' data, if I apply shap here for each model, there will be 250 output! Thus, I want to get a single output which will present the performance of the 250 models.

...

ANSWER

Answered 2021-Jun-23 at 08:02

You seem to train model on a 250 datapoints while doing LOOCV. This is about choosing a model with hyperparams that will ensure best generalization ability.

Model explanation is different from training in that you don't sift through different sets of hyperparams -- note, 250 LOOCV is already overkill. Will you do that with 250'000 rows? -- you are rather trying to understand which features influence output in what direction and by how much.

Training has it's own limitations (availability of data, if new data resembles the data the model was trained on, if the model good enough to pick up peculiarities of data and generalize well etc), but don't overestimate explanation exercise either. It's still an attempt to understand how inputs influence outputs. You may be willing to average 250 different matrices of SHAP values. But do you expect the result to be much more different from a single random train/test split?

Note as well:

However, in each iteration of the Leave One Out Cross Validation (LOOCV), the ML model will be different as in each iteration, I am training on a different dataset (1 participant’s data will be different).

In each iteration of LOOCV the model is still the same (same features, hyperparams may be different, depending on your definition of iteration). It's still the same dataset (same features)

Also, the model will be different as I am doing feature selection in each iteration.

Doesn't matter. Feed resulting model to SHAP explainer and you'll get what you want.

Source https://stackoverflow.com/questions/68095052

QUESTION

JS array returning element at index i+1 instead of index i element

Asked 2021-Mar-16 at 16:50

I’m learning JavaScript and I want to implement Gale Shapley algorithm, not exactly that but something similar. A Player orders his choices between available choices and each Choice choose the best ranked players according to the limit of it’s place. Then the rest of players loose their first choices and their second choice become their first one, and the process restart.

...

ANSWER

Answered 2021-Mar-16 at 16:50

In your algorithm function, you run this loop, just before you call doSort()

Source https://stackoverflow.com/questions/66659555

QUESTION

How to understand Shapley value for binary classification problem?

Asked 2021-Feb-05 at 20:39

I am very new to shapley python package. And I am wondering how should I interpret the shapley value for the Binary Classification problem? Here is what I did so far. Firstly, I used a lightGBM model to fit my data. Something like

...

ANSWER

Answered 2021-Feb-03 at 17:54

Let's run LGBMClassifier on a breast cancer dataset:

Source https://stackoverflow.com/questions/66018154

QUESTION

shap.Explainer constructor error asking for undocumented positional argument

Asked 2021-Jan-09 at 09:54

I'm using the python shap package to better understand my machine learning model. (From the documentation: "SHAP (SHapley Additive exPlanations) is a game theoretic approach to explain the output of any machine learning model." Below is a small reproducible example of the error I'm getting:

...

ANSWER

Answered 2021-Jan-09 at 09:54

The init signature of Impute is:

Source https://stackoverflow.com/questions/65621290

QUESTION

'tensorflow_federated' has no attribute 'NamedTupleType

Asked 2020-Dec-28 at 03:37

I am following this code https://github.com/BUAA-BDA/FedShapley/tree/master/TensorflowFL and trying to run the file same_OR.py

I also place input file "initial_model_parameters.txt" and data folder "MNIST_data" in same folder

...

ANSWER

Answered 2020-Dec-28 at 03:37

tff.NamedTupleType was renamed to tff.StructType in TFF version 0.16.0 (release notes).

Two options:

Install a pre-0.16.0 version of TFF: this should be doable with pip install tensorflow_federated=0.15.0.
Update the code: the error should go away after replacing the tff.NamedTupleType with tff.StructType in the snippet:

Source https://stackoverflow.com/questions/65471612

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install shapley

You can download it from GitHub.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: