Sparkov | Markov Chain based fraud detection system in Spark
kandi X-RAY | Sparkov Summary
kandi X-RAY | Sparkov Summary
Sparkov is a Python library typically used in Financial Services, Banks, Payments applications. Sparkov has no bugs, it has no vulnerabilities and it has low support. However Sparkov build file is not available. You can download it from GitHub.
This code utilizes data generated by our Data Generation Tool in order to detect fraud in sliding windows of credit card transactions. The Spark-based portion of the code loads .csv files of credit card transactions and generates state-transition matricies for every user (and user segment/profile) based on their transaction history. The generation of the matricies, as well as aggregation of various user and segment-level statistics are all performed via Spark and are distributed in nature. The results (matricies and aggregates) are stored in a Redis instance after being calculated. Kafka is utilized to listen for incoming streaming transactions (transaction_listener_AWS.py) and the listener utilizes the pre-calculated aggregates / matricies stored in Redis to evaluated incoming transactions for probabilities of fraud. If the probability of fraud is over a given threshold, the transaction information is sent (via Kafka) to another listener (fraud_listener_AWS.py) which records the data to a local .csv file, as well as creates an updated map of the United States with location of fraud via Folium / Leaflet.js. Streaming data is usually simualted by generating a test dataset via the data generation process, that is independent of the data used in the state-transition / aggregation process (though both sets must share the same customer file). This implementation is designed to be run on Amazon Web Services Elastic MapReduce (EMR), and utilizes AWS ElasticCache for a Redis instance. Map visualization is served on the Hadoop namenode via the apache2 server (/var/www/html). This process will not run without opening a number of ports to your Namenode instance, so plan to review your EMR security policy.
This code utilizes data generated by our Data Generation Tool in order to detect fraud in sliding windows of credit card transactions. The Spark-based portion of the code loads .csv files of credit card transactions and generates state-transition matricies for every user (and user segment/profile) based on their transaction history. The generation of the matricies, as well as aggregation of various user and segment-level statistics are all performed via Spark and are distributed in nature. The results (matricies and aggregates) are stored in a Redis instance after being calculated. Kafka is utilized to listen for incoming streaming transactions (transaction_listener_AWS.py) and the listener utilizes the pre-calculated aggregates / matricies stored in Redis to evaluated incoming transactions for probabilities of fraud. If the probability of fraud is over a given threshold, the transaction information is sent (via Kafka) to another listener (fraud_listener_AWS.py) which records the data to a local .csv file, as well as creates an updated map of the United States with location of fraud via Folium / Leaflet.js. Streaming data is usually simualted by generating a test dataset via the data generation process, that is independent of the data used in the state-transition / aggregation process (though both sets must share the same customer file). This implementation is designed to be run on Amazon Web Services Elastic MapReduce (EMR), and utilizes AWS ElasticCache for a Redis instance. Map visualization is served on the Hadoop namenode via the apache2 server (/var/www/html). This process will not run without opening a number of ports to your Namenode instance, so plan to review your EMR security policy.
Support
Quality
Security
License
Reuse
Support
Sparkov has a low active ecosystem.
It has 7 star(s) with 2 fork(s). There are 1 watchers for this library.
It had no major release in the last 6 months.
Sparkov has no issues reported. There are no pull requests.
It has a neutral sentiment in the developer community.
The latest version of Sparkov is current.
Quality
Sparkov has no bugs reported.
Security
Sparkov has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
License
Sparkov does not have a standard license declared.
Check the repository for any license declaration and review the terms closely.
Without a license, all rights are reserved, and you cannot use the library in your applications.
Reuse
Sparkov releases are not available. You will need to build from source code and install.
Sparkov has no build file. You will be need to create the build yourself to build the component from source.
Top functions reviewed by kandi - BETA
kandi has reviewed Sparkov and discovered the below as its top functions. This is intended to give you an instant insight into Sparkov implemented functionality, and help decide if they suit your requirements.
- Load data from files .
- Evaluate a transaction .
- Process a line of transition probabilities .
- generate a map for importing
- configure spark configuration
- Computes the miss probability of a list of sequences .
- Push value to a list .
- Initialize redis
- Generate a list of values from an iterable .
- Populate the last 4 transactions
Get all kandi verified functions for this library.
Sparkov Key Features
No Key Features are available at this moment for Sparkov.
Sparkov Examples and Code Snippets
No Code Snippets are available at this moment for Sparkov.
Community Discussions
No Community Discussions are available at this moment for Sparkov.Refer to stack overflow page for discussions.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install Sparkov
You can download it from GitHub.
You can use Sparkov like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.
You can use Sparkov like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.
Support
For any new features, suggestions and bugs create an issue on GitHub.
If you have any questions check and ask questions on community page Stack Overflow .
Find more information at:
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page