data-anonymization | Want to use production data for testing | DB Client library
kandi X-RAY | data-anonymization Summary
kandi X-RAY | data-anonymization Summary
Afraid of using production data due to privacy issues? Data Anonymization is a tool that helps you build anonymized production data dumps which you can use for performance testing, security testing, debugging and development.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of data-anonymization
data-anonymization Key Features
data-anonymization Examples and Code Snippets
Community Discussions
Trending Discussions on data-anonymization
QUESTION
I am trying to mask data in such a way that referential integrity is not compromised.
My table Customer has this data:
Customer table
...ANSWER
Answered 2020-Sep-01 at 18:32There are two things that seem important for you
- anonymity
- referential integrity
For both of your requirements that solution from the blog article you linked is a bad choice.
Anonymity
Just hashing does not provide anonymity. The article also mentions (but it is not in the code) you probably at least want to add a salt.
Just an example:
A number like 211
will be af9fad5f
as a CRC32 hash. If the person you share your data with sees this 8char(32bit) alphanumeric string it probably will assume that this might be a CRC32 hash. The good thing with hashes is you can not easily calculate back starting from af9fad5f
to 211
. The bad thing is, most short words/ hashes are already precalculated and easy to look up in what is called a rainbow table (e.g. https://md5hashing.net/hash/crc32/af9fad5f).
This basically means everybody could just look up the "clear text" behind the crc32 hashes. (same for all other hashes). Adding a salt prevents this. (this salt must of course be kept secret!)
Referential Integrity
The referential integrity is kept. 211
will be always be af9fad5f
as a CRC32 hash - this is static and there is no random effect to it. So the Product_ID would stay the same for all your tables. Which is what you need.
But just to be sure I would use SHA256 instead of CRC32. In CRC32 everything will be mapped to a 8chars alphanumeric (32bit). If you have quite a lot of data - there is some chance of hash collisions. This means two numbers/ids in the same table actually having the same hash. With SHA256 this is next to impossible.
Overall I think using the anonymizer package seems ok. (it is not actively maintained - but functionality seems to be ok)
QUESTION
I want to mask sensitive information on multiple columns in a table named my_table
using ProxySQL.
I've followed this tutorial to successfully mask a single column named column_name
in a table using the following mysql_query_rules
:
ANSWER
Answered 2019-Mar-18 at 05:27Using flagIN
, flagOUT
, and apply
allows me to mask data on multiple columns.
Here's the final mysql_query_rules
I have:
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install data-anonymization
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page