Skip to content

Svetlana-W/gdpr_obsuscator_s_wise

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GDPR Obfuscator

A Python tool for GDPR-compliant data obfuscation in AWS S3.

Overview

This tool processes data being ingested to AWS and intercepts personally identifiable information (PII). It creates obfuscated copies of files while maintaining their original structure, ensuring GDPR compliance for data analysis.

Features

  • Processes CSV files (MVP)
  • Supports JSON and Parquet files (extended functionality)
  • Handles files up to 1MB
  • AWS S3 integration
  • Command-Line Interface (CLI)
  • GDPR-compliant data anonymization

Installation

  1. Clone the repository:
git clone https://github.com/Svetlana-W/gdpr_obsuscator_s_wise.git
cd gdpr_obfuscator_s_wise

## Usage

# As a Library
from core_obfuscator import GDPRObfuscator

obfuscator = GDPRObfuscator()
config = {
    "file_to_obfuscate": "s3://my-bucket/data.csv",
    "pii_fields": ["name", "email_address"]
}
result = obfuscator.obfuscate(config)

# As a CLI
python src/cli.py config.json --output-bucket my-bucket --output-key output/data.csv

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages