Skip to content

Commit f6df129

Browse files
committed
presidio added
1 parent 5cfc73d commit f6df129

File tree

6 files changed

+79
-0
lines changed

6 files changed

+79
-0
lines changed
Lines changed: 63 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,63 @@
1+
---
2+
draft: false
3+
title: Presidio fully managed open source service | OctaByte.io
4+
meta:
5+
cover: /images/development/dev-tools/presidio/screenshot-1.png
6+
description: Presidio is an open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) in text, images, and structured data using advanced NLP and customizable pipelines.
7+
keywords: Presidio, PII detection, data redaction, data anonymization, sensitive data masking, open-source data privacy, GDPR compliance, HIPAA, NLP PII, data privacy framework, personally identifiable information, data protection tools, Kubernetes PII detection, Python anonymization library
8+
breadcrumb:
9+
- name: Home
10+
url: /
11+
- name: Software Catalog
12+
url: /fully-managed-open-source-services
13+
- name: Development
14+
url: /fully-managed-open-source-services/development
15+
- name: Dev tools
16+
url: /fully-managed-open-source-services/development/dev-tools
17+
- name: Presidio
18+
url: /fully-managed-open-source-services/development/dev-tools/presidio
19+
content:
20+
id: presidio
21+
name: Presidio
22+
title: Open-Source Framework for PII Detection, Redaction & Anonymization
23+
logo: /images/development/dev-tools/presidio/logo.png
24+
website: https://microsoft.github.io/presidio/
25+
iframe_website: /website/development/dev-tools/presidio
26+
screenshots:
27+
- /images/development/dev-tools/presidio/screenshot-1.png
28+
- /images/development/dev-tools/presidio/screenshot-2.jpg
29+
---
30+
31+
## Overview
32+
33+
Presidio is a powerful open-source framework designed to help organizations safeguard sensitive data by detecting and anonymizing personally identifiable information (PII) across various formats, including text, images, and structured datasets. Built with privacy and flexibility in mind, Presidio supports both predefined and custom PII recognizers using natural language processing (NLP), regular expressions, rule-based logic, and checksums.
34+
35+
Its modular architecture allows seamless integration with external PII detection models, while supporting multiple deployment options—from Python scripts and PySpark jobs to containerized environments like Docker and Kubernetes. With robust customization features, Presidio lets developers tailor recognition and anonymization workflows to meet industry-specific compliance and privacy needs.
36+
37+
Whether you're looking to secure customer data, anonymize documents for machine learning, or comply with regulations like GDPR and HIPAA, Presidio provides a scalable and developer-friendly solution for PII protection.
38+
39+
## Features
40+
41+
- ### Predefined & Custom PII Recognizers
42+
43+
Use out-of-the-box recognizers or define your own using NER, regex, rule-based logic, and checksums with multilingual support.
44+
45+
- ### Modular Architecture
46+
47+
Connect to external PII detection models or extend Presidio with custom pipelines and components.
48+
49+
- ### Flexible Usage Options
50+
51+
Deploy via Python, PySpark, Docker containers, or Kubernetes clusters to match your infrastructure.
52+
53+
- ### Data Format Versatility
54+
55+
Detect and anonymize PII in text, images, and structured datasets—ideal for enterprise-scale processing.
56+
57+
- ### Customizable Anonymization
58+
59+
Tailor the anonymization strategy using redaction, masking, or transformation methods to fit specific compliance needs.
60+
61+
- ### Scalable & Developer-Friendly
62+
63+
Built for scalability and extensibility, Presidio integrates smoothly into existing data workflows and machine learning pipelines.
Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
---
2+
draft: false
3+
title: "Presidio - Open-Source Framework for PII Detection, Redaction & Anonymization"
4+
content:
5+
id: presidio
6+
name: Presidio
7+
website: https://microsoft.github.io/presidio/
8+
proxy: true
9+
description: "Presidio is an open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) in text, images, and structured data using advanced NLP and customizable pipelines."
10+
---

data/services.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1651,6 +1651,12 @@ catalog_list:
16511651
url: /development/dev-tools/repman
16521652
category: dev-tools
16531653
description: Repman is a powerful Packagist proxy and private package host for PHP Composer, enabling seamless dependency management with GitHub, GitLab, and Bitbucket integration.
1654+
- id: presidio
1655+
title: Presidio
1656+
image: /images/development/dev-tools/presidio/logo.png
1657+
url: /development/dev-tools/presidio
1658+
category: dev-tools
1659+
description: Presidio is an open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) in text, images, and structured data using advanced NLP and customizable pipelines.
16541660
- id: pritunl
16551661
title: Pritunl
16561662
image: /images/development/network/pritunl/logo.png
Loading
Loading
Loading

0 commit comments

Comments
 (0)