
HydroXai/pii-masker
📦 Open Source ProjectHydroXai
An open-source AI tool for high-precision PII detection and masking using DeBERTa-v3.
PII Masker addresses the critical need for data privacy in AI-driven applications. By leveraging the DeBERTa-v3 model, it achieves superior accuracy in recognizing PII compared to traditional regex-based methods. The tool is engineered for scalability, allowing it to process large datasets efficiently. Its architecture is modular, enabling seamless integration into modern data stacks, including vector databases like Milvus and RAG-based systems. Developers can utilize the simple Python API to implement automated masking, ensuring that sensitive user data is scrubbed before being processed by LLMs or stored in databases. This proactive approach to data security helps organizations maintain regulatory compliance while still benefiting from the power of advanced AI models.
💡Highlights
- ├─Powered by DeBERTa-v3 architecture
- ├─High-precision PII detection
- └─Seamless RAG and agent integration
🎯For
- ├─Data Engineers
- ├─AI Security Researchers
- └─Backend Developers