New - PII Masking 2M European Release is here

Our Datasets

Open-source and enterprise PII masking datasets for training privacy-preserving NLP models. From foundational research data to production-grade European multilingual collections.

Need custom data?

We can generate bespoke PII datasets tailored to your domain, locale requirements, and entity types. Reach out to discuss your needs.