Gradio

PIIMB measures zero-shot PII masking: a model's ability to mask any PII out-of-the-box, without fine-tuning or label customization.

⚠️ This benchmark is still in early development — test datasets, metrics, and evaluation methodology are likely to change. Suggestions are very welcome!