PIIMB measures zero-shot PII masking: a model's ability to mask any PII out-of-the-box, without fine-tuning or label customization.
โ ๏ธ This benchmark is still in early development โ test datasets, metrics, and evaluation methodology are likely to change. Suggestions are very welcome!