Private data detection
The translation API checks input terms for private or personal data before returning model translations. The check is designed to prevent patient or professional identifiers from being sent through the translation workflow.
Detection behavior
The API uses Presidio-style entity detection on each submitted source term.
When sensitive content is found, the request returns HTTP 422 with the
standard message:
{
"error": "Unprocessed Entity: term includes sensitive data",
"message": "term includes sensitive data"
}
For multi-term requests, each term is checked independently. Terms without
private data can still be translated by the batch translation path, while terms
with private data are blocked and receive the private-data message in their
output. If at least one term is blocked, the HTTP status is 422.
Remote API summary
The following synthetic examples were tested against the remote mTLS endpoint
https://anstranslation2.ddns.net/translate. All names and identifiers are
test values, not real patient data.
Category |
Requested terms |
HTTP status |
Result |
|---|---|---|---|
Safe medical control |
|
|
All translated |
Name, first name, maiden name |
|
|
All blocked |
Email address |
|
|
Blocked |
Phone number |
|
|
Blocked |
French social security number / NIR |
|
|
Translated |
US SSN-style social security number |
|
|
Translated |
Patient identifier / IPP |
|
|
Translated |
Mutuelle / complementary insurance number |
|
|
Blocked |
RPPS / ADELI professional identifier |
|
|
One blocked, one translated |
Messaging identifier |
|
|
All translated |
Date and place of birth |
|
|
Blocked |
Passport / identity card number |
|
|
All translated |
Driving licence / fiscal number |
|
|
All translated |
IBAN, RIB, credit card |
|
|
All blocked |
Bank account number |
|
|
Translated |
Mixed safe and sensitive batch |
|
|
One translated, one blocked |
Disease names resembling people names |
|
|
Four translated, two blocked |
Eponym medical syndromes |
|
|
Three translated, two blocked |
Medical terms with initials and codes |
|
|
All translated |
Mixed disease control and explicit person name |
|
|
One translated, one blocked |
Detailed term results
This section lists only the submitted terms and the observed result for each term. It intentionally omits the full request and response bodies.
Requested term |
Result |
Test group |
|---|---|---|
|
Translated |
Safe medical control |
|
Translated |
Safe medical control |
|
Blocked |
Name, first name, maiden name |
|
Blocked |
Name, first name, maiden name |
|
Blocked |
Email address |
|
Blocked |
Phone number |
|
Translated |
French social security number / NIR |
|
Translated |
US SSN-style social security number |
|
Translated |
Patient identifier / IPP |
|
Blocked |
Mutuelle / complementary insurance number |
|
Blocked |
RPPS / ADELI professional identifier |
|
Translated |
RPPS / ADELI professional identifier |
|
Translated |
Messaging identifier |
|
Translated |
Messaging identifier |
|
Blocked |
Date and place of birth |
|
Translated |
Passport / identity card number |
|
Translated |
Passport / identity card number |
|
Translated |
Driving licence / fiscal number |
|
Translated |
Driving licence / fiscal number |
|
Blocked |
IBAN, RIB, credit card |
|
Blocked |
IBAN, RIB, credit card |
|
Translated |
Bank account number |
|
Translated |
Mixed safe and sensitive batch |
|
Blocked |
Mixed safe and sensitive batch |
|
Translated |
Disease names resembling people names |
|
Blocked |
Disease names resembling people names |
|
Translated |
Disease names resembling people names |
|
Translated |
Disease names resembling people names |
|
Translated |
Disease names resembling people names |
|
Blocked |
Disease names resembling people names |
|
Translated |
Eponym medical syndromes |
|
Translated |
Eponym medical syndromes |
|
Blocked |
Eponym medical syndromes |
|
Blocked |
Eponym medical syndromes |
|
Translated |
Eponym medical syndromes |
|
Translated |
Medical terms with initials and codes |
|
Translated |
Medical terms with initials and codes |
|
Translated |
Medical terms with initials and codes |
|
Translated |
Medical terms with initials and codes |
|
Translated |
Medical terms with initials and codes |
|
Translated |
Mixed disease control and explicit person name |
|
Blocked |
Mixed disease control and explicit person name |
Interpretation
The detector blocks obvious private data such as synthetic names, email addresses, phone numbers, birth information, and banking examples. Some healthcare-specific identifier formats were not blocked in this remote test, including the tested NIR-like value, IPP-like value, passport/ID values, driver/fiscal values, and standalone bank account number.
The disease-name tests show an important false-positive risk: medical eponyms
or misspelled disease names can be detected as PERSON. For production use,
these cases should be handled with a terminology allowlist before private-data
blocking, or reviewed manually when a clinical term is blocked as PERSON.