How We Built TranslateMed: Quality and Verification at Every Step

When we set out to build TranslateMed, we made a decision early on that has shaped everything since: medical code translation should not be guesswork.

AI language models are impressive. They can summarize, explain, and contextualize medical documents better than most tools available today. But ask an AI to map a German ICD-10-GM code to its US ICD-10-CM equivalent, and you’ll get a confident answer that may be subtly wrong in a way that costs a patient a claim denial — or worse, misleads a clinician.

We decided the core of TranslateMed would be verified, auditable medical coding data where every mapping is either confirmed by authoritative sources or explicitly flagged as an approximation. Here’s what that means in practice.

Why Verified Mappings for Medical Codes

Medical coding is not a fuzzy problem. When a German discharge summary contains I10.00 (essential hypertension, no hypertensive crisis), the correct US equivalent is I10. Not a range of possibilities — that specific code, with an annotation that the crisis status was lost in translation.

When a dental record from Japan uses FDI notation #26 for a tooth, the correct US Universal Notation equivalent is #14. There is no ambiguity. The tooth is the tooth.

AI inference introduces probabilistic error into a domain that has exact answers most of the time. Our evaluation data confirms that verified mappings consistently outperform AI-only approaches on both standard code pairs and challenging edge cases — rare codes, partial matches, and system-specific extensions that don’t cleanly map to any equivalent.

For cases where no verified mapping exists, we use AI-assisted approaches. But we always know which category a given mapping falls into, and we surface that information to the clinician. The goal is transparency, not false confidence.

Country-Level Context

Every healthcare system starts with an understanding of how that country’s medical record ecosystem works. Not just the code sets, but the clinical context: how diagnoses are documented, what coding conventions are standard, which system-specific features exist that have no equivalent elsewhere.

Germany’s ICD-10-GM adds an exclamation mark suffix (!) to optional secondary codes — a syntactic convention that doesn’t exist in the WHO base or any other national variant. Japan’s medical system integrates Kampo (traditional herbal medicine) through a separate coding track that sits alongside conventional diagnosis coding. India’s records frequently mix WHO ICD-10 with AYUSH (Ayurveda, Yoga, Unani, Siddha, Homeopathy) documentation in ways that require context to interpret correctly.

Without understanding these system-level conventions, code mapping is unreliable.

Verified Code Mappings

The core of TranslateMed covers direct mappings between code systems: ICD-10-CM to ICD-10-GM, FDI dental notation to Universal/ADA notation, ATC drug codes to NDC/PZN identifiers, and so on.

These mappings are built from authoritative sources: official mapping tables from national health authorities, published documentation, and clinical coding standards. Where official mappings exist, we use them. Where they don’t — and in many cases they don’t — we document the closest valid equivalent and the nature of the gap.

Every mapping has a confidence type: exact (the codes are definitionally equivalent), approximate (the target code is the best available equivalent with some specificity loss), or unmapped (no valid equivalent exists). This classification is surfaced in every translation output.

Translation Corridor Depth

Different country pairs have different translation challenges. The Germany-to-US corridor, for example, has unusually high complexity around drug safety alerts. Several medications that are standard German formulary drugs — metamizole (Novalgin), nimesulide — are either banned or strictly controlled in the US. A German prescription that a US pharmacist might encounter needs an immediate safety context, not just a code conversion.

The India-to-US corridor has different complexity: a high volume of generic drug names that don’t correspond to US brand names, and a large AYUSH documentation component that US providers may have never encountered. TranslateMed addresses these practical translation challenges with clinical depth specific to each country pair.

Notation Systems and Medical Terminology

Some coding systems are shared across countries but implemented differently. Dental notation is the clearest example: the FDI (International) system, Universal/ADA (US and Canada), and Palmer notation (UK, parts of Europe) all describe the same 32 permanent teeth and 20 primary teeth using completely different numbering schemes.

FDI #26 is Universal #14 is Palmer 6 (upper left first molar). The mapping is exact and non-controversial. But dental records frequently mix systems, abbreviate, or use shorthand that requires system-level context to parse correctly.

TranslateMed also addresses linguistic translation challenges specific to medical vocabulary. “Angina” in German medical usage typically refers to tonsillitis (angina tonsillaris), not chest pain. “Admission” in the UK NHS context carries administrative and clinical implications that differ from US hospital admission. These false cognates and terminology divergences sit above pure code mapping — and getting them wrong can have clinical consequences.

Verification and Quality

Building TranslateMed required structured verification grounded in authoritative sources.

Every mapping in the production system has been reviewed against authoritative national health authority publications and clinical coding standards. Approximate mappings are documented as approximate, with the reason for the gap noted. TranslateMed currently spans over 20 countries with thousands of validated code mappings, and we maintain traceability so that every production mapping can be traced back to its origin.

Safety First

Medical translation carries patient safety implications that generic translation never does.

We build safety flags into TranslateMed at every level where clinical risk is present.

Drug Safety Alerts

Several high-frequency drugs in one healthcare system are banned, restricted, or have different risk profiles in another. TranslateMed includes drug-level safety flags that are surfaced in the translation output when these situations arise:

Metamizole (Novalgin, Dipyrone): Common OTC/prescription analgesic in Germany, Mexico, and India. Withdrawn from US market in 1977 due to agranulocytosis risk. A German prescription for Novalgin arriving at a US pharmacy needs an immediate alert, not just a name translation.
Nimesulide: NSAID available in Europe, India, and other markets. Not approved by the FDA. Associated with hepatotoxicity concerns that led to restrictions in multiple jurisdictions.
Codeine for pediatric use: Approved dosing and age restrictions vary significantly across healthcare systems. What is standard pediatric pain management in one system may be contraindicated in another.

Controlled Substance Differences

Scheduling categories vary by country. A drug that is Schedule IV in the US (moderate abuse potential, prescribable) may be an unscheduled OTC item in Germany, or vice versa. TranslateMed flags controlled substance classification differences so that the receiving clinician or pharmacist understands the regulatory context, not just the chemical identity.

OTC and Prescription Switches

Many drugs are prescription-only in the US and OTC in Europe, or the reverse. Ibuprofen at higher doses requires a prescription in Germany; lower-dose formulations are OTC. Several antihistamines that are OTC in the US are prescription-only in other markets. These are not pharmacologically trivial distinctions — they affect what patients can obtain, at what doses, and under what supervision.

Why Not AI-Only?

AI language models trained on medical text have impressive pattern-matching capabilities across medical codes. They can often produce plausible mappings for codes they’ve seen during training.

The problem is “often plausible” is not good enough for clinical use.

AI models hallucinate. They interpolate. They may produce confident, formatted-looking output for a code pair where no mapping exists — or where the mapping has important caveats that the model didn’t learn or didn’t surface.

More specifically: medical coding standards update annually. ICD-10-GM is updated every January 1st. ICD-10-CM updates take effect every October 1st. A language model trained on data through a certain cutoff date will be unaware of new codes, deprecated codes, and changed mappings that have been published since. Verified, timestamped mappings have known provenance. A language model does not.

We do use AI — for edge cases where no verified mapping exists, for reading clinical narrative, for generating translation summaries. But verified mappings are the first, authoritative source, and they outperform AI-only approaches on every metric we’ve measured.

Continuous Improvement

Healthcare coding standards change every year. Regulatory decisions about drug approvals, withdrawals, and scheduling changes happen continuously. We update our verified medical coding data as these changes occur, with source-referenced updates so that the provenance chain is always maintained.

We also track translation cases where coverage is lacking — these represent genuine gaps that get prioritized for the next update cycle.

How TranslateMed Uses Verified Data

Every TranslateMed translation runs the source document’s detected codes against verified mappings first. If a validated mapping exists, it’s used. If only an approximate mapping exists, the output is annotated with the confidence level and the nature of the gap. If no mapping exists, the case is routed to AI-assisted handling, and the output is clearly marked as AI-proposed.

The result is a translated document where the confidence level of every mapping is explicit — not a black-box output where the clinician has to guess which parts can be trusted and which parts need verification.

That transparency is the point. Medical translation is a tool for clinicians, not a replacement for clinical judgment. A well-annotated approximate mapping that prompts the right question is more useful than a confident wrong answer.

TranslateMed is available via API and web interface. For teams handling high-volume cross-border records, Team and Enterprise plans include full country coverage, bulk processing, and dedicated support.