In the largest machine learning study of Pompe disease to date, Volv Global demonstrates that clinician-defined endpoints can be tracked and novel disease features discovered in US claims data across 3,549 patients
In brief
New research presented at ISPOR Global 2026 in Philadelphia demonstrates that machine learning can map clinician-defined endpoints to real-world claims data in Pompe disease and surface disease manifestations beyond pre-specified frameworks. The study, conducted by Volv Global in collaboration with Sanofi, was conducted in a US administrative claims database.
Pompe disease is a rare, chronically debilitating metabolic disorder in which enzyme replacement therapy has now extended patient survival, bringing new long-term manifestations not captured by endpoints established earlier in its treatment history. Many clinically meaningful endpoints do not map to routine healthcare codes, leaving a gap between what patients experience and what the evidence base reflects – with consequences for disease monitoring, HTA submissions, and trial design.
The research addresses that gap through three sequenced methodological contributions:
Volv Global's proprietary machine learning methodology was applied across all three components, providing a reproducible framework applicable across rare diseases where treatment advances have outpaced existing evidence frameworks.
"Rare disease evidence frameworks are often frozen at the moment of first approval," said Christopher Rudolf, CEO and Founder of Volv Global. "This research demonstrates that machine learning can systematically close that gap – confirming what clinicians know, independently discovering what the data reveals, and making this a tractable problem for any team building an evidence strategy in a disease where treatment has changed the clinical picture. This is precisely the work Volv Global exists to do."
On the study
Patients were identified in a US administrative claims database using confirmed diagnosis and/or treatment records. The control population comprised patients with mimic disease codes and no Pompe disease history in the preceding seven years. Of 67 pre-specified clinical endpoints, 46 were successfully mapped to claims codes; the 21 unmapped endpoints reflect the limits of administrative claims data coding, and are themselves informative for teams assessing the feasibility of claims-based real-world evidence strategies. All findings are based on retrospective analysis; prospective validation has not been conducted and is not claimed. Research conducted in collaboration with Sanofi. To the abstract