A Google Health executive walked into a room full of health IT founders and said something that stopped everyone cold. AI developed on licensed data has no future. Primary data ownership is the only defensible moat in health IT. I brought that statement directly to Dr. Yin Ho — and her response shaped everything about how Symetrique thinks about data.
The Statement That Changed My Thinking
Most health IT AI companies today are building on licensed data. They access datasets from EHR vendors, clearinghouses, payer networks, and data aggregators — paying for the right to run their models on someone else's asset. The AI is theirs. The data is not.
This is a fragile position. The moment a data supplier changes terms, raises prices, decides to compete directly, or simply loses market position — the company built on top of their data is exposed. There is no moat. There is only a licensing agreement.
The Google Health executive's statement was not just a competitive strategy observation. It was a warning about the fundamental architecture of health IT AI — and which companies will still be standing in ten years.
Dr. Ho's Response — Validated and Extended
When I brought the Google Health executive's statement to Dr. Yin Ho, her response was immediate and direct.
Unfortunately that is true. The problem is you cannot develop an AI model on data that is not accurate, not quality level, or not representative.
She validated the thesis completely. But she did not stop there. She extended it in a direction I had not fully considered — and in doing so, she identified the most important white space in health IT data strategy today.
Large companies have volume. The EHR vendors, the clearinghouses, the payer networks — they have accumulated data at a scale that is difficult for entrepreneurs to match. And many of these organizations are reluctant to share. As Dr. Ho put it, entrepreneurs are almost forced to partner with organizations that may not want to share their data. That is the current reality of the marketplace.
But here is where the conversation took a turn.
Ownership of data still belongs to the patient. How can patients pull their data together and then provide permission to use their data for other purposes?
The Twelve Step Problem — Why Patients Cannot Access Their Own Records
This is the moment in the interview that stopped me completely. Dr. Ho did not just identify the opportunity. She mapped the obstacle with a specificity I had never heard articulated so clearly.
Putting together a complete patient record today requires navigating a twelve-step gauntlet that most patients simply cannot complete. Dr. Ho walked through it step by step:
First — knowing everywhere you have been seen over your entire medical history. Second — knowing which EHR system each of those physicians uses. Third — understanding how your data is stored in each system and how accessible it is. Fourth — navigating different permission and consent forms for each system. Fifth — understanding the statute of limitations each provider imposes — some will only release 90 days of records, others 120 days. Sixth — recognizing that even inside a patient portal, you will not get all of your data. Seventh — figuring out how to string all of these fragmented records together into a coherent format. Eighth — understanding what format each system exports data in. Ninth — determining whether the compiled record is actually complete. Tenth — navigating what happens when you have been seen across state lines. Eleventh — dealing with the different state laws governing data access in each jurisdiction. Twelfth — reconciling records from multiple different kinds of health systems that may not interoperate at all.
The challenge for a patient to put together their record is already an example of how hard it is for anyone to put it together in such a way that it is a large enough volume to run an AI model on.
The Opportunity Hiding Inside the Problem
Dr. Ho's twelve-step map of the patient record challenge is not just a description of the problem. It is a blueprint for the solution. Every step in that gauntlet is an opportunity to build something — a tool, a service, a platform — that makes the next step easier.
The founder who builds the infrastructure that helps patients compile their most complete longitudinal record — navigating the permissions, the formats, the state laws, the portal limitations — that founder builds the most defensible primary data asset in health IT. Not by acquiring data from institutions. By empowering patients to exercise rights they already legally have.
As Dr. Ho framed it: this might be a moment in time to think about not only empowering patients to access their data, but how to start putting together better, more complete records on behalf of the patient — so that you have the most complete set of records that exists. That is the opportunity. That is the moat.
What This Means for the AI Wrapper Problem
The Google Health executive's warning about AI on licensed data is a warning about commoditization. As foundation models improve and become accessible to everyone, the AI layer itself becomes less differentiated. The question is not whether your model is better. The question is whether your data is irreplaceable.
Licensed data is replaceable — by definition. Your competitor can license the same dataset. Primary patient data built on consent, trust, and genuine value exchange is not replaceable. It cannot be scraped, synthesized, or licensed away. It compounds over time as more patients join, as more data flows in, and as the platform proves its value to the people at the center of it.
The Symetrique Perspective
At Symetrique this is the north star that guides every data strategy decision we make. We are not building on licensed data. We are building toward primary data ownership — with patients as genuine partners, not data sources. The moat is not the data. It is the trust.
What This Means for Founders
If you are building an AI product in health IT, ask yourself one question: if your primary data supplier changed terms tomorrow, would your product still exist? If the answer is no — you are building on licensed data and you do not have a moat. You have a dependency.
The path to primary data ownership in healthcare runs through patients. It requires building genuine trust, delivering genuine value, and creating a consent architecture that patients understand and choose. That is harder than licensing a dataset. It is also the only way to build something that lasts.
About Symetrique
Symetrique is a healthcare analytics and intelligence company serving pharma and biotech, payers, health systems, and medical device companies. We combine healthcare analytics, commercial analytics, market research, and real world evidence into an integrated service offering — giving our clients the complete intelligence picture they need to make faster, more confident decisions.
Our approach is built on a simple belief: data without context is noise. Symetrique brings together the structured and unstructured data layers of healthcare — claims, clinical records, physician observations, and patient journeys — and applies AI to turn them into actionable intelligence across the full commercial and clinical lifecycle.
Whether you are a pharma team navigating a competitive landscape, a payer making formulary decisions, a health system optimizing care pathways, or a medical device company building your market strategy — Symetrique delivers the evidence and analytics that move you from questions to answers.
This blog is part of our ongoing series — Conversations at the Edge of Health IT — featuring insights from leading voices in healthcare, technology, and life sciences. Subscribe at www.thesymetrique.com to receive each new episode directly.
www.thesymetrique.com | Madhav Kathikar, Founder & CEO | 224-566-9880
About Rushing Headlong
Rushing Headlong: Health IT's Legacy and the Road to Responsible AI is written by Dr. S. Yin Ho, MD, MBA. It has a foreword by J.D. Kleinke and an endorsement from Stéphane Bancel, CEO of Moderna. Available on Amazon.