Blog

News, guides, and insights from the Verial team.

Hospital IT leaders reviewing AI governance dashboards at a conference

HIMSS26's Agentic AI Gap Is an Eval Problem

HIMSS26 showed health systems deploying agents faster than they can audit them. The fix isn't more governance theater, it's independent simulation.

KHKevin Huang·April 19, 2026·8 min read

Clinician reviewing FHIR integration logs on a hospital workstation

guides

FHIR Sandbox Alternatives: Epic, HAPI, Cerner, Verial

Epic has 18 patients. HAPI has no auth. Here is how six FHIR sandboxes compare for testing healthcare AI agents in realistic conditions.

SLStan Liu·April 14, 2026·9 min read

Healthcare operations dashboard monitoring AI agent performance

guides

Post-Deploy Monitoring for Healthcare AI Agents [2026 Guide]

What to actually monitor after a healthcare AI agent goes live: FHIR writes, HL7 ACKs, portal sessions, IVR completion, PHI signals, and kill switches.

KHKevin Huang·April 14, 2026·10 min read

Healthcare developer testing FHIR integration on laptop

guides

Sim-to-Prod: One API for Healthcare AI Test and Live

Plaid and Stripe flip one credential between sandbox and production. Healthcare AI agents don't get that luxury. Here's how to build the canonical API layer that closes the sim-to-prod gap.

KHKevin Huang·April 14, 2026·9 min read

Hospital leadership reviewing AI vendor evaluation data

insights

The Agent RFP: How Hospitals Should Evaluate AI in 2026

Slide decks and 3-month pilots can't tell you if an AI agent survives your workflows. Here's how the agent RFP replaces slideware with sim-based bakeoffs.

KHKevin Huang·April 14, 2026·8 min read

Clinician reviewing data on a hospital workstation

insights

Why MedAgentBench and HealthBench Miss Real-World Bugs

HealthBench and MedAgentBench test clinical accuracy. Production agents fail on portal navigation, state management, and recovery. Here's the gap.

SLStan Liu·April 14, 2026·8 min read

Dark server room with rows of illuminated servers and blue lighting

engineering

The FHIR Sandbox Problem: Why Test Environments Fail

Epic gives you 18 test patients. HAPI gives you no auth. Neither gives you realistic data. Here's why healthcare AI teams keep shipping code that breaks in production.

SLStan Liu·April 10, 2026·6 min read

Doctor using a tablet with digital health technology interface in a hospital setting

insights

Why Healthcare AI Agents Need Sandbox Environments

Healthcare AI agents interact with EHRs, payer portals, and phone systems. Testing them against production is risky and slow. Sandboxes solve this.

SLStan Liu·April 10, 2026·5 min read

Laboratory test tubes and scientific equipment in a testing environment

guides

How to Test Healthcare AI Agents Before Go-Live

A four-layer framework for testing healthcare AI agents across FHIR, voice, portals, and claims workflows before your first production patient.

SLStan Liu·April 9, 2026·8 min read

Wooden gavel resting on a legal book, representing regulatory compliance

guides

CMS-0057 for Developers: Testing Prior Auth APIs

CMS-0057-F requires payers to support FHIR prior auth by January 2027. How to build and test CRD, DTR, and PAS workflows before the deadline.

KHKevin Huang·April 8, 2026·8 min read

Person wearing a headset at a desk, representing call center operations

insights

Why Voice AI Agents Break on Real Healthcare IVR Calls

Why healthcare voice agents fail in production and what a deterministic test environment must include. Strategic overview of voice AI testing in healthcare.

SLStan Liu·April 7, 2026·7 min read

Medical researcher analyzing data on a computer screen in a laboratory

engineering

Synthetic Patient Data Beyond Synthea

Synthea generates population-level data. Healthcare AI agents need scenario-specific patients with clinical coherence. Here's what's missing and how to fill the gap.

KHKevin Huang·April 6, 2026·9 min read

Abstract illustration of an AI brain formed by circuit board patterns on a blue background

insights

Building an OpenAI Gym for Healthcare AI Agents

Healthcare AI agents need training environments like RL agents need gyms. Deterministic, resettable, parallelizable environments for iterating on agent behavior.

SLStan Liu·April 5, 2026·8 min read

Close-up of a computer monitor displaying lines of programming code

guides

FHIR R4 Testing Guide: Edge Cases and Vendor Gotchas

US Core conformance, must-support fields, vendor-specific quirks, and the edge cases that break your FHIR integration in production.

SLStan Liu·April 3, 2026·12 min read

Ship control room with multiple monitors and navigation screens, representing multi-interface systems

engineering

Testing Healthcare AI Across FHIR, Voice, and Portals

Real healthcare workflows span FHIR servers, payer phone lines, insurance portals, and claims. Testing them in isolation misses the failures that matter.

KHKevin Huang·April 2, 2026·8 min read

Technology conference audience watching a presentation on a large screen

insights

Why Healthcare AI Companies Need a Sandbox, Not Just a Demo

A demo shows your agent works once. A sandbox proves it works across hundreds of scenarios, edge cases, and failure modes.

SLStan Liu·April 1, 2026·10 min read

Digital security concept with code and encryption on a dark screen

guides

HIPAA-Compliant AI Testing with Synthetic Data

Synthetic data eliminates HIPAA risk in AI development while providing more realistic testing than de-identified PHI subsets.

KHKevin Huang·March 31, 2026·8 min read

Yellow caution tape warning sign against a blurred background

engineering

6 Ways Prior Auth AI Agents Fail in Production [2026]

Prior auth agents fail at portal login, form mapping, document upload, status polling, denial parsing, and payer quirks. A failure-mode catalog with test patterns.

KHKevin Huang·March 29, 2026·7 min read

Vintage IBM personal computer on a desk, representing legacy healthcare technology still in use

engineering

HL7v2 Is Not Dead: Why Your AI Agent Still Needs It

FHIR gets the attention, but 95%+ of US hospitals still run HL7v2 ADT messaging. Your healthcare AI agent needs to handle both protocols.

SLStan Liu·March 28, 2026·6 min read

Close-up of network cables plugged into a switch or router

guides

Connecting an AI Agent to a FHIR R4 Sandbox

Walk through connecting a healthcare AI agent to a FHIR sandbox environment, from authentication to reading patient data to writing results back.

KHKevin Huang·March 27, 2026·11 min read

Person signing paperwork on a desk with documents and forms

guides

Testing Prior Auth Agents with Simulated Payer Portals

How to test prior auth AI agents against simulated payer portals: login flows, form mapping, document upload, and status tracking without hitting production systems.

SLStan Liu·March 26, 2026·6 min read

Office telephone on a desk, representing IVR phone systems

guides

Voice Agent IVR Testing: A Practical How-To Guide [2026]

Step-by-step how-to for building IVR test scenarios for healthcare voice agents: DTMF, speech recognition, hold handling, and representative interactions.

KHKevin Huang·March 25, 2026·7 min read

Hospital room with medical monitors, IV pumps, and clinical equipment displays

guides

HL7v2 ADT Testing Guide: Simulated Hospital Feeds [2026]

A practical guide to testing healthcare AI agents against HL7v2 ADT feeds: MSH parsing, MLLP transport, test scenarios, and realistic admission events.

SLStan Liu·March 24, 2026·7 min read

Stack of white document folders and binders representing claims file processing

guides

SFTP Claims Testing with Synthetic 837 and 835 Files

Revenue cycle AI agents process claims via SFTP. How to test 837 submissions and 835 remittance parsing with synthetic X12 data.

KHKevin Huang·March 23, 2026·9 min read

Laptop screen showing a colorful data visualization dashboard with charts and graphs

engineering

Synthea Alternatives for Healthcare AI Testing

Synthea generates population-level synthetic data. If you need scenario-specific patients, vendor-shaped FHIR bundles, or multi-interface test data, here are your options.

KHKevin Huang·March 21, 2026·10 min read

Two professionals shaking hands in a business agreement

insights

How to Evaluate Healthcare AI Vendor Testing

Health systems buying AI should ask how vendors test their agents. Here's a framework for evaluating testing rigor before signing a contract.

KHKevin Huang·March 19, 2026·8 min read

Rocket launching into a clear sky, symbolizing a product go-live moment

guides

Healthcare AI Go-Live Checklist: Pre-Production Validation

A checklist for clinical informatics and IT teams validating healthcare AI agents before they touch production systems and real patients.

SLStan Liu·March 18, 2026·9 min read

Analytics dashboard displaying charts and growth metrics on a laptop screen

insights

Healthcare AI Pilot ROI: Metrics That Actually Matter

Most healthcare AI pilots measure the wrong things. Here's how to structure a pilot that generates evidence your governance committee will accept.

KHKevin Huang·March 17, 2026·8 min read