About
I'm a data platform engineer with 10+ years of experience designing and building the systems that move, transform, and surface data at scale — across AI, fintech, media, and high-scale analytics environments.
These days I'm drawn to the intersection of data engineering and AI: agentic pipelines, schema governance with human-in-the-loop oversight, and evaluation frameworks that make it safe to iterate quickly on complex transformations.
I care about building platform foundations that teams can actually trust — reliable, observable, and easy to extend, so engineers spend less time firefighting and more time building.
Experience
Staff Data Engineer · Figure
2021 – PresentArchitected and owned the company's core data platform — a multi-stage lakehouse with standardized ingestion, automated schema evolution, and self-serve onboarding for new data sources. Later led the development of agentic systems that apply LLMs to propose and validate data transformations end-to-end, turning a manual, error-prone process into one that scales without proportional headcount.
Software Engineer · Microsoft
2020 – 2021Defined the engineering patterns and infrastructure standards for a large-scale data platform migration, establishing the technical foundation the team built on. Drove the performance work that made the new platform a clear step forward — not just a lateral move.
Lead Data Engineer · Vice Media
2017 – 2020Led the modernization of Vice Media's data infrastructure — introducing the orchestration tooling, building the company's foundational data lakes, and driving the migration away from legacy warehouses. Left the data org with a platform that could scale with the business rather than against it.
Projects
Agentic Evaluation Framework
Built an evaluation system that leverages agents to automatically seed test data into Iceberg tables, propose schema and transformation changes, execute full pipelines, and validate outputs. This approach made it dramatically faster and safer to iterate on complex data transformations. The verifiable output loop also unlocked agentic coding workflows, giving AI agents a reliable signal to iterate against.
Unified Data Platform
Designed and built a platform to unify data scattered across 100+ microservices into a single, queryable warehouse. The system automated schema evolution, infrastructure provisioning, and onboarding — so adding a new service went from a week of manual work to a two-hour self-serve process.
Custom Event Tracking Platform
Replaced a costly third-party event tracking platform with a custom-built system on AWS — Kinesis for ingestion, API Gateway and Lambda for collection, and Redshift for storage and analysis. Gave the analytics team full ownership of their event data pipeline while dramatically cutting infrastructure spend.