Skip to content
View zaherkarp's full-sized avatar

Block or report zaherkarp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zaherkarp/README.md

Hi, I'm Zaher

I build and govern production analytics systems in regulated healthcare — pipelines, semantic layers, and data products that clinical, operational, and executive teams can trust and act on.

I started as a news writer and book editor, then moved into healthcare research and analytics. That background still shapes how I work: clear questions, defensible methods, and results that change decisions.

Currently: Lead Data Engineer at Baltimore Health Analytics, working on Medicare Advantage quality and performance analytics.


What I work on

  • Production data pipelines — EHR data modeling across Epic, Cerner, Veradigm, and athenahealth; ETL/ELT on AWS and Databricks; stored procedures built to survive edge cases in regulated populations
  • Analytics governance — Semantic layers, metric definitions, and dbt-versioned models that give stakeholders one version of truth
  • Value-based care analytics — ACO, MSSP, HEDIS, and Medicare Stars performance tracking; benchmarking and significance testing against CMS methodology
  • Platform reliability — Observability, incident response, and cost optimization across cloud data infrastructure

Selected work

Payer Quality Analytics Platform — Lead data engineering for a SaaS Medicare Advantage analytics platform. Designed stored procedure architecture for regulatory measure calculation, CMS specification validation, and plan-level performance reporting. (MariaDB, Python, Ruby on Rails)

Embedded Refills and Care Gaps — Designed the data model and daily cohort refresh pipeline; standardized legacy SQL into shared stored procedures (~70% reduction in codebase), improving maintainability and cross-customer deployment. (AWS Redshift, dbt, Airflow, FHIR, Tableau, Power BI)

Revenue and Program Performance — Built benchmarking and trend tracking for ACO/MSSP/HEDIS/Stars programs; reduced storage costs ~50% and ETL load time by 24+ hours through infrastructure redesign; governed dashboards enabled 7× user growth and eliminated 400+ manual hours per quarter. (S3, Redshift, dbt, SQL, Python, Tableau)


Research and writing

Published on accountable care organization shared savings, clinic design and team efficiency, and EHR optimization in primary care. Presented at state and national conferences on population health, community-engaged mental health research, and quality and safety improvement — audiences ranging from clinical teams to health system leadership.


Stack

Data and cloud: SQL, Python (pandas, PySpark, Jinja, boto), R, dbt, Airflow, Git, AWS (S3, Redshift), Databricks, Perl, Ruby
BI and visualization: Tableau, Power BI, QuickSight, Sisense for Cloud Data Teams
Observability: Grafana, DataDog, SumoLogic
Healthcare data: Epic Clarity, Cerner, Veradigm/Allscripts, athenahealth, FHIR, HL7, CMS technical specifications, Public Use Files

Pinned Loading

  1. skillsprout skillsprout Public

    MVP web application that uses O*NET occupation skill data to help users discover job transition opportunities based on current skills and experiences

    Python

  2. ecds-shock-index ecds-shock-index Public

    Reproducible analytics framework for modeling how shifts in Electronic Clinical Data Systems (ECDS) measure performance can impact Medicare Advantage Star Ratings

    Python

  3. hedis-state-briefing hedis-state-briefing Public

    Narrative, single-page briefing wall that explains how Stars / HEDIS operations are shifting by state

    Python

  4. medicare-advantage-insight-engine medicare-advantage-insight-engine Public

    Free, local Medicare Advantage news insight monitor that fetches public sources, scores items for analytic relevance, and posts structured alerts to a webhook endpoint (Teams-compatible or generic)

    Python

  5. deterministic-media-metadata-generator deterministic-media-metadata-generator Public

    A deterministic, script-driven tool for building an Obsidian media library from a markdown list. Parses, normalizes, deduplicates, and enriches media entries with Wikidata metadata.

    Python

  6. zaherkarp.github.io zaherkarp.github.io Public

    Astro