Skip to content
View yullieyang's full-sized avatar

Block or report yullieyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
yullieyang/README.md

Hi, I'm Yullie Yang ๐Ÿ‘‹

I am a Data Scientist & Quantitative Analyst specializing at the unique intersection of advanced financial risk forecasting and production-grade Generative AI framework evaluation. Currently pursuing my Master of Applied Science in Computer Science at the University of Pennsylvania.

๐Ÿ› ๏ธ Core Expertise

  • AI & LLM Core: LLM & RAG Evaluation, Agentic Workflows (LangChain, LangGraph), Prompt Engineering, Synthetic Data Generation, AI Trust & Safety
  • Data & Analytics: Statistical Analysis, A/B Testing & Experimentation (CUPED), Metric Design, Quant Model Validation, Stress Testing, Python, SQL, R

๐Ÿš€ Featured Projects (Check out my pinned repos below!)

  • Developed an adversarial evaluation platform using synthetic model-failure datasets to stress-test LLM workflows.
  • Built a Streamlit validation dashboard benchmarking precision, recall, and confidence to map hallucination patterns.
  • Tech Stack: Python, LLM APIs, Streamlit, LangGraph
  • Spearheaded a simulated feed-ranking A/B test integrating SRM checks, power/MDE analysis, and CUPED variance reduction.
  • Analyzed segment-level treatment effects to translate statistical insights into data-driven launch plans.
  • Tech Stack: Python, pandas, scipy, statsmodels

๐Ÿ“ˆ Career Snapshot

  • Quantitative Analyst @ CoStar Group (Current)
  • Data Scientist / Consultant @ Guidehouse
  • Graduate Intern @ Federal Reserve Board of Governors

๐Ÿ“ซ How to reach me: yullieyang@gmail.com | LinkedIn

Pinned Loading

  1. r-macro-trade-commodity-forecast r-macro-trade-commodity-forecast Public

    Reproducible R workflow: FRED macro/trade/commodity panel, auto.arima forecasts for net exports, real GDP, and WTI, FX pass-through regression.

    R

  2. llm-research-workflow-assistant llm-research-workflow-assistant Public

    Responsible AI workflow prototype for research QA, documentation, and human-in-the-loop review.

    Python

  3. cre_stress_test cre_stress_test Public

    Production-style CRE credit-risk modeling pipeline โ€” Python package + R/auto.arima companion + SQLAlchemy persistence + Streamlit dashboard. FRED, Google Mobility, Boston Zoning. pytest + CI.

    HTML

  4. agentic-ai-evaluation-platform agentic-ai-evaluation-platform Public

    Applied research study of LLM-based QA agents on synthetic model-monitoring anomaly review: scenario-labelled synthetic data, deterministic baselines, schema-constrained agent + reviewer, calibratiโ€ฆ

    Python

  5. product-ab-experiment product-ab-experiment Public

    End-to-end A/B experimentation analysis: North Star metric, SRM check, power/MDE, CUPED, guardrails, segmentation, and ARIMA forecasting. Reproducible.

    Python