Dev by DuesselbergAdrian · Pull Request #26 · SmartGridsML/genai_project

DuesselbergAdrian · 2026-02-25T20:33:39Z

No description provided.

Add CV document parser + /applications/parse upload endpoint

…st ID middleware

feat: config.py + llm_service.py | test for llm_service.py

Add /applications/generate orchestration with Redis caching, request

…LM + test

Day2 person a

code still references settings.timeout_seconds, changed as per fix

A Day 3: JD analysis + grounded cover letter endpoints

…tem that verifies cover letters against extracted CV facts

feat(day4A): implemented the Auditor

## What Was Built ### 1. Golden Test Dataset (test_cases.json) - 8 comprehensive test cases: 1 happy path + 7 edge cases - Covers sparse CVs, skill mismatches, quantitative verification, international characters - Each test case includes expected hallucination rates and metrics ### 2. Automated Evaluation Pipeline (evaluation_suite.py) - End-to-end evaluation of cover letter generation pipeline - Tracks quality metrics: hallucination rate, confidence, support ratio - Tracks performance metrics: P50/P95/P99 latency - Tracks cost metrics: token usage, API costs - MLflow integration for experiment tracking - CLI with filtering options ### 3. Prometheus Production Metrics (prometheus_metrics.py) - 15+ production-ready metrics - Four Golden Signals: Latency, Traffic, Errors, Saturation - Auto-instrumentation via decorators - LLM-specific metrics: hallucination rate, token usage, API costs ### 4. Comprehensive Test Suite (test_evaluation_suite.py) - 14 unit tests covering all functionality - Tests initialization, execution, metrics, reporting, MLflow logging - All tests passing ✅ ### 5. Documentation - EVALUATION_SUITE_USAGE.md: Quick start guide - demo_evaluation.py: Demo script (no API keys needed) - Pedagogical guides: evaluation strategies & production metrics ## Import Fixes - Updated imports from `app.*` to `backend.app.*` for proper module execution - Fixed in: auditor.py, fact_extractor.py, prompts.py, llm_service.py ## Testing - All 14 evaluation suite tests passing - Evaluation suite CLI functional - Demo script verified

## Implementation Refactored LLM service to support automatic fallback from OpenAI to Google Gemini using LangChain, providing increased reliability and cost optimization. ### Changes 1. **LLM Service (llm_service.py)** - Added LangChain integration (ChatOpenAI, ChatGoogleGenerativeAI) - Implemented automatic OpenAI → Gemini fallback on errors - Enhanced observability with provider tracking - Cost: ~90% savings with Gemini fallback ($0.01 vs $0.10) 2. **Configuration (config.py)** - Added gemini_api_key field - Added gemini_model configuration (gemini-2.5-flash) 3. **Tests (test_llm_service.py)** - Updated for LangChain mocking - Added fallback mechanism test - All 56 tests passing 4. **Markdown Stripping** - Already in auditor.py and fact_extractor.py - Handles Gemini's code-wrapped JSON responses ### Testing Evaluation suite tested with real API calls: - ✅ Success Rate: 100% - ✅ Hallucination Rate: 0% - ✅ Cost per request: $0.01 (Gemini) vs $0.10 (OpenAI) - ✅ All fallback transitions logged to MLflow ### Dependencies Added - langchain - langchain-openai - langchain-google-genai

Person B Day 4: CV enhancement, results endpoint, structured logging

Implement Day 5 Person A: Evaluation Suite & Production Metrics

- - Configures pytest to add project root to Python path (pythonpath = .) - Now pytest works directly without needing python -m pytest - Sets default test paths and options

Fix/imports

Document Generation + Download Endpoints + CI Pipeline DONT MERGE

- Load balancer configuration - Environment variables management - S3 for document storage - CloudWatch logs configuration - Prometheus + Grafana dashboards - Alert rules

Day6_PersonA: - Terraform for AWS ECS

Frontend foundation: Vite + React + TypeScript + Tailwind

Results UI scaffold + backend contract alignment (WIP)

…olicies, ALB routing

SmartGridsML and others added 30 commits December 15, 2025 15:54

Initial commit

c0967e2

Update README.md

18b6e66

Update README.md

8af92e9

Update README.md

d0da64e

Update README.md

e18005f

updated projected strcuture

dd41722

added docs, infra , frontend + mlops folders

13448a6

Add FastAPI app with /health, /metrics and docker compose stack

82cef6e

Add FastAPI health/metrics endpoints and backend Docker setup

80c0ff8

Add CV document parser and /applications/parse upload endpoint

e5fe5c7

Merge pull request #9 from SmartGridsML/person-b-day2-parser

1267d91

Add CV document parser + /applications/parse upload endpoint

Add /applications/generate orchestration with Redis caching and reque…

280e99a

…st ID middleware

feat: config.py + llm_service.py | test for llm_service.py

0cd5170

Merge pull request #14 from SmartGridsML/a_d1

0c323ab

feat: config.py + llm_service.py | test for llm_service.py

Merge branch 'main' into person-b-day3-generate

edf3545

Merge pull request #13 from SmartGridsML/person-b-day3-generate

a4a541f

Add /applications/generate orchestration with Redis caching, request

Add JD analysis and grounded cover letter LLM endpoints (Day 3 Person A)

46fcecc

feat: fact extractor - Extracts structured facts from CV text using L…

f825b15

…LM + test

feat: versioned prompts

7d0562a

updated schemas with Facts, Job Analysis

20cf389

updated response Format in llm_service.py

ac9815e

Merge pull request #16 from SmartGridsML/day2_personA

77f0f72

Day2 person a

Merge branch 'main' into person-a-day3-jd-cover

72cd410

Update llm_service.py

bd1c817

code still references settings.timeout_seconds, changed as per fix

Merge pull request #15 from SmartGridsML/person-a-day3-jd-cover

5b26f8f

A Day 3: JD analysis + grounded cover letter endpoints

feat(day4A): implemented the **Auditor** - our anti-hallucination sys…

053750c

…tem that verifies cover letters against extracted CV facts

Merge pull request #17 from SmartGridsML/day4_personA

3c13378

feat(day4A): implemented the Auditor

fix: backend.app.* import structure in tests

537dcaf

Fix import-time side effects, make LLMService lazy, Docker test setup

58b3be6

SmartGridsML and others added 28 commits January 20, 2026 15:18

Merge pull request #20 from SmartGridsML/feature/personb-day4

5bec0a8

Person B Day 4: CV enhancement, results endpoint, structured logging

Merge branch 'main' into day5_personA

638d0fb

Merge pull request #18 from SmartGridsML/day5_personA

b4d3560

Implement Day 5 Person A: Evaluation Suite & Production Metrics

fix: Corrupted Code in llm_service.py (lines 325-337) | imports

0b17134

addedd pytest.ini ;

451aa85

- - Configures pytest to add project root to Python path (pythonpath = .) - Now pytest works directly without needing python -m pytest - Sets default test paths and options

fixed import on test

6f2e2c4

Merge pull request #21 from SmartGridsML/fix/imports

62835df

Fix/imports

Fix tests container PYTHONPATH by mounting project root

c01a5ee

Add CI with tests, linting, and type checks

2d8bc62

Comment formatting tests

f43f024

Adding pytest to dev requirements

a218b09

Adding requirements.txt

f2f5fe1

adding gemini key

04e4efa

Stub tests downloads

2618f04

Updated CI (Redis and Postgres)

4924afd

Merge pull request #22 from SmartGridsML/feature/personb-day5

d2d2225

Document Generation + Download Endpoints + CI Pipeline DONT MERGE

Day6_PersonA: - Terraform for AWS ECS

04700ab

- Load balancer configuration - Environment variables management - S3 for document storage - CloudWatch logs configuration - Prometheus + Grafana dashboards - Alert rules

Merge pull request #23 from SmartGridsML/day6_personA

d1a3ef2

Day6_PersonA: - Terraform for AWS ECS

fix(frontend): type-only imports for TS build

a4ff0fd

Merge pull request #24 from SmartGridsML/day6-frontend-foundation

c791699

Frontend foundation: Vite + React + TypeScript + Tailwind

Frontend page

8fd1ff5

Merge pull request #25 from SmartGridsML/day7-results-ui

35133ab

Results UI scaffold + backend contract alignment (WIP)

fix(docker): fix backend module path and add frontend service

30f197f

refactor(llm): remove dead LLM_BASE_URL config

8fbfbe8

fix(frontend): switch to Tailwind v4 import syntax, delete artifact file

49995a0

feat(terraform): add frontend ECS service, ECR repos with lifecycle p…

0869888

…olicies, ALB routing

fix: docker build path, frontend redesign, state flow fixes

479a54b

DuesselbergAdrian closed this Feb 25, 2026

DuesselbergAdrian force-pushed the dev branch from 1b310c3 to 479a54b Compare February 25, 2026 20:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dev#26

Dev#26
DuesselbergAdrian wants to merge 60 commits into
mainfrom
dev

DuesselbergAdrian commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DuesselbergAdrian commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants