Improve all samples with cache-awareness, add 4 new samples, fix SDK versions, and prepare repo for public sharing by leestott · Pull Request #546 · microsoft/Foundry-Local

leestott · 2026-03-23T23:37:37Z

Summary

This PR improves every existing sample across all languages (C#, JavaScript, Python, Rust) with cache-awareness and visual feedback, adds 4 brand-new samples, fixes SDK version inconsistencies across the repo, and addresses repo hygiene issues for public sharing readiness.

93 files changed — 67 new files, 26 modified files.

What's Changed

1. New Samples (4)

`samples/js/local-cag/` — Context-Augmented Generation (12 files)

Offline CAG-powered support agent for gas field engineers. Pre-loads domain documents (valve inspections, PPE requirements, emergency shutdown procedures, etc.) directly into the context window — no vector database, no embeddings, no retrieval pipeline needed.

Express web server with streaming chat UI
Full document context pre-loading at startup
Model auto-selection with cache awareness
Domain-specific gas field safety documentation included

`samples/js/local-rag/` — Retrieval-Augmented Generation (11 files)

Offline RAG-powered support agent using SQLite + term-frequency vectors for document retrieval. Demonstrates the full RAG pipeline running 100% locally.

Document ingestion with chunking (npm run ingest)
SQLite-backed vector store with term-frequency ranking
Express web server with streaming chat UI
Same gas field domain docs for direct comparison with CAG approach

`samples/python/agent-framework/` — Microsoft Agent Framework Integration (24 files)

Full-featured agent framework sample showing Foundry Local as the LLM backend for agentic AI workflows.

5 interactive demos: weather tools, code reviewer, math agent, sentiment analyzer, multi-agent debate
Tool calling with automatic function dispatch
RAG pipeline with document ingestion
Flask web UI with streaming responses
Orchestrator pattern for multi-step reasoning
Comprehensive README with architecture diagrams

`samples/cs/whisper-transcription/` — ASP.NET Core Whisper Transcription (13 files)

Production-quality audio transcription service using Foundry Local's Whisper model via WinML.

ASP.NET Core Minimal API with proper service architecture
Drag-and-drop audio upload UI
Real-time recording via MediaRecorder API
Health checks for Foundry service availability
Error handling middleware
Clean separation: FoundryModelService, TranscriptionService, FoundryHealthCheck

2. Cache-Awareness Improvements (All Existing Samples)

Every existing sample was updated to check the local model cache before attempting downloads. This provides:

Visual feedback — users see whether their model is already cached or needs downloading
Faster startup — skips unnecessary download operations
Better UX — clear progress indicators with ✓ Model already cached or ⏳ Downloading...

C# samples updated (6 files):

AudioTranscriptionExample/Program.cs
FoundryLocalWebServer/Program.cs
HelloFoundryLocalSdk/Program.cs
ModelManagementExample/Program.cs
ToolCallingFoundryLocalSdk/Program.cs
ToolCallingFoundryLocalWebServer/Program.cs

JavaScript samples updated (7 files):

audio-transcription-example/app.js
copilot-sdk-foundry-local/src/app.ts and src/tool-calling.ts
langchain-integration-example/app.js
native-chat-completions/app.js
tool-calling-foundry-local/src/app.js
web-server-example/app.js

Python samples updated (4 files):

hello-foundry-local/src/app.py
summarize/summarize.py
functioncalling/fl_tools.ipynb
functioncalling/README.md

Notebooks updated (1 file):

rag/rag_foundrylocal_demo.ipynb — significant rewrite with cache detection, clearer cell structure, and improved RAG pipeline

3. SDK API Correctness Fixes (7 files)

Validated all samples against the latest public SDK APIs (JS SDK sdk/js/src, Python SDK sdk_legacy/python, C# SDK sdk/cs/src) and fixed:

File	Issue	Fix
`js/local-cag/src/modelSelector.js`	Used private `selectedVariant._modelInfo`	Switched to public `model.variants` / `variant.modelInfo` / `model.isCached`
`js/local-rag/src/chatEngine.js`	`progress * 100` yielded 0–10000 (SDK reports 0–100)	Changed to `Math.round(progress)` for display, `progress / 100` for normalized value
`python/summarize/summarize.py`	`load_model(cached_models[0].id)` inconsistent with alias pattern	Changed to `load_model(cached_models[0].alias)`
`python/agent-framework/foundry_boot.py`	Fragile `str(m)` substring match for model ID resolution	Replaced with `manager.get_model_info(alias).id`
`python/agent-framework/web.py`	`drain()` buffered all SSE events before yielding	Replaced with incremental `__anext__()` loop for real-time streaming
`cs/whisper-transcription/TranscriptionService.cs`	`CancellationToken.None` hardcoded	Threaded `CancellationToken` through method and into all async calls
`cs/whisper-transcription/FoundryModelService.cs`	`progress % 10 == 0` unreliable for float	Replaced with `Math.Floor(progress / 10)` threshold bucket approach

4. Review Feedback Fixes — Round 2 (3 files)

File	Issue	Fix
`cs/whisper-transcription/FoundryModelService.cs`	`InitializeAsync()` not thread-safe — concurrent ASP.NET requests could double-initialize	Added `SemaphoreSlim` with double-check locking pattern
`python/summarize/README.md`	Claimed default model is `phi-4-mini` but code uses first cached model	Aligned README with actual behavior
`js/local-rag/README.md`	Claimed "TF-IDF" throughout but implementation uses raw term-frequency (no IDF)	Replaced all "TF-IDF" references with "term-frequency"

5. Review Feedback Fixes — Round 3 (7 files)

File	Issue	Fix
`python/agent-framework/README.md`	Troubleshooting referenced `FLASK_PORT` env var that doesn't exist in code	Changed to `--port <number>` CLI flag which matches `__main__.py`
`js/local-rag/package.json`	`"tfidf"` keyword misleading — implementation is term-frequency only	Changed keyword to `"term-frequency"`
`python/agent-framework/web.py`	`asyncio.new_event_loop()` without `set_event_loop()` — breaks on Python 3.10+	Added `asyncio.set_event_loop(loop)` after creation, clears in `finally` block
`cs/whisper-transcription/FoundryModelService.cs`	`EnsureModelReadyAsync` lacked `CancellationToken`	Added `CancellationToken ct = default` parameter, threaded through `IsCachedAsync(ct)`, `DownloadAsync(..., ct)`, `LoadAsync(ct)`
`cs/whisper-transcription/TranscriptionService.cs`	Caller didn't pass `ct` to `EnsureModelReadyAsync`	Now passes `ct` from `TranscribeAsync`
`js/local-cag/src/config.js`	`host` hardcoded to `"127.0.0.1"` despite README documenting `HOST` env var	Changed to `process.env.HOST \|\| "127.0.0.1"`
`js/local-rag/src/config.js`	All config values hardcoded — `FOUNDRY_MODEL`, `PORT`, `HOST` env vars documented but not read	Added `process.env.FOUNDRY_MODEL`, `parseInt(process.env.PORT, 10)`, `process.env.HOST` with sensible defaults

6. SDK Version Fixes

File	Before	After	Issue
`samples/js/local-cag/package.json`	`^0.9.0`	`^0.5.1`	Version 0.9.0 doesn't exist on npm
`samples/js/local-rag/package.json`	`^0.9.0`	`^0.5.1`	Version 0.9.0 doesn't exist on npm
`samples/js/copilot-sdk-foundry-local/package.json`	`"latest"`	`^0.5.1`	Unpinned — could break at any time
`samples/js/chat-and-audio-foundry-local/package.json`	`"latest"`	`^0.5.1`	Unpinned — could break at any time
`samples/js/electron-chat-application/package.json`	(missing)	`^0.5.1`	`foundry-local-sdk` not listed despite `import` in `main.js`
`samples/python/summarize/requirements.txt`	`>=0.3.1`	`>=0.5.1`	Outdated min version
`samples/python/hello-foundry-local/requirements.txt`	(file missing)	Created with `>=0.5.1`	No requirements.txt existed at all

7. Repo Hygiene

SUPPORT.md — Replaced the default GitHub template (contained TODO and REPO MAINTAINER: INSERT INSTRUCTIONS HERE placeholders) with actual content pointing to GitHub Issues, docs, and samples.

Validation Performed

Check	Result
SDK API correctness — validated all samples against latest SDK source in `sdk/js/src`, `sdk_legacy/python`, `sdk/cs/src`	✅ 7 issues fixed
Thread safety — FoundryModelService.InitializeAsync uses SemaphoreSlim	✅ Fixed
CancellationToken propagation — EnsureModelReadyAsync threads ct through all async calls	✅ Fixed
Event loop safety — web.py sets event loop for Python 3.10+ compatibility	✅ Fixed
Env var consistency — config.js files in local-cag and local-rag read documented env vars	✅ Fixed
README accuracy — all READMEs match actual implementation behavior	✅ Fixed
Security scan — searched all samples for hardcoded secrets, API keys, tokens	✅ Clean — all `api_key` references are programmatic
SDK version consistency — cross-referenced every dependency file against published SDK versions	✅ Fixed (7 issues resolved)
.gitignore coverage — verified no build artifacts can be committed	✅ Comprehensive
README coverage — checked every sample has documentation	✅ 35 README.md files
License files — verified legal files present	✅ All present
No TODO/FIXME in shipping code	✅ Clean
No committed build artifacts	✅ Clean
C# compile errors — checked TranscriptionService.cs and FoundryModelService.cs	✅ No errors

SDK Version Matrix (Current State)

Language	Package	Version	Source
C#	`Microsoft.AI.Foundry.Local`	0.9.0	Central `Directory.Packages.props`
C#	`Microsoft.AI.Foundry.Local.WinML`	0.9.0	Central `Directory.Packages.props`
JavaScript	`foundry-local-sdk`	^0.5.1	All sample `package.json` files
Python	`foundry-local-sdk`	>=0.5.1	All sample `requirements.txt` files
Rust	`foundry-local-sdk`	0.1.0	Path reference to `sdk/rust/`

Notes

4 JS samples (native-chat-completions, web-server-example, audio-transcription-example, langchain-integration-example) are intentionally single-file with no package.json — their READMEs instruct users to npm install manually.
Rust samples all use path = "../../../sdk/rust" which always resolves to the latest local SDK.
Python functioncalling notebook uses ! pip install foundry-local-sdk without version pin — standard for notebooks.

…ions, and prepare repo for public sharing

vercel · 2026-03-23T23:37:42Z

@leestott is attempting to deploy a commit to the MSFT-AIP Team on Vercel.

A member of the Team first needs to authorize it.

Copilot

Pull request overview

This PR updates the repository’s samples to be more “cache-aware” (skip redundant model downloads and provide clearer progress UX), adds several new end-to-end samples (JS local CAG/RAG, Python agent framework, C# Whisper transcription), and tightens repo hygiene/version consistency in preparation for public sharing.

Changes:

Added new JS offline CAG and offline RAG samples with web UIs + model init progress reporting.
Added a new Python “agent-framework” sample (multi-agent orchestration + Flask SSE UI) and smoke tests.
Updated multiple existing samples/notebooks/docs to use cache checks, clearer lifecycle steps, and pinned SDK versions (plus SUPPORT.md refresh).

Reviewed changes

Copilot reviewed 93 out of 93 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
samples/rag/rag_foundrylocal_demo.ipynb	Updates notebook to use Foundry Local C# SDK lifecycle + SDK-managed endpoint.
samples/rag/README.md	Documents SDK-based lifecycle and removes hardcoded endpoint/variant guidance.
samples/python/summarize/summarize.py	Adds cache-aware model selection/download UX for summarize CLI.
samples/python/summarize/requirements.txt	Bumps minimum `foundry-local-sdk` version.
samples/python/summarize/README.md	Adds feature notes for cache-awareness + UX improvements.
samples/python/hello-foundry-local/src/app.py	Adds cache-check + explicit lifecycle steps before streaming chat.
samples/python/hello-foundry-local/requirements.txt	Adds missing requirements file with SDK + OpenAI deps.
samples/python/hello-foundry-local/README.md	Adds cache-aware feature notes + clarifies run steps.
samples/python/functioncalling/fl_tools.ipynb	Adds explicit lifecycle (start/cache/download/load) before tool-calling demo.
samples/python/functioncalling/README.md	Fixes notebook link + adds prerequisites/features.
samples/python/agent-framework/tests/test_smoke.py	Adds smoke tests for imports, doc loading, env override, demo registry.
samples/python/agent-framework/src/app/web.py	Flask web UI + SSE endpoints for orchestrator + demos.
samples/python/agent-framework/src/app/tool_demo.py	Standalone tool-calling validation for direct + LLM-driven tools.
samples/python/agent-framework/src/app/orchestrator.py	Implements sequential/concurrent/hybrid orchestration as async generators.
samples/python/agent-framework/src/app/foundry_boot.py	Bootstrapper for Foundry Local endpoint/model selection + env override.
samples/python/agent-framework/src/app/documents.py	Loads/chunks local docs into retriever context.
samples/python/agent-framework/src/app/demos/weather_tools.py	Adds multi-tool weather demo.
samples/python/agent-framework/src/app/demos/sentiment_analyzer.py	Adds sentiment/emotion/key-phrase tools demo.
samples/python/agent-framework/src/app/demos/registry.py	Central demo registry for web UI listing/routing.
samples/python/agent-framework/src/app/demos/multi_agent_debate.py	Adds multi-agent debate demo.
samples/python/agent-framework/src/app/demos/math_agent.py	Adds math/tools demo (includes expression evaluation).
samples/python/agent-framework/src/app/demos/code_reviewer.py	Adds code review tools demo.
samples/python/agent-framework/src/app/demos/init.py	Exposes demos + registry helpers for import/registration.
samples/python/agent-framework/src/app/agents.py	Agent factories + shared tool functions.
samples/python/agent-framework/src/app/main.py	CLI entry (web/cli modes) + orchestrator runner.
samples/python/agent-framework/src/app/init.py	Defines package root.
samples/python/agent-framework/requirements.txt	Declares runtime dependencies for the new sample.
samples/python/agent-framework/pyproject.toml	Packaging metadata + deps + dev extras (pytest).
samples/python/agent-framework/data/orchestration_patterns.md	Sample docs for retriever context.
samples/python/agent-framework/data/foundry_local_overview.md	Sample docs for retriever context.
samples/python/agent-framework/data/agent_framework_guide.md	Sample docs for retriever context.
samples/python/agent-framework/README.md	Full sample documentation + quickstart + structure.
samples/python/agent-framework/.env.example	Environment template for model/docs/log level.
samples/js/web-server-example/app.js	Adds cache check + progress bar before downloading models.
samples/js/tool-calling-foundry-local/src/app.js	Adds cache check + progress bar before downloading models.
samples/js/native-chat-completions/app.js	Adds cache check + reusable progress bar for model download.
samples/js/local-rag/src/vectorStore.js	New SQLite-backed TF store with inverted index + caching.
samples/js/local-rag/src/server.js	New Express server with SSE status + chat + upload + ingestion.
samples/js/local-rag/src/prompts.js	System prompts for gas-field RAG agent (full + compact).
samples/js/local-rag/src/ingest.js	New ingestion script to chunk + index docs into SQLite.
samples/js/local-rag/src/config.js	Config for model, chunking, paths, and server settings.
samples/js/local-rag/src/chunker.js	Front-matter parsing + chunking + cosine similarity helpers.
samples/js/local-rag/src/chatEngine.js	Initializes SDK/model + retrieval + streaming/non-streaming responses.
samples/js/local-rag/package.json	New package manifest for local-rag sample.
samples/js/local-rag/docs/valve-inspection.md	Domain doc for RAG ingestion.
samples/js/local-rag/docs/pressure-testing.md	Domain doc for RAG ingestion.
samples/js/local-rag/docs/ppe-requirements.md	Domain doc for RAG ingestion.
samples/js/local-rag/docs/gas-leak-detection.md	Domain doc for RAG ingestion.
samples/js/local-rag/docs/emergency-shutdown.md	Domain doc for RAG ingestion.
samples/js/local-rag/README.md	New sample documentation (setup/ingest/architecture).
samples/js/local-cag/src/server.js	New Express server for CAG sample + init status SSE.
samples/js/local-cag/src/prompts.js	System prompts for gas-field CAG agent (full + compact).
samples/js/local-cag/src/modelSelector.js	Auto model selection based on RAM + caching preference.
samples/js/local-cag/src/context.js	Loads docs + keyword scoring + builds selected context per query.
samples/js/local-cag/src/config.js	Config for model selection, RAM budget, server, and context size.
samples/js/local-cag/src/chatEngine.js	Initializes SDK/model + injects preloaded context per query.
samples/js/local-cag/package.json	New package manifest for local-cag sample.
samples/js/local-cag/docs/valve-inspection.md	Domain doc for CAG startup context.
samples/js/local-cag/docs/pressure-testing.md	Domain doc for CAG startup context.
samples/js/local-cag/docs/ppe-requirements.md	Domain doc for CAG startup context.
samples/js/local-cag/docs/gas-leak-detection.md	Domain doc for CAG startup context.
samples/js/local-cag/docs/emergency-shutdown.md	Domain doc for CAG startup context.
samples/js/local-cag/README.md	New sample documentation (setup/architecture/config).
samples/js/langchain-integration-example/app.js	Adds cache check + progress bar before downloading models.
samples/js/electron-chat-application/package.json	Adds missing `foundry-local-sdk` dependency.
samples/js/copilot-sdk-foundry-local/src/tool-calling.ts	Pins SDK version + cache-aware model download.
samples/js/copilot-sdk-foundry-local/src/app.ts	Pins SDK version + cache-aware model download.
samples/js/copilot-sdk-foundry-local/package.json	Pins `foundry-local-sdk` version.
samples/js/chat-and-audio-foundry-local/package.json	Pins `foundry-local-sdk` version.
samples/js/audio-transcription-example/app.js	Adds cache check + progress bar before downloading models.
samples/cs/whisper-transcription/wwwroot/styles.css	New UI styling for Whisper transcription sample.
samples/cs/whisper-transcription/wwwroot/index.html	New drag/drop UI for uploading and transcribing audio.
samples/cs/whisper-transcription/wwwroot/app.js	Client-side upload/transcribe/copy + health polling.
samples/cs/whisper-transcription/nuget.config	Adds package source mapping for Foundry packages.
samples/cs/whisper-transcription/appsettings.json	Adds Foundry config (model alias, log level).
samples/cs/whisper-transcription/WhisperTranscription.csproj	New ASP.NET Core project for transcription service.
samples/cs/whisper-transcription/Services/TranscriptionService.cs	Implements streaming transcription via Foundry SDK audio client.
samples/cs/whisper-transcription/Services/FoundryOptions.cs	Options binding for model alias + logging.
samples/cs/whisper-transcription/Services/FoundryModelService.cs	Initializes Foundry manager + cache-aware download + load.
samples/cs/whisper-transcription/README.md	New sample documentation + endpoints + setup.
samples/cs/whisper-transcription/Program.cs	Minimal API endpoints + swagger + error middleware.
samples/cs/whisper-transcription/Middleware/ErrorHandlingMiddleware.cs	Centralized exception-to-JSON error handling.
samples/cs/whisper-transcription/Health/FoundryHealthCheck.cs	Health check that validates model availability.
samples/cs/GettingStarted/src/ToolCallingFoundryLocalWebServer/Program.cs	Adds explicit cache check + download progress bar.
samples/cs/GettingStarted/src/ToolCallingFoundryLocalSdk/Program.cs	Adds explicit cache check + download progress bar.
samples/cs/GettingStarted/src/ModelManagementExample/Program.cs	Adds explicit cache check + download progress bar.
samples/cs/GettingStarted/src/HelloFoundryLocalSdk/Program.cs	Adds explicit cache check + download progress bar.
samples/cs/GettingStarted/src/FoundryLocalWebServer/Program.cs	Adds explicit cache check + download progress bar.
samples/cs/GettingStarted/src/AudioTranscriptionExample/Program.cs	Adds explicit cache check + download progress bar.
SUPPORT.md	Replaces template placeholders with real support guidance.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/js/local-cag/src/modelSelector.js

samples/python/summarize/summarize.py

samples/python/agent-framework/src/app/foundry_boot.py

samples/cs/whisper-transcription/Services/TranscriptionService.cs

samples/js/local-rag/src/chatEngine.js

samples/python/agent-framework/src/app/web.py

samples/cs/whisper-transcription/Services/FoundryModelService.cs

Copilot

Pull request overview

Copilot reviewed 93 out of 93 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/python/summarize/README.md

samples/cs/whisper-transcription/Services/FoundryModelService.cs

samples/js/local-rag/README.md

samples/cs/whisper-transcription/Services/FoundryModelService.cs

… claims - FoundryModelService.cs: add SemaphoreSlim for thread-safe InitializeAsync to prevent concurrent callers from double-initializing in ASP.NET - summarize/README.md: align docs with code (uses first cached model, not phi-4-mini default) - local-rag/README.md: replace 'TF-IDF' with 'term-frequency' throughout since the implementation uses raw term-frequency maps without IDF weighting

Copilot

Pull request overview

Copilot reviewed 93 out of 93 changed files in this pull request and generated 9 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/python/agent-framework/README.md

samples/js/local-rag/package.json

samples/python/agent-framework/src/app/web.py

samples/cs/whisper-transcription/Services/FoundryModelService.cs

samples/js/local-cag/src/config.js

samples/js/local-rag/src/config.js

samples/python/agent-framework/src/app/orchestrator.py

samples/cs/whisper-transcription/Program.cs

samples/cs/whisper-transcription/Services/TranscriptionService.cs

…onToken, README accuracy

Copilot

Pull request overview

Copilot reviewed 93 out of 93 changed files in this pull request and generated 7 comments.

Comments suppressed due to low confidence (1)

samples/python/agent-framework/src/app/web.py:177

api_demo_run() creates a new event loop but doesn't call asyncio.set_event_loop(loop) (and doesn't clear it). For consistency with api_run() and to avoid libraries failing due to missing current event loop, set/clear the loop in a try/finally around run_until_complete.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/python/agent-framework/src/app/web.py

samples/js/local-cag/src/modelSelector.js

samples/cs/whisper-transcription/Program.cs

samples/js/local-rag/src/chatEngine.js

samples/python/agent-framework/src/app/orchestrator.py

samples/python/functioncalling/fl_tools.ipynb

Copilot

Pull request overview

Copilot reviewed 94 out of 94 changed files in this pull request and generated 3 comments.

Comments suppressed due to low confidence (1)

samples/python/agent-framework/src/app/web.py:183

SSE responses for /api/demo/<demo_id>/run are returned with only mimetype="text/event-stream". For consistent real-time streaming (especially behind proxies), add the usual SSE headers (Cache-Control: no-cache, Connection: keep-alive, and optionally X-Accel-Buffering: no) to this Response as well.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/js/local-cag/README.md

samples/js/tool-calling-foundry-local/src/app.js

samples/python/agent-framework/src/app/web.py

Copilot

Pull request overview

Copilot reviewed 94 out of 94 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/python/summarize/summarize.py

samples/python/hello-foundry-local/src/app.py

samples/cs/whisper-transcription/Middleware/ErrorHandlingMiddleware.cs

Copilot

Pull request overview

Copilot reviewed 94 out of 94 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/python/functioncalling/fl_tools.ipynb

samples/js/local-rag/src/chatEngine.js

samples/js/local-cag/src/chatEngine.js

Copilot

Pull request overview

Copilot reviewed 94 out of 94 changed files in this pull request and generated 6 comments.

Comments suppressed due to low confidence (1)

samples/python/agent-framework/src/app/web.py:173

prompt = data.get(...).strip() will raise an exception if the JSON field is not a string, leading to a 500 instead of a 400. Add a type check/coercion before .strip() and return a validation error for non-string input.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/python/agent-framework/src/app/demos/sentiment_analyzer.py

samples/python/agent-framework/src/app/demos/math_agent.py

samples/python/agent-framework/src/app/demos/code_reviewer.py

samples/python/agent-framework/src/app/demos/multi_agent_debate.py

samples/python/agent-framework/src/app/web.py

samples/python/agent-framework/src/app/demos/weather_tools.py

Copilot

Pull request overview

Copilot reviewed 94 out of 94 changed files in this pull request and generated 2 comments.

Comments suppressed due to low confidence (7)

samples/js/local-rag/src/chatEngine.js:1

The async generator can hang indefinitely if completeStreamingChat(...) rejects, because done is only set in .then(...) and the waiting loop relies on done to terminate. Set done = true and resolve any pending waiter in a .catch(...) or .finally(...), and consider capturing the error to rethrow after flushing buffered chunks so SSE clients don’t stall on failures.
samples/js/local-cag/src/chatEngine.js:1
Same streaming failure-mode as the Local RAG engine: if completeStreamingChat(...) throws/rejects, the generator can wait forever because done is only set in .then(...). Add a .catch(...)/.finally(...) that sets done = true and releases the waiter, and propagate the error (e.g., by storing it and throwing after the loop) so callers can send an SSE error event.
samples/python/agent-framework/src/app/foundry_boot.py:1
In the external-endpoint override path, model_id is set to self.alias. If the remote OpenAI-compatible endpoint expects a model ID (not an alias), downstream clients (e.g., OpenAIChatClient(model_id=conn.model_id)) can fail unexpectedly. A more robust contract is to allow specifying FOUNDRY_MODEL_ID (or reuse MODEL_ALIAS vs MODEL_ID explicitly) and keep model_alias/model_id distinct.
samples/rag/README.md:1
The README snippet doesn’t handle the GetModelAsync(...) not-found case; if it returns null, the next line will throw. Update the documentation snippet to use the same defensiveness as the notebook code (null-coalescing throw or explicit check) so copy/paste users don’t hit a NullReferenceException.
samples/python/agent-framework/src/app/web.py:1
traceback is imported but never used in this module. Removing unused imports reduces lint noise and keeps the sample easier to maintain.
samples/python/agent-framework/src/app/tool_demo.py:1
asyncio is imported but not used anywhere in this file. Removing it avoids confusion about event loop usage in this demo.
samples/python/agent-framework/src/app/orchestrator.py:1
logging (and re) are currently unused in this module (log is defined but never referenced). Removing unused imports/variables will keep the sample clean and reduce warnings for users running linters.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/cs/whisper-transcription/nuget.config

samples/cs/whisper-transcription/WhisperTranscription.csproj

Copilot

Pull request overview

Copilot reviewed 94 out of 94 changed files in this pull request and generated 1 comment.

Comments suppressed due to low confidence (8)

samples/js/local-rag/src/chatEngine.js:1

If completeStreamingChat rejects/throws, done is never set and notify() is never called, which can leave the async generator blocked indefinitely waiting for more data. Attach a rejection handler that (1) records the error, (2) sets done = true, (3) calls notify(), and then have the generator either yield an error event or rethrow after draining buffered chunks.
samples/js/local-cag/src/chatEngine.js:1
Same streaming failure-mode as the RAG engine: if completeStreamingChat rejects, done never flips to true and the generator can hang forever. Add a .catch(...) path that marks completion and propagates the error (e.g., by storing it and throwing it from the generator after waking it).
samples/js/local-rag/src/chatEngine.js:1
This assumes catalog.getModel(...) always returns a model object. If the alias is invalid or the SDK returns null/undefined, this will throw on this.model.alias with a less actionable error. Prefer an explicit null-check and throw a clear error that includes the requested alias (and optionally suggests listing available models).
samples/js/local-rag/src/server.js:1
fs.writeFileSync (and mkdirSync) blocks the Node.js event loop, so large uploads or slow disks can stall all concurrent requests. Prefer the async fs.promises equivalents (await mkdir + writeFile) or stream to disk to keep the server responsive under load.
samples/rag/README.md:1
GetModelAsync(...) can return null; the subsequent model.IsCachedAsync() would then throw a null-reference exception. Update the snippet to explicitly handle the null case (e.g., throw with a clear message) so the README code is copy/paste safe.
samples/python/summarize/summarize.py:1
The PR description states summarize.py was changed to load by alias (load_model(cached_models[0].alias)), but the updated code loads by cached model ID. Either update the PR description to match the implementation, or switch the code back to alias-based loading if that’s the intended/required SDK pattern.
samples/python/agent-framework/src/app/web.py:1
Using module-level mutable globals for _conn/_docs makes the app hard to reason about in multi-app/test scenarios and can lead to cross-test interference (e.g., calling create_app twice overwrites shared state). Store these on the Flask app instance (app.config / g) or close over them inside create_app so each app instance is isolated.
samples/python/agent-framework/src/app/demos/math_agent.py:1
Even with builtins disabled and a restricted character set, eval still permits potentially expensive computations (e.g., very large exponentiation via , or huge integers) that can cause CPU/memory DoS when invoked via tool-calling. Consider explicitly rejecting '' and extremely long inputs (length/digit-count limits), or replace eval with an AST-based expression evaluator that only supports the intended operators with bounded complexity.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/cs/whisper-transcription/Program.cs

Copilot

Pull request overview

Copilot reviewed 94 out of 94 changed files in this pull request and generated 3 comments.

Comments suppressed due to low confidence (11)

samples/python/summarize/summarize.py:1

download_model(args.model)may cache a specific resolved variant, but the code setsmodel_name = model_info.id(from the catalog lookup) rather than using the actual downloaded variant’s id. If resolution differs,load_model(model_name)can fail or load the wrong variant. Prefer capturing the return value ofdownload_model(...)(or re-readinglist_cached_models()` after download) and loading by that returned cached variant id.
samples/python/hello-foundry-local/src/app.py:1
After download_model(alias), the code sets model_id = model_info.id instead of using the id of the downloaded cached variant (similar to the summarize sample). If the download resolves to a different variant than model_info.id, load_model(model_id) may not load what was downloaded. Prefer using the returned value from download_model(alias) (if available) to set model_id, or re-query cached models and pick the cached variant id.
samples/python/hello-foundry-local/src/app.py:1
After download_model(alias), the code sets model_id = model_info.id instead of using the id of the downloaded cached variant (similar to the summarize sample). If the download resolves to a different variant than model_info.id, load_model(model_id) may not load what was downloaded. Prefer using the returned value from download_model(alias) (if available) to set model_id, or re-query cached models and pick the cached variant id.
samples/python/agent-framework/src/app/foundry_boot.py:1
The class docstring claims the bootstrapper “download → load”, but the implementation only constructs FoundryLocalManager(self.alias) and reads get_model_info. If the SDK constructor doesn’t guarantee download+load semantics in all environments, this is misleading and can cause hard-to-debug runtime failures. Either (a) make the bootstrapper explicitly cache-check/download/load (mirroring the cache-aware flows used elsewhere in the PR), or (b) update the docstring/comments to accurately describe what’s guaranteed here.
samples/python/agent-framework/src/app/foundry_boot.py:1
The class docstring claims the bootstrapper “download → load”, but the implementation only constructs FoundryLocalManager(self.alias) and reads get_model_info. If the SDK constructor doesn’t guarantee download+load semantics in all environments, this is misleading and can cause hard-to-debug runtime failures. Either (a) make the bootstrapper explicitly cache-check/download/load (mirroring the cache-aware flows used elsewhere in the PR), or (b) update the docstring/comments to accurately describe what’s guaranteed here.
samples/python/agent-framework/src/app/foundry_boot.py:1
The class docstring claims the bootstrapper “download → load”, but the implementation only constructs FoundryLocalManager(self.alias) and reads get_model_info. If the SDK constructor doesn’t guarantee download+load semantics in all environments, this is misleading and can cause hard-to-debug runtime failures. Either (a) make the bootstrapper explicitly cache-check/download/load (mirroring the cache-aware flows used elsewhere in the PR), or (b) update the docstring/comments to accurately describe what’s guaranteed here.
samples/js/local-cag/src/server.js:1
Unlike the local-rag server’s status SSE, the local-cag status SSE connections are never closed when initialization completes. If users refresh or open multiple tabs, these long-lived connections can accumulate unnecessarily. Consider ending and removing connected SSE clients once state.stage === "ready" (or likewise on terminal error) and clearing the set to avoid resource/connection pressure.
samples/python/agent-framework/src/app/web.py:1
The traceback import is unused in this module. Removing it helps keep the sample minimal and avoids implying stack traces are intended to be surfaced.
samples/python/agent-framework/src/app/tool_demo.py:1
asyncio, Console, and console aren’t used in this module. Removing unused imports/variables reduces noise and avoids suggesting output is routed through Rich when it isn’t.
samples/python/agent-framework/src/app/tool_demo.py:1
asyncio, Console, and console aren’t used in this module. Removing unused imports/variables reduces noise and avoids suggesting output is routed through Rich when it isn’t.
samples/python/agent-framework/src/app/tool_demo.py:1
asyncio, Console, and console aren’t used in this module. Removing unused imports/variables reduces noise and avoids suggesting output is routed through Rich when it isn’t.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/cs/whisper-transcription/Program.cs

samples/cs/whisper-transcription/wwwroot/app.js

Copilot

Pull request overview

Copilot reviewed 94 out of 94 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-26T06:27:17Z

samples/js/local-rag/src/server.js

+const engine = new ChatEngine();
+
+// ── API: Chat (non-streaming) ──
+app.post("/api/chat", async (req, res) => {
+  try {
+    const { message, history, compact } = req.body;
+    if (!message || typeof message !== "string") {
+      return res.status(400).json({ error: "message is required" });
+    }
+
+    if (compact !== undefined) engine.setCompactMode(!!compact);
+
+    const result = await engine.query(
+      message,
+      Array.isArray(history) ? history : []
+    );
+    res.json(result);
+  } catch (err) {
+    console.error("[API] Error:", err.message);
+    res.status(500).json({ error: "Internal server error" });
+  }
+});


/api/chat, /api/chat/stream, /api/upload, and /api/docs can be called before engine.init() completes (the server starts listening first). In that window engine.getStore() is still null and engine.query*() will fail because the model/chatClient aren’t initialized, causing 500s or crashes. Add a requireReady middleware (similar to the local-cag sample) that returns 503 while engineReady is false, and apply it to all routes that depend on the initialized engine/store.

Copilot · 2026-03-26T06:27:18Z

samples/js/local-rag/src/chatEngine.js

+    // Load the model into memory
+    this._emitStatus("loading", `Loading ${this.modelAlias} into memory...`);
+    await this.model.load();
+
+    // Create the native chat client with performance settings pre-configured
+    this.chatClient = this.model.createChatClient();
+    this.chatClient.settings.temperature = 0.1; // Low for deterministic, safety-critical responses
+    this._emitStatus("ready", `Model ready: ${this.modelAlias}`);
+
+    // Open the local vector store
+    this.store = new VectorStore(config.dbPath);
+    const count = this.store.count();
+    this._emitStatus("ready", `Vector store ready: ${count} chunks indexed.`);
+


ChatEngine.init() emits status with phase: "ready" twice (once right after creating the chat client and again after opening the vector store). Because the server/UI treat phase === "ready" as “fully initialized” (and close the SSE stream / enable chat), the first emission can signal readiness before initialization is actually complete. Use a non-terminal phase for intermediate steps (e.g., model_ready, store_ready) and emit ready only once at the end of init().

Copilot · 2026-03-26T06:27:18Z

samples/python/agent-framework/src/app/foundry_boot.py

+        # Resolve alias to the actual model ID via the SDK's catalog API
+        model_info = manager.get_model_info(self.alias)
+        model_id = model_info.id if model_info else self.alias
+


FoundryLocalBootstrapper.bootstrap() silently falls back to using model_id=self.alias when get_model_info(self.alias) returns None. If the alias is wrong or missing from the catalog, this hides the configuration error and pushes the failure later into agent execution. Prefer failing fast here (e.g., raise a ValueError with a clear message) so misconfiguration is surfaced at startup.

Improve samples with cache-awareness, add 4 new samples, fix SDK vers…

b6c9e49

…ions, and prepare repo for public sharing

Copilot AI review requested due to automatic review settings March 23, 2026 23:37

Copilot started reviewing on behalf of leestott March 23, 2026 23:38 View session

Copilot AI reviewed Mar 23, 2026

View reviewed changes

leestott marked this pull request as draft March 24, 2026 21:38

leestott requested a review from Copilot March 24, 2026 23:42

Update

78206d0

Copilot started reviewing on behalf of leestott March 24, 2026 23:43 View session

leestott closed this Mar 24, 2026

leestott reopened this Mar 24, 2026

leestott marked this pull request as ready for review March 24, 2026 23:47

Copilot AI reviewed Mar 24, 2026

View reviewed changes

leestott requested a review from Copilot March 24, 2026 23:52

Copilot started reviewing on behalf of leestott March 24, 2026 23:55 View session

Copilot AI reviewed Mar 24, 2026

View reviewed changes

fix: address round-3 review issues — env vars, event loop, Cancellati…

acf06fc

…onToken, README accuracy

leestott requested a review from Copilot March 25, 2026 00:11

Copilot started reviewing on behalf of leestott March 25, 2026 00:12 View session

Copilot AI reviewed Mar 25, 2026

View reviewed changes

update

050fbed

leestott requested a review from Copilot March 25, 2026 03:06

Copilot started reviewing on behalf of leestott March 25, 2026 03:07 View session

Copilot AI reviewed Mar 25, 2026

View reviewed changes

samples/js/local-cag/README.md Outdated Show resolved Hide resolved

samples/js/tool-calling-foundry-local/src/app.js Show resolved Hide resolved

samples/python/agent-framework/src/app/web.py Outdated Show resolved Hide resolved

update

e373a2b

leestott requested a review from Copilot March 25, 2026 03:58

Copilot started reviewing on behalf of leestott March 25, 2026 03:59 View session

Copilot AI reviewed Mar 25, 2026

View reviewed changes

Update

10d78b3

update

26908ec

leestott requested a review from Copilot March 25, 2026 14:31

Copilot started reviewing on behalf of leestott March 25, 2026 14:32 View session

Copilot AI reviewed Mar 25, 2026

View reviewed changes

samples/python/functioncalling/fl_tools.ipynb Show resolved Hide resolved

samples/js/local-rag/src/chatEngine.js Outdated Show resolved Hide resolved

samples/js/local-cag/src/chatEngine.js Outdated Show resolved Hide resolved

Update

16cfcac

leestott requested a review from Copilot March 25, 2026 14:43

Copilot started reviewing on behalf of leestott March 25, 2026 14:44 View session

Copilot AI reviewed Mar 25, 2026

View reviewed changes

update

bf1b5ca

leestott requested a review from Copilot March 25, 2026 16:29

Copilot AI reviewed Mar 25, 2026

View reviewed changes

samples/cs/whisper-transcription/nuget.config Show resolved Hide resolved

samples/cs/whisper-transcription/WhisperTranscription.csproj Outdated Show resolved Hide resolved

leestott added 2 commits March 25, 2026 09:49

update

a60d8e0

update

0a82b17

leestott requested a review from Copilot March 25, 2026 17:01

Copilot started reviewing on behalf of leestott March 25, 2026 17:08 View session

Copilot AI reviewed Mar 25, 2026

View reviewed changes

samples/cs/whisper-transcription/Program.cs Outdated Show resolved Hide resolved

update

c6c1cab

leestott requested a review from Copilot March 25, 2026 17:18

Copilot AI reviewed Mar 25, 2026

View reviewed changes

samples/cs/whisper-transcription/Program.cs Show resolved Hide resolved

samples/cs/whisper-transcription/Program.cs Show resolved Hide resolved

samples/cs/whisper-transcription/wwwroot/app.js Outdated Show resolved Hide resolved

Copilot started reviewing on behalf of leestott March 25, 2026 17:28 View session

Copilot started reviewing on behalf of leestott March 25, 2026 17:34 View session

leestott added 2 commits March 25, 2026 23:09

Update

781c743

update

8e60d15

leestott requested a review from Copilot March 26, 2026 06:21

Copilot started reviewing on behalf of leestott March 26, 2026 06:22 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

Conversation

leestott commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What's Changed

1. New Samples (4)

samples/js/local-cag/ — Context-Augmented Generation (12 files)

samples/js/local-rag/ — Retrieval-Augmented Generation (11 files)

samples/python/agent-framework/ — Microsoft Agent Framework Integration (24 files)

samples/cs/whisper-transcription/ — ASP.NET Core Whisper Transcription (13 files)

2. Cache-Awareness Improvements (All Existing Samples)

3. SDK API Correctness Fixes (7 files)

4. Review Feedback Fixes — Round 2 (3 files)

5. Review Feedback Fixes — Round 3 (7 files)

6. SDK Version Fixes

7. Repo Hygiene

Validation Performed

SDK Version Matrix (Current State)

Notes

Uh oh!

vercel bot commented Mar 23, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leestott commented Mar 23, 2026 •

edited

Loading

`samples/js/local-cag/` — Context-Augmented Generation (12 files)

`samples/js/local-rag/` — Retrieval-Augmented Generation (11 files)

`samples/python/agent-framework/` — Microsoft Agent Framework Integration (24 files)

`samples/cs/whisper-transcription/` — ASP.NET Core Whisper Transcription (13 files)