Performance: Zero-allocation metrics in AudioSegmentProcessor by ysdede · Pull Request #244 · ysdede/keet

ysdede · 2026-04-08T16:45:10Z

What changed

Added an optional out parameter to AudioSegmentProcessor.getStats() and getStateInfo(). If provided, the methods will mutate the passed object instead of allocating a new one. AudioEngine was updated to initialize and cache these objects, passing them down in the handleAudioChunk hot path.

Why it was needed (bottleneck evidence)

Profiling revealed that AudioSegmentProcessor.getStats() and getStateInfo() were allocating 3 and 1 objects respectively per call. Since these are called for every single audio chunk in AudioEngine.handleAudioChunk (multiple times per second), this generated significant unnecessary GC churn. Our script measuring 1,000,000 calls showed ~120MB of heap allocation and ~146ms overhead.

Impact

Zero allocations for metrics querying during handleAudioChunk execution. The heap allocation delta drops from several MBs per stream session down to 0 bytes for this specific codepath.

How to verify

Run bun test src/lib/audio/AudioSegmentProcessor.test.ts to ensure VAD behavior is preserved.
The logic can be proven with a profiler measuring heapUsed around a high frequency loop of handleAudioChunk (or getStats(out)) compared to a baseline.

PR created automatically by Jules for task 17590775678211104803 started by @ysdede

Summary by Sourcery

Introduce optional zero-allocation metric queries in audio processing hot paths to reduce GC churn and heap allocations.

New Features:

Add optional output parameter to AudioSegmentProcessor.getStats() and getStateInfo() to allow reuse of caller-provided objects for metrics and state snapshots.

Enhancements:

Cache reusable stats and state info objects in AudioEngine and use them in handleAudioChunk and related methods to avoid per-call allocations during audio processing.

Summary by CodeRabbit

Refactor
- Optimized audio processing performance through object caching and efficient memory allocation in statistics and state retrieval operations.

AudioSegmentProcessor.getStats() and getStateInfo() were allocating 3 and 1 objects respectively per call. These are called every 80ms chunk in AudioEngine.handleAudioChunk, causing unnecessary high-frequency GC churn (allocating ~120MB per million calls). This patch modifies the methods to accept an optional pre-allocated `out` object and adds cached objects to AudioEngine to eliminate allocations entirely in the hot path.

google-labs-jules · 2026-04-08T16:45:11Z

👋 Jules, reporting for duty! I'm here to lend a hand with this pull request.

When you start a review, I'll add a 👀 emoji to each comment to let you know I've read it. I'll focus on feedback directed at me and will do my best to stay out of conversations between you and other bots or reviewers to keep the noise down.

I'll push a commit with your requested changes shortly after. Please note there might be a delay between these steps, but rest assured I'm on the job!

For more direct control, you can switch me to Reactive Mode. When this mode is on, I will only act on comments where you specifically mention me with @jules. You can find this option in the Pull Request section of your global Jules UI settings. You can always switch back!

New to Jules? Learn more at jules.google/docs.

For security, I will only act on instructions from the user who triggered this task.

qodo-code-review · 2026-04-08T16:45:25Z

Review Summary by Qodo

Zero-allocation metrics in AudioSegmentProcessor

✨ Enhancement

Walkthroughs

Description

• Implement optional out parameter in AudioSegmentProcessor.getStats() and getStateInfo() to
  enable object reuse
• Cache metric objects in AudioEngine to eliminate allocations in hot path
• Pass cached objects through all metric query calls in handleAudioChunk and related methods
• Reduce heap allocation from ~120MB per million calls to zero for metrics querying

Diagram

flowchart LR
  A["AudioSegmentProcessor<br/>getStats/getStateInfo"] -->|"optional out param"| B["Mutate passed object<br/>instead of allocating"]
  C["AudioEngine<br/>constructor"] -->|"initialize once"| D["cachedStats<br/>cachedStateInfo"]
  D -->|"reuse in hot path"| E["handleAudioChunk<br/>getSignalMetrics<br/>isSpeechActive"]
  E -->|"zero allocations"| F["Reduced GC churn"]

File Changes

1. src/lib/audio/AudioEngine.ts ✨ Enhancement +12/-5

Cache and reuse metric objects in AudioEngine

• Added two cached object fields (cachedStats and cachedStateInfo) for zero-allocation metric
 queries
• Initialize cached objects once in constructor
• Updated all metric query calls to pass cached objects: getStats(this.cachedStats) and
 getStateInfo(this.cachedStateInfo)
• Modified calls in getSignalMetrics(), isSpeechActive(), handleAudioChunk(), and
 getSegmentsForVisualization()

src/lib/audio/AudioEngine.ts

2. src/lib/audio/AudioSegmentProcessor.ts ✨ Enhancement +30/-2

Add optional out parameter for object reuse

• Added optional out parameter to getStats() method to accept pre-allocated object for mutation
• Implement object reuse logic: mutate all fields in out object if provided, otherwise create new
 object
• Added optional out parameter to getStateInfo() method with similar mutation pattern
• Preserve backward compatibility by returning new objects when out is not provided

src/lib/audio/AudioSegmentProcessor.ts

coderabbitai · 2026-04-08T16:45:26Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: f2e35dc3-7fee-4666-9bda-4fa427a44516

📥 Commits

Reviewing files that changed from the base of the PR and between 474dbe6 and 6d865f3.

📒 Files selected for processing (2)

src/lib/audio/AudioEngine.ts
src/lib/audio/AudioSegmentProcessor.ts

📝 Walkthrough

Walkthrough

AudioEngine now caches stats and state objects returned from AudioSegmentProcessor methods, passing these cached objects as optional parameters to subsequent method calls instead of repeatedly allocating new ones. AudioSegmentProcessor methods updated to accept optional output parameters for in-place mutation while maintaining backward compatibility.

Changes

Cohort / File(s)	Summary
AudioSegmentProcessor Updates `src/lib/audio/AudioSegmentProcessor.ts`	Added optional `out` parameters to `getStats()` and `getStateInfo()` methods to support in-place mutation of pre-allocated objects, reducing allocations when callers provide cached instances.
AudioEngine Caching `src/lib/audio/AudioEngine.ts`	Initialized cached stats and state objects in constructor and reused them across multiple calls to `getStats(cachedStats)` and `getStateInfo(cachedStateInfo)` in `handleAudioChunk()`, `getSignalMetrics()`, `isSpeechActive()`, and `getSegmentsForVisualization()`.

Possibly related PRs

refactor(audio): harden speech stats cache safety #173: Modifies AudioSegmentProcessor's internal speech-stats caching and invalidation logic, touching the same class being extended with out-parameters in this PR.
Performance: Optimize AudioSegmentProcessor allocations #184: Directly enables allocation-free reuse by adding optional out-parameters to AudioSegmentProcessor's getStats/getStateInfo methods that this PR depends on.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Poem

🐰 Hop, hop! No more alloc today,
Cached objects in the fray,
Reuse with care, mutations in place,
Memory whispers—a joyful race! 🎵

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main change: implementing zero-allocation metrics through optional output parameters in AudioSegmentProcessor, addressing the core performance optimization objective.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch perf/audio-processor-zero-alloc-17590775678211104803

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

qodo-code-review · 2026-04-08T16:45:27Z

Code Review by Qodo

🐞 Bugs (2) 📘 Rule violations (0) 📎 Requirement gaps (0) 🎨 UX Issues (0)

🐞\ ⚙ Maintainability (2)

1. Stats out branch drifts 🐞 ⚙

Description

AudioSegmentProcessor.getStats(out) manually copies fields into out, while the no-out branch
uses a spread; if CurrentStats gains fields later, only one branch will update them and callers
using out can observe stale/missing values.

Code

src/lib/audio/AudioSegmentProcessor.ts[R529-552]

+    getStats(out?: CurrentStats): CurrentStats {
        const stats = this.state.currentStats;
+        if (out) {
+            out.noiseFloor = stats.noiseFloor;
+            out.snr = stats.snr;
+            out.snrThreshold = stats.snrThreshold;
+            out.minSnrThreshold = stats.minSnrThreshold;
+            out.energyRiseThreshold = stats.energyRiseThreshold;
+
+            if (!out.silence) out.silence = { avgDuration: 0, avgEnergy: 0, avgEnergyIntegral: 0 };
+            out.silence.avgDuration = stats.silence.avgDuration;
+            out.silence.avgEnergy = stats.silence.avgEnergy;
+            out.silence.avgEnergyIntegral = stats.silence.avgEnergyIntegral;
+
+            if (!out.speech) out.speech = { avgDuration: 0, avgEnergy: 0, avgEnergyIntegral: 0 };
+            out.speech.avgDuration = stats.speech.avgDuration;
+            out.speech.avgEnergy = stats.speech.avgEnergy;
+            out.speech.avgEnergyIntegral = stats.speech.avgEnergyIntegral;
+            return out;
+        }
+
        return {
            ...stats,
            silence: { ...stats.silence },

Evidence

CurrentStats is a structured type (including nested silence/speech summaries). The new out
code path enumerates fields manually, but the allocation path spreads ...stats, so the two
branches can easily diverge when CurrentStats changes, producing inconsistent snapshots depending
on whether callers pass out.

src/lib/audio/AudioSegmentProcessor.ts[40-48]
src/lib/audio/AudioSegmentProcessor.ts[529-555]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
`getStats(out)` has two different implementations (manual field copy vs object spread). This is brittle: if `CurrentStats` is extended or refactored, the `out` path can silently stop updating new fields while the allocation path still includes them.

### Issue Context
This PR intentionally adds an `out` parameter to reduce allocations in the hot path; we can keep that benefit while removing branch drift risk.

### Fix Focus Areas
- src/lib/audio/AudioSegmentProcessor.ts[525-555]

### Suggested approach
- Introduce a small internal helper like `copyCurrentStats(src, dst)` that copies *all* scalar fields and updates nested `silence`/`speech` summaries.
- Use that helper in both branches (and optionally in any other place that needs a consistent snapshot), so future `CurrentStats` changes require updating one place.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

2. StateInfo type duplicated 🐞 ⚙

Description

getStateInfo repeats the same object shape in both the out parameter and return type, making it
easy for the two to drift when fields change.

Code

src/lib/audio/AudioSegmentProcessor.ts[R561-571]

+    getStateInfo(out?: { inSpeech: boolean; noiseFloor: number; snr: number; speechStartTime: number | null }): { inSpeech: boolean; noiseFloor: number; snr: number; speechStartTime: number | null } {
+        if (out) {
+            out.inSpeech = this.state.inSpeech;
+            out.noiseFloor = this.state.noiseFloor;
+            out.snr = this.state.currentStats.snr;
+            out.speechStartTime = this.state.speechStartTime;
+            return out;
+        }
        return {
            inSpeech: this.state.inSpeech,
            noiseFloor: this.state.noiseFloor,

Evidence

The method signature now embeds the same structural type twice, increasing the chance of future
edits updating one occurrence but not the other.

src/lib/audio/AudioSegmentProcessor.ts[557-575]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
`getStateInfo` duplicates the state info object type in multiple positions (parameter and return). This is error-prone during future maintenance.

### Issue Context
The PR adds `out` for zero-allocation. We can keep the API while defining a single shared type alias.

### Fix Focus Areas
- src/lib/audio/AudioSegmentProcessor.ts[557-575]

### Suggested approach
- Define a `type StateInfo = { inSpeech: boolean; noiseFloor: number; snr: number; speechStartTime: number | null }` near the method (or near other interfaces).
- Update the signature to `getStateInfo(out?: StateInfo): StateInfo`.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

ⓘ The new review experience is currently in Beta. Learn more

sourcery-ai

Hey - I've left some high level feedback:

Consider extracting the getStateInfo return type into a named interface/type alias so the method signature and cachedStateInfo declaration don’t duplicate the same inline object shape in multiple places, which will make future changes easier and less error-prone.
The getStats(out) implementation conditionally allocates nested silence/speech objects on first use; if you want strictly zero allocations in the hot path, you could initialize these sub-objects once when creating cachedStats instead of lazily inside getStats.
Given that getStats/getStateInfo now have mutating behavior when out is provided, you might want to document (or enforce via typing) that the same out object should not be shared across concurrent calls to avoid subtle data races in multi-consumer scenarios.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- Consider extracting the `getStateInfo` return type into a named interface/type alias so the method signature and `cachedStateInfo` declaration don’t duplicate the same inline object shape in multiple places, which will make future changes easier and less error-prone.
- The `getStats(out)` implementation conditionally allocates nested `silence`/`speech` objects on first use; if you want strictly zero allocations in the hot path, you could initialize these sub-objects once when creating `cachedStats` instead of lazily inside `getStats`.
- Given that `getStats`/`getStateInfo` now have mutating behavior when `out` is provided, you might want to document (or enforce via typing) that the same `out` object should not be shared across concurrent calls to avoid subtle data races in multi-consumer scenarios.

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

gemini-code-assist

Code Review

This pull request introduces zero-allocation metric queries in the AudioEngine by allowing the AudioSegmentProcessor to accept optional pre-allocated objects for stats and state information. This reduces garbage collection overhead during frequent audio processing updates. The review suggests using Object.assign for cleaner property copying in the processor and recommends defining a named interface for the StateInfo type to improve maintainability and readability.

gemini-code-assist · 2026-04-08T16:49:14Z

+            out.silence.avgDuration = stats.silence.avgDuration;
+            out.silence.avgEnergy = stats.silence.avgEnergy;
+            out.silence.avgEnergyIntegral = stats.silence.avgEnergyIntegral;


To make this code more concise and easier to maintain if the StatsSummary interface changes, you could use Object.assign to copy the properties.

Suggested change

out.silence.avgDuration = stats.silence.avgDuration;

out.silence.avgEnergy = stats.silence.avgEnergy;

out.silence.avgEnergyIntegral = stats.silence.avgEnergyIntegral;

Object.assign(out.silence, stats.silence);

gemini-code-assist · 2026-04-08T16:49:14Z

+            out.speech.avgDuration = stats.speech.avgDuration;
+            out.speech.avgEnergy = stats.speech.avgEnergy;
+            out.speech.avgEnergyIntegral = stats.speech.avgEnergyIntegral;


Similarly here, Object.assign can make the code more concise and maintainable.

Suggested change

out.speech.avgDuration = stats.speech.avgDuration;

out.speech.avgEnergy = stats.speech.avgEnergy;

out.speech.avgEnergyIntegral = stats.speech.avgEnergyIntegral;

Object.assign(out.speech, stats.speech);

sourcery-ai bot reviewed Apr 8, 2026

View reviewed changes

gemini-code-assist bot reviewed Apr 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance: Zero-allocation metrics in AudioSegmentProcessor#244

Performance: Zero-allocation metrics in AudioSegmentProcessor#244
ysdede wants to merge 1 commit intomasterfrom
perf/audio-processor-zero-alloc-17590775678211104803

ysdede commented Apr 8, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

google-labs-jules bot commented Apr 8, 2026

Uh oh!

qodo-code-review bot commented Apr 8, 2026

Uh oh!

coderabbitai bot commented Apr 8, 2026 •

edited

Loading

Walkthrough

Changes

Possibly related PRs

Estimated code review effort

Poem

Uh oh!

qodo-code-review bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

sourcery-ai bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 8, 2026

Uh oh!

gemini-code-assist bot Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ysdede commented Apr 8, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed

Why it was needed (bottleneck evidence)

Impact

How to verify

Summary by Sourcery

Summary by CodeRabbit

Uh oh!

google-labs-jules bot commented Apr 8, 2026

Uh oh!

qodo-code-review bot commented Apr 8, 2026

Review Summary by Qodo

Walkthroughs

File Changes

Uh oh!

coderabbitai bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Possibly related PRs

Estimated code review effort

Poem

Uh oh!

qodo-code-review bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review by Qodo

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ysdede commented Apr 8, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 8, 2026 •

edited

Loading

qodo-code-review bot commented Apr 8, 2026 •

edited

Loading