Performance: Optimize energy aggregation to O(1) running sum in AudioSegmentProcessor by ysdede · Pull Request #247 · ysdede/keet

ysdede · 2026-04-11T16:34:15Z

What changed

Replaced speechEnergies: number[] and silenceEnergies: number[] inside ProcessorState with O(1) primitives: speechEnergySum, speechEnergyCount, silenceEnergySum, and silenceEnergyCount. The reduce operation previously used to calculate averages has been replaced with a simple mathematical division (sum / count).

Why it was needed

During long speech or silence segments, the arrays grew unbounded, causing unnecessary garbage collection pressure and triggering O(N) array iteration to calculate energy averages via reduce. During streaming high-frequency processing in AudioSegmentProcessor, maintaining large arrays of primitives severely penalized throughput and memory usage.

Impact

In synthetic benchmarks, continuous chunk processing throughput dramatically improved, preventing out-of-memory overhead or massive performance loss via GC pauses. Memory churn from continuous chunk handling in the VAD algorithm was entirely eliminated for this specific hot path.

How to verify

Run bun test src/lib/audio/AudioSegmentProcessor.test.ts to ensure no changes were introduced to detection accuracy or logic. Run standard profiling on the app when processing audio, and the specific .reduce() call and underlying float array allocations in AudioSegmentProcessor will no longer appear in DevTools traces.

PR created automatically by Jules for task 976545635379442013 started by @ysdede

Summary by Sourcery

Optimize audio segment energy tracking to use O(1) running sums instead of accumulating per-frame energies in arrays.

Enhancements:

Replace per-chunk speech and silence energy arrays with running sum and count fields to compute averages without allocations or O(N) reductions.
Initialize and reset processor state to use scalar accumulators for speech and silence energy statistics.
Document the learning and action around replacing array-based accumulation with running sums in high-frequency audio streams in the project notes.

Summary by CodeRabbit

Performance Improvements

Optimized audio segment analysis within the voice activity detection system to reduce memory overhead and improve efficiency. Updated internal accumulation mechanisms now minimize garbage collection pressure, decrease system resource consumption, and enhance responsiveness when processing real-time audio streams.

…SegmentProcessor

google-labs-jules · 2026-04-11T16:34:17Z

👋 Jules, reporting for duty! I'm here to lend a hand with this pull request.

When you start a review, I'll add a 👀 emoji to each comment to let you know I've read it. I'll focus on feedback directed at me and will do my best to stay out of conversations between you and other bots or reviewers to keep the noise down.

I'll push a commit with your requested changes shortly after. Please note there might be a delay between these steps, but rest assured I'm on the job!

For more direct control, you can switch me to Reactive Mode. When this mode is on, I will only act on comments where you specifically mention me with @jules. You can find this option in the Pull Request section of your global Jules UI settings. You can always switch back!

New to Jules? Learn more at jules.google/docs.

For security, I will only act on instructions from the user who triggered this task.

qodo-code-review · 2026-04-11T16:34:29Z

Review Summary by Qodo

Optimize energy aggregation to O(1) running sum in AudioSegmentProcessor

✨ Enhancement

Walkthroughs

Description

• Replace energy arrays with O(1) running sum accumulators
• Eliminate unbounded array growth in speech/silence tracking
• Remove O(N) reduce operations for average calculations
• Reduce garbage collection pressure during streaming audio processing

Diagram

flowchart LR
  A["Energy Arrays<br/>speechEnergies[]<br/>silenceEnergies[]"] -->|"Replace with"| B["Running Accumulators<br/>sum + count"]
  B -->|"Eliminates"| C["Array Growth<br/>GC Pressure"]
  B -->|"Replaces"| D["reduce() O(N)<br/>Iteration"]
  C -->|"Improves"| E["Throughput &<br/>Memory Usage"]
  D -->|"Improves"| E

File Changes

1. src/lib/audio/AudioSegmentProcessor.ts ✨ Enhancement +22/-12

Replace energy arrays with running sum accumulators

• Replaced speechEnergies: number[] and silenceEnergies: number[] with speechEnergySum,
 speechEnergyCount, silenceEnergySum, and silenceEnergyCount primitives in ProcessorState
 interface
• Updated all energy accumulation points to use addition and increment instead of array push
 operations
• Changed average energy calculation from .reduce() to simple division (sum / count)
• Updated startSpeech() and startSilence() methods to initialize accumulators instead of arrays
• Updated reset() method to initialize accumulators to 0

src/lib/audio/AudioSegmentProcessor.ts

2. .jules/bolt.md 📝 Documentation +3/-0

Document energy accumulation optimization pattern

• Added learning note about array accumulation overhead in high-frequency audio streams
• Documented the pattern of using O(1) running accumulators instead of arrays for stream average
 calculations
• Captured the performance optimization insight for future reference

.jules/bolt.md

coderabbitai · 2026-04-11T16:34:29Z

📝 Walkthrough

Walkthrough

The PR optimizes Voice Activity Detection's audio processing by replacing array-based energy accumulation with running sum/count accumulators in AudioSegmentProcessor, reducing garbage collection overhead and eliminating redundant array iterations during averaging calculations.

Changes

Cohort / File(s)	Summary
Documentation `.jules/bolt.md`	Added dated note documenting VAD hot-path performance issue: replacing array accumulation (`push` + `reduce`) with O(1) running accumulators (`sum`/`count` variables).
Audio Processing Optimization `src/lib/audio/AudioSegmentProcessor.ts`	Refactored `ProcessorState` to replace `speechEnergies` and `silenceEnergies` arrays with running sum/count accumulators (`speechEnergySum`, `speechEnergyCount`, `silenceEnergySum`, `silenceEnergyCount`). Updated energy calculations, initialization, and reset logic accordingly.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Possibly related PRs

Performance: Optimize AudioSegmentProcessor allocations #184: Also modifies AudioSegmentProcessor.ts to reduce allocations in the hot audio-processing path through similar accumulator-based optimizations and stats object mutations.
refactor(audio): harden speech stats cache safety #173: Edits AudioSegmentProcessor's internal stats and averaging logic to eliminate array allocations and restructure summary computation patterns.

Poem

🐰 Arrays were piling up, frame by frame, so slow,
But running sums and counts now steal the show,
No garbage, no waste, just O(1) cheer,
The VAD hot-path sings a faster song, crystal clear! 🎵

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Performance: Optimize energy aggregation to O(1) running sum in AudioSegmentProcessor' directly and specifically matches the main change: replacing energy arrays with O(1) running accumulators in AudioSegmentProcessor to improve performance.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch perf-vad-energy-accumulator-976545635379442013

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

qodo-code-review · 2026-04-11T16:34:30Z

Code Review by Qodo

🐞 Bugs (0) 📘 Rule violations (0) 📎 Requirement gaps (0)

Great, no issues found!

Qodo reviewed your code and found no material issues that require review

ⓘ The new review experience is currently in Beta. Learn more

sourcery-ai

Hey - I've reviewed your changes and they look great!

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

gemini-code-assist

Code Review

This pull request optimizes the AudioSegmentProcessor by replacing array-based energy accumulation with running sums and counts, which reduces garbage collection overhead and improves performance in high-frequency audio processing paths. Review feedback identifies that the silenceEnergySum and silenceEnergyCount variables are currently dead code and should be removed. Additionally, a logic error was found where energy values are double-counted during segment splits, leading to inaccurate average energy calculations.

gemini-code-assist · 2026-04-11T16:36:36Z

+    silenceEnergySum: number;
+    silenceEnergyCount: number;


The silenceEnergySum and silenceEnergyCount properties appear to be dead code. While they are correctly updated and reset throughout the class, they are never read or used to calculate any statistics (unlike their speech counterparts). Since this is a performance-critical hot path, removing these unused accumulators would reduce unnecessary operations and simplify the state object.

gemini-code-assist · 2026-04-11T16:36:36Z

+                this.state.speechEnergySum += energy;
+                this.state.speechEnergyCount++;


A logic error causes the current chunk's energy to be double-counted when a proactive segment split occurs. When a split is triggered (around line 211), startSpeech is called, which initializes speechEnergySum with the current energy and sets speechEnergyCount to 1. However, the execution then continues to this else block, where the same energy is added again and the count is incremented to 2. This results in an incorrect average energy calculation for the first part of the split segment. Adding a check to ensure the energy is only added if it wasn't already initialized in the same frame fixes this.

References

Ensure that logic transitions do not lead to double-counting or redundant state updates in the same processing frame.

coderabbitai

Actionable comments posted: 1

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 788d7812-8dfb-43b3-a240-0aabd669f89a

📥 Commits

Reviewing files that changed from the base of the PR and between 474dbe6 and f3c27c8.

📒 Files selected for processing (2)

.jules/bolt.md
src/lib/audio/AudioSegmentProcessor.ts

coderabbitai · 2026-04-11T16:37:38Z

+                this.state.speechEnergySum += energy;
+                this.state.speechEnergyCount++;


⚠️ Potential issue | 🟡 Minor

Avoid split-frame speech energy double counting.

When proactive splitting happens, startSpeech(currentTime, energy) seeds the new segment with the current chunk, and the same chunk is then added again in the generic in-speech continuation path. This inflates avgEnergy/energyIntegral for split segments.

💡 Proposed fix (track whether current frame was already seeded)

@@ - const segments: ProcessedSegment[] = []; + const segments: ProcessedSegment[] = []; + let seededSpeechEnergyThisFrame = false; @@ // Start new segment immediately this.startSpeech(currentTime, energy); + seededSpeechEnergyThisFrame = true; @@ } else { // Continue in current state if (this.state.inSpeech) { - this.state.speechEnergySum += energy; - this.state.speechEnergyCount++; + if (!seededSpeechEnergyThisFrame) { + this.state.speechEnergySum += energy; + this.state.speechEnergyCount++; + } } else { this.state.silenceEnergySum += energy; this.state.silenceEnergyCount++; } }

Also applies to: 341-342

Performance: Optimize energy aggregation to O(1) running sum in Audio…

f3c27c8

…SegmentProcessor

sourcery-ai bot reviewed Apr 11, 2026

View reviewed changes

gemini-code-assist bot reviewed Apr 11, 2026

View reviewed changes

coderabbitai bot reviewed Apr 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance: Optimize energy aggregation to O(1) running sum in AudioSegmentProcessor#247

Performance: Optimize energy aggregation to O(1) running sum in AudioSegmentProcessor#247
ysdede wants to merge 1 commit intomasterfrom
perf-vad-energy-accumulator-976545635379442013

ysdede commented Apr 11, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

google-labs-jules bot commented Apr 11, 2026

Uh oh!

qodo-code-review bot commented Apr 11, 2026

Uh oh!

coderabbitai bot commented Apr 11, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

qodo-code-review bot commented Apr 11, 2026 •

edited

Loading

Uh oh!

sourcery-ai bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 11, 2026

Uh oh!

gemini-code-assist bot Apr 11, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Apr 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		this.state.speechEnergySum += energy;
		this.state.speechEnergyCount++;

Conversation

ysdede commented Apr 11, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed

Why it was needed

Impact

How to verify

Summary by Sourcery

Summary by CodeRabbit

Uh oh!

google-labs-jules bot commented Apr 11, 2026

Uh oh!

qodo-code-review bot commented Apr 11, 2026

Review Summary by Qodo

Walkthroughs

File Changes

Uh oh!

coderabbitai bot commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

qodo-code-review bot commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review by Qodo

Great, no issues found!

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ysdede commented Apr 11, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 11, 2026 •

edited

Loading

qodo-code-review bot commented Apr 11, 2026 •

edited

Loading