Skip to content

Add PortConfigLoader for on-demand GPU tracing via TCP#1280

Open
sanrise wants to merge 1 commit intopytorch:mainfrom
sanrise:export-D91278956
Open

Add PortConfigLoader for on-demand GPU tracing via TCP#1280
sanrise wants to merge 1 commit intopytorch:mainfrom
sanrise:export-D91278956

Conversation

@sanrise
Copy link
Copy Markdown
Contributor

@sanrise sanrise commented Mar 3, 2026

Summary:
This implements Phase 1.1 of the lightweight on-demand GPU tracing system for Kubernetes environments. It provides an alternative to DaemonConfigLoader that receives trace configs via a TCP port instead of IPC Fabric, enabling GPU tracing in K8s pods without requiring a full Dynolog daemon.

The PortConfigLoader implements the IDaemonConfigLoader interface with a simple JSON protocol (PING/TRACE commands) and includes a socket abstraction layer (ISocket) for testability. Unit tests use MockSocket to enable deterministic testing without binding to real ports.

Differential Revision: D91278956

@meta-cla meta-cla bot added the cla signed label Mar 3, 2026
@meta-codesync
Copy link
Copy Markdown

meta-codesync bot commented Mar 3, 2026

@sanrise has exported this pull request. If you are a Meta employee, you can view the originating Diff in D91278956.

Summary:

This implements Phase 1.1 of the lightweight on-demand GPU tracing system for Kubernetes environments. It provides an alternative to DaemonConfigLoader that receives trace configs via a TCP port instead of IPC Fabric, enabling GPU tracing in K8s pods without requiring a full Dynolog daemon.

The PortConfigLoader implements the IDaemonConfigLoader interface with a simple JSON protocol (PING/TRACE commands) and includes a socket abstraction layer (ISocket) for testability. Unit tests use MockSocket to enable deterministic testing without binding to real ports.

Differential Revision: D91278956
@sanrise sanrise force-pushed the export-D91278956 branch from cc24c80 to abba53d Compare March 3, 2026 02:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant