[RFC][lldb][NVGPU] introducing "shadow functions" to cuda-lldb by zhyty · Pull Request #94 · clayborg/llvm-project

zhyty · 2026-04-09T00:28:16Z

Summary

By default, disable breakpoint locations in host-side kernel wrapper functions on the CPU side once an associated NVIDIA GPU target exists. This is targeting CUDA programs where source-level breakpoints can otherwise resolve to the host launch wrapper instead of the actual device kernel.

These locations are not deleted. LLDB still creates them, but disables them by default so users can explicitly re-enable them if they really want to stop in the host launch path.

What are "shadow functions"?

When nvcc compiles CUDA source, a __global__ kernel typically has host-side launch machinery associated with it. In practice, source breakpoints may resolve to that host wrapper path because the host binary can contain symbol and line-table information for it.

Using shadow_functions.cu as an example:

my_kernel(int) is the host-visible wrapper entry point.
That wrapper quickly transitions into a __device_stub_ helper such as __device_stub__Z9my_kerneli(int), which performs the host-side CUDA launch boilerplate.
The actual GPU instructions for the kernel are not in the host .text for my_kernel(int) or __device_stub__...; they live in device code embedded in the binary.

From the user's point of view, my_kernel is "the kernel". From the host CPU symbol table's point of view, it is a wrapper around launch machinery. This PR treats those CPU-side wrapper locations as "shadow functions" and disables host breakpoint locations there when a GPU target is present.

How does this PR identify shadow functions?

The implementation no longer precomputes wrapper address ranges or maintains interval maps. Instead, it answers the question at breakpoint-location handling time using the owning symbol context and the module's indexed symbol lookup.

For a native breakpoint location:

Resolve a SymbolContext for the location using function and symbol scope.
Prefer the concrete owning function name, falling back to the owning symbol name when needed.
Construct the expected device-stub base name by prefixing that name with __device_stub_.
Query the same module with Module::FindFunctionSymbols(..., eFunctionNameTypeBase, ...).

If the module has a matching __device_stub_ function symbol, LLDB treats the native location as a host-side shadow wrapper and disables that native breakpoint location.

This matches the important property we care about: a CPU-side wrapper is identified by the presence of the corresponding CUDA device-stub symbol in the same module, and the lookup uses the symbol table index instead of scanning pre-recorded address ranges.

Plugging Into LLDB's Lifecycle

There are two integration points:

When a GPU plugin target is associated with an existing native target, Target::SetGPUPluginTarget walks the native target's current breakpoint locations and asks the GPU platform to inspect each one. This handles breakpoints that already existed before the GPU target was created.
When a new native breakpoint location is created later, BreakpointLocationList::AddLocation checks associated GPU plugin targets and lets each platform decide whether that location should be disabled.

The platform hook used by both paths is Platform::HandleNativeBreakpointLocation. In the NVIDIA implementation, PlatformNVGPU::HandleNativeBreakpointLocation resolves the symbol context for the location, checks whether it is a shadow function, and disables it if so.

Why this design?

This version is simpler than the earlier interval-map approach:

No module-level shadow-function bookkeeping.
No need to track wrapper address ranges.
No cleanup problem for stale interval-map entries on unload.
The lookup uses existing symbol table indexing for function-name searches.

It also keeps the user-visible behavior we want: the host breakpoint location still exists and is visible in LLDB, but it is disabled by default once the GPU target provides a better device-side interpretation.

Test Plan

lldb-dotest is unreliable in this setup, so I used llvm-lit directly:

$BUILD_DIR/bin/llvm-lit -v --test-output=all \
  $LLVM_PROJECT_ROOT/lldb/test/API/gpu/nvidia/shadow_functions/TestNVGPUShadowFunctions.py

The test covers:

Name breakpoints in CUDA kernels are not left enabled on the CPU target.
File/line breakpoints in CUDA kernels are not left enabled on the CPU target.
The behavior holds for multiple kernels in the same test binary.
Breakpoints created after GPU target creation are also filtered correctly.

TODOs

Ideally, we handle dlclose by re-enabling the shadow function host side breakpoint locations. We're deferring this to a future change.

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

on GPU target creation Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

clayborg

So this does follow what NVidia does within GDB. Though the cost can be quite high and cause a lot of time wasted processing and identifying all shadow functions even though we might set a breakpoint in a few of them. We only care about the identifying which breakpoints are in shadow functions.

A good solution would only check each breakpoint location to see if it is a shadow breakpoint and disable it. We don't need to parse everything and make a huge map where 99% of the contents will never be accessed and making this map will cause delays in starting the debug sessions.

clayborg · 2026-04-15T18:59:05Z

 protected:
  StatsDuration m_create_time;
  StatsDuration m_load_core_time;
+  StatsDuration m_shadow_function_identification_time;


remove and make a virtual platform method to get statistics for a platform. The default Platform::GetStatistics() should get the plug-in name only:

"platform": { "name": "nvgpu", }

Subclasses should override this and call the base class and add any key/value pairs that make sense for the platform itself.

clayborg

Things to fix:

rvemo

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

fixes Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

bool we weren't really making use of it anyway. not sure if we'd need a return in the future, but no need for now. Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

Tom Yang added 9 commits April 8, 2026 17:10

fix logging template vars in PlatformNVGPU

f697de4

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

claude attempt at book-keeping shadow functions

d470a2c

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

only search for shadow functions in host modules, and search all modules

0403040

on GPU target creation Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

disable future breakpoints if shadow function

f6a96d0

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

allow subsequent IdentifyShadowFunctions calls

0ef92b1

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

use intervalmap for shadow function tracking

793de1c

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

move shadow function search to after GPU process successfully connects

4799625

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

some light refactoring

d9dd4dd

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

add Target API to iterate through all GPU plugin targets, refactoring

4acc53e

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

zhyty requested a review from walter-erquinigo April 9, 2026 00:28

agontarek self-requested a review April 9, 2026 20:49

agontarek reviewed Apr 9, 2026

View reviewed changes

Comment thread lldb/source/Plugins/Platform/NVGPU/PlatformNVGPU.cpp Outdated

agontarek reviewed Apr 9, 2026

View reviewed changes

Comment thread lldb/source/Plugins/Platform/NVGPU/PlatformNVGPU.cpp Outdated

agontarek reviewed Apr 9, 2026

View reviewed changes

Comment thread lldb/source/Plugins/Platform/NVGPU/PlatformNVGPU.cpp Outdated

Tom Yang added 3 commits April 14, 2026 17:20

track shadow function processing time in statistics

96c3ada

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

change wrapper function retrieval to use more precise API

d563fa3

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

remove asserts

c26bcdd

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

zhyty marked this pull request as ready for review April 15, 2026 02:01

zhyty requested a review from clayborg April 15, 2026 16:12

clayborg reviewed Apr 15, 2026

View reviewed changes

clayborg requested changes Apr 15, 2026

View reviewed changes

agontarek force-pushed the meta-nvidia branch from 041e469 to 23ad52d Compare April 17, 2026 20:02

zhyty force-pushed the meta-nvidia branch from 23ad52d to b4c306e Compare April 17, 2026 20:34

Tom Yang added 4 commits April 20, 2026 00:53

greg's suggested changes to shadow function

f9616f9

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

add another kernel function to shadow functions test

0499266

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

documentation for Platform::HandleNativeBreakpointLocation, minor style

be932b9

fixes Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

make Platform::HandleNativeBreakpointLocation return void instead of

fff2ab6

bool we weren't really making use of it anyway. not sure if we'd need a return in the future, but no need for now. Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

zhyty requested a review from clayborg April 21, 2026 06:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC][lldb][NVGPU] introducing "shadow functions" to cuda-lldb#94

[RFC][lldb][NVGPU] introducing "shadow functions" to cuda-lldb#94
zhyty wants to merge 16 commits into
clayborg:meta-nvidiafrom
zhyty:shadow-functions-for-pr

zhyty commented Apr 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clayborg left a comment

Uh oh!

Uh oh!

Uh oh!

clayborg Apr 15, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clayborg left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zhyty commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What are "shadow functions"?

How does this PR identify shadow functions?

Plugging Into LLDB's Lifecycle

Why this design?

Test Plan

TODOs

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clayborg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

clayborg Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clayborg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zhyty commented Apr 9, 2026 •

edited

Loading