Skip to content

[ROCm] Check native BF16 support for AMP#51

Open
austin1997 wants to merge 1 commit into
ROCm:paddle_hackthonfrom
austin1997:rocm-bf16-capability-check
Open

[ROCm] Check native BF16 support for AMP#51
austin1997 wants to merge 1 commit into
ROCm:paddle_hackthonfrom
austin1997:rocm-bf16-capability-check

Conversation

@austin1997

Copy link
Copy Markdown

PR Category

Environment Adaptation

PR Types

Improvements

Description

当前 AMP 的 BF16 可用性判断在 ROCm 构建下直接返回 True,可能导致不具备 native BF16 的 AMD GPU 进入 BF16 autocast 路径。

本 PR 将 BF16 capability 判断下沉到 C++/PHI GPU info:CUDA 保留 compute capability 与 runtime 版本判断;ROCm 基于 gcnArchName 仅对 gfx90a、gfx94*、gfx95* 返回支持。Python autocast 统一调用 core.is_bfloat16_supported,并补充单测覆盖 ROCm 分支。

是否引起精度变化

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant