[AIROCMLIR-49] LLVM Upstream Merge - April 2026#2239
Conversation
|
@mirza-halilcevic Github is unable to render such large changes.
|
The diff files are too large to be uploaded as attachments (in gigabytes). If needed we can generate the diff with: |
The diff files have historically only been a couple of hundred lines for both llvm and mlir. If we are seeing the file size be a couple of gb then it seems like something went wrong. |
|
Also, when it comes time to merge in this PR can we try to have a clean git history as much as possible? I think having a single commit for bumping the LLVM version, then another commit for any external changes that we need to apply, and then finally a third commit for any rocMLIR changes that were needed because of the upstream bump. |
I meant git diff between upstream SHA and with this branch and not against the develop branch |
d4d1212 to
c371694
Compare
|
@umangyadav @justinrosner diff files can be found in the PR description |
…6a98
f5f2faf16a98 [amd] Add to CODEOWNERS for device-libs/hipcc (#1649)
403e9af59db7 merge main into amd-staging (#1645)
25fcb507e040 [AMDGPU] Add -amdgpu-gfx1250-b0-specific option (#1644)
2a255ac32fc4 merge main into amd-staging
772c38832056 [Comgr] Add COMGR_STATIC_LLVM option for static LLVM linking (#1640)
ec15263cb898 [lld][WebAssembly] Convert weak-alias tests to assembly. NFC (#184667)
90c6e6374b95 [clang-tidy] Fix readability-else-after-return for if statements appear in unbraced switch case labels (#181878)
d8e86a5cc0d5 [dsymutil] Add option to filter debug map objects by allow/disallow-list (#182083)
f26ff8d86fd8 [LLDB] Allow symbols added by linker scripts to be examined. (#184679)
933e5368ba02 [DebugInfo] Don't specify target triple for cross-cu-linkonce-distinct.ll (#184685)
7822c770cf48 merge main into amd-staging (#1642)
97275e57b4fc [hipcc] Add HIP_CLANG_LAUNCHER for launching Clang through a wrapper executable (#1490)
585928482419 [MemProf] Add stack IDs to MemProfUse optimization remarks (#184670)
5c08616df765 [DAG] isKnownToBeAPowerOfTwo - Power of 2 value is known to be power of 2 after BSWAP/BITREVERSE (#182207)
28b840815c22 [HLSL] groupshared variables should be implicitly extern and should not be initialized (#184459)
f986523c5937 [CIR] Fix Codegen for Complex & Scalar comparisons (#184006)
ade43a54d441 [WebAssembly] MC support for acquire-release atomics (#183656)
bb78a0a47083 [CIR] Fix __builtin_va_start handling (#184654)
149e62c29bf8 [clang-doc] Add DAG directive to MD "All files" test (#184671)
56ed7485b75e [CIR][NFC] Remove unnecessary call to clangCmpToCIRCmp (#184217)
116b44526724 device-libs: Use frexp builtin instead of frexp_exp + frexp_mant (#1469)
63074da25d16 [DebugInfo][DwarfDebug] Move emission of globals from beginModule() to endModule() (5/7) (#184219)
6e1aee4276bb [AMDGPU] Select v_bfe_u32 for i8/i16 (and (srl x, c), mask) (#182446)
d62fbb6a9664 [SPIRV] Update the global registry when expanding function pointer (#183873)
a212ebd471b1 ValueTracking: Handle constant structs in computeKnownFPClass (#184192)
20902f0b721b ValueTracking: Teach computeKnownFPClass to look at bitcast + integer max (#184073)
34541e5a5400 [HLSL] Add WaveActiveAllEqual functions (#183634)
75b2ea57d5f4 [Clang][UnsafeBufferUsage] Warn about two-arg string_view constructors. (#180471)
2032960b5de1 [flang][NFC] Converted five tests from old lowering to new lowering (part 24) (#184538)
a7914aecb2db [RISCV] Allow unsigned immediates for pli.h, pli.dh, pli.w (#184554)
64c0f624cd07 [lldb] Make the PluginManager thread safe (#184452)
9dc65372aa61 [clang-tidy] Don't report unnamed params for misc-const-correctness (#184388)
df52bb4c32e2 [LoopUnrollPass] Don't use clang specific syntax in optimization remarks (#182430)
0baf5a0496a5 Revert "Silence -Wunused-parameter warnings in Unwind-wasm.c" (#175776)
fbc3a312d69d [mlir][xevm] Remove unnecessary attach target pass. (#184432)
2c95b8d5185b AMDGPU: Clean up print handling of AMDGPUTargetID (#184643)
2b3e30d4c470 [CodeGen] Treat hasOrderedMemoryRef as implying arbitrary loads or stores (#182000)
d9d6b16cc622 ValueTracking: Handle ConstantDataSequential in computeKnownFPClass (#184191)
ff0220d236de [flang][acc] Allow orphaned acc cache directive (#184448)
937bf9cef69f [bazel] Fix parse_headers in bolt (#184648)
56a53550d317 [DebugInfo] Emit DW_AT_const_value for constexpr array static members (#182442)
b80248a0ea35 [clang-doc] Add a Mustache Markdown generator (#177221)
c2db12daa171 [AIX] Sort relocations in XCOFF object writer. (#180807)
302860455663 [bazel] Add target for `clang-nvlink-wrapper` (#184644)
9105d9c24949 [lld][Hexagon] Fix findMaskR8 missing duplex support (#183936)
d326a76d9a1f merge main into amd-staging
9cd054b0bb3e [AArch64] Add lowering for misc NEON intrinsics (#183050)
b9f1199581e7 InstCombine: Support extractvalue in SimplifyDemandedFPClass (#184171)
b0b583475a03 [DAG] Improved handling of ISD::ROTL and ISD::ROTR in isKnownToBeAPowerOfTwo (#182744)
a14f9f822f48 [mlir][xegpu] Add support for accessing the default order of a layout. (#184451)
18226e7e2eb8 [RISCV] Lower i8/i16/i32 scalable vector ISD::CLMUL/CLMULH with Zvbc32e. (#184465)
77f1480f7ef1 [SPIRV] Fix return value of runOnModule for SPIRVPrepareFunctions (#184636)
668d09b2846d [clang][Modules] Fixing unexpected warnings triggered by a PCH and a module with config macros (#177078)
4b3a9246a086 [bazel] Fix more parse_headers cases in lldb (#184534)
8f8590e691fe [OpenMP][AIX] Add libpthreads for -fopenmp (#184629)
39d5aea6df90 [OpenACC] Replace terminators with scf.yield in wrapMultiBlockRegionWithSCFExecuteRegion (#184458)
613a5c555ebf [mlir][vector] Replace OneDimMultiReductionToTwoDim with OneDimMultiReductionToReduction (#184241)
7b72b5fde414 [bazel] Fix building lldb without libedit (#184535)
8486d893bd79 [SelectionDAG] Fix -Wunused-variable after #179318 (#184623)
a40e83b29ce9 [libsycl] Add sycl::queue stub (#184110)
53aa77092ea7 [flang] Fix distribution build of Fortran builtin/intrinsic modules. (#184204)
b0f953f0f395 merge main into amd-staging (#1638)
e8e8d30b229a [Hexagon] Use __HVX_IEEE_FP__ to guard protos that need -mhvx-ieee-fp (#184422)
f55080da988a [flang][OpenMP] Avoid implicit default mapper on pointer captures (#184382)
247a9bfc26ad [mlir][AMDGPU] Add folders for memref aliases to TDM base creation (#184567)
a3eb13b5bfd1 [X86] remove unnecessary movs when %rdx is an input to mulx (#184462)
ded64d2417d4 [DTU] fix dominator tree update eliding reachable nodes (#177683)
b28ec5ad1808 [mlir][Func] Fix FuncOp verifier ordering via hasRegionVerifier (#184612)
e5a6a0f10856 [SPIRV] Fix global emission for modules with no functions (#183833)
c123642824fd [CI] Install binutils-dev in pre-merge container (#184608)
33be2d0e7ab2 [AArch64] Update clmul tests after #184403 (#184611)
c9ca768c88cc [mlir][shape] Fix crash when shape.lib array references undefined symbol (#184613)
56e0b6af1d39 [mlir][affine] Fix crash in vectorizeAffineLoopNest test utility for reduction loops (#184617)
c370f5af6c8d [VPlan] Preserve IsSingleScalar for hoisted predicated load. (#184453)
50653e5a0d1d [tosa] : Enhance tosa.slice folding for dynamic dims. (#184615)
11c11ec2e9fd [clang][Lex] Preserve MultipleIncludeOpt state in Lexer::peekNextPPToken (#183425)
5c2740784224 [analyzer] Suppress optin.cplusplus.VirtualCall warnings in system headers (#184183)
073de3b80375 [SPIRV] Rename `selectSelectDefaultArgs` to `selectBoolToInt` (#184120)
0cbba3ed5f12 [flang-rt] Fix incorrect condition for removing backtrace (#184610)
c6bb6a7e4254 [LV] Add `-force-target-supports-masked-memory-ops` option (#184325)
71de1e47c012 Reapply "[AArch64] Wrap integer SCALAR_TO_VECTOR nodes in bitcasts (#172837)" (#183380) (#184403)
21c1ba16edc0 [TableGen] Complete the support for artificial registers (#183371)
c2e22e3b797d [clang][cmake] Add option to control hmaptool installation (#172725)
47766d7f8c39 [AMDGPU][Clang][Doc] Add documentation for WMMA builtins (#183939)
1b3545117df0 [mlir][irdl] Fix crash in TypeOp/AttributeOp verify on empty sym_name (#184598)
05fdd5383967 [Clang] Fix the lambda context for constraint evaluation (#184319)
c985dec6c26d [psdb][Linux] enable aomp smoke test (#1626)
0af2d43e0641 [Clang] Warn if both of `dllexport`/`dllimport` and `exclude_from_explicit_instantiation` are specified (#183515)
5cf09a68a63d [AArch64][ISel] Use vector register for scalar CLMUL (#183282)
98ed41718b0f [LV] Transform tests for early-exit with stores (#183288)
8bb41c929f3a AMDGPU: Fix copy of Triple (#184594)
095e1694d9c0 [clang] Turn misc copy-assign to move-assign (#184144)
c2784e11cc44 [Flang][OpenMP] DEFAULT(NONE) error checking on implicit references (#182214)
1f4074b771be [mlir][llvm] Fix SROA crash on empty LLVM struct types (#184596)
75320d07f881 merge main into amd-staging
0a1e39517b22 [nfc][analyzer][test][z3] Replace "REQUIRES: no-z3" with "UNSUPPORTED: z3" (#184349)
ee8184573f03 Revert "[flang] make lowering to scf.while default" (#184592)
9c2829f2e188 [mlir][Func] Use getMutableSuccessorOperands() in FuncOp verifier (#184589)
7f044944e43e [MLIR][Arith][Vector] Reject i0 integer type in arith and vector ops (#183589)
ee92ac2343f6 [mlir][nvgpu] Fix crash in optimize-shared-memory pass with vector element types (#179111)
943eb6fd958e [LV] Use make_early_inc_range in handleFindLastReductions (#184340)
d0f50d55746a [AMDGPU] Remove DX10_CLAMP and IEEE bits from gfx1170 (#182107)
de5e081a8339 [flang][NFC] Converted five tests from old lowering to new lowering (part 23) (#184533)
8ac00ba7f9f1 [mlir][SCFToEmitC] Fix crash when scf.while carries a memref loop variable (#183944)
f1aa7c3c5fc9 [mlir][cf] Canonicalize block args with uniform incoming values (#183966)
f702ee89c1d7 [VPlan] Fix partially uninitialized accesses after 17aaa0e590a7. (#184583)
2aab31a94e7b [X86] combine-fcopysign.ll - extend test coverage to all x86-64/x86-64-v2/x86-64-v3/x86-64-v4 levels (#184579)
177211a99f70 [AArch64] Generate test checks (NFC) (#184582)
6bdf076137d0 [clang] Predefine `_MSVC_TRADITIONAL` in MSVC compatibility mode (#184278)
1582dd9c31d5 [lldb] Change more uses of AppendMessageWithFormat to AppendMessageWithFormatv (#184337)
d737cd505562 [clang-tools-extra] Turn misc copy-assign into move-assign (#184146)
756d068ead7e [MLIR][Python][Transform] Expose PatternDescriptorOpInterface to Python (#184331)
9cc0df99de85 [clang-repl] Create virtual files for `input_line_N` buffers (#182044)
c62d5f35b678 [AArch64] Avoid folding sign-extend of vector extracts into ALU ops (#183522)
14af5be5da77 [lldb] Add arithmetic binary subtraction to DIL (#184017)
b6761b287f6b [clang-tidy][NFC] Add missing Option tests in `bugprone` [1/N] (#184015)
732f66eccc24 libclc: Reimplement amdhsa workitem functions (#184571)
31f69d333e49 [libc] Fix integration test args/env in LibcTest lit format (#184438)
3d52f0c539d8 [SPIR-V] Don't consider a function be a builtin just by checking name (#182776)
a636928bb4db [SelectionDAG] Add expansion for llvm.convert.from.arbitrary.fp (#179318)
1a7502592f0f [ARM] Generate test checks (NFC) (#184574)
e7db3f1d3df6 [DSE] Handle provenance when eliminating tautological assignments
b86f24fd0ed4 [InstCombine] make `foldBinOpIntoSelectOrPhi` fold on all operands (#183692)
8fcb60aa47e7 [libc++][NFC] Introduce __data() to std::string to replace std::__to_address(__get_pointer()) (#178212)
c9355cc121df [ELF] Move ArmCmseSGSection into Arch/ARM.cpp (#184570)
3bb4a506c590 [WebAssembly] Print type signature and table for call_indirect (#179120)
52dd63d3caa3 [mlir] Add option to ignore commutativity in OperationEquality (#181507)
2be2926b56ea [Loads] Allow replacement of null with ptr in `canReplacePointersIfEqual`
7fbc0734f5b3 [clang][bytecode] Fix a few comment typos (#184561)
9a821584a516 [clang][bytecode] Fix a mishap in HasPtrField calculation (#184557)
deb70a6d643d [InstCombine] Don't strip leading zero index for overaligned vector GEP (#184364)
c612c98fa6dc [VPlan] Add const to VPPredicator methods. nfc (#184359)
ae4386775663 [DA] Remove consistent flag from Dependence class (#181608)
5b156a4372ac [AMDGPU] Add half vector support for table-driven libcall optimization (#178638)
6b59ad6e8d88 [mlir][linalg] Data layout propagation test schedule (#184151)
6fae863eba8a [X86][APX] Add a few pseudo opcodes support EGPR (#184550)
debb2514ea7f [MC] Fuse relaxation and layout into a single forward pass (#184544)
cd01e6526af6 [ELF] Add target-specific relocation scanning for LoongArch (#182236)
027447c61724 [MC][test] Add relax-branch-align.s demonstrating unnecessary branch relaxation (#184551)
dd8d5ffe0d08 [RISCV] Sink instructions so AVL dominates in RISCVVLOptimizer (#184155)
348f4fb9e00a [DA] Add tests that represent edge cases for the Weak Zero SIV tests (NFC) (#183735)
f5f0930c4715 [GVN] Fix crash when svcount is used with globals-aa (#184347)
5f29cdc175f6 [RISCV] Remove OperandType OPERAND_SIMM10_UNSIGNED. Rename OPERAND_SIMM8_UNSIGNED->OPERAND_SIMM8 (#184540)
4b85c1301fb6 [clang-tidy] Fix false positive in readability-redundant-preprocessor for builtin checks (#181734)
a6bbe463ce67 [clang][CIR] Pass VFS to command-line parsing (#184226)
502df3320553 [Flang][OpenMP] Fix unintended write-back shown in SWDEV-579431 (#1594)
e808a7f844ca [RISCV][GISel] Replace buildInstr with BuildMI (#183714)
e6b9d816f6a2 [Hexagon] Ignore formatting of generated proto files (#184427)
dcbc5de7f85f [lldb][NFC] Add missing include to LZMA.h (#184536)
6e1ab3a4310d [Serialization] Stop demote var definition as declaration (#172430) (#177117) (#184287)
7fb92cdf5fa6 [Benchmark] Fix warnings around usage of __COUNTER__ (#184524)
98c46261d926 [TargetLowering][PowerPC] Don't unroll vector CLMUL when MUL is not supported. (#184238)
1c434928d26c [bazel] Remove old zlib config variable (#184527)
3a85d99a1606 [bazel] Fix building lldb with zlib disabled (#184525)
5de659a44310 merge main into amd-staging (#1632)
928505c98345 [lld][WebAssembly] Convert more tests to assembly. NFC (#184418)
3b4d5ffe847c [MLIR][XeGPU] Add blocking and subgroup to lane distribution support for ConvertLayout operation (#183837)
45dbce3a3a3e [lldb] Wrap LLDBLog Initialize/Terminate in a class (NFC) (#184469)
53fbbaa577f2 [lldb] Fix Initialization/Termination for all log channels (#184467)
699563e0da93 [NFC] Don't replicate hasKernelCallingConv. (#184464)
5e5f7efd7706 [lldb] Expose block equality with SBBlock. (#184222)
60d729fdb226 [flang] Fix test breakage from recent preprocessor change (#184455)
630b9570d199 [mlir][math] Add constant folding for math.rsqrt (#184443)
ece4b759327c [lldb] Add C source output mode to formatter_bytecode.py (#184242)
76568dc89916 [LoopUnrollPass] Add `const` to parameters in `computeUnrollCount` (NFC) (#184058)
0c04d019f0a6 [NFC] [Doc] Fix text codeblock being declared llvm (#184461)
62144f48d43f [flang] make lowering to scf.while default (#184234)
393bbd55201a [gn build] Port commits (#184454)
0c9734f12055 [NFC] [doc] fix invalid comment syntax in IR (#184457)
87a4b36fbe7f [WebAssembly] Use MVT::i32 instead of i1 in performAnyAllCombine (#183866)
e71f327b4605 [X86] support reserve r8~r15 on X86_64 (#180242)
1b633d6d6d75 [Clang] Permit floating point and pointer values in most atomic ops (#183843)
28638f519783 [lldb] Remove Debugger::{FindTargetWithProcessID, FindTargetWithProcess} (#184446)
f4e64ceb4bd8 [lldb] AArch64 register 33 is not cpsr (#183860)
685a65a7f03d [clang-tidy] Add zeyi2 as maintainer (#183883)
9264159ae1df [lldb] Fix the GoogleTest teardown in the DAP unit tests (#184262)
5b144c0aec63 [AMDGPU] Add suffix _d4 to tensor load/store with 4 groups D#, NFC (#184176)
f00a05496471 merge main into amd-staging
1953b87a31a9 [CIR][CodeGen] Upstream support for `__builtin_isinf_sign` (#183977)
5f8065ef63e2 merge main into amd-staging (#1631)
89a4bcf02349 [CIR] Split cir.binop into separate per-operation binary ops (#184227)
c4ea6cc3f736 [lldb] Remove call_once wrappers around PluginManager::RegisterPlugin (#184273)
6b5c55ef169c [lldb] Fix 10 year old leak of `g_debugger_list_ptr` (#184259)
fe76fd292cc3 [AMDGPU][SIInsertWaitcnts][NFC] Call applyWaitcnt() in a loop (#184426)
fdc4a982f5d6 [AMDGPU] Add dereferenceable retAttr to a call to llvm.amdgcn.implicitarg.ptr (#182206)
dc1e3e5dbf78 [X86] getFauxShuffleMask - add ISD::ROTL/ROTR handling (#184417)
dc44bcafe08e [flang-rt] Fix NVPTX builds erroneously using backtrace support (#184415)
df1a53ae2424 Disable leak sanitizer test on ppc. (#184414)
4b06e8388559 [Github][CI] Bump CI containers to LLVM v22.1.0 (#184375)
80a1cf4f8058 clang: Add builtin header for amdhsa abi (#181993)
9d0c62c3ddb1 [X86] known-never-zero.ll - improve demandedelts test coverage for #183227 (#184411)
375d65ee8de7 [CIR] Implement EH lowering to Itanium form and LLVM IR (#184386)
5586d93a87ef [NFC] [HWASan] more meaningful BB names in use-after-scope test (#183867)
b4dfa43cb8ae [RISCV] Fix type inference ambiguity in SwapSysReg pattern (#184305)
8272546f6910 [HLSL][SPIRV] Fix `faceforward` pattern matcher logic (#183630)
17aaa0e590a7 [VPlan] Use bitfield to store Cmp predicates and GEP wrap flags. (NFC) (#181571)
899080a87ad9 [Analysis][DXILResource] Correct bound computation (#184198)
b5baf5e062b2 [CIR] Implement func-ptr/void-ptr addition/subtraction/inc/dec. (#184254)
c7c16573b8f3 [CIR] Synchronize CIR with recent changes to atomic ops (#184416)
a5ca0ec16bdd [libc++] Update documentation for _executeWithFakeConfig (#184420)
2d4c8e0d0fa2 [OpenMP][clang] Indirect and Virtual function call mapping from host to device (#184412)
03bd4ef4ecf9 [CIR] Handle vtable pure and deleted virtual functions (#183862)
6893d277575d [flang][acc] Improve clause validity check around do concurrent (#184389)
c5039c184827 [NFC] Refactor the SelectionDAG::getMemcmp etc with a existing helper function getRuntimeCallSDValueHelper (#184200)
e379ad78203b [LifetimeSafety] Use per-container invalidation rules to fix false positives (#183000)
80acaccbe644 [RISCV] Promote i8/i16/i32 scalable vector CLMUL to i64 CLMUL with Zvbc. (#184265)
f42b8a18d904 [sanitizer][Fuchsia] Define interceptor for reallocarray on Fuchsia (#184410)
ac950786b13e merge main into amd-staging
637bb0e37747 [WebAssembly][FastISel] Call materializeLoadStoreOperands in load fold (#184203)
90febba9c4ec [X86] vector-shuffle-combining-xop.ll - tests showing failure to combine shuffles with non-uniform rotates (#184397)
a34d56dee94b [AArch64] Fix relative vtable PLT/GOTPCREL specifiers to use MCSpecifierExpr (#184393)
ea79bcfcc579 [flang][OpenMP] Fix lowering of LINEAR iteration variables (#183794)
d0dd37124979 [MLIR][Canonicalization] Add shape_cast folding patterns (#183061)
6b040b0dee9c [HIP] Fix -save-temps with the new offload driver (#184385)
7161bd94fded [mlir][mpi] fixing 184189 build failures (#184399)
56b5af76cf3c [bazel][mlir] Fix Bazel build for a232b5b (#184394)
b926acfb341b [flang] remove unused variable (NFC) (#184293)
c1bba5ba023a [VPlan][NFC] Remove unnecessary explicit copy constructors (#183863)
7a310b4c5a06 [mlir][linalg] Upstream PackOp/UnPackOp's generateScalarImplementation. (#182838)
bf680bdf1349 [clang-tidy] Fix yet another false positive in `readability-redundant-typename` (#184301)
7b7c8b2eb3f1 [libc] Extend check-libc-lit to cover include, integration, and all src tests (#184366)
200600a06c20 [ELF] Move PPC32Got2Section into Arch/PPC.cpp (#184383)
616656bc5e1a [ELF] Move MIPS synthetic sections into Arch/Mips.cpp (#184384)
a8a2f2fe9976 [MLIR][XeGPU] Remove fold alias pass in xegpu (#182802)
f95662d159dc Revert "[OpenMP][clang] Indirect and Virtual function call mapping from host to device" (#184378)
b6f389e005d7 [clang-doc] Improve complexity of Index construction (#182621)
9081ac255a8b [DirectX][ResourceAccess] Resolve resource handles at access (#182106)
640ba7b05e75 [Github] Bump clang-format/clang-tidy to v22.1.0 (#184374)
b33c7db8eb63 [clang-doc] Add basic benchmarks for library functionality (#182620)
c72d2e503caf [Comgr] Keep LLVM temporary files if AMD_COMGR_SAVE_LLVM_TEMPS=1. (#1543)
779d76c9effd [AArch64] Add basic NPM support for LoadStoreOptimizer. (#184090)
b44dba97d059 [mlir] Install '.pdll' files along with the header files (#183855)
bb2b957c53b0 [Thumb2] Use BXAUT instruction if available (#183056)
829da4927bf1 [CIR][AArch64] Add lowering for vaba_* and vabd_* builtins (#183595)
a232b5b96f67 [mlir][shard, mpi] Adding Shard/MPI reduce_scatter and simplification (#184189)
5f8f1e2afe99 [CIR] Fix unreachable block generation in EH flattening (#184268)
f82f8cf8d498 [ELF] Add TargetInfo::initTargetSpecificSections hook (#184292)
3f1d968db946 [mlir][IR] Add variadic `getParentOfType` overloads (#184071)
e68f696fdae0 [CI][SPIRV][NFC] Remove unneccessary mkdir from workflow (#184353)
6cc42b39556d [libc] Various GPU allocator tweaks and optimizations (#184368)
d61b45cd409d [Clang] Generate ptr and float atomics without integer casts (#183853)
aef962708fe5 Reapply "[SPIRV][NFCI] Use unordered data structures for SPIR-V extensions (#184162)
02b2a1e8fe7f Fix `assignValueToReg` function's argument (#184354)
358f47772023 [Clang] Fix clang crash for fopenmp statement(for) inside lambda function (#146772)
e10655eb1dfc [X86] known-never-zero.ll - add sdiv/udiv vector test coverage for #183047 (#184350)
43503c44c8d0 [NFC][AArch64] isPureCmp is a duplicate of canAdjustCmp, so remove the duplicate (#183568)
81396ebc51c4 [AMDGPU] Generate more swaps (#184164)
e570faa87ed3 [SPIR-V][HIP] Disable SPV_KHR_untyped_pointers (#183530)
acb8a6df1991 [AArch64] Fix type mismatch in bitconvert + vec_extract patterns (#183549)
c9d065abc158 [X86] Add i256 shift / funnel shift coverage to match i512 tests (#184346)
5b976c930189 [libc][sys] add header and functions for sys ipc (#182700)
c782e2d40572 [SPIRV] Don't emit service function basic block names (#184206)
bbde3e3b59c8 [VPlan] Preserve IsSingleScalar for sunken predicated stores. (#184329)
1eeb2eccf8b2 [clang-tidy] Handle specialization of user-defined type in `bugprone-std-namespace-modification` (#183984)
33864efe461e [lld] Turn misc copy-assign to move-assign (#184145)
534d6e887ff8 [Analysis][NFC] Store CallbackVH in vector, not in map (#184323)
97043e50ad41 [mlir][Vector][GPU] Distribute expanding `shape_cast` ops (#183830)
cec32683498a merge main into amd-staging
de69348f80f5 [Reland] [APINotes] Refactor APINotesReader to propagate llvm::Error (#184212)
fa6eef837831 Revert "Avoid maxnum(sNaN, x) optimizations / folds (#170181)" (#184125)
703649554da8 [DAG] isKnownNeverZero - add ISD::OR DemandedElts handling (#183228)
d908184487b9 [AArch64] Limit support to f32 and f64 in performSelectCombine (#184315)
ec7f3503f8d0 [MLIR] Make test-block-is-in-loop pass a module pass (#184036)
a368bd4049db [CIR][CUDA]: Handle duplicate mangled names (#183976)
ee8259dcca82 [mlir][sparse] Fix use-after-free crash in SparseSpaceCollapsePass (#184001)
f934db36aa26 [mlir][sparse] Reject dense level after non-unique level in encoding verifier (#184157)
6884ff014277 [mlir][sparse] Fix crash in ForeachRewriter for rank-0 dense tensors (#183903)
8d082c7c3144 [mlir][sparse] Fix crash in sparse_tensor.new with unsupported element type (#183898)
bbd5b1d3bd07 [mlir][VectorToXeGPU] Fix crash on memref with non-scalar element type (#183905)
a8a5242bb2dc [mlir][XeGPU] Fix crash in wg-to-sg type converter on non-XeGPU tensors (#183914)
7856e9876808 [mlir][XeGPU] Fix crash in getUArch when no chip target attribute is set (#183912)
ecec7920c636 [mlir][func] Move return-type verification from ReturnOp to FuncOp (#184153)
263a22e86556 [mlir][xegpu] Fix crash in XeGPUPropagateLayout when module has llvm.func (#183899)
6ee48f2ce747 [RISCV] Remove VL != 1 restriction in RISCVVLOptimizer (#184298)
36cced2b8244 [NFC][AArch64] Refactor Arm llvm-mca tests (#183294)
245887e343d3 [X86] Added sincos vector lib codegen test coverage (#183702)
0b36d4265e30 [AArch64] Add vector expansion support for ISD::FCBRT when using ArmPL (#183750)
03a9ebc8974b [DAG] isKnownNeverZero - add ISD::UADDSAT/UMAX/UMIN DemandedElts handling and tests (#183992)
5e814e26dd72 [mlir][llvm] Fix crash in LLVM inliner when callee has no recognized terminator (#183949)
d1c563beee79 [lldb] Don't link TestingSupport as a component (#184310)
4d3bdc0f8947 [lldb] Use AppendMessageWithFormatv in ComandObjectWatchpoint (#184128)
91e73b93e881 [MLIR][XeGPU] Allow uniform vectors in layout conflict resolution (#183756)
8879ff136c73 Support unnamed functions in MIR parser (#183018)
b4fffcd8e415 [NFC][Docs] Add documentation for NVPTX conversion intrinsics (#175536)
5d8c6c198dde [LangRef] Mention allocation elision (#177592)
b4743b2641b6 [VPlan] Introduce VPlan::get(Zero|AllOnes) (NFC) (#184085)
39f2740facea [AMDGPU] IGroupLP: Avoid repeating reachability checks in greedy algorithm (#182463)
09217ba90459 [lldb] Disable shared build for TestTemplateArgs,TestEvents,TestTypeList (#184304)
c4e2f79c22d2 [AArch64][GlobalISel] Limit srem by const of small sizes. (#184066)
92bd6eee4db3 [libc] Reland add getc, ungetc, fflush to enable libc++ iostream on baremetal (#183556)
0933b634c6a2 [AMDGPU] IGroupLP: Refactor SchedGroup::initSchedGroup (NFC) (#184122)
eb1e808fdb44 [IR] Mark reduction intrinsics as nocreateundeforpoison (#184173)
9cda40735a17 [clang][Sema] Fix initialization of GRO when GRO-return type mismatches (CWG2563) (#179156)
b67536954eb1 [Clang][NFCI] Make unchanged global state const (#183478)
ecb694de65e8 [Clang][NFCI] Initialize PredefinedNames immediately (#183295)
a631c3f4077c [mlir][spirv] Expand verifier testing for spirv.Tosa ops (#184112)
7fb5a02dcda1 [CMake][AST] Add PCH (#183358)
0fff939c1aa9 [mlir][linalg] Lower unpack - capture handle to created copy op (#183744)
d59a267ad541 device-libs: Use generic fshr builtin instead of alignbit (#1468)
78ac964c47cb [RISCV][NFC] Prepare for Short Forward Branch of branches with immediates (#182456)
f67c2cd75e25 [RISCV] Handle Zvabd and XRivosVizip EEWs in RISCVVLOptimizer (#184117)
0504af9e3bf7 [llvm] Turn misc copy-assign to move-assign (#184143)
e4def2d11fb5 [AMDGPU] Make the options consistent across 3 RA pipelines(NFC) (#184190)
d20395cfa3bb [LegalizeVectorOps][RISCV][PowerPC][AArch64][X86] Enable the clmul/clmulr/clmulh expansion code. (#184257)
4ea39c43e133 [LIT] Use forward slashes in substitutions when LLVM_WINDOWS_PREFER_FORWARD_SLASH is set (#179865)
75b0cf39b2f8 [RISCV] Add scalar saturating add/sub operations for i32 for RV64P (#184062)
84d0f8766de2 [RISCV] Alphabetize riscv_files in clang/lib/Headers/CMakeLists.txt. NFC (#184024)
30fc31aa71fa [NFC][TableGen] Add deleted copy operations for RAII guard classes (#184168)
eba4a76597dd [CFI] Expand test to include minimal runtime (#183646)
da8929bd2404 [bazel][mlir][acc] Port e63e55cae8ce29150f38a758555d9cc712a1cf4c (#184289)
a85dbcfe016d [clang][bytecode] Reject non-VarDecl DeclRefExprs (#184141)
5a53fce8582b [RISCV] Extends RISCVMoveMerger to merge GPRPairs independent of even/odd pair instruction order. (#183657)
198f85ea7c17 [clang][bytecode] Fix newly added pfp test (#184137)
b23438661c10 [OpenMP][clang] Indirect and Virtual function call mapping from host to device (#159857)
572a0e45c637 AMDGPU: Remove "MBUF" from "loadMBUFScalarOperandsFromVGPR" (#184282)
6d25af00ac47 [utils] use annotations from __future__ in lit (#184225)
768240d01952 [AMDGPU] Insert readfirstlane for uniform VGPR arguments (#178198)
e63e55cae8ce [mlir][acc] Add ACCRecipeMaterialization pass and reduction ops (#184252)
582586d36e09 [psdb] staging nightly build status notifier with more details
92aa2d36f020 [Github] Respect LLVM_VERSION when building windows container (#184231)
52f32d780fa2 [Github] Bump Github Runner to v2.332.0 (#184230)
8decfb8a90df [mlir][emitc] Do not convert illegal types to emitc (#156222)
2407564cbfa1 [Clang] Add missing extension cl_intel_split_work_group_barrier declaration (#184269)
a6fa21c5aabb [CIR] Upstream basic CodeGen tests from incubator (#183998)
82319d74aae4 [RISCV] Update Andes45 vector reduction scheduling info (#182980)
4f91d0b322a8 [libc++] Give proper names to a few benchmarks (#183333)
0ced81f7eabc [NFC][OpenMP] Remove redundant prints in `target` regions from tests added in #184260. (#184266)
1d1c83ad7397 Reland "[OpenMP][Offload] Handle `present/to/from` when a different entry did `alloc/delete`." (#184260)
d4d18248fde6 [lldb] Terminate the LLDB Log in SystemInitializerCommon::Terminate (#184261)
03e2af7a65ea [CIR] Fix bitfield store locations for assignment codegen (#184005)
0f8aa9610c0a [lldb][NFC] Whitespace cleanup in RegisterContextMinidump_ARM64 Breaking out the whitespace changes turned up in a separate contentful PR.
743428688fb0 [flang] Recognize compiler directives after expansion in comment (#183626)
4e3e4f25bc4a [WASM] add CheckWasmTableElement helper (#181172)
6719ec1e9512 [Coroutines] Replace struct alloca frame with byte array and ptradd (#178359)
a4f9d43eef7f [alpha.webkit.NoDeleteChecker] Add a test for unsafe function override (#184208)
23f21f3e277d [CIR] Implement function/call attribute parsing (#184185)
abb228af20c9 [CIR] Fix handling of cleanup scopes inside a try body (#183869)
c433ae7e2e57 Revert "Add a test that we recover from a crashing breakpoint condition."
a14d8b2e36d4 [CIR] Upstream vtable thunk handling (#183629)
49c3cd15e8b4 Add a test that we recover from a crashing breakpoint condition.
78f259fcc14b [MLIR] mlir_levelzero_runtime: remove dependency on LLVM (#182942)
4995b2b8591d [Github] Enable long paths in windows CI Container (#184224)
4f50a725fa19 [clang][clang-scan-deps] Add LangOptions::AllowLiteralDigitSeparator to fix #88896 (#184235)
42a0fbc2c792 Revert "[OpenMP][Offload] Handle `present/to/from` when a different entry did `alloc/delete`." (#184240)
5156147824be [libc] Declare reallocarray in stdlib.h / malloc.h (#184223)
4a9e0812c506 [flang] Allow acc cache directive inside acc routine (#184213)
3c43fc16b73b [clang][deps] Remove the `finalize()` API for by-module-name scans (#184232)
895597a1f579 [Github][bazel] Run `buildifier --mode=diff` on error (#184233)
6dcaffaa1757 [psdb] use linux-mi325-4gpu-ossci-rocm resource label
526a4d4d8a6a [LAA] Always use DepCands when grouping runtime checks. (#91196)
f52a2035548f Revert "[mlir][acc] Replace terminators with scf.yield in wrapMultiBlockRegionWithSCFExecuteRegion (#183758)" (#184228)
fab5681686a1 comgr: Add new path to automatically embed from the resource directory (#1476)
183d02d257f6 [clang] NFC: remove unused / untested workaround in pack deduction (#183875)
ea7ff48c3108 [DominanceFrontier] Support multiple root nodes for post-dom (#181257)
61310cd72dd2 [Github] Remove force build from windows container
533f16fe8969 [clang-tidy][NFC] Add `findTokenInRange` and reuse it (#183941)
8107c71511b3 [RISCV] Put Large Code Model Constant Pools in .text (#151393)
1a7060a7b07c [OpenMP][Offload] Handle `present/to/from` when a different entry did `alloc/delete`. (#165494)
501c6fda951b [CMake] Propagate dependencies to OBJECT libraries in add_llvm_library (re-land) (#184201)
ebe3c1ee991c [flang] Remove usage of the `DependencyConsumer::finish()` API (#184229)
5ae64c620750 [Clang][Sema][Builtins] Check argument count for `__builtin_allow_sanitize_check` (#183927)
8a9049198d18 [clang] Replace `finish()` with destructors for `DiagnosticConsumer` (#183831)
a4d786630c47 [lldb][ARM] Support thread local variables on ARM Linux (#181315)
03773c3b06b2 [APINotes][NFC] Fix typos and header comment errors (#183811)
386a3afa553f [mlir] Fix typos that propagate downstream. NFC. (#184220)
8e6e9cb8c203 [HLSL][NFC] Move SemaHLSL resource tests to Resources subdir (#183386)
f7176ee33662 [bazel][mlir][acc] Port 12f4eb2156559c2f8c99fa7dc3b59cb4fef1389d: scf.yield (#184216)
ed524ba0d458 [llvm] Avoid resolving `.incbin` during symbol collection (#172920)
ed085573f402 [SemaHLSL] Warn when a local resource is re-assigned to non-unique global resource (#182101)
0797a10cc537 [MLIR][XeVM] Rewrite llvm.alloca if addr_space is 3 (#183417)
96a02c5eb53c Revert "[APINotes] Refactor APINotesReader to propagate llvm::Error " (#184211)
9aff7b6347f1 [HIP] Fix wrong triple being passed to offload-bundler (#184195)
24873cb95574 [SelectionDAG] Pass DemandedElts to isKnownNeverZero for extend nodes (#183624)
12f4eb215655 [mlir][acc] Replace terminators with scf.yield in wrapMultiBlockRegionWithSCFExecuteRegion (#183758)
973f7606fba4 [HWASan] [MTE] support double lifetime.end in same BB
0cabe933812f [HLSL] Reintroduce dx.disable_optimizations to set DisableOptimization Shader Flag (#180069)
fb6038d93781 [DAG] isKnownNeverZero - add ISD::SRA/SRL DemandedElts handling and tests (#183577)
b3c4d44c4423 [lldb] Batch breakpoint step-over for threads stopped at the same BP (#183412)
a171b8d4d523 [NVPTX] Refactor NVPTXLowerArgs and move helpers to NVPTXUtilities (#183686)
5ff5a1f14761 Revert "[CMake] Use keyword signature in two additional callsites (#1… (#184186)
2846cb31e045 merge main into amd-staging (#1615)
ae363d50ad29 [HLSL][Matrix] Make Matrix InitListExprs and AST row-major order, and respect /Zpr and /Zpc in codegen (#182904)
28d294e080b3 [flang] Let -fdisable-real-10 affect only user code (#183870)
d723d14e4c34 [flang][runtime] Emit "Infinity" rather than "Inf" when required (#183359)
dd3d727c88b5 Revert "[llvm-ir2vec] Adding Inst Embeddings Map API to ir2vec python bindings" (#184179)
f486fc95db20 [clang-tidy] Nominate myself as a maintainer (#183173)
307d912378ac [clang][analyzer] Add taintedness to argv (#178054)
ab5205c916a7 [llvm][DebugInfo] Emit DW_LNAME_Assembly for DWARFv6 assembly CUs (#183897)
573a54120207 [clang-tidy][NFC] Use singe mock string header in tests (#183996)
9d1fd9ec1eb8 [AMDGPU] Extend DS loop wait optimization with flush point tracking (#175658)
447eba88c8d7 [lldb][Target] Allow eLanguageTypeAssembly to use ScratchTypeSystemClang (#183771)
fd578f7c5c98 [libomp] Fix hwloc include for non-standard paths (#184087)
41fc9b98459c [LAA] Fix recordAnalysis receiving null Instruction pointer (#183512)
cf8597bd3b87 [clang][Modules] Handle relocated modules during implicit module builds (#181836)
d7eec97bd83f [APINotes] Refactor APINotesReader to propagate llvm::Error (#183812)
148b10be8ad3 [flang][OpenMP] Support custom mappers in target update to/from clauses (#169673)
f3e8508ac771 [clang][ssaf] Add `JSONFormat` serialization support for `LUSummary` and `LUSummaryEncoding`
9ae143149b6f [mlir][bazel] Fix build after moving AMX into X86 in #183717. (#184165)
95832c9bfd7c [LinkerWrapper] Fix a bunch of minor issues and typos (#183679)
82d747e49142 [X86] known-never-zero.ll - add additional demanded elts vector test coverage (#184159)
b11a424e0582 [flang] Inline trivial scalar allocatable assignments in HLFIR-to-FIR (#183177)
11576569336d [lldb][Process/FreeBSDKernelCore] Fix RegisterContext for arm64 (#183947)
f5e8e98a4ef4 [mlir][VectorOps] Fold extract on constant_mask (#183780)
3357e487cf0e [clang/APINotes] Fix assertion crash in addObjCMethod for protocol DesignatedInit methods (#183799)
64139516e5c2 [X86] known-never-zero.ll - remove unnecessary declarations (#184142)
bd02c1712322 [X86] known-never-zero.ll - add shift right vector test coverage for #183577 (#184140)
a8fb8eb49f00 AMDGPU: Stop copying triple into AMDGPUSubtarget (#184147)
24ac5987b482 [bazel][libc] Add missing dep (#184152)
dbacb148dc41 [bazel][libc] Enable layering_check for libc/BUILD.bazel (#183822)
4f84347b2e7e [llvm-ir2vec] Adding Inst Embeddings Map API to ir2vec python bindings (#180140)
bc0af9901b51 [TableGen] Allow specification of underlying type for GenericEnum (#183769)
dfcbf6c70e70 [CVP] Stop CVP constant propagation from destroying `llvm.assume` (#183688)
070683157766 [mlir][bazel] Fix build after changes from #183856. (#184134)
da6b2db1a6e1 Revert "[VPlan] Remove unused VPExpandSCEVRecipe before expansion" (#184108)
8da1bb891e00 Reapply "[AMDGPU] Elide bitcast fold i64 imm to build_vector" (#160325) (#184114)
919ae1cd2f46 [lldb-dap] Skip return_variable_with_children on arm64 (#184132)
644f07cef5dc [CIR] Use `-verify` on clang/test/CIR/CodeGen/nonzeroinit-struct.cpp (#183910)
bb42c74b05c6 [clang-tidy] Add fixit capability to performance-use-std-move linter (#184072)
cca5bb52f37a [OpenMP] Use CreatePtrDiff() (#184127)
eb8f17162973 [clang][test] Add missing FileCheck pipe in n1311.c (#183965)
e0fa4952fd78 [ARM] Format ARMLoadStoreOptimizer Pass classes. NFC
1175046d14b9 [libc] Fix GPU loader propagation to lit test infrastructure (#184105)
c5e5c9735a33 [MLIR][MemRef] Validate linear size before lowering allocs (#179155)
19be8d60662b [mlir][tosa] Fix crash in TosaInferShapes when while_loop carries sparse tensors (#183943)
977355be38d1 [mlir][tosa] Disallow inferable dim in reshape/slice validation (#182472)
4e8be20faa1c [clang][test] Add multi-dim-array diagnostic test for multi-dimensional array function passing (#183847)
9d5ca5282d13 [IR] Return bool from replaceUsesWithIf() (#184107)
4a907a526dd7 [CMake] Add LLVM_ENABLE_WARNING_SUPPRESSIONS to toggle warning suppressions (#183439)
4af885c0c13c [AArch64] Fix performZExtUZPCombine() DAG combine (#183765)
bcc272b3220f [LV] Remove DataAndControlFlowWithoutRuntimeCheck. NFC (#183762)
ce79fb371245 [InstCombine] Always fold nonnull assumptions into operand bundles (#169923)
cd0eb16a11f8 [AArch64] Add maybe_unused to DstTy in assert. NFC
60fec80bdcb3 Revert "[VPlan] Remove unused VPExpandSCEVRecipe before expansion" (#184108)
88693c49d9ac [NFC][analyzer][test][z3] Move test cases requiring Z3 to the `Analysis/z3/` subdirectory (#183724)
87cbea6cdc9a [openmp][cmake][NFCI] Avoid non-eval uses of ${var} (#182267)
b39247c391c4 [AMDGPU] Fix typo "PGRM" in variable name. NFC. (#184104)
24d21ca03cd2 [flang][OpenMP] Fix counting generated nests (#183957)
0a53c0b9c360 merge main into amd-staging
d2b6a5a3f6e8 [LLVM][NVPTX] Fix infinite legalization loop in tcgen05.st (#183012)
82a1905c4bd3 InstCombine: Pass SimplifyQuery through SimplifyDemandedFPClass (#184096)
52df4a19599b [AMDGPU] Fix typos "SPGR" / "VPGR" in comments
dd871f55f018 [flang] Use CHECK-DAG to check constants (NFC) (#184097)
47383919d111 [AArch64][NFC] Remove unused parameters for `performORCombine` (#184075)
6cce18b9f518 [LoopIdiomVectorize] Avoid wrapping in find_first_of loops. (#180570)
c1d82e2f3e83 [mlir][reducer] Use LDBG in opt-reduction-pass (NFC) (#184026)
00af181a4edd [mlir][emitc] Fix crash in form-expressions when identity cast is folded (#183894)
482a7718a8d8 [DAG] visitCLMUL - fold (clmul x, c_pow2) -> (shl x, log2(c_pow2)) (#184049)
e44fd05035a3 [mlir][x86] Move AMX dialect into X86 dialect (#183717)
e3b01e132908 [lldb] Fix wchar addition tests in DIL (#184082)
13751c87076b [AArch64] Vectorise llvm.pow using vector intrinsic for ArmPL library (#183319)
730587d3be6c [DAG] isKnownNeverZero - add DemandedElts for ISD::SMIN/SMAX (#184054)
61faf7d3db72 [AArch64][GlobalISel] Use GPR for illegal fconstants and extend < 32 bit GPR constants to 32 bits (#178692)
0c89071fa33f github-automation.py: Fix mis-indented statement (#149653)
925ec952ddd8 [llvm][DebugInfo][test] dwarf-asm-multiple-sections.s: refine FileCheck checks
f46aca9bf84e [AArch64] Combine (and/or X, (dup (not Y))) -> (bic/orn X, (dup Y)) (#175739)
900f70258b90 [lldb] Indent option help with ANSI cursor codes when possible. (#183558)
3ad43f2d1c03 [LangRef] Clarify nsz semantics (#180906)
96113ac416e7 [Clang] Use llvm.ptrmask to mask out thumb bit (#183535)
037fd6eaaf45 [AMDGPU] Add VINTERP encoding to gfx13 (#182481)
86b07a79a9c3 [AArch64] Remove -aarch64-load-store-renaming=true from test. NFC
c7e1ec97b979 [flang][OpenMP] Implicitly capture variables in enclosing task for nested firstprivate (#183770)
4c2ac846bb77 [mlir][spirv] Add Element Binary Logical operators to TOSA Ext Inst Set (#183703)
36c6c689dc31 [compiler-rt][ARM] Fix conditions for strict-mode FP testing (#183507)
4922ab9915b0 [RISCV] Relax codegen predicates for HINT-based instructions (#179872)
0704b68a027a [gold] Fix test
c62c00c52405 [VPlan] Remove unused VPExpandSCEVRecipe before expansion (#181329)
51d9b40b0d09 [AArch64] Remove iXLen from sve-lrint.ll. NFC
f7b1107bf564 [IVDescriptors] Remove function FMF attribute check for FP min/max reduction (#183523)
265c1f483398 [LV] Add debug print for TTI.MaxInterleaveFactor (NFC) (#183309)
1c3327561977 [mlir][spirv] Introduce a base class for spirv.TOSA convolution ops (#183751)
7fbbbd7893d4 merge main into amd-staging (#1612)
14bcb1a00954 [BOLT] Make sure IOAddressMap exist before lookup (NFC) (#183184)
b4b32e88dde6 [BOLT][instr] Disable stderr diagnostic output when targeting Android (#183185)
3270bbf04cba [BOLT][instr] Make instrumentation counter reset thread safe (#183186)
b8d0bb2ddc77 [WebKit checkers] Trivial function analysis ignores some nodelete annotation (#183970)
6d82f143dee1 [clang-tidy] New performance linter: performance-use-std-move (#179467)
f1620e44412f [OpenCL] Enable __cl_clang_function_scope_local_variables for AMDGPU and NVPTX targets (#183892)
90eb27e56e3e merge main into amd-staging
ab1d59e72524 [clang-format] Allow InheritParentConfig to accept a directory (#182791)
52a9eb37db83 [Github] Add TODO around actions/attest
8fff1c042d14 Update actions/attest-build-provenance action to v4 (#184051)
686987a540bc ValueTracking/AMDGPU: handle mbcnt in computeKnownBitsFromOperator (#183229)
e95dabef96f4 [MLIR][Python] Support attribute definitions in Python-defined dialects (#183907)
8774da8f2f4d [MLIR][XeGPU] Preserve anchor layouts in recoverTemporaryLayout (#182186)
53a6db6a3eba merge main into amd-staging (#1611)
81872e7049ea [NFC] Fix check lines for `clang/test/CodeGenOpenCL/cl-uniform-wg-size.cl` on Darwin (#184042)
e6aafae828e0 [Polly] Update isl to isl-0.27-86-gcf471c16 (#184044)
d947f8f699eb [clang][Sema] fix crash on __type_pack_element with dependent packs (GH180307) (#180407)
f05d2e8a3998 [AMDGPU] Make uniform-work-group-size a valueless attribute (#183925)
e2ef93fc5750 [NFC] Remove `clang/test/CodeGenOpenCL/.gdb_history` (#184038)
d9ca61b6e7b9 Revert "[NFC][Clang] Auto generate check lines for `clang/test/CodeGenOpenCL/cl-uniform-wg-size.cl`" (#184035)
a06dcc7ccf38 merge main into amd-staging
789bf51f0ce6 [SLP]Do not consider condition with multiple uses and negate predicate as a candidate for inversed select
cf1e76835feb [clang-tidy][NFC] Don't call `getLangOpts` in `isLanguageVersionSupported` (#184029)
d1d2a1ed76a6 [SLP][NFC]Add a test with the incorrect compare, extracted from the transformed vector
dddd06be8c3e [NFC][Clang] Auto generate check lines for `clang/test/CodeGenOpenCL/cl-uniform-wg-size.cl` (#183926)
f62adea305d6 [ProfCheck] Exclude new GVN test
3cf53f684d51 [LV] Handle sunk reverse VPInstruction in planContainsAdditionalSimps.
1dc85c60410f [clang-tidy][NFC] Add `getCommentsInRange` utility (#183940)
48209b6777be [DAG] isKnownToBeAPowerOfTwo - add ISD::EXTRACT_VECTOR_ELT handling (#183924)
a13afe84bb42 [SLP][NFC]Add more bitcast/bswap tests with immediate loads, NFC
3041c90718df [mlir][tensor] Remove hard-coded types from `ConstantOpExtractSliceFolder` (#184013)
e4301c48fdd9 [bazel] Fix windows stack space on llvm driver link (#182998)
451529778d42 [clang] fix common type calculation for l-values of 'void' type (#183972)
02c7a6cd7f35 [SLP][NFC]Add tests for bitcasts/bswaps with large target type
262be3b7cbd8 merge main into amd-staging (#1610)
ae7916539918 [clang][NFC][diagnostics] Remove several uses of `getCustomDiagID()` (#172532)
320220e48b8f [VPlan] Support arbitrary predicated early exits. (#182396)
9730d3128435 [SLP]Fix types for reductions in revec
7b26069828aa [VPlan] Pass ForceTargetInstructionCost insted of NumOccurences.
a6e7c38ea631 [SLP]Do not vectorize select nodes with scalar and vector conditions
49b77e3b4555 [VectorCombine] Fold sign-bit check for multiple vectors (#182911)
3bdee9b5576b [GVN] Forward store values through select addresses in findDominatingValue (#183316)
2c9720972e90 [mlir][python] Add stable ABI (abi3) support (#183856)
0ba4f13b264a [mlir][test] Fix crash in ReifyBoundOp with invalid 'type' attribute (#184004)
785490e9db54 [MLIR] Remove `let constructor = ` from mlir/include/mlir/Transforms/Passes.td (#183950)
7629c5cc32cc merge main into amd-staging
b7e20442d5ed [MLIR][ODS] Fix AllElementCountsMatch crash on dynamic shaped types (#183948)
2cb2fe7f2a1d [mlir][scf] Fix crash in ForOp verifier when body block has no arguments (#183946)
9801e752024f [ARM] tADDrSPi no side effects change (#183071)
74c0ee7e72bf [TTI] Remove TargetLibraryInfo from IntrinsicCostAttributes (NFC) (#183764)
6fa90a3f7e89 [MLIR][SymbolTable] Fix crash when SymbolTable is built on unverified IR (#183945)
d68d47db7b5a [ARM] Explicitly mark certain instructions as having no side effects. (#182771)
0f63db5c665b Attributor: Avoid calling identifyDefaultAbstractAttributes on declarations (#182663)
5768ee2dcdad [clang-repl] fix CleanUpPTU by removing decl according to C implicitly FuncitonDecl. (#178648)
d74c6b1176e9 [mlir][IR] Generalize `DenseElementsAttr` to custom element types (#183920)
9ffa08f097d4 [mlir][NFC] Fix typo in property predicate tests (#183987)
b872179bebe7 [AArch64][test] Add i256 codegen baseline tests (#183587)
d412b04a883c [UBSan] Wrap Location variants in anonymous union (#168866)
10b1b7857b05 [ASan] Mark recent integration tests as accordingly for MSVC (#135889)
b2ce908a48e0 [compiler-rt][CMake] Fix build when specifying --stdlib= (with 2 dashes) (#136111)
bf4ed7903aee [clang-tidy][NFC] Use singe mock vector header in tests (#183963)
dce9aaf48638 [Revert_patches.txt] cleanup (#1608)
c35a726ca979 [Clang][TableGen] Sort undocumented builtins after documented ones in generated docs (#183938)
da8d18190530 [libc][math] Cleanup shared/math (#183971)
4673cecc89d1 [MLIR][Python] Add support of `convert_region_types` and the bf integration test (#183664)
910988b9f15c [AMDGPU] Stop treating AMDGPU_CS_ChainPreserve as a module entry funtion (#183718)
7838bdfaaaea Amd/compiler/rlieberm/reland pch (#1607)
e3bad32ceb12 merge main into amd-staging (#1606)
0b423a5b2738 [MLIR] Fix invalid test after improving the error message (NFC)
2c3d5f958f22 [clang] use typo-corrected name qualifier for expressions (#183937)
20f36a2ff10f [MLIR][GPU] Improve error message on invalid pass option
b72d8ac98c6e [DAG] isKnownNeverZero - add ISD::EXTRACT_VECTOR_ELT handling (#183961)
df616fbe1c90 [lldb][lldb-dap] Correctly format lldb warnings in the debug console (#173852)
a0fb4f670848 [lldb] Add BytecodeSection class to formatter_bytecode.py (#183876)
a34fe9d5354d windows namspace ambiguity: remove using llvm
d89528150c26 [CMake][CodeGen] Add PCH (#183346)
b3af477f9862 [CMake][IR] Add PCH (#183303)
59ba10b9d38a [mlir][spirv] Fix crash when spirv.struct member type is not a SPIR-V type (#183942)
4a602c03ea05 [lldb][Process/FreeBSDKernelCore] Add riscv64 support (#180670)
3e05ab6322cb [ThinLTO] Reduce the number of renaming due to promotions (#183793)
e317f424557c [SLP]Recalculate dependencies for the buildvector schedule node, if they have copyable node
5ed875a06cb0 [lldb][lldb-server] Fix zip file lookup ignoring last entry in the zip file (#173966)
061714cd8c01 merge main into amd-staging
bf52cf2ee677 [Revert_patches.txt] cleanup (#1605)
3034c0966931 [clang-format] bugfix: Whitesmiths with IndentAccessModifiers (#182432)
4d724c074dd1 [X86] known-never-zero.ll - add tests showing failure to handle ISD::EXTRACT_VECTOR_ELT nodes (#183934)
1909e43a4adc [mlir][GPU] Fix crash in WarpExecuteOnLane0Op::verify with wrong terminator (#183930)
712f9637b278 [SimplifyLibCalls] Avoid simplifying pow(x, 2.0) -> x * x with math-e… (#1601)
baed2c8a31a3 merge main into amd-staging (#1600)
2430410b7d87 [lldb][Process/FreeBSDKernelCore] Add ppc64le support (#180669)
4a93b9a1b1ec [ARM] Lower strictfp vector fp16 rounding operations similar to default mode (#183700)
a6ceae48f56c [AMDGPU] Assert non-array alloca does have a size (#183834)
3d086f573dc4 [CIR] Implement ImplicitValueInitExpr for ComplexType (#183836)
7585ab05d6fb [AMDGPU] Enable shift64 hazard recognition for gfx9 (#183839)
d5a8f1eda29a [X86] known-pow2.ll - add tests showing failure to handle ISD::EXTRACT_VECTOR_ELT nodes (#183918)
5b64aeb409ec Revert "[mlir][IR] Generalize `DenseElementsAttr` to custom element types" (#183917)
2342db00ab4d [CMake] Use keyword signature in two additional callsites (#183889)
225b56e742fe [mlir][VectorToLLVM] Fix crash in VectorInsertOpConversion with dynamic index (#183783)
2f7c947946f4 Precommit tests: strictfp rounding vector f16 intrinsics (#183699)
e655c36c16c1 [mlir][IR] Generalize `DenseElementsAttr` to custom element types (#183891)
72525fb4ee37 [VPlan] Materialize UF after unrolling (NFCI).
94ebc8a95baf [LV] Remove duplicated IV expression sinking tests. (NFC)
e61d49ab51aa merge main into amd-staging
0b61f15f2e13 [AArch64] Add fcvt-i256 test cases. NFC
903acc2762d5 [AArch64][PAC] Emit `!dbg` locations in `*_vfpthunk_` functions (#179688)
ba0b395d3f25 [OpenMP] Remove NVPTX local addrspace on parameters (#183195) (#1598)
b3be782c4d14 [mlir][affine] Fix crash in linearize_index fold when multi-index is ub.poison (#183816)
f05b705dd3ce [mlir] Fix crash in testNoSkipErasureCallbacks on empty blocks (#183757)
245621408d03 Restore #125407, Make covmap tolerant of nested Decisions (#183073)
7370091a43e5 [mlir][test-ir-visitors] Fix noSkipBlockErasure crash with block args used across blocks (#183828)
c8e211c2a8b2 [mlir][tensor] Fix crash in expand_shape fold with dynamic result type (#183785)
b2c92bca2e66 [llvm-mc][dwarf] Bump supported version to DWARF 6 (#183779)
3403aac73418 [CMake][LLVM] Disable PCH on Clang for file with custom flags too (#183813)
9b1f7845227e [ARM][MVE] Add SLI and SRI recognition. (#183471)
8f0928252bbe [llvm][DebugInfo] Bump DWARFListTable maximum DWARF version (#183859)
ce3460e00272 [llvm][DebugInfo] Bump DWARFDebugLine maximum DWARF version (#183841)
c40b0b2235e5 [llvm][DebugInfo] Bump DWARFContext maximum DWARF version (#183838)
ab2908ed21e7 [LV] Add tail-folding & required scalar epilogue tests for IG narrowing.
55f9cf33fc14 RISCVMCAsmInfo: Remove redundant `UseAtForSpecifier = false`. NFC (#183890)
a3f9f6a82374 merge main into amd-staging (#1599)
1ff1e5f10a5c InstCombine: Stop applying nofpclass from use nofpclass attribute (#183835)
702e4ec5f705 [lldb/test] Skip TestDelayInitDependency on remote platforms (#183885)
3b30dcddd973 [Driver] Add -Wa,--reloc-section-sym= to control section symbol conversion (#183472)
27d654c4c4e6 [AMDGPU] Fix piggybacking after commute in AMDGPULowerVGPREncoding (#183778)
bed89970c3df AArch64: Replace @plt/%gotpcrel in data directives with %pltpcrel %gotpcrel (#155776)
ce6a3d98cc3e [clang-tidy] Teach `misc-unused-using-decls` that exported using-decls aren't unused (#183638)
04484e4c8fa7 [amd/device-libs] __builtin_elementwise_max ...
fe76e9004b5b [CodeGen] Allow `-enable-ext-tsp-block-placement` and `-apply-ext-tsp-for-size` passed together (#183642)
d72e95bab071 [CIR] Use `-verify` on clang/test/CIR/CodeGenHLSL/matrix-element-expr-load.hlsl (#182817)
0b88ee12dd88 [CIR] Infrastructure and MemorySpaceAttrInterface for Address Spaces (#179073)
53e538a99179 merge main into amd-staging
6f9c68d32074 [VPlan] Don't adjust trip count for DataAndControlFlowWithoutRuntimeCheck (#183729)
b281cdc5b244 [psdb] use latest rock CI backend
5f22decefac0 Clang: Deprecate float support from __builtin_elementwise_max (#180885)
62cfe1659edf [libc][math][c23] implement C23 `acospif` math function (#183661)
fb6b470caedc [libc][math] Refactor floor family to header-only (#182194)
a8d37d3cce19 [Flang][OpenMP] Unxfail omptarget-record-type-with-ptr-member-host.mlir (#1596)
e884a8cbcc51 merge main into amd-staging (#1597)
8bd8d8e6debe [AMDGPU] Remove extra pipes from load-saddr-offset-imm.ll (#183874)
5395d2668968 Revert "[WebAssembly] Incorporate SCCs into WebAssemblyFixIrreducibleControlFlow (#181755)" (#183872)
342e44603dc2 [AMDGPU][SIInsertWaitcnts] Move VCCZ workaround code out of the way (#182619)
795cfaea9cc8 [CIR][NFC] Move some builtin tests to the CodeGenBuitins folder (#183607)
7a5a92d27f68 Manual update of LLVM_MAIN_REVISION to 570809 (#1595)
12e1075b6495 [SLP]Fix operand reordering when estimating profitability of operands
fd9421cccd0b [lldb] Fix sys.path manipulation failure in formatter_bytecode.py (#183868)
e3c045415ae5 [CMake] Propagate dependencies to OBJECT libraries in `add_llvm_library` (#183541)
136ba6e208b2 [Hexagon] Define __HVX_IEEE_FP__ when -mhvx-ieee-fp is enabled (#183829)
dc520a5f493a [mlir][GPU] Add ValueBoundsOphinterface to gpu.subgroup_broadcast (#183848)
c78f37fdebd8 [CIR] Fix dominance problems with values defined in cleanup scopes (#183810)
07891ab5901c [cmake] Disable -Wdangling-pointer on GCC 12+ (#183593)
329c52c1004f [lldb] Change the way the shlib directory helper is set (#183637)
788625757ea4 [NFC] Fix use-after-free: track TargetLibraryAnalysis in BasicAAResult invalidation (#183852)
89d42b316a10 merge main into amd-staging
e35fc30cb8f5 Fix `BuiltinTypeMethodBuilder` uninitialized pointer (#183814)
0a9b5d52188f [libc++] Forward find* algorithms to find_if (#179938)
c5588becb8dd [lldb] Add skip shared build to more API tests
abbba22f4566 [lldb] Add synthetic support to formatter_bytecode.py (#183804)
7ad2c6db54a0 [mlir][arith] Add `exact` to `index_cast{,ui}` (#183395)
73d655a598d7 [VPlan] Support unrolling/cloning masked VPInstructions.
a0f79991dc3a merge main into amd-staging (#1592)
8f268e63e484 [Offload] Remove unused data type (#183840)
cdd431318318 [mlir][LLVM] Let decomposeValue/composeVale pad out larger types (#183825)
d7e037c8383e Revert "[VPlan] Remove manual region removal when simplifying for VF and UF. (#181252)"
94bd8b9444be [NFC] [MTE] add test for duplicated lifetime end
63ab568070c7 [NFC] [HWASan] add test for duplicated lifetime end
c2f66f2a940e [WebAseembly] Fix -Wunused-variable in #181755
a71ded3861aa [BOLT][AArch64] Add a unittest for compare-and-branch inversion. (#181177)
1073951bdb8e [mlir][cf] Fix crash in simplifyBrToBlockWithSinglePred when branch operand is a block argument of its successor (#183797)
d0afaeadecd0 [clang][modulemap] Lazily load module maps by header name (#181916)
977702ccc40d [clang] fix crash when casting a parenthesized unresolved template-id (#183633)
2c98566900f0 Revert "[Metal][HLSL] Add support for dumping reflection" (#183818)
02ebe23163c0 [ASan] Document limitations of container overflow checks (#183590)
fff2f0ba78fe [AMDGPU] Handle GFX1250 hazards between WMMA and VOPD (#183573)
fc153b1e254f [alpha.webkit.NoDeleteChecker] Check if each field is trivially destructive (#183711)
ca04a70891fb [libc][math] Refactor bf16sub family to header-only (#182115)
6f612cfbd921 [clang] stop error recovery in SFINAE for narrowing in converted constant expressions (#183614)
d1f4f9453c78 [flang] Fix explanatory messages for generic resolution error (#183565)
4f05592bc01c [Driver][SYCL] Add tests for -Xarch_<arch> option forwarding to SYCL JIT compilation. (#178025)
3d889c464eb1 [clang-format] Fix SpaceBeforeParens with explicit template instantiations (#183183)
df5bee6afc79 [CIR] Implement TryOp flattening (#183591)
8ce2b9cbc2bc [Clang][ItaniumMangle] Fix recursive mangling for lambda init-captures (#182667)
ee6f5f386f95 [InstCombine] Replace alloca with undef size with poison instead of null (#182919)
25d709e72c97 [SystemZ] Emit external aliases for indirect function descriptors in the ADA section (#183443)
cf28f23f1013 [SLP] Reject duplicate shift amounts in matchesShlZExt reorder path (#183627)
282a2b77c358 [clang][ssaf] Add `JSONFormat` support for `TUSummaryEncoding`
403fd7679f80 [SlotIndexes] Further pack indices to improve spill placement time (#182640)
dce48f2653cb [OpenMP] Enable internalization of 'ockl.bc' for OpenMP (#183685)
c05e323be7ca [WebAssembly] Incorporate SCCs into WebAssemblyFixIrreducibleControlFlow (#181755)
852c6ef5aca6 [mlir][LLVM] Let decomposeValue/composeValue handle aggregates (#183405)
48eb40bee024 [lldb-dap] Adjust VariableReferenceStorage lifetime management. (#183176)
ca0e7d31d05b [flang] [flang-rt] Addition of the Fortran 2023 TOKENIZE intrinsic. (#181030)
6301243a5d69 Reapply "[ValueTracking] Propagate sign information out of loop" (#182512)
c49460bae76c [flang-rt] Enable more runtime functions for the GPU target (#183649)
67a51ea34d25 [NFC][POWER] add Pre-Commit test case for Inefficient std::bit_floor(x) (#183363)
6e7c7131b2c3 [psdb] enable rock CI windows build for debug branches
0d95dda1eeee [LoopInfo] Preserve profile information in makeLoopInvariant (#174171)
c3b3f4195219 [SystemZ] Emit external aliases required for indirect symbol handling support (#183442)
1269a74db9ff [bazel] Enable `parse_headers` for llvm/BUILD.bazel (#183680)
179c25eaefe6 [MTE] [HWASan] support more complicated lifetimes
cd50a3074bdf Revert "[ThinLTO] Reduce the number of renaming due to promotions (#178587)" (#183782)
d2c545266b8b [RISCV] Use getCopyFromReg in unit test to match comment. NFC (#183199)
55d62abadbc5 [lldb] Add arithmetic binary addition to DIL (#177208)
5661ed60e37d [mlir][vector] Fix crashes in MaskOp::fold and CanonializeEmptyMaskOp (#183781)
dc26edd9b660 [ASan] Enable Internalization for 'asanrtl.bc' in Driver (#182825)
7f0a343a8ec4 [flang] Implement -grecord-command-line for Flang (#181686)
bad56dbb2385 [libsycl] Add sycl::context stub (#182826)
35f8ca8b76c6 [flang][NFC] Converted five tests from old lowering to new lowering (part 22) (#183681)
de4a1a77e147 [clang][modules] Prevent deadlock in module cache (#182722)
061762385805 [SPIR-V] Fix non-deterministic compiler output for debug type pointer (#182773)
d1da7f6ee5d7 [clang-scan-deps] Add test for symlink-aliased module map PCM reuse across incremental scans (#183328)
a703d91091ec [lldb-dap] Improve test performance for 'cancel' request. (#183632)
729602e81009 Revert "[SPIRV][NFCI] Use unordered data structures for SPIR-V extensions" (#183774)
975dba28633d [ThinLTO] Reduce the number of renaming due to promotions (#178587)
bb9122b3a558 [RevPatch] update PCH list of reverts
c7a20151621b Revert "[CMake][IR] Add PCH (#183303)"
9c53215d2131 [VPlan] Remove manual region removal when simplifying for VF and UF. (#181252)
4b10a4c17781 [mlir] Enable specifying bytecode producer in mlir-opt. (#182846)
e4a97cd05272 merge main into amd-staging
2265d3240f23 [pdb] Fix libc++ strict-weak-ordering assertion failures from gsiRecordCmp (#183749)
ed05f7012fe9 [mlir][vector] Rename `ReduceMultiDimReductionRank` -> `FlattenMultiReduction` (NFC) (#183721)
401163e3d2fa [psdb] enable rock CI windows build for debug branches
11a92a9305a7 [SystemZ] Add indirect reference bit XATTR REFERENCE(INDIRECT) for indirect symbol handling support (#183441)
4eab75e21fdf [SLP][NFC] Precommit test for zext reorder with duplicate shifts (#183748)
53656d1a2fad [clang][DebugInfo] Rename _vtable$ to __clang_vtable (#183617)
abe0c46e03cf merge main into amd-staging (#1586)
d8956d7796bb [SPIRV][NFCI] Use unordered data structures for SPIR-V extensions (#183567)
ef05d0610940 [lldb][Process/FreeBSDKernelCore] Implement DoWriteMemory() (#183553)
bcd8819aee05 [mlir][transforms] Fix crash in remove-dead-values when function has non-call users (#183655)
620425a88438 [mlir][tensor] Fix crash in tensor.from_elements fold with non-scalar element types (#183659)
f55b86258c91 [mlir][Python] Drop Python <=3.9 compatibility path (#183416)
370273382035 [SelectionDAG] Fix CLMULR/CLMULH expansion (#183537)
7cc27e28db97 [MLIR][Vector] Enhance shape_cast unrolling support in case the target shape is [1, 1, ..1] (#183436)
9c2a3ca4949e [MLIR] Fix OpenACC parser crash with opaque pointers (#183521)
a8a6613cc423 [AMDGPU][Scheduler] Fix compilation fail in EXPENSIVE_CHECKS (#183745)
7402312ae12d [NFC][SPIRV] Fix compile warnings (#183725)
9210d701cbf0 [MIR] Error on signed integer in getUnsigned (#183171)
bf3ab0d873bf [AMDGPU][Scheduler] Add `GCNRegPressure`-based methods to `GCNRPTarget` (#182853)
20df251af50b [LLVM][Runtimes] Add 'llvm-gpu-loader' to dependency list (#183601)
dc2ec04342de [gn] port 3490d28c8cab
8a0be0bc3772 [X86] Fold XOR of two vgf2p8affineqb instructions with same input (#179900)
48a9a2fd20a7 [Flang][OpenMP] Fix close map flag propagation for derived types in USM (#1557)
b2fdc435c823 merge main into amd-staging
2f4624613d05 [analyzer] Fix crash in MallocChecker when a function has both ownership_returns and ownership_takes (#183583)
e3dda81e2a80 [flang][OpenMP] Add `is_range<R>` trait to detect classes with begin/end, NFC (#183615)
fc69531254ca [LLVM][ExecutionEngine] Add vector ConstantInt/FP support to getConstantValue(). (#182538)
d8671280d4bf [VPlan] Add nuw to unrolled canonical IVs (#183716)
3676ae43bff9 [NFC][SPIRV] Remove dead code from `SPIRVPostLegalizer.cpp` (#183585)
6b91049f44d2 [Clang] support C23 constexpr struct member access in constant expressions (#182770)
4d169f38cab5 [LangRef] Clarify in vscale_range that vscale is a power-of-two without the attribute (#183689)
1a6bd39fd498 [flang] Use CHECK-DAG to check constants (NFC) (#183687)
14f73345ff0c [mlir][dataflow] Fix crash in IntegerRangeAnalysis with non-constant loop bounds (#183660)
c5c0fe663c7b [VPlan] Remove non-power-of-2 scalable VF comment. NFC (#183719)
98825908fc51 [mlir][affine] Fix crash in linearize_index fold when basis is ub.poison (#183650)
e7bc02d9a49f [SCEV] Always return true for isKnownToBeAPowerOfTwo for SCEVVScale (#183693)
49f4232a7d73 [AMDGPU] Remove unused CmpLGOp instruction (#180195)
b9f2a489607c [MemorySSA] Make `getBlockDefs` and `getBlockAccesses` return a non-const list (NFC)
5e30ff9e70be [lldb][test] Re-enable TestDyldLaunchLinux.py for Linux/Arm (#181221)
1afd7d40afe3 [AMDGPU] Support i8/i16 GEP indices when promoting allocas to vectors (#175489)
250ebfc30688 [X86] regenerate fcopysign test checks (#183710)
d6fcf47a8934 [libc++] Fix vector::append_range growing before the capacity is reached (#183264)
5e1d99158e0b [X86] stack-align.ll - regenerate test checks with no address scrubing (#183712)
294cf1f6b49b [X86] fnabs.ll - regenerate test checks and add AVX512 test coverage (#183709)
10b48e41e7d7 [InstCombine] Combine extract from get_active_lane_mask where all lanes inactive (#183329)
7a5ba652f08b [AArch64] optimize vselect of bitcast (#180375)
9e95cff5155a [AArch64] Add vector expansion support for ISD::FPOW when using ArmPL (#183526)
28cbc682a911 [NFC][analyzer] Remove NodeBuilders: part I (#183354)
4147cd29e1f2 [WebAssembly][FastISel] Emit signed loads for sext of i8/i16/i32 (#182767)
f71bd1c74fe8 [clang][bytecode] Add `Record::hasPtrField()` (#183513)
d43213fe8012 Revert "[VPlan] Don't drop NUW flag on tail folded canonical IVs (#183301)" (#183698)
16aa1900ef8f [clang][bytecode][NFC] Print more info in Pointer::operator<< (#183691)
c690414f8369 [clang][bytecode][NFC] Refactor visitDeclRef() (#183690)
a1f83ba1b6a7 [LV] NFCI: Move extend optimization to transformToPartialReduction. (#182860)
4a0f451cbd01 merge main into amd-staging (#1581)
5af5bd4f9867 [AMX][NFC] Match pseudo name with isa (#182235)
058705bf76af [Clang][NFCI] Make program state GDM key const pointer (#183477)
92704064e585 [VectorCombine][X86] Ensure we recognise free sign extends of vector comparison results (#183575)
a5bbedf522d4 [LV] Convert test to UTC. NFC
b0b3e3e1c7f6 [VPlan] Don't drop NUW flag on tail folded canonical IVs (#183301)
192acd6d536c [Clang][AMDGPU] Change __fp16 to _Float16 in GFX1250 WMMA/SWMMAC builtin definitions (#183493)
a107c1ccf18b merge main into amd-staging
32134a64b195 [mlirbc] Switch generator to enable write's with failures. (#182464)
ed8f080737de [Clang][docs] Fix proposal number typo for P1847R4 (#183671)
d471646607a6 Amd/dev/rlieberm/reland driver new (#1578)
f5bf00681c99 merge main into amd-staging (#1580)
86b99eff8c4d Revert "[Sema] Fix crash on invalid operator template-id (#181404)" (#183682)
07007b7c8d9e [lldb] Don't add remap entries for empty segments (#183651)
77600cbd9798 [MLIR][XeGPU] XeGPU Layout adds support for fractional-subgroup-size vector (#183434)
f30dfe7de4c3 Revert "[mlir-tblgen] Remove `namespace {}` around OpDocGroup (#182721)" (#183458)
b354b206d3be [SafeStack] Allow -fsanitize-minimal-runtime with -fsanitize=safestack (#183644)
5929c9040fac [mlir][vector] Fix fold result for empty vector.mask with no results (#180345)
8d5b74db2d8f [DenseMap] Add memory barrier for sanitizers in getInlineBuckets/getLargeRep
8f9c926868d1 Revert "AMDGPU: Fix runtime unrolling when cascaded GEPs present (#14… (#183641)
c056d7c5d6ea [Sema] Fix crash on invalid operator template-id (#181404)
46b6c9744f84 [LoopUnrollAndJam] Update test unroll-and-jam.ll (NFC) (#183520)
c1d33452468d [MLIR][Presburger][NFC] Don't add empty regions when unioning PWMA functions (#182468)
c78a4986b055 [RevPatch] PCH and Openmp
8224f11a2735 Revert "[CMake][CodeGen] Add PCH (#183346)"
6da19c1e02bd Revert "[OpenMP] Remove NVPTX local addrspace on parameters (#183195)"
b28ad9cb96c2 [llvm-dwp] Fix typo in --help
408209275e63 [LoopUnrollAndJam] Update test dependencies.ll (NFC) (#183509)
decb5d3ff6a1 [CIR] Remove branch through cleanup fixups (#182953)
361e2359860e [MLIR][Python] Support op adaptor for Python-defined operations (#183528)
9b708b003274 [mlir][arith-to-spirv] Fix null dereference when converting trunci/extui with tensor types (#183654)
99c463512a04 [MLIR] Do not abort on invalid --mlir-debug-counter values (#181751)
d149830b98f8 [AMDGPU] Pre-Commit tests for handle mbcnt in computeKnownBitsFromOperator (#178607)
26b4c25b8bce [flang][cuda] Add support for cudaStreamDestroy (#183648)
5e6f0c45a851 [Clang][Hexagon] Add QURT as recognized OS in target triple (#183622)
7c022af37ef2 [scudo] Add reallocarray C wrapper. (#183385)
00b3ce6b5abc merge main into amd-staging
7e39b280e860 [libc][math] Refactor nextafter family to header-only (#181673)
f2baaeb747b9 merge main into amd-staging (#1577)
20ec9a9bb725 build: correct `MSVC` and Windows mixup for `CLANG_BUILD_STATIC` (#183609)
e55945556a1e [scudo] Change header tagging for the secondary allocator (#182487)
2fc0733805e3 [AArch64] Decompose FADD reductions with known zero elements (#167313)
e92dd71f44c3 [RISCV] Add Defs = VXSAT to P extension instructions. (#183455)
c6db35fd343e [mlir][xegpu] Retain order attribute during load + transpose optimization. (#183608)
6bc9ba786d0f [Hexagon] Fix memory type for vgather intrinsics (#183563)
10abb231d6b4 [flang] Update the Flang Community Call to the new MS Teams series (#183576)
d5e501725e31 Reapply "[VPlan] Use VPInstructionWithType for Load in VPlan0 (NFC)"
46c06a34f1de [VPlan] Fixup C++ unit te…
6c9f97d to
9f3690c
Compare
…fd2a
36e42eaffd2a merge main into amd-staging (#1842)
be6a92d605e6 merge main into amd-staging
076226f378df [ELF] Separate relative and non-relative dynamic relocations (#187959)
5567572c44a0 [clang-tidy][NFC] Remove optimized container implementations in `misc-no-recursion` (#187630)
6dabcef0b3ff [MLIR][IRDL][Python] Fix error while composing `irdl.any_of` and `irdl.base` (#187914)
7482655a6b57 [clang] On Windows, silence warning when building with MSVC (#187937)
e1286d963eeb [mlir] Deterministic containers in BytecodeWriter (#187819)
f1bc5a216026 merge main into amd-staging (#1841)
98f84f9bf2df [clangd] Code completion for declaration of class method (#165916)
ce288f444102 [MLIR][XeGPU] Add distribution patterns for vector insert & extract ops in sg to wi pass (#184665)
8d64f56892b5 [Clang] Honour [[maybe_unused]] on private fields (#187940)
b2ba79578b35 [SLP]Fix patterns for compile time blow up with ordered reductions
c079372099cf [VPlan] Add m_VPPhi pattern matcher and use in removeDeadRecipes (NFC).
651482267dc2 [gn] port a2c0c436999
0e7a8ac6c15d [gn build] Port c6ba0e00161e
c58f32215d49 [gn build] Port 78729251fbb2
49b5d35f9d71 merge main into amd-staging
ea489fe12ee9 [llvm][ADT] Add PointerUnion benchmarks. NFC. (#187874)
ca3d04561ba8 [llubi][Github] Add llubi labelling support (#187877)
45039dfac472 [clang] On Windows, silence warning in `CFGBackEdgesTest` with MSVC (#187939)
33a14cbe35c1 [RISCV] Add guard to prevent GPRPair merge on targets without Zdinx or P (#186600)
88f830aed8ac [SLP]Do not try to reduced instruction, marked for deletion in previous attempts
34bc5d580b73 [MC,clang] Fix -Wa,--noexecstack not emitting .note.GNU-stack (#187880)
2566961cefad [clang-tidy] use-after-move: Support null_after_move annotations (#186903)
6eb5ac52ca56 [SystemZ] Remove custom lowering of f16 IS_FPCLASS (#187532)
b1cf9b0835d2 [Clang] Support constexpr for AVX512 compress intrinsics (#187656)
7a0c3401ffbf merge main into amd-staging (#1839)
26ee552ef583 [libc++] Fix includes in `<string>` for no-`wchar_t` modes (#187650)
720abd76e71f [clang][AST] Fix assertion in getFullyQualifiedType for DecltypeType (#187725)
4c4925f1a259 [LLD] [ELF] Make {bti,gcs}-report=none silence warnings from force-bti/gcs=always (#186343)
b4084bd21322 [clang] Detect pointee mutations in placement new expressions (#187508)
a0d550856342 [X86] Prefer branchless code with sbb for abdu (#187783)
997597d20dd3 merge main into amd-staging
5324c23d6c4e [llvm][DebugInfo] Use formatv instead of format in DWARFDebugLoc (#186800)
5b71607f384e [mlir][linalg] Specialize transform op - emit category ops (#187506)
a5472086ce24 [clang-tidy] False negatives readability-redundant-parantheses member of struct (#187054)
24546d96445a [clang][CodeGen] Use FieldDecl::getFieldIndex() in VisitOffsetOfExpr (#187826)
81b7a5daef22 merge main into amd-staging (#1837)
6f6adfbca46c [orc-rt] Add Session::tryCreateService convenience function. (#187640)
3258d361cbc5 [Clang] Use stable_sort in VerifyDiagnosticsConsumer. (#187827)
a9deba8f5c17 merge main into amd-staging
f014202dac32 [clang-format][NFC] Remove redundant parens enclosing braced list
2bbb4491a6fd [clang-format] Merge case alignment into AlignTokens (#187811)
aa62224a73b1 [libc][math] Refactor sqrtbf16 function header-only (#187849)
1120c971c849 [NFC][AMDGPU] Set output to null for llvm/test/MachineVerifier/AMDGPU/invalid-vop3-source-modifiers.mir (#187888)
09ca32edeefa merge main into amd-staging (#1836)
eea589f951e1 [libc][math] Qualify log with constant evaluation support (#184745)
1f1d3167b203 [clang-format] Correctly annotate Java lambda/sychronized blocks (#187842)
adcb17b47641 [Clang][HLSL] Fix -Wunused-variable
2be28d65fbf1 [gn] "port" 0ec9f7ebbdf72a94
1075a2fa7ed0 [Instcombine] Write Instcombine pass to strength reduce lock xadd to lock sub (#184715)
d3c7f23d2f3f [VPlan] Remove unused VPlan friend declaration from VPValue (NFC)
2702570686d7 [Bazel] Port 0ec9f7ebbdf72a94e0e2c0872d4d11fc8d0efa6b
b7d97d9e8dfa [BOLT] Remove outdated assertion from local symtab update logic (#187409)
6204bc78fdc5 merge main into amd-staging
f146677396ea [TargetLowering] Refactor expandDIVREMByConstant to share more code. NFC (#187582)
34203a53b60d [RISCV][Docs] Removed 'specified in' text from SiFive custom instruction links. NFC (#187817)
df9eb79970c0 [Clang][AMDGPU] Lower `__amdgpu_texture_t` to `<8 x i32>` instead of ptr adrspace(0) (#187774)
d818fa4c55c2 [MLIR][Python] Make init parameters follow the field definition order (#186574)
93d256b076b1 [llvm-pdbutil] Hash type records in yaml2pdb (#187593)
c1df6937bac1 [TargetLowering] Use legally typed shifts to split chunks in expandDIVREMByConstant. (#187567)
7d7cd745af22 [libc][math][c23] Add atanbf16 function (#184019)
82eee26ccccd [libclang/python] Fix Type.get_offset annotation (#187841)
4d058aeb08ca [lldb] Fix LLVMSupportHTTP linkage against libLLVM (#187848)
a2c37b3d474d merge main into amd-staging (#1834)
616240369e11 [SLP]Do not consider copyable node with SplitVectorize parent
acf9eede2302 [Object] Fix issues in BBAddrMap.h (#187704)
656fce889c65 [libclang/python] export libclang version to the bindings (#86931)
3b9106149c68 [lldb] Fix linking liblldb in a dylib build after 39d6bb21804d21ab
db143fb2b9e6 [SLP][NFC]Use block number instead of pointer for stable sorting, NFC
2d01df187f1c [CIR] Fix reference alignment to use pointee type (#186667)
43db30f5b223 merge main into amd-staging
1f9c54a15a87 [clang][AST] Preserve qualifiers in getFullyQualifiedType for AutoType (#187717)
b3c1098211c8 [lldb][test] PlatformTest: fix comment
64c4e529a956 [mlir][SPIRV] Add alignment calculation to support `PhysicalStorageBuffer` with vector types (#187698)
ac84b01655fa merge main into amd-staging (#1832)
1087db5b1f80 [clang-tidy] Speed up `bugprone-suspicious-semicolon` (#187558)
d7fe9c87ce9a [clang-format] Handle sizeof in C compound literals (#187633)
f0a652e351d8 [NFC][clang] Remove dead code in HandleCXXModuleDirective (#187737)
91dbaf5d89c0 [NVPTX] Print param space sub-qualifiers where supported (#187350)
a4ddd8faf8fc merge main into amd-staging
4cba4ad8d73b AMDGPU/GlobalISel: RegBankLegalize rules for pops_exiting_wave_id (#187778)
36798cfa860f [AMDGPU][GlobalISel][NFC] Change mbcnt test to use new-reg-bank-select (#187772)
8eccc21e47fb [libclc] Replace llvm-dis with llvm-nm in check-external-funcs.test (#187190)
2b47497f6e0a [libc][docs][NFC] Restructure Getting Started guide and update Build Concepts. (#187701)
b9d276748eca [lldb] Support arm64e Objective-C signing in the expression evaluator (#187765)
3d4dd4b94418 [lldb][test] Don't build PlatformLocateSafePathTest in non-asserts builds (#187829)
367da15a11c5 [MLIR][XeGPU] Enhance XeGPU lane layout to support "wrap-around" distribution (#186958)
0ae9aaf53973 [InstCombine] When rewriting GEPs, check that the types match. (#186886)
0ec9f7ebbdf7 [lldb] Add mechanism for auto-loading Python scripts from pre-configured paths (#187031)
6891a6ef0e24 [compiler-rt] Add bitmask to fix warning (#187812)
e1f885549095 Manual update of LLVM_MAIN_REVISION to 573671
bae6a2a76b63 [Runtimes] Fix Unix Makefiles race between runtimes-build and EXTRA_TARGETS (#187634)
335a2d0e7e5a [clang] fix error: cannot compile this l-value expression yet (#187755)
368f38b9fcfe [AMDGPU][SIInsertWaitcnts][NFC] SGPRInfo: Move score selection logic closer (#186518)
a60b3a83cf32 [libc] Fix function prototypes for <threads.h> C11 header. (#187808)
cf5f47b270b7 merge main into amd-staging (#1829)
51c158b5d29b [clang-doc] Fix incorrectly rendered specialization args in HTML (#187761)
720615f49586 [SandboxVec][DAG] Fix DAG update when user is scheduled (#187148)
8cc0124508a5 [scudo] Make the default for size/align checks to not die. (#187799)
7a5431eee3c2 [lldb][bytecode] Fix Update() and failing test (#187795)
485eda9cb32e [mlir][tosa] Fix crash in slice op folder when input values are not iterable (#187339)
a2615482d178 [libc][annex_k] Add constraint_handler_t. (#163239)
d514a4aa6a25 [HLSL][Matrix] Support row-major `transpose` and `mul` by inserting matrix memory layout transformations (#186898)
2b78c71cb5aa [lldb] Support -fptrauth-indirect-gotos in the expression evaluator (#187562)
79f3104e09e8 [lldb] Fix warning style for SymStore symbol locator (#187776)
343b566b5746 [TargetLowering] Move the MULH/MUL_LOHI legality checks to the beginning of BuildSDIV/UDIV. NFCI (#187780)
3eecb98b3702 [TargetLowering] Separate some of the control for the i32->i64 optimization out of BuildUDIVPattern. (#187739)
2d3b8cecfbcf [dsymutil] Require AArch64 backend in asm-line-tables.test (#187797)
a99dbc5be598 [HLSL] Add binding attributes to resources from structs (#184731)
9b30151594a7 [lldb] Support PointerAuthAuthTraps in the expression evaluator (#187612)
82f18b02d9fe [Clang] Rename OffloadArch::UNUSED to UNUSED_ to avoid macro collisions (#174528)
8c49c53a7f38 [dsymutil] Preserve DWARF debug info for assembly files (#187575)
b260861b388e [SLP]Update values after ordered vectorization
44c6a0acb70f [MLIR][XeGPU] Fix dpas f16 output layout (#184419)
1b44e34b18e8 [scudo] Add free_sized and free_aligned_sized (#186881)
66f06f54cb4d [mlir][acc] Sink constants into acc.compute_region when creating (#187777)
bd3b06b0a797 [AMDGPU][GlobalISel] Add RegBankLegalize rules for amdgcn.class (#178827)
bb369f1c30a1 [libc][x86] Add Non-temporal code path for large memcpy (#187108)
827ddb205144 [AMDGPU][SIInsertWaitcnts] Add test functions in waitcnt-wcg-attributes.mir (#186504)
dd30239f3315 [AMDGPU] Add basic verification for source modifiers (#186733)
498dd13f7228 Add VDS encoding for gfx13 (#187693)
950eaaabe318 [Clang] Use stable_sort for UnqualUsingDirectiveSet for determinism in ambiguity notes (#187750)
cfc94a6fd7fa [flang][OpenMP] Introduce `WithReason<T>` for nest/sequence properties (#187563)
78b651a2cba9 [RISCV] Fix the pipe used by `fmv.x.<fp>/<fp>.x` in SiFive7 sched model (#187740)
63c9573f1c6b [LSR] Add regression test for unnecessary phi introduction (#187751)
9f76933b07fa merge main into amd-staging
9431920bfaee [llvm] Silence llvm-debuginfod-find/headers-winhttp.test on Windows bots temporarily (#187753)
07896d44a320 [OpenMP] Emit aggregate kernel prototypes and remove libffi dependency (#186261)
60db764b90b1 [Bazel] Port a2c0c43699917bb26a3eb20fefcbf29ff120ce70
0ec6e1d21e83 [CIR] Address Space support for GlobalOps (#179082)
4a5da64759ae [CIR][NFC] Minor cleanups to missing feature markers (#187754)
bc6a265e3be2 [offload] Use flang-rt for test feature requirements (#187733)
eaeca6d2fb88 [clang] fix #187352 breakage on 32-bit platforms (#187741)
94e366ef2060 [SLP] Initial support for ordered reductions
5717524c02c2 [mlir][acc] Use index for acc.par_width results (#187734)
65d84ea12725 [lldb] Update the calling convention of BytecodeSyntheticChildren::Update (#182155)
d8e1f50780e1 [lldb][bytecode] Document invocation in compiler output (#187547)
e835f8b687f6 [flang][OpenACC] Fix reduction init value for minnumf/minimumf/maxnumf/maximumf (#187647)
965ee6c91f7f [FIRToMemRef] copy ACC Variable Name attribute (#187724)
537a8cc745ed [IR][NFC] Fix MSVC deprecation warnings about BranchInst (#187702)
5a14e4f231bb [flang] Implement SPLIT intrinsic subroutine with tests (#185584)
68a9e9ca3e93 [GlobalISel] Add G_ABDU and G_ABDS to computeKnownBits. (#186822)
a2c0c4369991 [clang][ssaf] Consolidate tools and shared utilities under `clang/tools/clang-ssaf/`
ca54948d0b7a [ASan] Fix missed poisoned suffix in first granule in __asan_region_is_poisoned (#187466)
81e3364e7608 [OpenMP] Emit aggregate kernel prototypes and remove libffi dependency (#1810)
d7dbba55bff5 [llvm] Run headers-winhttp.test only if the Python side of it works (#187727)
18f7e625bd78 Revert "[AMDGPU] Generate more swaps" (#187723)
98eaa95baeb7 [flang][OpenMP] Store bad ExecutionPartConstruct in LoopSequence (#187556)
97a1a7020881 [SPIR-V] Fix SPV_INTEL_long_composites continued instruction handling (#187262)
c3e7b4556e40 [offload] Define flang-rt as an available test feature (#187732)
d8104bfc9e9d [SPIR-V] Add `llvm.spv.named.boolean.spec.constant` (#187420)
bf57f910abbe [Clang] Fix -Wunused-variable
e1347d197723 [SPIR-V] Fix linker error after #187685 (#187722)
063109f758aa [NewPM] Adds a port for AArch64MIPeepholeOpt (#187515)
bb070ea56b3e [SPIRV][NFC] Update `SPV_INTEL_function_pointers` tests to check `spirv-val` output (#182549)
93d7583f4f2d [AMDGPU] Update features for gfx1170 (#186107)
2bb0fa46a816 [SLP]Prefer copyable over alternate
20768a957654 [ACC] Use ExistingOps strictness in ACCSpecializeForDevice for non-specialized functions (#187645)
6d45f6dfb7e1 [clang-tidy] Generate valid JSON for characters that require escaping (#187454)
8cbf3f3c393d [GlobalISel] Fold `a bitwiseop (~b +/- c)` -> `a bitwiseop ~(b -/+ c)` (#181725)
a2446e25f05a reland 2 SLP PRs (#1812)
a6cc2f5e07b8 [AMDGPU] Remove `_e32` patterns for i32 saturated conversions (#187715)
c6ba0e00161e [clang][ssaf] Add whole-program analysis execution layer
0d251db0f1d9 [NVPTX] Split NVVM annotation query helpers out of NVPTXUtilities (NFC) (#187349)
376907e09303 [NFC][clang][analyzer] Complete enumeration in emitCrossTUDiagnostics (#187646)
f064a9979ff5 [DAGCombine] Optimize away cond ? 1 : 0 post-legalization (#186771)
fa49ad564bae [libc++] Fix random_shuffle signature in C++03 mode with frozen headers (#186443)
0506c03802b1 AMDGPU/GlobalISel: RegBankLegalize rules for readlane, writelane (#187386)
7cc46928d9d3 [ARM] Add a phase ordering test for multiple reductions. NFC
702582182d4d [llc] Flatten SkipModule branch and sink defs to their use(NFC) (#187661)
f58b6754a1f5 [clang][ModulesDriver] Fix build failure with Xcode 14 (#187713)
f5e2238a3e14 [MLIR][XeGPU] Enhance multi-reduction layout propagation rules (#186308)
4c60d48e1bbc [clang] Don't dllexport inherited constructors with -fno-dllexport-inlines (#187684)
a3db68a97b2c [llvm] Restrict llvm-debginfod-find test to localhost to fix winhttp case (#187705)
d339d0053ce2 [gn] port a021a93e5320d8
17d2890e7e41 [gn] port 7bf871c39f739
4a1e9f73103f [CIR] Make the -save-temps flag emit .cir and .mlir files (#186814)
40a6180cea1b [Flang] - Fix AliasAnalysis to preserve Allocate source kind through box loads (#187152)
646c4a00ee38 merge main into amd-staging (#1823)
78729251fbb2 [SPIR-V] Fix isAggregateType function implementation (#187685)
6feced2a7cc2 Fix select-best-vf-tripcount.ll buildbot failure
52452aa447d2 [CFG] Support CycleInfo in isPotentiallyReachable() (#187681)
a70419505471 [AMDGPU] Shrink S_MOV_B64 to S_MOV_B32 during rematerialization (#184333)
9ab77fa8daf2 [gn] port aa3465793a250
838354d9d805 [X86] Use GFNI for vXi8 per-element shifts (#89644)
d3a8666ec82d [mlir][spirv] Add reduction ops in TOSA Ext Inst Set (#187278)
bd3ba6042b3b [AMDGPU] Remove unused forward declaration of GCNSubtarget (#187695)
e3959a9c502e [NFC][Object] Move BBAddrMap related types to a shared header (#187268)
35ebb8c37077 [AMDGPU] Saturate at i16 for f16 to i1/i8 conversion (#187467)
da8d0ab20287 [flang][NFC] Converted five tests from old lowering to new lowering (part 36) (#187628)
19b0c68ee012 [VPlan] Skip epilogue vectorization if dead after narrowing IGs. (#187016)
2600c723e101 [libc][NFC] Fix typo in file.cpp (#91192) (#187688)
a6a34333a113 [analyzer] Don't rule out symbolic pointer pointing to stack (#187080)
bdc8d9293d28 [OFFLOAD] Add GPU wrappers for headers currently supported by SPIRV built libc (#181913)
1dfd268f1008 [VPlan] Simplify mul x, -1 -> sub 0, x (#187551)
b6accfa0b46f [LV] Regen induction-ptrcasts test with UTC (NFC) (#187678)
39d6bb21804d [lldb] Add HTTP support in SymbolLocatorSymStore (#186986)
22f5b8db125c libclc: Update acos (#187666)
43b4028d54e9 [RevPat] remove SLPs
6a29e02059b1 Merge branch 'amd-staging' into amd/dev/rlieberm/relandSLP
a021a93e5320 Revert "Reapply [clang][analyzer] Format macro expansions" (#186614)
f5b00daaae86 merge main into amd-staging
214bc4db9cc9 [X86][AVX10.2] Canonicalize narrow FP_TO_{S,U}INT_SAT (#186786)
277bd13cc6fc [analyzer] Fix logic in CallEvent::getReturnValueUnderConstruction (#187020)
172c0bbfbd56 [clang-tidy] Fix alphabetical order check for multiline doc entries and whitespace handling (#186950)
66bc5652bde0 [BAZEL] Add missing affine python enum gen (#187669)
21f439f13250 [LoopRotate] Use SCEV exit counts to improve rotation profitability (#187483)
14de6dafee4d [SPIR-V] Support global variable annotations in llvm.global.annotations (#187241)
e6789f94b9cb [AMDGPU] Introduce ASYNC_CNT on GFX1250 (#185810)
895c281515fb [AArch64][GlobalISel] Remove fallback for scalar usqadd/suqadd intrinsics (#187513)
4376bf27c1ef [clang-tidy] Fix "effective" -> "efficient". (#187536)
4b17135d14aa [LV] Simplify `matchExtendedReductionOperand()` (NFCI) (#185821)
78f267f01d93 Reapply "[clang][bytecode] Allocate local variables in `InterpFrame` … (#187644)
ab28384cd3e1 [ExpandMemCmp] Remove unused TM/TLI dependency (#187660)
d97adc4314a6 [X86] Perform i128/i256/i512 BITREVERSE on the FPU (#187502)
689afb5ecd15 Windows release build: Add checksum verification for downloaded source archives (#187113)
69cd746bd2f1 [llc] Add -mtune option (#186998)
4df296733da9 [lldb] Implement llvm::formatv overload for Stream::operator << (#187462)
facc82de4f21 [clang][cir] Adding myself in CODEOWNERS for CIRGenBuiltinAArch64.cpp (#187570)
c8dd82916bc3 libclc: Override cbrt for AMDGPU (#187560)
edbe8277c104 libclc: Use log intrinsic for half and float cases for amdgpu (#187538)
a5de509e4ecd libclc: Rewrite log implementation as gentype inc file (#187537)
441790b31f06 [AArch64] Use an unknown size for memcpy ops with non-constant sizes. (#187445)
421bf13e4bf1 libclc: Update trigpi functions (#187579)
a971089cb817 [LV] Explain why a less profitable VF was chosen (NFCI) (#187469)
7f8e23613685 libclc: Implement sin and cos with sincos (#187571)
090c40545f0f libclc: Replace flush_if_daz implementation (#187569)
5599d6018758 [Coroutines][NFC] Elide coro.free based on frame instead of coro.id (#187627)
a4f97f0d9019 [CIR][AMDGPU] Add module flags for AMDGPU target using amendOperation of CIRDialectLLVMIRTranslationInterface (#186073)
e42c16f12de2 [NFC][clang] Add HandleModuleName to avoid redundant module name handling code (#184014)
416935e29f5a [orc-rt] Add BootstrapInfo argument to ControllerAccess::connect. (#187635)
2a10df7dadf4 merge main into amd-staging (#1820)
f5e28768dc7b [clang-format] Fix an AllowShortNamespacesOnASingleLine bug (#187451)
984417de3f37 [lldb][doc] Add cross-compilation guide for FreeBSD (#186216)
2bef931c4349 Reland "[flang][openacc] Prevent SimplifyArrayCoorOp from folding rebox used by ACC data entry ops (#187616)" (#187626)
c3f381ccfe4b [mlir-python] Fix duplicate EnumAttr builder registration across dialects. (#187191)
fa2df7e853d7 [flang][NFC] Converted five tests from old lowering to new lowering (part 35) (#187407)
caf079fbb2f7 [clang][headers][endian.h] add some common extensions (#187565)
19ced5ad8248 [orc-rt] Redesign Session to provide a clearer lifecycle. (#187496)
2e88fe7021ad Revert "[flang][openacc] Prevent SimplifyArrayCoorOp from folding rebox used by ACC data entry ops" (#187625)
bf1db77fc87c Revert "[clang][bytecode] Allocate local variables in `InterpFrame` tail storage" (#187410)
3012c60d227a merge main into amd-staging
89d8fe9d08c3 [LoongArch] Ensure .dwo sections do not contain relocations (#187429)
828da6176c49 [M68k] Fix pipeline.ll test after CodeGenPrepare analysis change (#187617)
979048e04dc6 [flang][openacc] Prevent SimplifyArrayCoorOp from folding rebox used by ACC data entry ops (#187616)
bf748ea653d3 [offload] - Remove standalone build in favor of 'runtimes' (#1817)
f08b0cc1f67b merge main into amd-staging (#1819)
5881ce66b121 [Bazel] Port 7efcd6198c8d15a1ab2ae2b30a3aa8b5168ef9cc
0e53fbcc43b5 [Bazel] Port 81e8a1e59ee28e5403d8a78874c37f853d18d4fb
9e54ca12e91c [AtomicExpandPass] Remove AtomicExpandUtils.h (NFC) (#187609)
edf0fb81326a [GISEL][NFC] Remove a useless assert in constrainSelectedInstRegOperands (#187592)
366da1252b2c [libclc] Restore previous generic fmod implementation (#187470)
df85f45bf1c1 [libc][docs][NFC] Add Build Concepts and consolidate patterns (#187490)
2f076c383ebb [libclang/python] Deprecate _CXUnsavedFile, introduce UnsavedFile instead (#187412)
3991dcbbd55d [gn] port 81e8a1e59ee28e54
8ccda467e100 [gn] port 02451f54d642ae
8771fd92f641 [gn build] Port d18a784d4106
7a544217fe54 [gn build] Port b17db271d030
d7dc03bc5bde [gn build] Port 4f298d4efa2f
13d99cecf03c [gn build] Port 39b6a4d84ad8
efc11383c9be [CIR] Add lowering for bool attributes (#187590)
9eb852c0627b [CIR] Implement global variable replacement in global view (#186168)
3fdc82c9ab13 [flang][openacc][cuda] Fix order of clause processing for host_data directive (#187600)
0e605742be6d [raw_socket_stream] Fix a file descriptor leak when connect failed (#187574)
537e7b586e15 [clang][DependencyScanning] Fix misplaced Driver includes (NFC) (#187599)
ec3a7192cc7f [X86] bitreverse.ll - add additional i128/i256/i512 GFNI test coverage for #187502 (#187552)
cbab7e65a720 [AMDGPU] Minor cleanups in offload plugin and AMDGPUEmitPrintf. NFC. (#187587)
61b9fc1d4225 [CIR] Upstream CUDA mangling test with LLVM and OGCG verification (#184444)
9cb1e372ddfa [Clang][AMDGPU] Minor driver cleanups. NFC. (#187586)
7efcd6198c8d [libc] Modular printf option (float only) (#147426)
4e19eee8a61a [NFC] Annotate CommentFlag with underlying type (#186560)
e895a80b5d65 [lldb][TypeSystem] Add CompilerType::IsMemberDataPointerType (#187172)
59bc629bf37a [AMDGPU] Fix decoding of SETREG MSBs (#187578)
33cfe2843b58 [DirectX] Fix TypedBuffer load shader flag mismatch (#187393)
f276ad429091 [VPlan] Make sure Inductions outlive returned VPlan.
25f0d08bafa4 [clang] add x86_64 baremetal triple include search paths (#183453)
81e8a1e59ee2 [clang][modules-driver] Add dependency scan and dependency graph (#152770)
33f7d655a65a merge main into amd-staging
b03e3d1c262c [lldb] Fix Python 2 prints in the docs (#187553)
a9ca888fc4f4 [IR][CAPI] Mark LLVMIsABranchInst as deprecated (#187576)
10068ee79b1b Regen CodeGen/voidptr-vaarg.c
c5aefc77534b [flang] Downgrade an overly strict error to a warning (#187524)
101799100b73 [X86][GISEL] Port X86PostLegalizerCombiner to npm (#182787)
964bf036a105 [RevPat] record layout revert
963848e93fb9 Revert "[Clang][CodeGen] Restore isEmptyFieldForLayout for empty class handling"
11b439c5c5a0 [DTLTO] Speed up temporary file removal in the ThinLTO backend (#186988)
9ae3077ae921 [clang][modules] Remove `Module::ASTFile` (#185994)
2632ffeab1b4 [libc][stdio] Fix standard streams in overlay mode. (#187522)
adbb1227174f [libc] Implement iswprint entrypoint (#185251)
8e1e371561e7 [IR][NFC] Mark BranchInst as deprecated (#187314)
ae6fbd0fb753 [mlir][linalg] Fix vectorizer generating invalid vector.gather for 0-D tensor.extract (#187085)
7d188c5118d2 merge main into amd-staging (#1815)
fd3cf1c1604e [LV] Move dereferenceability check from Legal to VPlan (NFC) (#185323)
486bd960993d [libc++][NFC] Remove redundant guard for `__cpp_lib_destroying_delete` (#187473)
88cbac0495b4 [libc++] Unify python shebangs (#187258)
b6543c98d7e4 [TargetLowering] Make sure LL/LH are always initialized in expandDIVREMByConstant
721775bf88e1 clang-c/Index.h: Fix typoed comment (#144219)
4df2725a2e05 [AMDGPU][AMDGPUBaseInfo] Replace Waitcnt members with array (#182927)
f66bd8e81a0b [LLVM] Add flatten function attribute to LLVM IR and implement recursive inlining in AlwaysInliner (#174899)
90d3944c4ad1 [CIR] Implement VisitCXXStdInitializerListExpr for 2 ptr layout (#186679)
7d76a3122dde [SLP]Improve analysis for the shl-based reduced values with copyables (#185485)
a52a3f6c7a04 AMDGPU/GlobalISel: RegBankLegalize rules for s_sleep_var, s_prefetch (#187382)
584c83cb1527 [Clang][AMDGPU] Add clang builtins for buffer format load/store intrinsics (#187064)
4f298d4efa2f [lldb] Support arm64e in the expression evaluator (#186001)
05ae66851da5 [lldb][bytecode] Swift output is conditional on >=6.3 (#187544)
f334194167b7 [BasicAA] Use KnownBits trailing zeros to boost GCD in modular aliasing check (#187297)
e405a1195716 [lldb][debugserver] Get the size of the shared cache in mapped VM (#187419)
d96722b66022 [LLVM] Improve IR parsing and printing for target memory locations (#176968)
467cf7caeda8 [SandboxIR] Implement UncondBrInst and CondBrInst (#187196)
7925ef6df83c [clang] fix crash related to missing source locations for converted template arguments (#187352)
1418f80d5c4d [mlir][tensor] Forward concat insert_slice destination into DPS provider (#183490)
300a19d83e29 [X86] bitcnt-big-integer.ll - add vXi128/vXi256 ctpop test coverage for #187447 (#187533)
de9746cdfeb4 [clang-doc] Add individual target for unit tests (#185695)
1d9762a07765 [RISCV] Add scheduling models for `sifive-x160` and `sifive-x180` (#187089)
23d8651de3f8 [lldb][bytecode] Remove tracking of stack temps in compiler (#187401)
555caa18762f [clang-tidy] Fix `readability-else-after-return` breaking code by deleting too many characters (#187437)
ffa8ba8ce2d1 [NFC][LAA] Minor stylistic/comments improvements (#185510)
dfafee7a4782 [SCEV] Convert more interfaces to use SCEVUse (NFC). (#185045)
65f6a346a96a [NFC][analyzer] Eliminate IndirectGotoNodeBuilder (#187343)
3f5d6bdd2ac0 AMDGPU/GlobalISel: RegBankLegalize rules for buffer atomic add/sub (#187405)
b20d21aaf164 [C23] Downgrade WG14 N3037 implementation status to partial (#187495)
422dabeb4e82 [RISCV] SFB with Immediates to QC.MVccI (#186555)
55e6683fe4a2 [NFC][AMDGPU] Move SWMMAC features into specific target feature sets (#187394)
9044b0f17162 [DebugInfo][CodeView] Support `S_DEFRANGE_REGISTER_REL_INDIR` (#186410)
4ea9c1a4564a [LICM] Mark load function as willreturn in test (NFC)
1d854bd51bf0 AMDGPU/GlobalISel: RegBankLegalize rules for s_sendmsg (#187361)
bdeb18a74e59 [llc] Enable -mattr=help regardless of -mattr order (#187269)
2ec08b31941d [LSV] Added check for mismatched GEP strides in getConstantOffsetComplexAddrs (#186671)
4cbb67a96219 [AMDGPU] Use empty() instead of size() comparisons. NFC. (#187424)
d0caa41c51ce [GISel] import pattern `(A-(B-C)) to A+(C-B)` (#181676)
9050794e06cf [SLP]Improve reductions for copyables/split nodes
593683f9a0eb [OpenACC][NFC] Generalize wrapMultiBlockRegionWithSCFExecuteRegion (#187359)
83b378b38196 Unsupported: llvm/test/Transforms/LoopVectorize/Sparc/no-vectorize.ll
e2c9dde1a5f3 AMDGPU/GlobalISel: RegBankLegalize rules for s_ttracedata (#187342)
c9f6ad8d4299 [libc++][docs][NFC] Update Open XL supported version to 17.1.4 (#176112)
a693970f0793 [LICM] Regenerate test checks (NFC)
b7776ccebe9d [CIR] Add support for array new with ctor init (#187418)
d18a784d4106 [compiler-rt] Define GPU specific handling of profiling functions (#185763)
923cc2d43b41 [AMDGPU] Fix alias handling in module splitting functionality (#187295)
d8a83a11231f [NFC][SPIR-V] Disable tests failed after spirv-val update (#187028)
d049eef4b5ab [DAG] Use value tracking to detect or_disjoint patterns and add a add_like pattern matcher (#187478)
4199bb1a8149 [AMDGPU] Simplify loop in AMDGPULowerVGPREncoding::handleCoissue. NFC. (#187511)
c5c0b8348e6c [mlir][memref] Rewrite scalar `memref.copy` through reinterpret_cast into load/store (#186118)
c63ce62f7cf6 [NFC][AMDGPU] New test for untested case in SILowerI1Copies (#186127)
2754e35f7347 [mlir][EmitC] Support pointer-based memrefs in load/store lowering (#186828)
201d3547cce1 [AMDGPU] Clean up `LowerFP_TO_INT_SAT` in AMDGPUTargetLowering (#187486)
e1aef9e22748 [libc++] Fix missing availability check for visionOS in apple_availability.h (#187015)
70bb9e24526a [CycleInfo] Index using block numbers instead of pointers (#187500)
5ae5f9df42f7 [DA] Check nsw flags for addrecs in the Exact SIV test (#186387)
bc2a8ef6f567 [lldb][NativePDB] Remove cantFail uses (1 out of ?) (#187158)
989ea0e2d726 [MLIR][XeGPU] Lowering 2-Dimensional Reductions of N-D Tensors into Chained 1-D Reductions (#186034)
8ca7a336fb10 [SCEV] Generate test checks (NFC)
cf92512e0968 [DebugInfo] Add Verifier check for local imports in CU's imports field (#187118)
807377492e7c [MemorySSA] Fix EXPENSIVE_CHECKS build
cdaf29f84dd0 Revert "[LV] Simplify and unify resume value handling for epilogue vec." (#187504)
b55f6dbb35f1 [clang][ssaf] Improve layout of `clang-ssaf-format --list` by adding a separator between name and description
153c230446ca [PDB] Fix and simplify module index lookup (#179869)
ef4f87425c51 [analyzer] Fix [[clang::suppress]] for friend function templates with namespace-scope forward-declarations (#187043)
da92bc06ff47 [mlir][acc] Support call target handling for bind(name) (#187390)
44e306ecdb02 [Clang] Correctly link and handle PGO options on the GPU (#185761)
b227fab5a602 [NFC][LV] Introduce enums for uncountable exit detail and style (#184808)
bed9fa2de54a [libc][sys/sem] Add sys v sem headers and syscall wrapper implementation (#185914)
0e7262407ca6 [offload] - Remove standalone build in favor of 'runtimes' (#170693)
e8556ff6b664 [NFC] Remove fractional part of costs in maxbandwidth-regpressure.ll (#187498)
b91c5a7701e1 [AMDGPU] Test saturated f32 to i8 conversion on vectors (#187487)
068176a50371 [Analysis] Remove LLVM_ABI annotations from llvm/lib/Analysis/BranchProbabilityInfo.cpp which cause build errors (#187388)
e3415da3cd12 [Flang][OpenMP] Permit THREADPRIVATE variables in EQUIVALENCE statements (#186696)
a32d2695c38f [bazel] Gate GPU parsers behind llvm_targets (#187213)
a3e3fed088bd [CodeGen] Declare MachineCycleInfo in headers (#187494)
2e2bcf785519 [AMDGPU] Remove unused forward declaration
dddf01cc1457 [RISCV] Relax out of range Zibi conditional branches (#186965)
76f725257137 [FastISel] generate FAKE_USE for llvm.fake.use (#187116)
d641186cb61e [clang-cl] test that `-Xlinker` works, update supported options docs (#187395)
18ed1a9414b2 [X86] Add bitrevese/bswap i128/i256/i512 test coverage for #187353 (#187492)
78a8f0097796 Revert "[VPlan] Create header phis once regions have been created (NFC)."
289c58823150 [X86] Optimize load-trunc-store for v4i16/v2i32/v2i16 vectors (#186676)
1078a1dabd68 Lowering `~x | (x - 1)` to `~blsi(x)` (#186722)
49a5192e5d70 [CycleInfo] Don't store top-level cycle per block (#187488)
7d02ca610b9c [mlir][LLVM] add llvm.fake.use to LLVM dialect (#187026)
796b218edd35 [LegalizeTypes] Expand UDIV/UREM by constant via chunk summation (#146238)
582fa7875374 [SLP]Do not match buildvector node, if current node is part of its combined nodes
a0b5025752ea Revert "[SLP] Loop aware cost model/tree building"
191c84b822e0 [VPlan] Permit derived IV in isHeaderMask (#187360)
6aeeae676ac4 [SPARC][Tests] Add lit.local.cfg to SPARC LoopVectorize tests (#187489)
2babd1709e69 merge main into amd-staging
b029b9879749 [X86] Add i128 bit manipulation pattern test coverage (#187480)
23af867e6d93 [SPARC] Add TTI implementation for getting register numbers and widths (#180660)
c3e7624ac4bd [clang] Add implicit std::align_val_t to std namespace DeclContext for module merging (#187347)
f104b7355ce8 [NFC][SPIRV] Run `spirv-val` on tests related to `SPV_ALTERA_arbitrary_precision_integers` (#187464)
76638021257f [LLVM][DAGCombiner] Limit extract_subvec(extract_subvec()) combine to vectors of the same type. (#187334)
c58d62857e4f [STLForwardCompat] Switch transformOptional from direct call to invoke (#186333)
d29c6a34255e [TabelGen] Use ID{n-m} for outer let statements (#187436)
eaf04be3417c [SPIR-V] Complete SPV_INTEL_16bit_atomics extension support (#184312)
e122a2d53193 [flang][OpenMP] Remove extraneous semicolon, NFC (#187468)
0d05c882ce99 [Support] Use block numbers for LoopInfo BBMap (#103400)
333ac33be6f4 [Analysis][NFC] Include LoopInfoImpl only in source file (#187459)
4262045ba98d [DebugInfo] Fix segfault in constructSubprogramScopeDIE with null subprogram type (#184299)
621d40e1827e Revert "[clang-tidy] [Modules] Skip checking decls in clang-tidy" (#187461)
2bb6b5902877 [libsycl] add USM alloc/free functions (#184111)
d518f8ff6740 [MemorySSA] Fix handling of cross-iteration dependencies for calls (#187291)
1f8da277148f libclc: Really implement half trig functions (#187457)
1ba5b6e875d1 libclc: Stop implementing sincos as separate sin and cos (#187456)
7fad49b186c8 [AArch64][SVE] Prefer FMOV for scalar insert into first element of zero. (#187236)
6e8ca5edde05 libclc: Fix nextafter with -cl-denorms-are-zero (#187358)
85e9ac589819 libclc: Add canonicalize utility functions (#187357)
9b7c437033b2 libclc: Update f64 trig functions (#187455)
0960f0b8feb3 libclc: Really implement denormal config checks (#187356)
c800afd65e12 [clang-tidy] [Modules] Skip checking decls in clang-tidy (#145630)
a54c1490611c libclc: Invert subnormal checks (#187355)
07ff1a63dac5 [CompilationDatabase] Treat .cppm file as C++ in InterpolatingCompilationDatabase (#187446)
ae66911399b7 [lldb][Platform][NFC] Move SanitizedScriptingModuleName into ScriptInterpreter (#187229)
ca5e4bcc5715 [NFC] [clangd] [Modules] Leave more log for failing cases (#187448)
bdfd9725afd1 libclc: Move subnormal config file to clc (#187354)
1c3c349cf8a3 [clang-format] Fix stale .lock files in git-clang-format (#187379)
3e90e1a26fd1 [docs][QualGroup] Update Qualification WG sync-up schedule and calendar links (#186011)
e3198dbe59ab libclc: Move FLT_MIN gentype macros (#187272)
9e6ce65962c4 libclc: Fix vector float tan (#187387)
79042e701b7b [clang-format] Add LeaveAll to the BreakAfterAttributes option (#187204)
f554fcfd0b75 Revert "[flang][openacc][cuda] Add implicit device attribute for use_device unconditionally" (#187438)
55607559203c [flang][OpenACC] Fix crash on invalid clauses in WAIT and ATOMIC constructs (#187263)
4db2ce4d546f [libc][math] Refactor dadd family to header-only (#182142)
2e6740541ba9 [clang][headers][endian.h] include_next in freestanding (#187380)
8c1896d9067b [clang-tidy][NFC] Compare nodes by pointer instead of by ID in `readability-else-after-return` (#187363)
e9bf455ae9a5 [SLP] Loop aware cost model/tree building
160ac07cbd4a [SLP]Add external uses estimations into tree throttling
7335ebf5f8ca merge main into amd-staging (#1809)
f1c8b9b4aad9 [Clang] Fix assertion when __block is used on global variables in C mode (#183988)
291359be687e [SelectionDAG] Move the call to BuildExactSDIV and BuildExactUDIV to the top of BuildSDIV/BuildUDIV. (#187378)
310766bb389f merge main into amd-staging (#1808)
3fdec1c9f946 [clang] Enable exceptions in CWG2486 test (#187195)
015e3d2b2092 [compiler-rt] Unify python shebangs (#187285)
d434d82010be [MLGO] Modernize type annotations in mlgo-utils (#187408)
39b6a4d84ad8 [HLSL] Add globals for resources embedded in structs (#184281)
8176bc0e9b53 [HLSL][SPIRV] Use 0 to represent unbounded arrays on shader flags (#187174)
77066a3d33db [SandboxVec][SeedCollection] Aux pass argument for enabling different types (#155079)
80034dd582d3 [libc][annex_k] Add rsize_t (#163238)
53f8f3b01794 Reland [LV] Replace remaining LogicalAnd to vp.merge in EVL optimization. (#184068) (#187199)
42b75ed85fcb [libc][math] Refactor bf16divf128 to Header Only (#186641)
19fbdf9a1ecd merge main into amd-staging
fc2c965394b4 [Passes] Remove redundant semicolon from PassRegistry.def
fb36a54ef6bc [lldb] Rename formatv verbose log call, misc log cleanups [NFC] (#186951)
fb39a5d6afe1 [flang] Better handling of ALLOCATED(pointer) error (#186622)
0d01afffe123 [Utils] Format git-llvm-push
3f36e7030f30 [AsmPrinter] Only warn about unsupported remarks section if requested (#187362)
2c0d210d2c27 merge main into amd-staging
f4199fa99626 [Utils] Add --use-gh-cli-token flag
3cf80812f007 [llvm-remarkutil] filter: Add --sort and --dedupe flags (#187338)
3d881804bbd3 [Clang][OpenMP] Move declare simd codegen into OMPIRBuilder (#186030)
319d3c056b9e [mlir][llvmir][OpenMP] Translate affinity clause in task construct to llvmir (#182223)
a67c3b746836 [NewPM] Adds a port for AArch64ExpandPseudo (#187332)
ca07ca0314bf [flang] Fix confusing explanatory error message (#187341)
8e062105593b [libc][annex_k] Add errno_t (#163094)
abdcde9bbc9b [SLP] Loop aware cost model/tree building
89657f726f44 [libc][docs][NFC] Documentation consolidation and de-duplication (#187385)
04e86fbff043 [llvm-jitlink] [test] Add an XFAIL for a JITLink test on MinGW (#186980)
a70f82fab606 [clang][FlowSensitive] Do a quick check and bail early for massive CFGs (#186808)
2a89e249a293 [flang] [flang-rt] Subscript overrun could occur in namelists during a READ command. (#176959)
d70ebc84acd6 [lldb][bytecode] Compile pick ops using unsigned literal (#187376)
227ced1b1eb4 [flang] Use integer arith.max/min operations for max/min lowering. (#186466)
752ccf718b71 [flang][openacc][cuda] Add implicit device attribute for use_device unconditionally (#186844)
e044c4ad81f0 [AMDGPU] Add target features for SWMMAC instructions (#185785)
3de7814b8d2e [MLIR][XeVM] Update HandleVectorExtractPattern (#186247)
a9181e8f9d3f [CIR] Fix CFG flattening for loops with cleanup in special regions (#187369)
c95af4078379 [MLIR][XeVM] Add truncf and mma_mx op. (#180055)
0b49adc32c8d [AMDGPU] Rename AMDGPUMachineFunction to AMDGPUMachineFunctionInfo. NFC. (#187276)
fce100e26e7e [VPlan] Fix masked_cond expansion.
d1e625c93f85 [clang-tidy] `bugprone-unchecked-optional-access`: Add support for GTest asserts like `ASSERT_TRUE` and `ASSERT_FALSE` (#186363)
5a5c3176ef24 [MLIR][Python] Add optional emit reset to exportSMTLIB (#187366)
360fab623d3f [RISCV] Fix IDiv/IRem scheduling data for RV32 cores that use the SiFive7 model (#187331)
2be4a9b1b208 [LV] Add predicated early-exit tests showing poison prop issue. (NFC)
d226f1b16136 [AMDGPU] Regenerate codegen tests to check extra stuff at end of line (#187325)
96299d8d4d3c [flang] Disable trampoline test for PPC (NFC) (#187194)
d4b86e561752 [LSR] skip ephemeral IV users when collecting IV chains (#187282)
c630b09af7dc [CIR][NFC] Remove NYI checks in ternary with cleanup (#186870)
03f488a00255 [AsmPrinter][MTE] Support memtag-globals for all AArch64 targets (#187065)
da47ede4b272 [AArch64] Fix register scavenger crash when merging MTE stack tags (#186934)
eabcfcee0803 [HLSL][DXIL][SPIRV] QuadReadAcrossX intrinsic support (#184360)
17158b2ab16d [InstCombine] Fix comment in SimplifyDemandedUseBits (NFC) (#187126)
e9799e51ed32 [lldb-dap] Improve support for variables with anonymous fields and types (#186482)
0ea2e5813f96 [VPlan] Account for early-exit dispatch blocks when updating LI. (#185618)
b2c2422c2ea9 [CIR] Upstream ThreeWayCmpOp (#169963)
81950f6de421 [mlir][GPU] Bump static bound on cluster IDs (#187106)
480eba33e294 [lldb][PrefixMap] follow up fixes to #187145 (#187337)
872247c702ff [NVPTX] Split Param address space into EntryParam and DeviceParam (NFC) (#186636)
befaa35212db [libc++] Fix passing through object to comparisons in __tree (#186341)
3e09538a4268 [libc++] Expand test coverage for converting comparators in associative containers (#187133)
a33e9e5047c9 Move the call frame edges log messages to the verbose channel. (#187324)
a2891ff85c6c Reapply "[LoopUnroll] Remove computeUnrollCount()'s return value" (#187104)
f8db5db9586c [flang] Fix fir.call setCalleeFromCallable (#187124)
ba231aaaa0d1 [clang-doc] Enclose documented entities in a card (#185121)
d54da6897375 [llvm-remarkutil] filter: Add --exclude flag (#187163)
716646895747 [flang][acc] Handle deduplicated use_device (part 2) (#187305)
b11a603a45e1 [mlir][Transform] Fix crash in SequenceOp::getEffects when body region is empty (#185063)
d8545a486857 [LoopFusion] Use DA by default for dependence analysis (#187309)
253616de7e8f [libc][docs] Generate configure.rst in the build directory (#187266)
81d3f04f2990 [NFC] Fix mve-reg-pressure-spills.ll test (#187316)
6e86ee2c23d7 [clang][modules] Stop uniquing implicit modules via `FileEntry` (#185765)
4e500bd00150 Revert "[SLP] Loop aware cost model/tree building"
2caba086abeb [ASan] Fix overflow and last byte handling in __asan_region_is_poisoned (#183900)
b17db271d030 [clang][Headers] add endian.h (#186032)
9dd2e3792a9f [DAGCombiner] Move the XORHandle in rebuildSetCC inside the while loop. (#187189)
9b0c2a135e07 [NFC] Update `LoopVectorize/predicator.ll` test (#187125)
7a9299f7f11a [RISCV] Rename add_like pattern -> riscv_add_like (#187306)
16585af33b4d [mlir][acc] Fix bindNameValue for RoutineOp (#187307)
55cee50e6b16 [AMDGPU] Use native instructions for f16 to u16/i16 saturated conversion (#186769)
13a093b2b250 [VPlan] Compute cost for predicated loads/stores to invariant address. (#181572)
1b8db068ed27 [PrefixMap] Teach lldb to auto-load compilation-prefix-map.json (#187145)
9418cdbccab9 [llvm-remarkutil] filter: Support multiple input files (#187162)
63b44decb547 [clang][bytecode] Allocate local variables in `InterpFrame` tail storage (#185835)
c9c057564475 [Nightly][Infra] Enable the target gfx950 (#1796)
f60934412000 AMDGPU/GlobalISel: RegBankLegalize rules for ds_add/sub_gs_reg_rtn (#185991)
0f5d8a960f7c [Bazel] Fixes dd9dd1d (#187310)
4c745df8bc90 [MLIR][LLVMIR][NFC] Drop uses of BranchInst (#187304)
bfedc2aa7668 [Polly][NFC] Drop uses of BranchInst (#187301)
dd9dd1d2f39c [mlir][bufferization] Fix crash in promote-buffers-to-stack for nested memrefs (#186426)
96ec23096ce7 [tools][examples][NFC] Drop remaining uses of BranchInst (#187293)
60c102036acf [AArch64][AsmParser] Add MC support for %dtprel() relocation (#186599)
b37f8a54a0b1 [flang] Fix extra "./" prefix in source file paths (#186212)
a083e19efeb1 [VPlan] Add the cost of spills when considering register pressure (#179646)
b5ef9e29c5fe [MLIR][XeGPU] Avoid crashing on `gpu.func` missing `gpu.return` (#186330)
dc0ba9b4531f [flang][acc] Add missing dependency for checking CUF attributes (#187292)
9871ad1c8a54 [VPlan] Rename DataLayout -> DL
88f8a2bdb25b [flang][NFC] Converted five tests from old lowering to new lowering (part 34) (#187175)
fe09f74e1cf6 merge main into amd-staging (#1797)
080bc2572896 [IR][NFCI] Remove *WithoutDebug (#187240)
9f4a1ebb2054 [flang] Add const-qualified version of parser::Messages::messages() (#187250)
3a3f863e34ae [X86] sse-minmax.ll - add baseline SSE2 test coverage (#187283)
5e3202749ac1 [MLIR] Fix crash in FrozenRewritePatternSet when PDL lowering is skipped by debug counter (#186159)
bf46a95f2ca2 [VPlan] Use target's index type for {First,Last}ActiveLane instead of i64 (#186361)
fc569dafd767 [libc++] Refactor __is_transparent_v to make it clear what it depends on (#186419)
a4ef581c714b [InstCombine] RAUW for proven zero-indexed GEPs rather than cloning for a specific user (#185053)
7a2193cd190b [Offload] Add CMake alias for CI (#186099)
b85cf95aad71 [mlir][acc] Move acc routine functions into GPU module (#187161)
2ef6a669afe9 [NFC][AArch64] fix triple used in test (#187275)
6261cb4487f1 [SLP] Loop aware cost model/tree building
e1c81fa24eeb [gn build] Port e4a2d9cd8a63
c051449e27df [gn build] Port d0d0a665c238
e9b95ce4a728 [gn build] Port 6b3cf50d958c
6ef0b80d9d8e [gn build] Port 681f1a5ee987
0f6cd5c1838c [gn build] Port 55db533b74fe
c94403dbd336 [gn build] Port 45fe4bbdde13
d783723a584a [compiler-rt] Update runtime build script to detect RPC XDR header for AIX (#186977)
dfc02b7744c5 [gn] port c1f6fd24aa637d6a
77667d7c5bd5 [flang] Fix the CHECK: directive to ensure flagging RWE (NFC) (#187186)
c0064f744c01 [libc][math] Fix missing underflow exception in DyadicFloat::generic_as (#186734)
fa8d3c810f36 [NewPM] Add port for AArch64DeadRegisterDefinitionsPass (#187180)
073d019c450d [OpenMP] Use the standard fences now that they are supported (#187138)
1db809655578 [libc][math] Improve hypotf performance. (#186627)
2a8168ddaf4d [CIR] Add support for arrays-of-pointer-to-member-data (#186887)
1b904e948afb [CIR][NFC] Unify the 'null data member attr' getters (#186876)
138cae4a08d2 [CIR][NFC] Split the CXXABI 'TypeConverter' into its own type. (#186874)
de3f57399c66 [CIR] Fix bug where block after-unreachable wasn't CXXABILowered (#186869)
dafadf53a0a7 Fix MSVC "not all control paths return a value" warning. NFC. (#187265)
b15fa374fff9 libclc: Improve float trig function handling (#187264)
d7dbf1bd641f [mlir][gpu] Fix typo in documentation (#156619)
9b8532dd2aaa libclc: Clean up sincos macro usage (#187260)
2ecd001215f5 libclc: Use select function instead of ?: for some fp selects (#187253)
2f44f6998345 [flang][OpenMP] Use OmpDirectiveSpecification for range/depth queries, NFC (#187109)
a9605a92bdb8 [clangd] Support suppressions for driver diagnostics (#182912)
fd47fbe87e7f [lldb] Do not use mkdir -p in makefile on Windows (#187244)
cf89c33e6deb [OpenMP] Map const-qualified target map variables as 'to'. (#185918)
c949c9be610d [AArch64][llvm] Make SBZ/SBO insns warn not fail when disassembling (#187068)
596269288899 merge main into amd-staging
2563006f3178 [Clang][NFC] Drop uses of BranchInst (#187242)
f1b82dcd99bc [Bazel] Fixes c1f6fd2 (#187146)
f52b2616f4d8 [mlir][vector] Use non-native runner in gather.mlir test (#187243)
3f649d0537b3 [AArch64] Use SVE/NEON FMLAL top/bottom instructions (#186798)
60dc4c70fc44 [CycleInfo] Use block numbers for dfs numbering (NFC) (#187062)
2915519efd91 [orc-rt] Move CallViaSession into Session, add comments. (#187238)
003ec3e0a161 [NFC][AArch64] add tests for `is_fpclass` (#187231)
49f9b4b44a66 [LV] Add test for diff checks with ptrtoint subtract. (NFC)
c374678d2955 [orc-rt] Rename Session setController/detachFromController. NFC. (#187235)
671ccfea2767 [mlir][reducer] Add eraseAllOpsInRegion function to reduction-tree pass (#185892)
95824ca6b92b [Frontend/OpenMP][NFC] Drop uses of BranchInst (#186393)
6bfb44f32016 [orc-rt] Add ShutdownRequested flag to Service::onDetach. (#187230)
7404a5dbe0ca [PowerPC] Preserve load output chain in vcmpequb combine (#187010)
2734c46153cf [DAG] Add back SelectionDAG::dump() without parameter (#187001)
f9d2d8beeff0 [clang] Enable '-verify-directives' mode in C++ DR tests (#187219)
fef74e1c005d [mlir][spirv] add ExecutionModeIdOp (#186241)
8c4f4e8a05ab [LifetimeSafety] Track origins through array subscript and array-to-pointer decay (#186902)
4f6379069eb2 [mlir][tosa][tosa-to-linalg] Fix rescale with double rounding failing validation (#184787)
23a0c9f55826 [lldb] Skip file cleanup to avoid permission issue in API test (#187227)
f7763570e558 [VPlan] Improve code in VPlanRecipes using VPlanPatternMatch (NFC) (#187130)
30c962c9b701 [Instrumentation][nsan] Add maximumnum to NSAN (#186345)
25abe22ed852 [X86] Improve handling of i512 SRA(MSB,Amt) "highbits" mask creation (#187141)
9cb9081049a4 [mlir][vector] Extend vector.gather e2e test (#187071)
570c388685e2 [llvm][utils] Give git-llvm-push u+x permissions (#187211)
a8ff7e13c3be [NFCI] [Serialization] Deduplicate DeclID properly (#187212)
e762078424b2 [VPlan] Use auto return in VPlanPatternMatch (NFC) (#187210)
b83fd4dc5937 [AArch64][GlobalISel] Fix uqadd/sub with scalar operands (#186999)
81ba8b2aa0e3 merge main into amd-staging (#1795)
9b2fe0c885d1 [X86] Remove extranous I in comment. NFC (#187209)
ea8fb06f2443 [atomicrmw] fminimumnum/fmaximumnum support (#187030)
fdd2437af3cd [lldb] Avoid permission issue in API test with SHARED_BUILD_TESTCASE (#187072)
ec1c08a29145 [DA] Regenerate assertions for the tests (NFC) (#187207)
b3fdcac90d9d [AArch64] Remove vector REV16, use BSWAP instead (#186414)
77ad2c2a9cfd [DA] Add test that represents an edge case for the Exact SIV test (NFC) (#186389)
0f622c507ecc [orc-rt] Add TaskGroup for tracking completion of a set of tasks. (#187205)
76d5704633c7 [NFC][PowerPC] Update check lines to include power 9 label (#187193)
fbd24677963a [AMDGPU] DPP implementations for Wave Reduction (#185814)
7899b26e88f5 [lldb-dap] Allow expressions in setVariable request (#185611)
63dd9966d319 [orc-rt] Capture a Session& in SimpleNativeMemoryMap, fix TODOs. (#187200)
351501799ab4 [CodeGen] Improve `getLoadExtAction` and friends (#181104)
9a2f23e1a40b [CodeGen] Use separate MBB number for analyses (#187086)
14b42335c641 [orc-rt] Publish controller interface from SimpleNativeMemoryMap ctor. (#187198)
51fd033521b6 [BOLT] Enable compatibility of instrumentation-file-append-pid with instrumentation-sleep-time (#183919)
950292535e8d [orc-rt] De-duplicate some test helper APIs. (#187187)
dc8fd02b6237 [clang] Reshuffle compiler options in C++ DR tests
ee0ac7443e4d [mlir][x86] Lower packed type vector.contract to AMX dot-product (#182810)
2890f9883cb9 [OFFLOAD] Improve handling of synchronization errors in L0 plugin and reenable tests (#186927)
038c8d3f4f23 [DA] Rewrite formula in the Weak Zero SIV tests (#183738)
5f6cd9b92324 [DA] Fix overflow in symbolic RDIV test (#185805)
709ef15d7431 [NFC][PowerPC] Pre-commit to optimize bswap64 builtin for power8 (#181776)
3a7568311010 [libclang/python] Add type annotations to the TranslationUnit class (#180876)
60b11a479239 merge main into amd-staging
70665c665a08 [clang] Update C++ DR status page
8187875b5802 [clang][Driver][Darwin] Use `xcselect` for `*-apple-darwin*` targets too (#186683)
e0f74e65a4df merge main into amd-staging (#1792)
ffcb5745ab54 [orc-rt] Add BootstrapInfo: info for controller session bootstrap. (#187184)
c61d11df4003 [clang][RISCV] Add RequiredFeatures for zvknha and zvknhb (#186993)
3a1d5b5b8ce5 [X86] Support reserving EDI on x86-32 (#186123)
495c518b96cb [FMV][AIX] Implement target_clones (cpu-only) (#177428)
3661bf74cdc5 [Clang][Modules] Add regression test for #179178 (#187173)
aa2defc147c3 [X86][APX] Remove patterns for ArithBinOp (#187018)
c4137a6c0f63 [orc-rt] Relax addUnique assertion to match error condition.
13665f0d8930 [AMDGPU] Set gfx1250 default to B0
350385e7923e [libclc][NFC] Change include style from <...> to "..." (#186537)
ae9b5a4bcad8 [clang] Add `-verify-directives` cc1 flag (#179835)
dbdf1accf55d [orc-rt] Rename SimpleSymbolTable::addSymbolsUnique, relax error cond… (#187171)
1fee51c40b47 [WebAssembly] Fold sign-extending shifts into signed loads in FastISel (#185906)
8b265cf27025 [NVPTX][AutoUpgrade] atom.load intrinsics should be autoupgraded to monotonic atomicrmw for NVPTX (#187140)
3eb8b788b7df Revert "[LV] Replace remaining LogicalAnd to vp.merge in EVL optimization." (#187170)
9b5db0468438 Fix build issue starts_with
c727cd9a4b21 [orc-rt] Rename ControllerInterface to SimpleSymbolTable. NFCI. (#187164)
52089f895eb5 [LV] Replace remaining LogicalAnd to vp.merge in EVL optimization. (#184068)
a0656aba831a merge main into amd-staging
20c0e362b856 [AMDGPU] Change A0/B0 commentary to only affect gfx1250 (#1790)
c493feb71106 [Clang][Driver] Deprecate -parallel-jobs= in favor of --offload-jobs=
d49701bba36f [CIR] Implement abstract conditional operator handling for aggregates (#186284)
f6f57f0da6bf [CIR] Add handling for nounwind attributes (#187096)
d9eba8b3550d [lldb][Module] Don't try to locate scripting resources without a ScriptInterpreter (#187091)
3482480087ed [orc-rt] Add an ExecutorProcessInfo field to Session. (#187155)
1337dc9a5e1a [clang][OpenMP] Remove -fopenmp-allow-kernel-io flag
e7326007960b [orc-rt] Require non-empty triples and power-of-two page sizes. (#187151)
f2605193689f [IR2Vec] Remove redundant death test for invalid TypeID (#187143)
c78c5df960fb [mlir][spirv] Add comparison and elementwise ternary ops in TOSA Ext Inst Set (#186356)
ce805223940b [orc-rt] Add ExecutorProcessInfo APIs. (#187147)
1a359488d2be merge main into amd-staging (#1787)
c1f6fd24aa63 Reapply "[clang][ssaf] Add UnsafeBufferUsage summary extractor for functions (#182941)" (#186899)
838b1ccdd90b [libc] Add a smaller b36_char_to_int (#180841)
d17ce9a6fbe4 [ROCDL] Align mfma op description examples with the actual op (#186949)
5bda6166684f [AMDGPU][NFC] Remove kernarg_segment_ptr regbankselect test (#186029)
d13eb6d16e18 AMDGPU/GlobalISel: RegBankLegalize rules for s_setreg (#186872)
8f891a1bb3d7 [dsymutil] Fall back to compatible triple in BinaryHolder (#186893)
2fd9ee05c6be merge amd-main into amd-staging (#1785)
b03b58be38f9 [NVPTX] Fix scoped atomic when given runtime values (#185883)
cbedaa83e182 [lldb] Upstream arm64e support in ValueObject (#186906)
07542af9215c [flang][OpenMP][CUDA] Place privatized device allocatable descriptors in managed memory (#187114)
5706070eb809 [AMDGPU][GlobalISel] Switch tests to new reg-bank-select and refresh checks (#186506)
e7e45cdcec91 [spirv][mlir] Add myself to CODEOWNERS (#187115)
dd4f5c69c539 [CIR] Fix missing RegionBranchTerminatorOpInterface declarations (#187112)
99b93b586c5d [AMDGPU] fold a call to implictarg.ptr to a poison with no-implicitarg-ptr (#186925)
abb7288c1e2a AMDGPU/GlobalISel: RegBankLegalize rules for bswap, cvt_ubyte, rcp (#187093)
29f6bdb65b72 AMDGPU/GlobalISel: RegBankLegalize rules for wave_reduce_umax/umin (#186528)
015601bf04ce [clang][Driver][SPIRV] Fix assertion when using -emit-llvm (#186824)
5293760dd110 [mlir][llvmir] Fix crash when a CallSiteLoc has a UnknownLoc callee (#186860)
673002f32576 [libc][math] Fix bazel build for fmaf16 (#187111)
31fd58d8c91a [PSDB][Infra] Temporary test removal for gfx94x due to MI325 machine shortage
6b2e347ae12b [libc]: implement 'iswpunct' entrypoint (#186968)
d4afb1bfed53 [flang][OpenMP] Remove unused function declaration, NFC (#187101)
f0e699a35c8b [libc][math] Fix fma bazel build (#187107)
2ef41cca149c [clang-format] Fix Macros configuration not working with try/catch expansions (#184891)
af67e30a6303 [SLP][NFC] Refactor BinOpSameOpcodeHelper BIT enum (#187067)
bed5e7dc2018 [libc][math] Refactor fmaf16 implementation to header-only in src/__support/math folder. (#163977)
e6f0ec83438b [libc][math] Refactor fmaf implementation to header-only in src/__support/math folder. (#163970)
f5d83fb40400 [mlir][GPU] Set nsw/nuw when expanding out subgroup ID (#187099)
d0d1f0b7af32 [libc][math] Refactor fma implementation to header-only in src/__support/math folder. (#163968)
3d0e7e04c80b [z/OS] Recognize EBCDIC archive magic (#186854)
996b62231ca8 [bazel] NFC: reformat mlir & libc bazel files (#187094)
aef7e5768e78 [DirectX] Fix assertion in PointerTypeAnalysis with empty global_ctors (#179034)
385aeb24bca8 Revert "[LoopUnroll] Remove `computeUnrollCount()`'s return value " (#187035)
803828f4aa62 [mlir][GPU] Refactor, improve constant size information handling (#186907)
da86e036abca [Bazel] Fixes ebb3309 (#187090)
7477045d3d83 [lldb] Fix build on Linux when SEGV_PKUERR is undefined (#186963)
f0dfa36815dc [mlir][spirv] Add spirv.GroupNonUniformBroadcastFirst Op (#185818)
18c8b8d81da2 [Clang] Add __ob_trap support for implicit integer sign change (#185772)
ebb3309975c8 [libc] Refactor core Linux syscalls to use syscall_wrappers (#185983)
4b9693a4231b Revert "[libc] Avoid host header collisions in full builds (-nostdinc)" (#187079)
aa14eb8421f5 revert fc648683cd75 - [SLP]Add external uses estimations into tree throttling
c3219f5222a4 [mlir][spirv] Fix struct.mlir for stricter spirv-val variable-pointer rules (#186974)
037c2095e6bd Add hybrid function ordering support (#186003)
0769dde7a23b Removed Hardcoded SM Number from Mlir Test (#186917)
b5614bc21cb5 [green-dragon] fix Python and Swig flags (#187052)
74a5efa3318e [CIR] Split BinOpOverflowOp into separate overflow-checked ops (#186653)
f28ef689961e [X86] getMaskNode - perform pre-truncation of oversized scalar mask sources (#187063)
2531b1692cd2 [mlir][bytecode] Use getChecked<T>() in bytecode reading to avoid crashes (#186145)
19fd930bf00c [flang][OpenMP][CUDA] Set allocator_idx on privatized allocatable device array descriptors (#186945)
b686f5e62e09 [SandboxVec][BottomUpVec] Fix crash caused by Cmps with different operand types (#186550)
9d94bdace1b1 [mlir][Interfaces][NFC] Add early exit to MakeRegionBranchOpSuccessorInputsDead (#186325)
333f6abe30ed Reland Support float8_e3m4 and float8_e4m3 in np_to_memref (#186453) (#186833)
1800651c86c0 [flang] Lower anint with math.round (#186039)
3c391665ec1f [lldb] Fix user-after-free in CommandInterpreter (#187032)
9c7e203be3e5 [flang] Fix ignore_tkr(c) passing descriptor instead of base address for non-descriptor dummies (#186894)
79d1a2c41856 [AMDGPU] Standardize on using AMDGPU::getNullPointerValue. NFC. (#187037)
adf458cbac43 [lldb] Add additional logging to wait_for_file_on_target (#186915)
8b7c0c42edfe merge main into amd-staging
0af9058e6881 merge amd-main into amd-staging
810ba55de915 [CycleInfo] Support forward declarations (#187029)
832c95948c80 [NewPM] Port for AArch64ConditionOptimizer (#186941)
b04b9e58aa4c [gn] port 55b271ddc1fd968
6f68daa42cab [InstCombine] Recognize non-negative subtraction patterns (#182597)
19c04ce0d58f [X86] Fix fcmp+select to min/max lowering (#185594)
cb3e9eec5fe0 [clang] DeducedTypes deduction kind fix and improvement (#186727)
fec11e3e5f26 [libc++] Add scripts defining two LNT runners for libc++ (#187050)
abd5b6964e74 [X86] Fold compress(splat(x),splat(x),mask) -> splat(x) (#187042)
bed77a1d9bf4 [libc] Avoid host header collisions in full builds (-nostdinc) (#187025)
60f478a1599f Add Zstandard to Windows release build (#186772)
e8a03bb043ea [CodGen] Port UnpackMachineBundles to new pass manager (#184918)
25981794438b [AMDGPU][GlobalISel][NFC] Group RegBankLegalize intrinsic rules (#186912)
fcefee017b8a AMDGPU: llvm.amdgcn.ds.add/sub.gs.reg.rtn are sources of divergence (#186883)
a93560d13eab [lldb][PlatformDarwin][test] Move Platform test utilities into common header for re-use (#187036)
6d107523b14b [analyzer] Fix [[clang::suppress]] for nested templates (#183727)
c91a9b8d1048 [libc] Add Jeff Bailey to Maintainers.rst (#186662)
65bf05a494c9 [Instrumentation][nsan] Intrinsic tests + bugfixes (#186803)
d20315f15432 [RISCV] Select (sext_inreg (sra X, C), i8/i16) as slli+srai. (#186956)
5d97341c1329 merge main into amd-staging (#1778)
3ee7caa27387 [flang][OpenMP] Use the LoopSequence-based checks (#185300)
ca15db1cd509 [lldb] Fix permission issue in API test on lldb-x86_64-win (#187021)
db4f8f7af09b [SPIR-V] Add support for arbitrary precision integer constants in instruction printer (#185306)
a2e21b67f2f3 device-libs: Replace nextafter implementations (#1727)
3eec4d5b683a device-libs: Remove correctly_rounded_sqrt control libraries (#1724)
52e9e828985f [NFC][AArch64] ConditionOptimizer: refine cmp/cond instruction update code (#186724)
3ae428ff3a52 [libc++][NFC] Rename the template parameter of __make_transparent (#186435)
43ec60eee5f9 Reland "[DomTree] Assert non-null block for pre-dom tree" (#187005)
0f1ec17f29f1 [AMDGPU][GlobalISel] Add RegBankLegalize rules for atomic fmin/fmax (#182824)
6e17b2ef33b8 [CIR][AArch64] Upstream NEON shift left builtins (#186406)
240bc0a7ad74 [AMDGPU] Remove R600TargetTransformInfo dependency on AMDGPUTargetLowering. NFC. (#187014)
05f2b89f4459 [NFC][analyzer] Update some incorrect doc-comments (#186852)
a114bbe4cb7d [ValueTracking] fadd never produces subnormal with no underflow (#186985)
2859621ddbb3 [Bazel] Port 429e9717 (#187019)
a1a714b8b87e [MLIR][Interfaces] Make `getMutableSuccessorOperands` overridable on `ReturnLike` ops (#186832)
9f4fbe86a592 [lldb] Add pointer arithmetics for addition and subtraction to DIL (#184652)
dc5c6d008f48 [sancov] add -fsanitize-coverage=trace-pc-entry-exit (#185972)
0eefb2682bf8 [libc++] Build the library with C++26 (#181021)
e31db655fd51 [NFC][analyzer] Improve computeObjectUnderConstruction (#186186)
bec0f40ef2ee [SPIR-V] Handle spirv.MemoryModel metadata (#186138)
f335bd9685b3 [Flang][OpenMP] Add semantic support for OpenMP Loop Interchange and permutation clause in Flang (#183435)
818efd5c9541 [SPIR-V] Handle undef aggregate initializers for global variables (#186785)
34fa16afff1d [LifetimeSafety] Exclude basic_string::insert from capturing methods (#186989)
9a42e5ba6fd8 [mlir][tosa] Remove 'Pure' trait from operations that are not speculatable (#185700)
2b5e30262777 [lldb][windows] fix TestReplaceDLL.py reruns (#187002)
e1baf3a99bdc [AMDGPU] Remove AMDGPUCallLowering dependency on AMDGPUTargetLowering. NFC. (#187008)
3be7b2fc9da7 [X86] Improve handling of i512 SHL(-1,Amt) + SRL(-1,Amt) "mask shifts" (#186806)
5de7c865dc82 [X86][APX] Enable NDD tunings (#186049)
6c9407a2f064 [Bazel] Port 9e43b35 (#187011)
bc54aeff7445 [LAA] Add tests with missed aliasing invariant load/store. (NFC)
fdbc015abc9f [lldb][PlatformDarwin][NFC] Move logic to emit warning on invalid/conflicting Python script names into helper function (#185669)
bc190619eb41 [Bazel] Port 55b271d (#187007)
a78d1d9a8b0c [mlir][vector] Add missing tests (nfc) (#186990)
04cc7523ed6a [mlir][bufferization] Fix crash with copy-before-write + bufferize-function-boundaries (#186446)
a26077ee5f4b [NFC][NVPTX] Fix tcgen05.mma PTX instruction encoding (#186602)
22840d33d768 merge main into amd-staging
055322c38af1 [mlir] Fix crash in diagnostic verifier for unmatched @unknown expectations (#186148)
b861a289d799 [WebAssembly] combine `bitmask` with `setcc <X>, 0, setlt` (#179065)
67e47fb5317c [mlir][gpu] Add SymbolUserOpInterface to launch_func op (#173277)
f1a7c7e772f9 [MIR] Support symbolic INLINEASM extra-info flags (#186818)
962b304f6130 [LLVM] Make -use-constant-fp-for-scalable-splat the default. (#186422)
429e9717e232 [mlir][arith][NFC] Use type parser instead of hard-coding type keywords (#186753)
c69187622fdf [orc-rt] Update SPS wrapper names to reflect new namespace. NFCI. (#186994)
612d80348f79 [orc-rt] Move SPS controller interface funcs into their own headers. (#186991)
e904d559c5ae [mlir][bytecode] Fix crashes when reading bytecode with unsupported types (#186354)
35118457abeb [flang][NFC] Converted five tests from old lowering to new lowering (part 33) (#186943)
f15852ce21ef [AArch64] Remove promotion cost for fixed-length bfloat arith with +sve-b16b16 (#186378)
740f1b56c925 [lldb][PlatformDarwin] Reword warning when locating scripting resources from dSYM (#185666)
b14eea0b2313 [libclc] Fix check-libclc dependency on llvm-dis (#186978)
df03e1a3724a [MIR] Fix printing INLINEASM dialects. (#186797)
9b5084f894cb [clang][win] Define vector deleting dtor body for declared-only dtor if needed (#185653)
fdcb1f4ab19f [Clang] Make members with exclude_from_explicit_instantiation never be exported or imported (#185140)
6f966fb5dade [LV] Add select instruction to VPReplicateRecipe::computeCost (#186825)
527496bb10f5 libclc: Improve large float trig reduction (#186984)
107b113b67d2 libclc: Use small trig reduction for nan (#186983)
a0d6e97142bd libclc: Use frexp and ldexp in trig reduction instead of bit hacking (#186982)
77ba0d9e244a libclc: Update pow functions (#186890)
6dcd70d10377 [AMDGPU] Use AMDGPULaneMaskUtils in SILowerI1Copies (#186170)
fae024aca954 libclc: Move edge case handling of trig functions (#186429)
56d7920c093f [VPlan] Factor collectGroupedReplicateMemOps (NFC) (#186820)
7887ac6e7f2e [libc][docs] Update clang-tidy checks page (#185923)
1588f083694b [OpenMP][OMPT] Add missing `error` entry to device tracing record union (#185683)
19460ff85976 libclc: Use fshr builtin in sincos helpers (#186427)
b5e825ec3839 [DA] Add test for the Weak Crossing SIV test misses dependency (NFC) (#186355)
9e43b35befa4 [clang][ssaf] Add --ssaf-list-{extractor,format} flags (#185428)
c64d9af7b5f5 [llvm-link] Add more detail to `--internalize` description (#170397)
096371b7e334 libclc: Use struct for ep pair (#186973)
7c2aef4b58fc Reland "[lldb] Initial plugin and test for SymbolLocatorSymStore" (#185658)
b091331f0605 [orc-rt] Fix stale file comment. NFC.
b2442a20a946 [NFC][SPIRV] New test for untested SPIRVInstructionSelector case (#186069)
63ebca6a50ab Add zlib to Windows release build (#186630)
91b928f91936 [VPlan] Create header phis once regions have been created (NFC).
356717656324 merge main into amd-staging (#1774)
d9066944d7d3 merge main into amd-staging (#1773)
3dc46e9fffe8 [lldb] Use clang_cl_host to build `vbases.test` (#186857)
3d421d59ad24 [DA] Refactor the signature of the Exact SIV test (NFCI) (#186386)
234aacf8e895 [C++20] [Modules] Diagnose for duplicated definition in the same module (#186959)
f1b16eaeddbb [orc-rt] Hold `const void*` rather than `void*` in ControllerInterface. (#186954)
33cfc6ba610a [CI][libclc] Enable libclc in premerge CI with single target (#186104)
b7843a241102 [RISCV][NFC] Remove duplicate setTargetDAGCombine registrations (#186928)
6439500cefaa [clang][bytecode] Clean up CondScope after while loop (#186816)
51b3b9b03907 [LV] Optimize x && (x && y) -> x && y (#185806)
69b83274578b [orc-rt] Add ControllerInterface symbol table. (#186947)
376f41439375 AArch64: Look through copies in CCMP converter.
f572cc0e7edf [RISCV] Fold (WADDAU -C, -1, rs1, 0) -> (WSUBU rs1, C) where C > 0 (#186638)
711e8846714d [revPat] two reverts
b005ff76f367 [ValueTracking] frem in computeKnownFPClass can not return +/-Inf (#186748)
193d26743b12 Revert "[Reland][IR] Add initial support for the byte type (#186888)"
b988242a2542 merge main into amd-staging
d97a7a13997e Revert "[Clang][OpenMP] Move declare simd codegen into OMPIRBuilder (#186030)"
4fdd4404744d Merge commit '4c63b28bb971' into amd/merge/upstream_merge_20260316230220
a8edc5355799 [mlir][Interfaces][NFC] Improve time complexity of RegionBranchOpInterface canonicalization patterns (#186114)
827012111082 [llvm-ir2vec] Refactoring the ir2vec python bindings testing (#180664)
2c8e855a7a97 merge main into amd-staging
88f1ec9a70bf [clang][OpenMP] Parse/Sema for OpenMP 6.0 declare_target 'local' clause (#186281)
38eebe843b41 [AMDGPU] Add s_sethalt to hasUnwantedEffectsWhenEXECEmpty (#186745)
4abb927bacf3 [libclc][CMake] Use clang/llvm-ar on Windows (#186726)
fe1f51250234 [Clang][docs][test] Add N3517, N3652, and N3715 according to N3783 (#185566)
47970a4428d6 [flang][PPC] Update vector tests with nuw nsw (NFC) (#186879)
83965ad10c1e [clang] use canonical arguments for checking function tem…
…ug is" This reverts commit 8d4dc98.
|
Github is unable to render changes for me. I can not see "approval" button either. I've gone through diff files and changes look good to me. Consider this PR approved from me. @justinrosner @pabloantoniom can you guys please check changes related to backend/AsyncLoadToLDS diffs etc. ? |
I'm also unable to see the approve button... is this a known problem because so many files have been updated? I don't recall this happening on past upstream merges Update: I refreshed a few times and it worked for me. |
|
One minor comment reported by AI Agent: |
…ream-merge-jan-26
addIllegalDialect call).
…2c2f
eaf55a972c2f merge main into amd-staging (#2315)
aa65d53f62e3 merge main into amd-staging
1348766d1d68 [SLP]Initial support for non-power-of-2 vectorization
0ad0e899d456 [libc++] Remove full header path from assertion messages (#190060)
2f28e1db535b [libc++] Implement P1899 `ranges::stride_view` (#65200)
3e10b2fe2169 [clang] Fix incorrect register information for AVR (#193940)
61f311d93ed0 [AVR] Fix a bug in printing assembly operand with extra code (#193964)
5c73c7a3a057 [lldb] Propose MultiBreakpoint extension to GDB Remote (#192910)
4ed36386a276 [asan] API for getting multiple pointer ranges (#181446)
6443e9b8a5bc [clang] Tests for CWG1670 and CWG1878: `auto` in conversion functions (#187850)
0ec82abfd2c1 [clang] Enable part of CWG2598 test in C++20 mode (#189310)
55af9c26151b [libc][math] Refactor dsub family to header-only (#182160)
3bbfa3e5f07a [LoongArch] Combine rounded vector shifts to VSRLR/VSRAR (#192921)
80fc7f447952 merge main into amd-staging (#2312)
a693efcc40b1 [RISCV][GlobalISel] Support RISC-V specific inline asm constraints: 'I', 'J', 'K' and 'S' (#193765)
3dc4fd6dd411 [compiler-rt][sanitizer] Remove linux/scc.h (#194116)
320a5154ecd3 [LoongArch] Add tests for vector shift-right-and-round combines (#192920)
aeea3191d416 [MC] Change MCContext::getTargetOptions to return a reference. NFC (#194112)
5e09af5f30a7 [clang][bytecode] Reject inc/dec on non-numbers (#193954)
7bbfee35775e [clang][bytecode][NFC] Add record/union names in Descriptor::dump() (#194002)
90ec315dcb31 merge main into amd-staging
84b0809a84f4 [GIsel] Add constant-folding for bit-counting ops (#194010)
2428fbb613be [NFC][ThinLTO] Remove JumpTableToSwitchPass from the test (#194103)
7059fc556bfe Revert "[Clang][CodeGen] Report when an alias points to an incompatible target" (#194106)
c3df8f8c8337 [SPIRV] Add 64 bit lowering for bitreverse (#193068)
69797daef12f [revPat] revert [flang][OpenMP] Move branching verification to semantic checks (#193324)
34f75f040637 Revert "[flang][OpenMP] Move branching verification to semantic checks (#193324)"
6dd373f8aaab [sanitizer] Relax pthread_join tests for different glibc versions (#194100)
bd1c30811723 [Clang][CodeGen] Report when an alias points to an incompatible target (#192397)
e66613f38124 [CSKY] Fix build after #191460 (#194102)
f478220dd062 [LoongArch] Add support for vector FP_EXTEND from vxf32 to vxf64 (#164746)
4a7ae4b900fd Revert "Reland: [LowerTypeTests] Add debug info to jump table entries" (#194095)
87317d39f44a [compiler-rt][WebAssembly] Use an int as CMP_RESULT (#194093)
0c472c140158 [lldb] Handle partial memory region coverage in IRMemoryMap::FindSpace (#194001)
a1b9a3d6ce63 merge main into amd-staging
a041024da2f9 Revert "[compiler-rt] Improve ubsan-minimal runtime for GPU use (#193597)"
44638abac851 merge main into amd-staging
e38b8da23b0b [RISCV][P-ext] Remove dead code from LowerOperation handling of ISD::STORE. NFC (#194088)
ec9d7d18bdfe Revert "[llvm-profgen] Add support for ETM trace decoding" (#194087)
ecdcd40e233b [DirectX] Emit `dx.precise` metadata when fast math is not present (#192526)
b4c1e1a14e47 [RISCV] Expand fcanonicalize on vector types (#193842)
5064b936bec6 [clang][deps] Always initialize module cache out params (#194082)
e3bd61890e68 [llvm-profgen] Add support for ETM trace decoding (#191584)
4e4b91c6b690 merge main into amd-staging (#2308)
cf183f4509a3 Manual update of LLVM_MAIN_REVISION to 577912
327f027f108e [offload] Fix compilation (#194081)
ca934b892fdd [dsymutil] Report error when section offsets exceed DWARF32 limit (#193867)
5536a4c7122e [LFI][AArch64] Add rewrites for control flow (#192602)
d1c9b4a53975 [MLIR][XeVM] Update API usage. Some OpenCL APIs are not supported. (#193320)
d0c91de53e3b [clang][NFC] Linux/Windows Multilib Include Path Tests (#193869)
d14866f029e9 gn build: Port a4538a3ad902
2b43da5ac0fa [NewPM] Port for AArch64StackTaggingPreRA (#194021)
b49855fc5684 [AMDGPU][MC] Allow the nolds modifier (#185129)
7ea78deff2d1 Revert "workflows/issue-release-workflow: Use GitHub app for generating tokens" (#194058)
ebbaa93e005e [llvm] Implement the BPF ABI (#194031)
da2c4a9efe99 [clang] Add constant evaluation support for CK_ToUnion. (#193370)
5b570d1b3b1d [NFC][MLIR] Use `getIntrinsicSignature` to verify overloaded intrinsics (#194035)
fa2588e3110f [NFC][NSAN] Use `getIntrinsicSignature` instead of `matchIntrinsicSignature` (#194025)
bfa88a8d3c3b [libc] Implement wcscoll (#192778)
0da7c12e43bf [AMD-GPU] Fix smfmac builtin target (#193999)
ef739b97b108 [AMDGPU] Correct gfx950 smfmac sparse index verifier (#193541)
7ec8037f3243 workflows/issue-release-workflow: Use GitHub app for generating tokens (#193825)
39e02c141d03 [Offload][AMDGPU] Use ROCr API for APU check (#193887)
833025532955 [lldb] Fix build logic in TestPtrAuthExpressions.py (#193847)
a47eca0ae99f [lldb] Rewrite make rules for TestFileBreakpointsSameCUName.py (#193871)
9805381e5f01 [libc] Move mul_overflow to math_extras.h (#194033)
6fb343619009 [dsymutil] Handle DW_OP_GNU_push_tls_address in markEverythingAsKept (#193870)
f098aa3b5b58 [lldb][Darwin] debugserver expedite new binary info, lldb use (#192754)
32eb90bfdf9f [clang][deps] Keep module cache in memory (#192347)
ad5a7609df9b [SLP]Do not cache sentinel position for SplitVectorize nodes
125f69d71ad1 [libc][math] Refactor fabs family to header-only (#182173)
719c38062a3b [NFC][lld] Avoid hex address case sensitivity in fill-trap tests (#194037)
7de8b11607bf [LoopUnroll] Make optimization remarks more precise (#190714)
ed02685dd70f Revert "AMDGCN: Skip -fgnuc-version version for amdgcn (temporary wor… (#1961)
b08ec97d37cc [LoopVectorize] Don't replace widen with replicate for ExtractValueInst (#193404)
d4ba0194f52b [mlir] Add analysis filter in dataflow solver (#192998)
ec5862a28e59 Re-apply: workflows: Use main-branch-only environment when using ISSUE_SUBSCRIBER_TOKEN (#179990) (#193801)
72ca372fa7c9 Revert "AMDGPU: Implement getInstSizeVerifyMode" (#194026)
25ec1baf2eb9 [CIR] Fix remaining (part 2) FlattenCFG rewriter contract violations (#192503)
dc41953559a6 [MLIR][NVVM] Add `nvvm.ex2` OP (#193790)
2948f9a784e8 [CIR] Add `__attribute__((annotate(...)))` support (#193329)
a3285a1a14db clang: Check -Xarch compatibility using Triple parsed architecture. (#189651)
600efe3dd9bb AMDGPU: Implement getInstSizeVerifyMode (#191461)
7b336dcfce1d [lldb] Allow forks to occur in expression evaluation (#184815)
f24bfb8967cb [SPIR-V][NewPM] Register IR-level passes with the new pass manager (#193660)
a497f90dc091 [compiler-rt] Improve ubsan-minimal runtime for GPU use (#193597)
3f3c26039f13 [DirectX] Resolve unreachable default branches in switch statements (#193592)
b785dc42c7b1 [CIR] Update uses of no-prototype GetGlobalOp (#193868)
589d337d3d44 [SLP] Update analyzeRtStrideCandidate() to correctly handle types widen than i8 (including revectorization) (#191878)
8e4f0ce69853 [HLSL] Remove support for user-defined constructors and destructors (#193375)
a1a4da0416ad [libc][math] Refactor ddiv family to header-only (#182149)
b06f62f7fc82 [llvm] Introduce TargetInfo (#190730)
323c3da8dcb8 [flang] [flang-rt] Implement AT edit descriptor for Fortran 202X with appropriate handling and tests (#189157)
b3cc1929966e [NFC][Clang][ByteCode] Apply rule of three to Context and EvalIDScope (#193856)
61bfd7db9f55 [CIR] Tolerate identical source and destination in cir.copy (#193852)
7c043b7a571c [MLIR][XeGPU][VectorToXeGPU] Fixed lowering of transfer_read/write for rank > 2 (#193308)
0dc6e8c41e5f [AMDGPU][NFC] Refactor TryGetMCExprValue into evaluateMCExprs helper (#193859)
63e36755decf [DirectX] Denote `dx.resource.getpointer` with `IntrInaccessibleMemOnly` and `IntrReadMem` (#193593)
df359d81659a [SLP] Skip FMulAdd conversion for alt-shuffle FAdd/FSub nodes (#193960)
1ba6cc0e318b [clang][CIR] Add lowering for vcvtd_n_ and vcvts_n_ conversion intrinsics (#190961) (#193273)
cbda767c2a41 [ThinLTO] Reduce the number of renaming due to promotions in distribu… (#188074)
e8f32abba9a1 CodeGen: Fix double counting bundles in inst size verification (#191460)
1823355d06b8 [ARM] Fold SELECT (AND(X,1) == 0), C1, C2 -> XOR(C1,AND(NEG(AND(X,1)),XOR(C1,C2)) in Thumb1 (#185898)
a614cd391a40 [lldb-dap][windows] fix a race condition in runInTerminal mode (#193773)
0b449f66927f [flang] Fix abort on invalid -fdo-concurrent-to-openmp value. (#193929)
deb84db5b405 [DirectX] Apply DXIL op fnattrs to declarations (#193622)
eb17a2e19c85 [libclc] Make sure PACKAGE_VERSION is set for libclc (#193966)
1bd6f6636a39 Revert "[PreISelIntrinsicLowering] Expand binary elementwise intrinsics (#193552) (#193580) (#193990)
0f861ec33ae6 [MLIR][NVVM] Add `nvvm.cos` OP (#193792)
0642d03c76b3 [MLIR][NVVM] Remove ptx version for consistency (#193991)
cd4ac81779e0 [SLP] Add new test for widened strided loads of > i8 width (#193901)
94cdc55d8a77 [MLIR][XeGPU] Remove use-by-broadcast-only restriction for ShapeCast op in Wg-to-Sg distribution pass (#193640)
87a9cbaed1d6 [compiler-rt][TySan] Add Hexagon target support (#191603)
b0931483280d [libc][math] Refactor dmul family to header-only (#182151)
25a035d94486 [clang][bytecode] Reject float-to-int casts on non-numbers (#193968)
08992737d231 [VPlan] Use early continue in ::buildVPlansWithVPRecipes (NFC). (#193979)
4837b0a476eb [LifetimeSafety] Suppress suggestion/inference for moved loans (#193899)
785d7246bf16 [AMDGPU][Disassembler] Permit unneeded VOPD3 operands to be non-zero (#193974)
c92bf56cd7bb [lldb][AArch64][Linux] Rename "por" register to "por_el0" (#193983)
52534a1e1207 Revert "[C++20] [Modules] Don't profiling the callee of CXXFoldExpr (#190732)" (#193975)
f1f2022f4f18 [libc][docs] Add sys/uio.h implementation status (#122006) (#193980)
2168f4b3d3bf [flang][NFC] Converted five tests from old lowering to new lowering (part 48) (#193889)
c49b1773b223 [clangd] [C++20] [Modules] Introduce GC for clangd built modules (#193973)
771440f5bb2e [libc][docs] Add dlfcn.h implementation status (#122006) (#193972)
6c7d16c0bfd3 [libclc] Use 'LLVM_DEFAULT_TARGET_TRIPLE' instead of 'LLVM_RUNTIMES_TARGET' (#193969)
8e5b38383f1e [flang][OpenMP] Rename dirSpec to spec in openmp-parsers.cpp, NFC (#193967)
8baf33522df3 [BOLT][AArch64] Refuse to run JTFootprintReduction pass (#193946)
aca5d1ed27f7 [LifetimeSafety] Remerge "Add support for `new`/`delete`" (#193776)
97b7cee34583 [CIR] Introduce LocalInitOp, & lower static locals (#193576)
ee4d927dfc01 [mlir][tosa] Fix integer bilinear (quantized) tosa.resize lowering to use floordivsi (#193821)
b5d253cf31c2 [flang][NFC] Converted five tests from old lowering to new lowering (part 47) (#193886)
48e0e16886f9 [lldb-dap] extend env when testing reverse request (#193743)
629f81599d6e [X86] freeze-binary.ll - regenerate to show VPADD constant asm comments (#193953)
7b68b4b2c196 [lldb][docs] Document AArch64 Linux Permission Overlay support (#184119)
d3f4fc750db9 [AArch64][clang] Fix typos in `arm_sve.td` (NFC) (#192981)
bb3d25167abe [libc][docs][POSIX] Add sys/select.h implementation status (#122006) (#193948)
4c66205b070c [lldb][Linux] Add overlay and effective permissions to "memory region" (#184115)
a528529064db [clang][bytecode] Allow constexpr-unknown values in GetPtrBase{,Pop} (#193903)
e2170a0f18cd [VPlan] Remove unused LVer arg from tryToBuildVplanWithVPRecipes (NFC). (#193950)
e3443a1189b9 [libc] add `pthread_cond_*` public interfaces (#193656)
17861903e6bb [LoongArch] Custom legalize vector_shuffle to `xvshuf4i.d` (#164213)
e7edfd81ca1a [X86] Regenerate vector ext tests to show VPADD constant asm comments (#193942)
c0844b7b65b4 [X86] known-never-zero.ll - regenerate to show VPADD constant asm comments (#193943)
0844fdfd7255 [clang][bytecode] Start lifetime when activating pointers (#192589)
c59c19bf5921 [MachineSSAUpdater][AMDGPU] Add faster version of MachineSSAUpdater class. (#145722)
0571ce414ec0 [flang][OpenMP] Move branching verification to semantic checks (#193324)
8141a4351c5e [flang][OpenMP] Make OpenMPLoopConstruct inherit from OmpBlockConstruct (#193823)
48e65b6c4337 [AArch64][GlobalISel] Add a variant of gi_extract_high_v8bf16 (#193345)
a227dc7c0980 [DAG] visitIS_FPCLASS - fold to constant when result is fully determined by KnownFPClass (#193737)
e6db81282c7f merge main into amd-staging
f64e8d1567ad [AMDLIBM] Remove the mapping of the deleted vector call (#193760)
a48159df9ce5 [AArch64][llvm] Remove support for FEAT_MPAMv2_VID (#193191)
cedcf1876cb1 [Clang][Sema] Change `ExtnameUndeclaredIdentifiers` to MapVector. (#193924)
90020486aa89 [X86] masked div/rem tests - fix avx512 and add sse4/avx2 test coverage (#193933)
3fa0ac20705b Reland "[lldb][Linux] Read memory protection keys for memory regions (#193934)" (#193936)
cc0913eacf88 [clang][bytecode] Fix `MemberExpr`s with a static member (#193902)
9df51a964ddb [MLIR][NVVM] Add `nvvm.sin` OP (#193775)
23ea7363ff76 [AAEval] Print ModRefInfo for atomic operations (#193935)
390a29ea8339 Revert "[lldb][Linux] Read memory protection keys for memory regions" (#193934)
e2159f8e3c1a [LangRef] Allow monotonic & seq_cst accesses to inter-operate with other accesses (#189014)
b1af74c44790 [flang][OpenMP] Remove duplicate code block in MapInfoFinalization (#2306)
69724c88e3e0 [lldb][Linux] Read memory protection keys for memory regions (#182246)
faf37adb3b33 [Clang] Use const std::string & in ClangOptionDocEmitter. NFC. (#193926)
ca136926c125 Fix formatting of changes in recent redefine_extname changes. (#189938)
f780e46d6e0e [llvm][ExpandMemCmp] Avoid making copy of loop value (#193915)
8685fb10938f [mlir][math] Add constant folding for `math.fpowi` (#193761)
839a22f449b3 [Flang] Add `INLINEALWAYS` Compiler Directive (#192674)
666e2af1d575 merge main into amd-staging (#2305)
170f030c22c5 [mlir][math] Use APFloat::SemanticsToEnum in constant folding (#193914)
3ec9bbc3da9c [DSE] Merge two test files and generate checks (NFC) (#193922)
297fb9377a6a [LICM] Generate test checks (NFC) (#193921)
4852c5b159c8 [ARM][MC] Gate Thumb hvc alias on virtualization (#193532)
c55a73c44e4a [lldb] Remove full stop from AppendErrorWithFormat format strings (part 1) (#193750)
340ba1191cfc [SPIRV] Do not add aliasing decorations to OpAtomicStore/OpAtomicLoad (#193779)
bd469e8a1d47 [SDAG] Minor cleanup to TargetLowering::expandFP_ROUND. NFC (#193793)
347aa3f6fbcc [GISel] Disable opt_brcond_by_inverting_cond combine at O0 (#193417)
efffb04c2b7b [DirectX] Fix DILocalVariable (#192573)
3de31988417d [DirectX] Replace non-const count of DISubrange with -1 (#192576)
8b3eac05aa74 [DirectX] Convert DICompileUnit versioned language (#192574)
eb6eb9fa0a0c [DirectX] Convert debug values to old style (#192162)
6114cbb611c8 [DirectX] Fix debug dump of ValueEnumerator (#191251)
6c16fc8a1a1d [lldb][test] Remove full stop from expected error messages (#193748)
248192d5bef1 [RISCV] Add bf16 tests for interleave and deinterleave (#193720)
e4f1530edcc9 [flang][debug] generate llvm.fake.use for arguments at -g and O0 (#187044)
fc9f14e42422 [libc] Switch check-libc from CTest to lit (#193798)
65e766dfda81 [libc] Honour LIBC_GPU_TEST_JOBS in lit test runs (#193797)
7758ee59e7a2 [libc] Fix implicit conversion warning in mktime_test (#193504)
0d332848bf28 [SPIRV] Lower load/store atomic to OpAtomicLoad/OpAtomicStore (#185696)
b5483871391b [LV] Simplify live-out extraction for first-order recurrence phis when tail folding (#176108)
99c9a1f566df [mlir][EmitC] Add tests for arith.max/min float/signed int conversions (#190160)
b565800d99a4 [lldb] Add regression test for stale Symbol pointer crash in statusline (#193854)
e7e85a744871 [IR] Remove pointer arguments from loop.dependence.{war|raw}.mask (#188248)
347f1ac86d54 [MLIR][Vector] Add fastmath attribute to vector.contract (#192788)
a7368c3b48f8 [NFC][Clang][docs] Clarify the status of P1949R7 (unicode identifiers) (#193483)
3041708a17b3 [Tooling][clang-tools-extra] Consume CommonOptionsParser errors in tools (#193675)
aadf3959eb0d [libcxx][Github] Add generic-llvm-libc config to CI (#193822)
ab27c601b01a [Comgr] Add end-to-end LIT coverage for amd_comgr_hotswap_rewrite (#2291)
5d4b17e963b1 [Clang][SPIRV] Add getSRetAddrSpace() for SPIRV (#193875)
104ee2aed28d [NFC] [clangd] [C++20] [Modules] Add a test for testing transtive change detection (#193888)
70e2e7e63f98 merge main into amd-staging
dbaa12a89f45 [AMDGPU] Add MC tests for scalar operands for packed fp32 instructions (#193866)
eef81b7a0a63 [lldb/test] Fix TestModuleLoadedNotifys duplicate module check (#193846)
70fcb235250b [lldb/test] Fix TestCompletion on Windows after realpath change (#193878)
28d2537af2b6 [clangd] [C++20] [Modules] Introduce persistent cache for clangd built module file (#193883)
af166f419fb9 [LoongArch][NFC] Pre-commit tests for vector fpext from vxf32 to vxf64 (#164740)
1249cb6aea88 [clang-scan-deps] Fixes an assertion in clang-scan-deps (#193619)
61b0de5f14a3 [RISCV] Remove codegen for vp_fneg, vp_fma. NFC (#193214)
67e1411de836 [VPlan] Fold lhs | (headermask && rhs) -> vp.merge rhs, true, lhs, evl (#193511)
e3ab3688e1c6 [X86][COFF] Enable basic-block-address-map emission (#191347)
6e2f5e9679cd [OpenCL] Diagnose error for zero-length array (#193163)
528e673fec47 [Clang][CodeGen] Fix sret lifetime marker AS mismatch after #186275 (#193850)
969247cc47a3 [libclc] Allow testing unresolved symbols on multiple libraries (#193647)
f3192382c336 [libclc][CMake] Remove CMAKE_C_COMPILER_ID check (#186717)
a95a1c40edba [LazyValueInfo] Support vector types in ICmp condition handling (#192900)
c6b998443589 [NFC][Clang][Sema] Apply rule of three to Sema helper classes (#193835)
deb238e224b7 [gn build] Port 3081d52d8242 (#193862)
2611f151c3a4 [gn build] Port a4538a3ad902 (#193863)
9b7b83b3499b [gn build] Port d137e6601f1c (#193864)
0cd635ca4504 [gn build] Port d64dd5a2afea (#193865)
4f877e47e69b [gn build] Port 2039a51881bb (#193861)
6b4cdb036471 Revert "[gn] port 40fcd2517a110 (#193293)" (#193860)
2a74f30cc20e [CIR] Add coroutine cleanup handling and update co_return semantics (#189281)
0bdcf4ee4dd5 Revert "Reapply "[clang][modules-driver] Add support for C++ named modules and `import std`" (#193857)
0d1bf3ac8eb1 [HLSL][NFC] Refactor worklist loop in HLSLEmitter.cpp to use index-based iteration (#193638)
3174c94eaf20 Revert "[flang][cuda] Preserve fir.rebox captured by cuf.kernel in SimplifyArrayCoorOp" (#193855)
8e736e102bd5 merge main into amd-staging
9473873906b6 Reapply "[clang][modules-driver] Add support for C++ named modules and `import std`" (#193815)
689dc6c58c01 [CIR] Handle boolean expression as array indexes (#193814)
2709f4872c66 [Flang][OpenMP] Support for parallel regions in Generic kernels (II) (#2276)
dca61067836e [mlir][xegpu] Add support for `vector.transfer_read/write` on SLM buffers (#192757)
28027f8ffee1 [MachineOutliner] Do not allow debug instructions to affect liveness computations. (#192336)
6364bf68058b [lldb] Remove unused ValueObject::IsBaseClass(uint32_t &depth) (NFC) (#193849)
c7c48e51a43d merge main into amd-staging (#2299)
8b96c2104e74 [flang] Add comparison operators for c_devptr (#192687)
464392e9d3b5 [TySan] add internal interface support (#192413)
b756be64cad6 [flang][cuda] Preserve fir.rebox captured by cuf.kernel in SimplifyArrayCoorOp (#193837)
42077db9afb5 [lldb/test] Fix TestOSIndSYM for Darwin embedded platforms (#193839)
281f993aafa6 [CIR] Add nonnull on returns and pointer params (#188281)
3cd4a795d9fd clang: Avoid hardcoding some offload triple strings (#193811)
8ebc7307fa4a [llvm-rc] Add support for MIPS machine (#193830)
7ed9d965f29e [AArch64][PreISelIntrinsicLowering] Adjust tests to include -march=+sve (#193833)
81b827f2e71a [Comgr] Add agent-config files (CLAUDE.md, .cursor/rules/comgr.mdc) (#2301)
de82b4790943 [Clang] Fix sret AS for non-trivial-copy returns. (#186275)
6b31a99ee4f5 Revert "[Darwin] Remove linker version checks for objc_msgSend selector stubs (#193637)" (#193828)
3baafed3e779 [NFC][offload][OpenMP] Fix kernel replay documentation (#193832)
ecefc4a2ec2b [VPlan] Shallow-traverse vector-loop in dropPoisonGen (NFC) (#193635)
6d826cb602f8 [flang] Add parser support for Fortran 2023 conditional arguments (F2023 R1526-R1528) (#191303)
0b255fe83f17 [mlir][canonicalize] Add filter-dialects option (#193041)
ad4cd22cebf5 [libcxx] Use debug() instead of note() for substitutions (#193667)
c9014d34522b [PreISelIntrinsicLowering] Use index type for index in intrinsic expansion (#193807)
b96263ce6c68 [HLSL] Update global array convergence test (#193380)
bd8b9934f7c7 [SPIR-V] Fix half precision OpConstant for log10/exp10 lowering (#193730)
1e690a2c16fd [HLSL][DXIL][SPIRV] Added DeviceMemoryBarrier() and AllMemoryBarrier() intrinsics (#190633)
b53aeab1d237 [X86] Add test coverage for #193700 (#193819)
36d19f50db3f [MLIR][Mem2Reg] Ensure dominance of default value in regions (#193708)
a6ab955369ae [Darwin] Remove linker version checks for objc_msgSend selector stubs (#193637)
f63bd03c1db0 [CIR] Handle CK_UserDefinedConversion and related casts in emitCastLValue (#193611)
44a1d740333b [lld][WebAssembly] Always initialize fixed `__tls_base` in single threaded mode (#193563)
0bdaf63d0159 [mlir] Enhance error messages for attribute type mismatch in properties (#193758)
86230d50912c [CIR] Implement VLA cast for ComplexType (#193583)
1b6c29ad5cfb [X86] resolveTargetShuffleInputsAndMask - match repeated vector sources through bitcasts (#193810)
d2553595793f [NFC][AMDGPU] Remove `amdgpu-link-time-lds` module flag (#193806)
df1c7ebac75f [lldb] Speculative fix for crash in Function::GetCallEdges() (#193636)
3e432057bfca [CIR] Add restrict→noalias on non-builtin pointer params (#191483)
6d67286e5cf1 [lldb/test] Fix TestDataFormatterObjCNSBundle.py following 8212cab4128d (NFC) (#193816)
8c1081531323 [scudo] Adjust PROT_MTE page count for secondary allocator (#192202)
c26c714d53f0 AMDGPU: Use preferred --target=triple flag in documentation (#193817)
80a00e1de18f [AMDGPU] Implement amdgpu.dot op (#193371)
9bda9bdf142b [CIR] Allow multi-block ctor regions on GlobalOp (#193596)
6fe957a8423f [clang][lit] Don't substitute cir-opt if it's not enabled (#193665)
c71780cf5879 Triple: Add constructor from enum entries (#190632)
3ec98d6c15bc [CIR] Implement handling for destroying delete (#193607)
ca90ff511250 [CIR] Handle negative offsets in pointer constants (#193624)
56fd2c016fd4 [CIR] Upstream __builtin_astype int_to_ptr (#193519)
e3a65e9175cb [ProfCheck] Add test from #193580 to xfail list (#193799)
d707870ceb1a [libcxx] XFAIL some tests for LLVM libc
e4c44c6cc903 [libcxx][Github] Build container images in separate jobs (#193346)
f650fba569c4 [libcxx] Add Testing Configuration for LLVM libc
dc34d163d8c9 Re-apply "[AMDGPU][Scheduler] Use MIR-level rematerializer in rematerialization stage (#189491)" (#192443)
18e4f3be2d49 [flang][OpenMP] Add parallel loop to loop directive parser set (#193621)
4a74a4346c34 [flang][cuda] Flatten memref descriptors in GPU kernel argument packing (#193651)
a1d11348aba4 [PreISelIntrinsicLowering] Expand binary elementwise intrinsics (#193552) (#193580)
712b058c1222 [lit] Fix `progress-bar.py` flaky test (#193741)
254fcbeface8 [offload][OpenMP] Add basic documentation for kernel record replay (#193699)
df77a292371c [offload] Fix envar description in docs (#193642)
5506829f7fd7 Remove unused parameter; NFC (#193767)
d060b496ebf7 [flang] Route elemental CHARACTER MIN/MAX OPTIONAL cases through custom lowering (#191244)
84eb64d64f78 [llubi] Implement intrinsics for integer arithmetic/bit manipulation (#193702)
44d1832283ca Reland #2 "[STLExtras] Add a template for detecting whether a type has an equality comparison operator" (#177415)
87a8d40fdd44 [HLSL] Add codegen for accessing resource members of a struct (2nd merge attempt) (#193584)
0459273e347a [ConstantRange] Expand makeAllowedICmpRegion to use samesign to give tighter range (#174355)
9f75001f924a [lldb] Fix flaky TestRunLocker by using lldb.target instead of lldb.frame (#193788)
7779ee8c0e95 [flang] Support polymorphic types in conditional expressions (#192684)
ee3ed4a0f688 [InstCombine] Fold neg arg in hyperbolic lib functions (#193586)
6f999388adeb Reland "[Lit][NFC] Refactor shell environment functionality and in-process builtins from TestRunner.py into new modules" (#193759)
dd8c77765646 [RISCV][GlobalISel] Add intial support for inline asm (#193314)
2248253e7f97 [PowerPC] fixed issue "Failure to optimize (x == 0) ? 0xFF : 0 to addic+subfe instead of cntlzw+srwi+neg" (#190606)
cf2b30aa2aca [libc] Honor per-test timeout in lit test format (#193772)
321db053f50d Revert "[AArch64][GlobalISel] Do not run the Localizer at -O0 (#177359)" (#193781)
7df553349756 [libc++] Implement `ranges::fold_left_first` and `ranges::fold_left_first_with_iter` (#180214)
023e2e6c9d99 [DAGCombiner] Fold bswap of single-byte-known-nonzero value to a shift (#193473)
f7828886920c [X86] Regenerate bit integer tests to show VPADD constant asm comments (#193763)
602cc92c19c1 [Hexagon] Add SafeStack runtime libcall to HexagonSystemLibrary (#191673)
d6ebdf4a1989 [NFC][Clang][Analyses] Fix AccessPath to have deleted copy assignment (#193639)
37cd9addde72 [TySan] Fix size type mismatch in instrumentMemInst for 32-bit targets (#191601)
2d84862fb3f4 [CodeGenPrepare] Drop nuw on gep unmerging if the new index is negative (#193488)
183168aa5035 [OpenMP][OMPIRBuilder] Convert cmpxchg memory order to C ABI constants (#193536)
a54364a7dc4a [DAGCombine] Relax restriction on (bswap shl(x,c)) combine (#193679)
ef82b673fb56 [RISCV] Pass Subtarget to CC_RISCVAssign2XLen. NFC (#193609)
f99880ef8cc6 [libc][NFC] Fix typo in GPU test warning message (#193762)
4300a3967bd2 [Comgr] Fix hotswap asm parser SourceMgr crash on bad input (#2295)
91b0fbc6bc1e [X86] Use getTargetVShiftByConstNode helper to reduce code duplication. NFCI. (#193736)
f19f3cde7fa5 [NFC][AMDGPU] Make code consistent in MCResourceInfo::gatherResourceInfo (#193735)
a829194012f2 [mlir][vector] Generalize castAwayContractionLeadingOneDim (#187312)
bced9f751e89 [MergeICmps] Check for libfunc emittability (#193764)
838fcbb9aa8a merge main into amd-staging
2f5fe2cf312c [analyzer] Fix typo in ExprEngine.h (#193535)
19d97727aafb [LLVM][CodeGen] Ensure SystemZTDCPass::convertFCmp only accepts scalar floating point types. (#193738)
e9ef76d7e27e [LifetimeSafety] Simplify `AccessPath` root `PointerUnion` (#193520)
47523f7f079b [clang][docs] open details of C++{17,14,11} implementation by default (#193141)
72b061857e0b [clang][docs] fix typo; NFC (#193648)
c32d2d1f2951 [flang] Add the MLIR pass pipelines for dumping (#183144)
a88516baa735 [lldb/test] Update remaining `filecheck` call sites to use `filecheck_log` (NFC) (#193654)
f6c4280ea906 [libc][docs] Add poll.h POSIX header documentation (#122006) (#193734)
8212cab4128d [lldb/test] Relax NSBundle formatter test for Darwin embedded platforms (#193659)
0ff393f5dac1 [NFC][Target] Fixed rule-of-three for RegisterTargetPassConfigCallback class (#193470)
ff125ae1bae7 [ConstantFolding] Constant fold nextafter and nexttoward (#168794)
2a5126748fb5 [AMDGPU] Fix s_cselect scc clobber issue (#193498)
49dad1672430 [flang] Ignore -fno-realloc-lhs for polymorphic allocatable LHS with warning (#192697)
41d05aef9c63 Revert "[Lit][NFC] Refactor shell environment functionality and in-process builtins from TestRunner.py into new modules (Reopened)" (#193740)
9b986d49d6ca [Lit][NFC] Refactor shell environment functionality and in-process builtins from TestRunner.py into new modules (Reopened) (#177358)
1ee288a2b51d [SPIR-V] Combine storage class bit with atomic memory semantics (#193696)
5a9d0cf7190f [PowerPC] Add mnemonics to paddis (#179979)
1881b1cf436d [Bazel] Fixes cd26e99 (#193729)
ba0e4af50fd4 [CodeGen][NFC] Do not iterate in DCE unless needed (#193355)
1bcdf716ae23 [AMDGPU] Add a sched group mask for LDSDMA instructions (#190872)
ea7a1782fbea [clang-tidy][cmake] clangTransformer cmake fix
4997815e883e merge main into amd-staging (#2297)
1b325745f134 [X86][GISel] lower GOT-relative G_GLOBAL_VALUEs (#181983)
a9731960adc6 [AMDGPU] misched: avoid subregister dependencies (#140255)
d28eeaa99735 [LangRef] Make volatile loads non-willreturn (#192992)
cd26e990ebed [mlir][memref][NVGPU] Move NVGPU ops to IndexedAccessOpInterface (#190430)
25ad2ee86da1 [mlir][IntegerRangeAnalysis] Don't unsoundly update constant lattice (#193546)
01291a8ea59a [LoongArch] Custom legalize vector_shuffle to `xvextrins` (#164375)
43aa40ddc67c [flang][OpenMP] Remove OmpEndLoopDirective from PFT (#193602)
263e4f22fd14 [NFC][LLVM] Simplify IIT encoding for scalable vectors (#191737)
b9a2e843d9b2 [NFC][SPIR-V] Add urem, srem, and snegate tests for integer arithmetic (#193170)
148f5509e6f2 [lldb-dap] Make Breakpoint ids unique. (#193526)
eb29a502b6e8 [Clang] Fix constexpr __builtin_(add|sub|mul)_overflow bugs (#192568)
3f3b50054165 [LV][NFC] Remove more unnecessary passes from RUN lines (#193686)
4645dc7bac06 [NFC][AArch64] Regenerate ldst-opt.ll checks to use update_llc_test_checks (#193712)
e407fc3f3bca [AArch64][GlobalISel] Do not run the Localizer at -O0 (#177359)
d64dd5a2afea [LV] Factor out VF-independent code from cost model (NFC). (#192426)
cd050a0fe32d [Mips] Support mips1 and singlethread ATOMIC_FENCE (#190129)
12411c1e6ac5 [clang][bytecode] 0 bitwidth IntAP values also use one word (#193224)
882527f9f5ae [NFC][SPIR-V] Remove dead non-intrinsic path in selectAtomicCmpXchg (#193692)
4f0af7adaac9 [ISel][AArch64] Add CodeGen support for partial sub reductions. (#186809)
d19e954b83cb [LLVM] Make -use-constant-fp-for-fixed-length-splat the default. (#193264)
1f332ae4f1b3 Fix -Wformat diagnostic after #190965 (#193704)
b7e2f7838974 [Comgr] Fix Windows build: use LLVM_ATTRIBUTE_WEAK for hotswap stubs
7e9561ffd11d [SystemZ] Enable LoopVectorizer interleaving for vectorized loops. (#184306)
e3ebeeca9531 [Coverage] Skip coverage mapping for consteval member functions (#190870)
fc843236738b AMDGPU: Set transient stack alignment to 4 (#193517)
848be2d800db Add missed variable change from a refactor (#193684)
de281fe22055 [SystemZ] Implement getCFInstrCost(). (#191017)
51325ba110fb [clang-tidy] Fix FP in bugprone-exception-escape for bodyless non-throwing functions (#192658)
57c10b0b8032 Add C++20 diagnostic to macro-braces-recovery.cpp. (#192654)
d0a38203bf2f [NFC] Remove assert from AArch64TargetLowering::LowerCTTZ. (#193474)
124fd5997e5f [DA] Remove monotonicity-related code and tests (#193697)
7425ab9d9577 [AArch64] Fix `shufflevector` miscompilation on `aarch64_be` (#193076)
378cd9a307c2 [libc++] Avoid using ranges::upper_bound in <format> (#186781)
de9830fe40bf [InstCombine] Treat sdiv as udiv in foldICmpDivConstant when both operands are non-negative (#188731)
5502053d5786 [MemoryDependenceAnalysis] Disambiguate visited state in non-local pointer dep tracking (#193220)
7239415b6f29 [X86] Add crash test coverage for #193475 (#193690)
2aadaae9a0ba [LLD][MinGW] Introduce --native-def argument (#193598)
1bf0787a1638 [lldb] Remove trailing newlines from AppendErrorWithFormat calls (part 3) (#193527)
9baca0126178 [mlir][tensor] Consolidate tensor fold patterns and rename related file (#192820)
6a06c8bdcbda [BOLT][AArch64] Refuse to run ThreeWayBranch pass (#193252)
5673b0215c06 [RISCV][MC] Remove tautological CHECK-UNKNOWN disassembly checks (NFC) (#193682)
5a45fbb35ea3 [llvm][RISCV] Split LMUL=8 fixed vector fcmp for zvfhmin and zvfbfmin (#193424)
8e2c42b3bcb5 [LV][NFC] Stop running DCE pass in tests (#193521)
4209849cd3db [lldb-dap] Add valueLocationReference for member function pointers (#186837)
52e5d65561c5 [AArch64][GlobalISel] Add fpext bf16 legalization. (#193342)
019cf510ba39 [RISCV][NFC] Rename isZipEven/isZipOdd to isPairEven/isPairOdd (#193674)
3f6aa4dd8052 [CodeGen][NFC] Remove InsertPt since it's always the same as MI (#193668)
582db3c2371d Revert "[clang][modules-driver] Add support for C++ named modules and `import std`" (#193677)
9152f212208b [compiler-rt] Set CMAKE_INSTALL_MESSAGE to NEVER for custom libcxx (#193666)
dd13552783ce Reland: [LowerTypeTests] Add debug info to jump table entries (#193670)
c1ff819a184e [mlir][LLVMIR] Extend FP array-splat constant lowering (#192378)
739c45916d4c [clang][modules-driver] Add support for C++ named modules and `import std` (#193312)
e3b3706ceeea Revert "[compiler-rt][asan] Add asan checks for __builtin_assume_dereferencable" (#193655)
b25ccddf84d0 [libc] Readd instructions on building kernel headers from sources
85c13cea0ba6 [Flang][Semantics] Allow EVENT_TYPE, LOCK_TYPE and NOTIFY TYPE to be deallocate (#192940)
26cc17f4bcde [libc] Drop elf.h include from dl_phdr_info header
cc91dbbd274d [clang][modules-driver] Reject module definitions in non-module inputs (#193629)
968e34e09ab5 AMDGPU/GlobalISel: RegBankLegalize AGPR support and gfx908 MFMA rules (#192603)
642d0167de9c [WebAssembly] Fix wide bitmask fallback in performBitcastCombine (#190915)
46bb3789ef6c [ScheduleDAG] Avoid duplicate worklist entries in ComputeDepth/ComputeHeight. NFC (#192023)
a3f1035b48a0 Revert "[LowerTypeTests] Add debug info to jump table entries" (#193663)
66beeecd4694 [NFC][GlobalISel] Use move capture for SmallVector in LegalityPredicates lambdas (#193464)
2a7313a2ff42 [LV] Relax OutOfLoopUses check in `getMinMaxRecurrence()` (#189906)
60cd34d22175 [SPIR-V] Fix OpTypeImage capability requirements for Vulkan (#192626)
2fe8966d029b [SPIR-V][docs] Document supported extensions missing from SPIRVUsage.rst (#193449)
abb4ff508e8f [LowerTypeTests] Add debug info to jump table entries (#192736)
fafcafd6eb0f merge main into amd-staging (#2292)
c74951c6c307 Revert "Reapply "[JTS][Passes] Enable JTS By Default" (#193409)" (#193649)
2855525c4a1f [ELF] Handle INCLUDE like a call stack (#193427)
96bc719fbad5 [flang] Add Flang Community Call notes for 4/22/2026 (#193575)
50916c4319a1 [CIR][RISCV] Support zbb builitin codegen (#188932)
80f540b6e312 merge main into amd-staging
09602502a9fd [PSDB][Linux] add render group access to aomp smoke test container
b2ae992193e3 [RISCV][CodeGen] Add initial CodeGen support of vpair{e,o} (#192918)
06a7d41eb12b [flang] Disable copy-out to INTENT(IN) args (#192382)
1a772bc616cf [X86] Improve FREEZE node elimination for SETCC operations (#192362)
a1a40cb725f4 [lldb/test] Fix shared library symlinks for remote testing (#189177)
0d0595b50f84 [SPIR-V] Encode Atomic metadata as UserSemantic string decoration (#193019)
46e09c516ffc [ExpandMemCmp] Pre-collect memcmp calls to improve compile time (#193415)
793bdd859789 [libc][CndVar] reimplmement conditional variable with FIFO ordering (#192748)
ccc608f11937 [DirectX] Implement lowering of Texture Load and Texture .operator[] (#193343)
417f5bc95296 [NFC][sanitizer_common] Fix getpw_getgr.cpp test for large groups (#193625)
cdbb6704a4ce [SLP][NFC] Precommit test for strided store revectorization (#191569)
90209202d336 [CIR][NFC] Delete unnecessary errorNYI call in emitDelegateCallArg (#193608)
819aabfe1ba1 [lldb] Update filecheck_log to use direct input (NFC-ish) (#193618)
2ca5abe3f593 [SPIR-V] Handle ASM with multiple outputs (#187128)
2c7b820bc8fd Ensure that the Synthetic children of a ValueObject are managed by their parents ClusterManager (#192561)
9f5e0ac8a8f2 [libc] Add some more segment type macros
802de7ebd18e [offload] Allow replay repetitions and report basic timing (#193388)
e68d91afdff3 [NFC][SPIRV] Introduce function to handle 64 bits overflow (#193088)
107701b3e619 Revert "Reapply "[SimplifyCFG] Reuse function comdat for switch lookup table"" (#193582)
8a12b26feb0c Revert "[libc] Replace check-libc with lit-based test execution" (#193610)
19b40f71fd70 [SPIR-V] Add SPV_AMD_weak_linkage extension (#193307)
4b44e2039c78 [fuzzer] Set target_cflags instead of target_flags in lit config (#191510)
083cab66fa7c [SLP] Precommit tests for strided store reordering (#193565)
e4e8bcb54da5 [RISCV] Expand vp.and, vp.or, vp.xor (#193542)
7431a4f984be [SLP]Fix dominance for multi-use copyable scalars in scheduled bundle
24be43f5c5f1 [VPlan] Pick correct insert point after creating canonical IV. (#193587)
1bec68a29602 [RISCV] Remove codegen for vp_abs (#193533)
dec3b1fea9b9 [lldb] Fix empty backtraces for scripted threads with no artificial frames (#193387)
ed2f5f42ee26 AMDGPU: Skip last corrections in afn f64 reciprocal (#183696)
59596d789eac [AArch64][GlobalISel] Add hadd-combine globalisel test coverage. NFC (#193591)
b9b472c8740a [NFC] Add check lines to concepts-out-of-line-def.cpp to fix failure (#193579)
698dce153ab5 [flang] Fix inline transfer for unsigned integer types (#193570)
fed79d4c91b5 [RISCV] Expand vp.inttoptr, vp.ptrtoint (#193530)
97015ad916c4 [HLSL] Disallow `volatile` keyword (#193322)
0062071c2759 [CIR] Fix a dangling reference to a replaced global (#193561)
6ef1b80feff5 [BOLT] Fix null pointer dereference in DWP processing with split DWARF (#191474)
5d9a1c172bae [lldb] Eliminate linear scan in SetSectionLoadAddress (#193560)
1c6ab1136504 Add expand-fp-math.ll to profcheck-xfail.txt (#193577)
1fe66f66d239 [llvm-objdump][offload] Fix offload bundle decompressing (#192729)
ff87dca5c5b4 merge main into amd-staging
99b369246cb0 Revert "[offload] Fix synchronization when record replay is enabled (#193291)"
335f9f95bc0b [HLSL] Reuse temporaries of aggregate types in list initialization (#191605)
1aad4a25a917 [PreISelIntrinsicLowering] Expand all unary elementwise intrinsics (#193552)
e2f66182fa60 [clang][Modules] Avoid checking for duplicating module definitions when a module does not have a valid definition location (#193534)
fb024337497f [Clang][AST] Introduce `ExplicitInstantiationDecl` to preserve source info and fix diagnostic locations (#191658)
cfa133a90772 merge main into amd-staging
50d7c990d9b3 [flang][OpenMP] Support user-defined declare reduction with derived types (#190288)
ebf14ed8b8e0 [CIR] Fix lowering of strings in constant array attributes (#193553)
0dbf7373b51e [LangRef] inline asm: the instructions are treated opaquely (#157080)
55762f305866 IR: Allow !fpmath metadata on homogeneous float structs (#193537)
be529fc55f2f [SLP]Fix scheduling of copyable bundle with commutative op used outside parent PHI
80efad535e81 [CIR] Support guard COMDAT for weak linkage in LoweringPrepare (#193274)
37be0841b30b Reland: [MemProf] Dump inline call stacks as optimization remarks (#193545)
da7ee36ad521 Revert "[clang] fix matching constrained out-of-line definitions of class specialization member function templates" (#193558)
8f2935c2ebe3 Loosen check for clang version string in test to work when setting CLANG_VENDOR. (#192961)
38874e1897e3 [GlobalISel] Change SSUBO to do (LHS < RHS) XOR (RESULT < 0) (#191744)
9e649076b9d6 [libc] Replace check-libc with lit-based test execution (#184163)
6e4fb52144fe [VPlan] Use MaxRuntimeStep in materializeVectorTC to simplify middle br. (#193067)
4f1be838a9f1 [compiler-rt] [Darwin] Enable arm64e tests on macOS (#193391)
440872232bbe [NFC][MachineBlockHashInfo] Add static asserts to guard agains hash_16_bytes changes (#192862)
91fe498ccc31 Revert "[SelectionDAG] Salvage debuginfo when combining load and z|s ext instrs. (#188544)" (#193554)
5d0143187437 [lldb] Scope symbol lookups to specific modules in ObjC/SystemRuntime plugins (#193379)
c3c8e40b6cb1 [Runtimes] Allow HandleLibc.cmake to be called multiple times (#193540)
d9bbb902fe8f [LegalizeTypes][DAG] Use SHL(X,1) instead of ADD(X,X) for variable vector indices for extraction/insertion legalization (#188277)
8f1b0f632756 [lldb] Decorate tests that use threading (#193117)
eb427a4cbbed [libc][NFC] Fix minor RPC warnings (#192997)
18bd7e409217 [Bazel] Fixes e52df04 (#193548)
2f5ccd4aaa3e [MLIR][XeGPU] Do not use ocloc lib if LLVM_BUILD_LLVM_DYLIB is ON (#193259)
b0166e7a2094 [libc] Fix .params file generation for integration tests (#193544)
bd09b03b1b09 [NFC][ADT] Make a few functions constexpr (#193302)
7a633290d6b1 Revert "[Support][JSON] Use `std::unordered_map` for object storage" (#193549)
7136a4b39b05 [ELF] Factor linker-script dispatch loops into helpers. NFC (#193547)
d2673ad6b0eb [RISCV] Expand vp.fshl, vp.fshr (#193225)
e52df047f762 [Support][JSON] Use `std::unordered_map` for object storage (#171230)
c3bd0c12943c [lldb] add terminfo name (#191740)
fb3ab402c1dc [lldb/test] Fix BacktraceRecording path for Darwin embedded devices (NFC) (#193436)
f1c4db6aef43 [SelectionDAG] Change SSUBO to do (LHS < RHS) XOR (RESULT < 0) (#191747)
836c77bc0282 [libc][docs][NFC] Rename Maintainers.rst to Maintainers.md (#191882)
94b9accfab9e [RISCV] Remove codegen for vp.fcmp (#193529)
5ea5b9eb8eec [profcheck] Fix assert in getInitializer call on global with no initializer (#193514)
8c82aa0a5070 [lldb] Log clang module loads (#193389)
cdc0a9073733 Revert "[ASan][Windows] Fix memmove/memcpy interception on x64" (#193524)
bb092120f1f0 [Hexagon] Non-pie default on hexagon-unknown-elf (#193184)
2072474a24c4 [OpenMP][OMPIRBuilder] Support complex types in atomic update/capture (#191490)
fb6f1bde00c4 [ItaniumDemangle] Strip __alloc_token_ to transparently demangle allocation functions (#191048)
42ef1321e6cb [MLIR][BUILD]: Fix for 36331abd8cbb630fc174e182f1580e7cdefd2616 (#193523)
9a63d044471d [lldb] Fix inappropriate uses of LLDB_INVALID_IMAGE_TOKEN (#193365)
fc910693c344 [AMDGPU] comgr: add HotSwap B0-to-A0 policy and public API (3/3) (#2203)
3950da0bc764 [RISCV] Add isKnown method to VSETVLIInfo. NFC (#193406)
58f3d7810211 [RISCV][P-ext] Custom legalize vector (setne X, allzeros) and (setgt X, allones) (#193360)
8fc58340444b [GIsel] Use changeElementType for cond types in LegalizerHelper (#193049)
36331abd8cbb [mlir] targeted verification for transform "inlining" (#192956)
6f115abef8c9 [lldb] Remove unused ExpressionPathOptions: NoFragileObjcIvar, NoSyntheticArrayRange (NFC) (#193336)
9d704b490d5a [flang][OpenMP] Remove unused member, NFC (#193512)
93419bf99ede [AArch64][ISel] Use TripleOpVT in LowerVECTOR_INTERLEAVE (#193506)
b13867d5146b [compiler-rt] Initial support for compiler-rt builtins on SPIRV64 target (#192897)
20005a09f950 [flang] Update LIT test for big-endian platform (NFC) (#193309)
8144c14a742f [lldb] Fix assert frame recognizer for non-macOS Apple platforms (#193435)
d154ccd8d4a1 [AArch64][ISel] Add lowering for fixed-width deinterleave3 (#192972)
54fcd8620313 [libc] Add struct sockaddr_storage (on linux) (#192978)
e1ab08a5bd26 [RISCV] Functional llvm.vector.reduce.mul on scalable types (#193094)
59bf8960f28d [lldb] Remove trailing newlines from AppendErrorWithFormat calls (part 2) (#193168)
8f51fe418509 arm: fix float to integer conversion with `+mve` (#193319)
b77a894afa2e [clang][bytecode] print array root state in Pointer::print() (#193494)
1364f522a773 [LoopInterchange] Fix out-of-bounds accesses in tests (NFC) (#193272)
4dbb7ee833d5 [DAG] visitFREEZE - revisit frozen node after merging with unfrozen uses (#188206)
ba767d0bbbde [MachineCopyPropagation][NFC] Refactor EliminateSpillageCopies (#192609)
85dc81166ac3 [X86] Add TODO for nsw+nuw handling to (add (add X, Y), X) --> add(add(X, X), Y) (#193503)
475639a10ecd Add SPIRV to excluded profcheck targets (#193509)
20a7d26297c2 [SPIRV] Fix legalization of zero-sized intrinsic globals (#192730)
c8b526f76b63 [bolt] AArch64: Fix TLSDESC to LE relaxation by mold (#190370)
eda6c60a051f [offload] Get kernel argument sizes from Level Zero (#192487)
f5e80c985804 [Flang] Add SIMD Compiler Directive (#192969)
a623bd913507 [lldb] Add full stop to "memory tag" help (#193505)
f66b3baed254 [libc][math] small typo (#193349)
94278b30b957 [flang] Move ResolveAccParts and ResolveOmpParts into better location… (#193497)
385e7eaafedd [clang] isConvertingBoolWithCmp0 - fix MSVC "not all control paths return a value" warning. (#193495)
124f14cbf7d3 [Passes] Remove Os/Oz from pass option listings (#193491)
c6e90081a7bc [InstCombine] Remove support for volatile in phi of load transform (#193154)
2e0112ff197d Enable disable LSPs in extension (#191957)
e7fd6fe12faa [NFC][SPIR-V] Consolidate OpVariable insertion point logic into getOpVariableMBBIt (#193433)
7f703cabf728 [MLIR][AsmParser] Fix non-deterministic SSA value completion order under LLVM_REVERSE_ITERATION (#192150)
39865a002e6b Revert "[lldb][test] Add support for building Wasm test inferiors" (#193493)
1b7de19f86ee [mlir][vector] Prevent masked transfer read/write identity folding (#192966)
90fa375b8c18 [X86][NFC] Reorganize fadd, fsub, fmul and fdiv selection tests (#193012)
b3319caafa4c [LLVM]Codegen][X86] Add vector ConstantInt/FP support to CollectConstantBits. (#193249)
3bb7c2c6f799 [MLIR][BUILD] Fix for c1cff89b (#193489)
4d18c1061dad [VPlan] Prefer checking opcode over underlying value (NFC) (#193463)
e43f3232f21f [VPlan] Permit licm-sinking recipes with no users (#189957)
58127f3ebffd [OpenMP] Fix OpenMP device subdir installation w/ multilibs (#193378)
62ae7e4786d9 [ASan][Windows] Fix memmove/memcpy interception on x64 (#192060)
64c9a758394c Revert "[LifetimeSafety] Add support for `new`/`delete`" (#193482)
2039a51881bb [libc++][ranges] P2164R9: Implements `views::enumerate` (#73617)
61e5c13fba2b [X86] Add baseline tests for #144231 (#193484)
d137e6601f1c [libc++] Remove apple_availability.h (#192851)
cd60aed5f8d7 [flang][OpenMP] Move directive deprecation check to semantic checks (#192796)
9673f1f8ce8b [SPIR-V] Handle [N x i8] byte addressing in SPIRVEmitIntrinsics (#192994)
8e1095ff01c1 [AArch64] Only prefer partial reductions if cost is lower. (#191369)
a166f0b2c7c4 [AMDGPU] performSraCombine - SRA(X,BW-1) - don't freeze HI operand for single (repeated) shift (#193468)
567583cbfee5 [Clang][SystemZ] Fix unwanted unsequenced volatile accesses in codegen tests (#190212)
b965d52bbbcc [LLVM][GlobalISel] Remove unnecessary comment (#193333)
5659f86af5ab [clang] Implement -fstrict-bool (#160790)
2a3639cd085f [DAG] computeKnownFPClass - add ISD::EXTRACT_VECTOR_ELT handling (#190307)
6fcc4d701da8 Revert "[Clang] Diagnose UB and emit error when identifier has both internal and external linkage" (#193462)
17be5a7dee8d [debugserver][NFCI] Factor out logic handling breakpoint packets (#192912)
5fc5c1120230 [clang-tidy][readability-identifier-length] Add a line count threshold (without std::transform_reduce) (#193276)
6d097d240dd6 [clang] Suppress glibc C11 extension warning in `c-index-test` (#193335)
b2f3532e9fb1 Revert "[Bazel] Fixes 8e56a89" (#193459)
c1cff89bdcea [mlir][GPU] Refactor GPUOps lowering (#188905)
b313bb714528 [Clang][AArch64] Lower NEON fcvtz{u/s} intrinsics into fpto{u/s}i.sat (#191365)
a5f7f4962751 [mlir][linalg] Fix crash when folding tensor.cast into unpack using static packed shape for inner tiles (#188000)
34a8d497d29a [Bazel] Fixes 8e56a89 (#193450)
1d9775f68440 [LV] Change VPLane::getAsRuntimeExpr to use constant 64-bit indices (#193206)
f5f107e01778 [LLVM][SelectionDAG] Reduce chances of a split VSETCC being rewidened. (#191438)
8f6598133b37 [clangd][test] Fix test failures when LLVM_WINDOWS_PREFER_FORWARD_SLASH is ON (#193160)
808af6fd987e [LLVM][ConstantFolding] Use correct type when flushing denormals. (#193254)
d368c5728fcd [DA] Remove unnecesasry SCEV negation operation (NFCI) (#193447)
582958c4337f Revert "[clang][ssaf][NFC] Rework how the Force linker anchors are defined and used" (#193451)
8fea7910b0b4 [CIR] Fix __builtin_clz/__builtin_ctz poison_zero to respect target (#192865)
c37764cc00f2 [IRBuilder][NFC] Add `CreateFAbs` helper (#193421)
8e56a89c8f28 [clang][ssaf][NFC] Rework how the Force linker anchors are defined and used (#189409)
3e7c207ebe35 [Flang][OpenMP] Fix DEFAULT(NONE) check for Cray pointers in nested OpenMP directives (#190764)
7a154470f51f [Flang] bug: preprocessor increases backslash to double backslash (#191512)
86b9775612f8 [Passes] Remove Os and Oz optimization pipelines (#191363)
34a917a53e51 [mlir][spirv] Add SPV_EXT_FP8 type support to SPIR-V TOSA ops (#193199)
d24ebe3f00b3 [Support] Add std::string overload for llvm::sys::path::native (#193228)
d3ee88b18e22 [lldb] Fix pexpect detection with LLDB_ENFORCE_STRICT_TEST_REQUIREMENTS (#193444)
2711d8a50a13 [X86] Recognise vectors with zeros in all upper elements to improve VMOVS*Z folding (#193263)
0da0163d3ace [SelectionDAG] Preserve poison in IS_FPCLASS folds (#193246)
4aecd0454574 [libcxx][test] Skip cas_non_power_of_2.pass.cpp in Picolibc build (#191415)
7af4283bc96c [NFC][SPIR-V] Use getScalarOrVectorComponent{Count,Type} instead of raw operand access (#193410)
9ec6788421ac [lldb] Add HTTPS tests for SymbolLocatorSymStore (#192274)
35480b22737f [SPIRV] Migrate NSDI emission from a machine pass to DebugHandlerBase (#191212)
22bb938f873f [CIR][AArch64] Lower NEON vminv intrinsics (#192901)
83b4c5cd71c8 [NVPTX] Add intrinsics for narrow-fp to bf16 conversions (#191376)
cd145989585e [AMDGPU][NFC] Eliminates the redundant code in the AMDGPUTargetMachine.cpp (#193169)
5ef29d1d8bb5 [ADT] Add predicate based match support to StringSwitch (#188046)
f4cc934dcbca [LV] NFCI: Create VPExpressions in transformToPartialReductions. (#182863)
9435160a040b [MLIR][NVVM] Update SM version requirements of Ops (#192257)
a6d14db61db4 [clang][bytecode] Fix DefaultInitExpr base pointer in IndirectFieldDecls (#193149)
1edcd7473fb1 [clangd] [Modules] Refactor cache to support duplicated module name (#193413)
a61de4b50c04 [mlir][spirv][nfc] Clean up FP8 and BF16 SPIR-V type tests (#193196)
bfd6ea0241ec [ELF] Improve allocateHeaders tests (#193419)
61f9516af963 merge main into amd-staging (#2283)
cfa67454e9d4 [clangd] Add go-to-definition support for fields in offsetof expressions (#192953)
efa0f22883b5 [RISCV][MC] Emit ISA mapping symbols on .option arch/rvc/norvc/pop (#193123)
fde2e27a017c [clang][modules] Fix false positive -Wweak-vtables in named modules (#193136)
bb762095af80 Reapply "[JTS][Passes] Enable JTS By Default" (#193409)
0f4fb3b2426e [NFC] [MC] Fixed rule-of-five for MCPseudoProbeDecoder class (#193181)
554edb289bb2 [ELF,test] Cover empty INCLUDE inside MEMORY { ... } (#193411)
ee06802dc4a8 [JTS] Correctly handle all zero profile values in VP metadata (#193402)
dee5769870ce [lldb] Fix potential TestAlwaysRunThreadNames flakiness (#193405)
bf7ced3bc901 [lldb] Remove ENABLE_THREADS from Makefiles that don't need it (#193363)
d0bb0c837876 [ELF,test] Convert INCLUDE tests to split-file pattern (#193403)
b3a5d146fc20 [lldb] Doxygenify comments in AppleObjCRuntimeV2 (NFC) (#193401)
eb4296f98ad7 [llvm-mc][AsmMatcherEmitter] Fix the minimum ConversionTable entry size (#191977)
a843c699cc89 Revert "[JTS][Passes] Enable JTS By Default" (#193399)
52914600bc77 [revPat] update revert_patches
c643aa496ba7 merge main into amd-staging
f4bf7297963b [lldb] Add exe_ctx to examples commands (#193347)
e02c089a857e Revert "[compiler-rt][asan] Add asan checks for __builtin_assume_dereferencable (#190871)"
3aeb3c191d36 merge main into amd-staging
a680361bcdaa [clang-tidy] Suggest materializing temporary ranges in readability-use-anyofallof (#185791)
b5f7bc55573c [Bazel] Fixes 3081d52 (#193376)
2531a6730ddd [clang][DebugInfo] Set linkage name for dynamic initializer/destructor debug info (#189794)
653030b4c627 [PDB] Refactor cache strategy for function symbol lookups (#188927)
523c26f9c2c7 [clang-tidy][NFC] Add a unittest for checking list.rst (#193134)
3081d52d8242 [MC][debug_frame] Fix a bug in MCDwarfFrameEmitter::emit() so that per-function CIE can be generated when CIEs are different (#192727)
506c3f17b450 [clang-tidy] Fix false positive in readability-convert-member-functions-to-static for const overloads (#191712)
f8ab74283f74 [runtimes] Enable Fortran only with explicit CMAKE_Fortran_COMPILER (#193332)
20c8f4ca8e50 merge main into amd-staging
820654dca4f8 [UnsafeBufferUsage][SSAF] Change -Wunsafe-buffer-usage API for SSAF-based analysis (#191934)
03bfba583532 [AMDGPU] comgr: add HotSwap MC/LLVM infrastructure (2/3) (#2202)
f2e4fcd134d1 [NFC][LowerTypeTests] Add AArch64 and X86 jump table tests with debug info (#192735) (#193358)
0903c314622a [Extractor] Use function return for the one and only output (#191824)
1cbd27f1ddb0 [ConstantFolding] Increase folding limit for vector loads to 128 bytes (#192775)
4d83691e290b [lldb] Fix a couple of return type / return value mismatches (#191464)
d87ac8716018 [offload] Fix synchronization when record replay is enabled (#193291)
404609d013f6 [runtimes] Enable Fortran only with explicit CMAKE_Fortran_COMPILER (#193332)
5892e34a9613 [BoundsSafety][NFC] Move LateParsedAttribute outside Parser class; move LateParsedAttrList to DeclSpec.h (#192145)
b48d8a54e29f Support loader arguments in GPU hermetic tests (#193341)
54c1b3050cb9 [libcxx][Github] Bump container version (#193351)
fc0f32196d27 [libc][NFC] Remove trailing whitespace from LLVMLibCTestRules.cmake (#193350)
7014ce846164 [bazel][lldb] Fix missing dep in ScriptedProcess (#193348)
2b63e5e75dba [LFI][libunwind] Avoid writing to reserved registers on the `aarch64_lfi` target (#192739)
daade8e56f82 [CI] Fix cross-project-tests dependencies (#193323)
d623ee801ce4 [SSAF] Fix -Wunused-variable (#193344)
d7d2c0ca6afd [bazel][clang] Fix build for #191932 (#193337)
e07f4b2d54f3 [lldb/DWARF] Support 5-component Swift version in DW_AT_producer (#193305)
d76111a9650d [libcxx][Github] Bump Github Runner to 2.334.0 (#193339)
9a6b93d5388d [X86] Regenerate vector shifts tests to reduce diffs in #188206 (#193325)
a1d0a0246161 [mlir][func] Avoid to create duplicate symbol during conversion (#192342)
2ed87bad8581 [clang-format] Add c++23 and 26 to the configuration (#193327)
e0b4a7063f78 [compiler-rt][profile] Use runtimes-libc-headers in the GPU runtimes build (#192814)
92958a0631c4 AMDGPU/GlobalISel: RegBankLegalize rules for DS barrier arrive atomics (#192767)
5ee4c51c1a9c [SSAF][Analyses] Add an AST visitor for the contribution model (#191933)
368ee151c774 [bazel][lldb] Add target for new plugin (#193316)
d4650168f0ea [lldb] Directly access object variable in GetObjectPointerValueObject (NFC) (#193120)
f29d0b4329d9 [CIR] Cache isSafeToConvert results to avoid redundant record layout … (#193122)
bffb208404fd [libunwind] Add SME detection for ZA test on OpenBSD / FreeBSD (#193148)
20ce456138fa [LegalizeTypes][RISCV] Don't widen expandload or compresstore with VP_LOAD/VP_STORE. (#193294)
6688763f44a3 [libc] Improve lit test discovery and execution (#192993)
27cc83383d19 [cross-project-tests] Add llvm-modextract as a dependency (#193296)
0bbd61a33960 [Github] Bump Github Runner Version to 2.334.0 (#193318)
99e4f6a05f59 [lldb] Add synthetic variable support to Get*VariableList. (#181501)
e7b103798d0d [CIR] add pairwise-addition-and-widen support (#191845)
b1175088531d [LifetimeSafety] Add support for `new`/`delete` (#192504)
7318bc734a5b [Clang][AMDGPU] Use unsigned for D# (#193310)
21dcb13f6d67 [flang][acc] Update fir.convert rematerialization handling (#193301)
346480e0abb5 [AArch64] Add more scalar testing for hadd patterns. NFC (#193313)
24464f6c2c36 [RISCV][GlobalISel] Lower G_MEMCPY_INLINE (#192671)
f9437779f447 [Attributor] Use trivial no side effects check in isAssumedSideEffectFree (#193303)
760bc124c189 Reapply "[JTS][Passes] Enable JTS By Default" (#193300)
f1baa17f7920 [libc] Add wcsxfrm (#191692)
a4538a3ad902 [NFC][SSAF] Extract common code in Analyses to a shared file (#191932)
11515959b571 [BOLT] Fix stream position before appendPadding in writeEHFrameHeader (#193126)
7bf48ec95c7c [AArch64] Remove xtn.ll test. NFC (#193306)
8bebc5847663 merge main into amd-staging (#2278)
ae9cb64645fd [VPlan] Clean up VPWidenPHIRecipe constructor (NFC). (#193297)
f1b65b96aa50 [PowerPC] Fix ADJCALLSTACKUP and ADJCALLSTACKDOWN def (#184696)
50241dcd08c8 [AMDGPU] Reland "Fixed verifier crash because of multiple live range components." (#190719) (#193286)
6a9ed459ce22 [JTS] Add a temporary workaround for multiple zero GUIDs (#193292)
b0fe500e7842 [lldb] Make global lookup in DIL configurable by caller (#192592)
bf8cf4b7b31b [compiler-rt][asan] Add asan checks for __builtin_assume_dereferencable (#190871)
1e2175ec4df7 [Clang] Diagnose UB and emit error when identifier has both internal and external linkage (#192116)
facb9ab34ffb [LV] Remove IV use restrictions for epilogue vectorization. (#190552)
0a8ed875075b [clang][deps] Simplify scanner VFS (#190843)
6997cc8c0f84 [docs] Add missing command line options to llvm-profgen.rst (#192890)
8cc12bcf730e [clang][bytecode][HLSL] Complete the HLSL aggregate splat and elementwise cast implementations, and enable the new constant interpreter on all HLSL tests with static asserts (#189126)
dd5632f51d3f [gn] port 40fcd2517a110 (#193293)
d4e2850a8412 Update [Github] Update GHA Dependencies (#176676)
99457c368586 [CIR] Upstream VectorType __builtin_astype (#192859)
9c2e67721adf [X86][GlobalISel] Ignore non-vregs in regbank mapping (#182880)
5299e00a73f7 [RISCV][TableGen] Use ArrayRef instead of vector&. NFC (#193267)
0a59b51a783a [RISCV] Add a getTargetStreamer helper to RISCVAsmPrinter. NFC (#193250)
bde6226911f6 [Github] Set persist-credentials in libclang-python-tests.yml (#193282)
4cdd9883943d [Clang] Allow VDBPSADBW intrinsics in constexpr (#188887)
bddd3d32bc64 [lld/mac] For catalyst outputs, tolerate implicitly linking against arm64e mac tbd files (#193065)
d794e04651f3 [Clang][PowerPC] Add DMF crypto builtins for extended mnemonics (#185961)
06b85c8bb2ed [SSAF][UnsafeBufferUsage] Make UnsafeBufferUsageExtractor a registered ASTConsumer (#191931)
849de61619cc [APINotes][unsafe-buffer-usage] Add [[clang::unsafe_buffer_usage]] support in APINotes (#189775)
0d45876e43a8 [ROCDL] Add dot intrinsics to rocdl (#193129)
40fcd2517a11 [lldb][test] Add support for building Wasm test inferiors (#192872)
8d21e4e692bc AMDGPU/GlobalISel: RegBankLegalize rules for BVH intersect ray (#192583)
febd3de07dba [clang] Get the directory identity from `ModuleCache` instead of `FileManager` (#193070)
c7eea85b8046 Revert "[llvm-cov] Fix error propagation in CoverageMapping::load() (… (#193266)
981a9e5acb49 AMDGPU/GlobalISel: RegBankLegalize rules for amdgcn_ballot (#193105)
5f33bbeb8375 [clang] Exclude trailing colons from param command names (#192598)
4f2d572349c4 [clang][NFC] Prevent scope pollution from repeat type specifiers (#193144)
10f8205c6f7e [SLP]Fix stale deps for operands of non-scheduled expanded-binop parents
6c5b4a73fb25 [lldb] Move GetTypeSystemFromCU to DILEval.cpp (NFC) (#193245)
655f38fad8bf [gn build] Port b799d7e8f8bc (#193262)
e86ed67d9f24 [LV] Improve code around all_of, any_of (NFC) (#193150)
57409d7558f1 [gn build] Port acc3f73113ab (#193261)
002b2dc6b2a0 [gn build] Port 9b8635f3247d (#193260)
adf12074a580 [gn build] Port 4acbf997891c (#193258)
93dce0bf4332 [VPlan] Strip null-check in partial-red casts (NFC) (#193162)
980ddce138a3 [CIR] Implement variably modified type parameter handling (#193072)
8226604fbbc9 [libc][math] Implement a code-size optimized version of powf. (#190984)
53a33fa6ec9a Add missing comment (#193050)
9576adbb96c7 [RISCV] Expand vp.frem (#193218)
9a1f716941b2 AMDGPU: ds.atomic.barrier.arrive.rtn.b64 is a source of divergence (#192765)
b8386954ac11 [flang][OpenACC] Limit hoisting out of compute operations. (#193099)
b799d7e8f8bc [libc++] Implement `std::constant_wrapper` (#191695)
e39e73c91ceb [IR] Avoid redundant TrackingMDRef reassignments and DebugLoc copies (#193018)
edd6797bd0f4 [LLVM][BUILD] Fix for #177158 (#193238)
e535fbfb9851 [clang] Add typed variants for C23 stdbit.h builtins (#192718)
3ee56ef44322 [runtimes] Ensure INSTALLed directory exists (#193243)
66cd11f73ef1 [bazel][mlir] Port a1dfc8d64e1faa752f020a8212782362b179416d (#193241)
e7103645c2e9 [compiler-rt] Implement __clear_cache for Hexagon (#188411)
406dc4e34008 [Hexagon] Handle FK_Data_8 fixups in ELF object writer (#192149)
2b76ec744d78 llvm/test: Fix incremental bots after revert of #190719 (#193234)
347dc1321ed5 Reapply "[SimplifyCFG] Reuse function comdat for switch lookup table" (#193229)
7a7d5936532a [LoopVectorize] Add minsize attribute to test (NFC) (#193223)
95a960114e92 [lldb] Skip memory region probing in FindSpace when process can't JIT (#193124)
665f5c0ee89d [lldb][docs] Add FOSDEM talks to the links page (#193015)
e23c053d25fa Fixed issue of use after move (#193175)
583f2949a841 [SLP] Normalize copyable operand order to group loads for better vectorization
e4afaa1fcfcc [RISCV] Further improved exact VLEN lowering for mul reductions (#192688)
d841a9383d5d [SPIR-V] Deduce argument types before doing GEP (#193046)
6a6f3b07e607 [SLP][NFC]Add a test with non-reordable operands of non-commutative copyables, NFC
216bccbee137 [SPIR-V] Handle constant expression uses of PushConstant globals (#193005)
1a0269771e0f [LIT] Add -nostdinc so system headers aren't searched with implicit module maps (#192125)
074965c22152 merge main into amd-staging
a2011b113acd [LV][RISCV][NFC] Update strided-accesses.ll to UTC version 6 (#193211)
e5925fb3a7fe [NFC][llvm-objdump] Use CHECK-NEXT in MachO tests (#192696)
dc73cabfa38c [X86][AVX10.2] Skip FP2I/I2FP customizations for bf16 (#193137)
8abcce069978 [LoopVectorize] Generate test checks (NFC) (#193216)
941e8ef04ebb [mlir][arith] Add support for `arith.flush_denormals` emulation (#192660)
1566b6344a2b [X86][clang-cl] Make AVX10.2 map to the same target-cpu as AVX10.1 (#193147)
9c2d9448238d [DAG] Reassociate (add (add X, Y), X) --> add(add(X, X), Y) (#162242)
1697b964ffcf [runtimes] Protect use of undefined CMAKE_Fortran_COMPILER (#193210)
d629a221707e [Polly] Disable PCH reuse for unit tests (#193209)
300285ed5f4f [CIR][NFCI] Remove 'isConstant' from getCIRLinkageForX (#193100)
f6f39c6fc172 [LV] Add test for interaction between interleaved and strided load. nfc (#192990)
a976a72c12c5 [AMDGPU] Multi dword spilling for unaligned tuples (#183701)
b7cfcfe03deb [llvm-cov] Fix error propagation in CoverageMapping::load() (#193197)
037a48aa4b9a [InstCombine] fold fabs(uitofp(i16 a) - uitofp(i16 b)) < 1.0 to a == b (#191378)
7134ce5d7215 Revert "[clang-tidy][NFC] add numeric include for transform_reduce" (#193200)
744279b9f173 [mlir][arith] Add `arith.flush_denormals` operation (#192641)
95c583697192 [AMDGPU] Add legalizer rule support for AMDGPU's regbank fminimumnum and fmaximumnum (#192719)
a1dfc8d64e1f [mlir] Add option to run CSE between greedy rewriter iterations (#193081)
ed34ee3a728d [mlir] Assert region is within config scope in RegionPatternRewriteDriver (#193177)
797fc5dde02d [AMDGPU] Prefer mul24 over mad24 on SDWA targets (#193033)
78cb9fbbb08f [DAG] Add Srl combine for extracting last element of BUILD_VECTOR (#181412)
3de6b5c685b3 [mlir][spirv] Fix Float8EXT type conversion legality (#192466)
044e21f04311 [SystemZ] Fix wrong mask for float vec_insert (#192967)
cf1f7c533425 [Attributor] Regenerate test checks (NFC) (#193192)
8e132f78bfb0 [runtimes][CMake] Move Fortran support code from flang-rt (#171610)
af5fb3870a00 [Attributor] Clarify volatile null pointer behavior (NFCI) (#193190)
47918c2c0a88 [CIR] Make array decay and get_element op perserve address spaces (#192361)
c2139f13606f Revert "[SLP] Normalize copyable operand order to group loads for better vectorization"
b3647eb0830f Revert "[clang-tidy][readability-identifier-length] Add a line count threshold" (#193182)
3600cd824d5a [AMDGPU] Unmark wave reduce intrinsics for constant folding (#193142)
853d7c9b2347 AMDGPU/GlobalISel: RegbankLegalize rules for merge-like opcodes (#193026)
fc7c25738635 [libc++] Fix any.cpp not compiling with the minimum header version >= 7 (#193183)
45db5e46b2ef [RISCV][NFC] Remove unused RISCVExtBit (#193153)
d1f4b79ec888 [LICM] Remove unnecessary check during store hoisting (#187529)
b460f296d6dd [RISCV] Remove codegen for vp_sqrt (#191837)
337ad44a3e58 [llvm] Errorize DebuginfodFetcher for inspection at call-sites (#191191)
9584e9c9b269 [LLVM][CodeGen][SVE] Implement custom lowering for insert_vector_elt_nxv1i1. (#192494)
a47551f22099 [lldb][windows] fix script interpreter file parsing (#193006)
a99dd8344ea9 [LV][NFC] Remove unnecessary extra passes from some tests (#193155)
b78a0a02c181 [AArch64][SelectionDAG] Improve codegen for and(sext(Op), splat(1)) (#192405)
bf24d742d942 [RISCV][NFC] Use IfDefEmitter in RISCVTargetDefEmitter (#193151)
5cc7956a2542 [RISCV] Remove codegen for vp_fadd, vp_fmul (#191842)
06e70f60c9cc [flang][debug] Handle USE statements inside modules (#186184)
49f159faa6b0 [clangd] [C++20] [Modules] Read module mappings from commands (#193158)
3db991b5c287 [clang-tidy][NFC] add numeric include for transform_reduce (#193165)
dac2cb9a5a73 [LLVM][BUILD] Fix for #192887 (#193167)
357d61fe48dd [MIR] Always print symbolic INLINEASM operands (#192991)
b550a5e09420 [libc] Fix riscv32 build after #192927 (#193152)
ab4283959fd1 [LICM] Remove unnecessary call arg in test (NFC) (#193159)
e7a2cf1243ba [mlir][SPIR-V] Lower boolean vector reductions (#192267)
b00e3a098681 [libc++] Fix numeric_limits::digits and digits10 for _BitInt(N) (#193002)
615678b37d73 [Coroutines] Add verifier checks for llvm.coro.begin and llvm.coro.id (#192887)
174783d157cf merge main into amd-staging (#2270)
68a27a07be36 AMDGPU/GlobalISel: RegbankLegalize rules for G_BITCAST (#193025)
5dab433d5bdc [lldb] Remove trailing newlines from AppendErrorWithFormat calls (#192965)
766607ca643e [lldb] Add EXPORT to lldb-tblgen (#192610)
59a9aa30e21e [LV] Add flag to always force a scalable VF when feasible. (#182467)
a35e21861f91 [lldb] Fix ambiguous call to create_directories in TraceIntelPTBundleSaver (#191967) (#192025)
547c3ad159fd AMDGPU/GlobalISel: RegbankLegalize rules for undef and constants (#193024)
3c88abe3206b [clang-tidy][readability-identifier-length] Add a line count threshold (#185319)
163d0b1b697c [ConstantMerge] don't merge constants with COMDAT…
Motivation
Pulled in ~6 months of upstream changes, up to commit
eaf55a972c2f174bc99390f8f6701ae816645100.Technical Details
AMDGPU
Structural Reorganization
AMDGPU.tdinto:AMDGPUBase.td,AMDGPUOps.td,AMDGPUAttrs.td,AMDGPUEnums.td,AMDGPUTypes.td.Op Rename
amdgpu.scaled_ext_packed816→amdgpu.scaled_ext_packed_matrixfirstScaleLane:IntIsOneOf<[0, 1]>→IntIsOneOf<[0, 16]>firstScaleByte:IntMaxValue<2>→IntMaxValue<3>New Operations
amdgpu.sparse_mfma— Sparse MFMA (smfmac) on gfx942+. Operands:sourceA,sourceB,destC,sparseIdx. Attrs:m,n,k,cbsz,abid.amdgpu.scaled_wmma— Scaled WMMA on gfx1250+. Operands:sourceA,sourceB,destC,scaleA,scaleB. Shapes: 16×16×128, 32×16×128.ds_barrier_init,ds_barrier_poll_state,ds_async_barrier_arrive,ds_barrier_arrive,ds_barrier_state_phase,ds_barrier_state_pending_count,ds_barrier_state_init_count,ds_barrier_state_phase_parity.make_dma_base_gather,make_dma_base_scatter— descriptor-building for gather/scatter.New Types
TDMGatherBaseType,TDMScatterBaseTypefor TDM descriptor results.ROCDL
New Operations
cluster.id.x/y/z,wave.id.s.nopwithcountattribute.s.get.named.barrier.state,s.wakeup.barrier.global.load.tr4.b64,global.load.tr8.b64,global.load.tr6.b96,global.load.tr8.b128) and DS variants.cluster.load.async.to.lds.b8/b32/b64/b128.global.prefetch,flat.prefetch.ds.atomic.barrier.arrive.rtn.b64,ds.atomic.async.barrier.arrive.b64.rocdl.tanh,sin,cos,rcp,exp,exp2,log,sqrt,rsq.MFMA/WMMA Rework
ROCDL_Mfma_IntrOpwith explicitABType,CDType(no more variadic$args).ROCDL_Mfma_Scale_IntrOpfor scaled MFMA,ROCDL_Smfmac_IntrOpfor sparse MFMA.ROCDL_WMMA_IntrOp,ROCDL_WMMA_Opsel_IntrOp, etc.) with explicit operands/attrs.wmma.scale.f32.16x16x128.f8f6f4,wmma.scale16.f32.16x16x128.f8f6f4.Barrier API Changes
BarrierInitOp:$id→$memberCnt.ROCDL.barrierdeprecation notice; prefergpu.barrier.New Type
ROCDLGlobalBuffer:LLVM_PointerInAddressSpace<1>.AMDGPUToROCDL / GPUToROCDL
New Lowerings
SparseMFMAOpLoweringforamdgpu.sparse_mfma→ ROCDL smfmac (gfx942+).ScaledWMMAOpLoweringforamdgpu.scaled_wmma(gfx1250+).ScaledExtPackedMatrixOpLowering(gfx1250+).ds_barrier_*ops.make_dma_base_gather/scatteron gfx1250+.MemoryCounterWaitOp: Newtensorcounter andWaitTensorcntOp.API Changes
convertMFMAVectorOperand→packSmallFloatVectorOperand.castMFMAScaleOperand→castScaleOperand(supportsvector<8xi8>→ i64).gpu.barrierLoweringPat<GPU_BarrierOp, ROCDL_BarrierOp>replaced byGPUBarrierOpLoweringthat handlesmemfenceaddress spaces and chipset-specific behavior.Shared Infrastructure
amdgpu::populateCommonGPUTypeAndAttributeConversions()centralizes GPU→ROCDL type/attribute conversions (used by both NVVM and ROCDL paths).Buffer Descriptor (gfx1250+)
makeBufferRsrcfor new flag layout.getNumRecords: withboundsCheck=false, returns(1<<45)-1.Upstream Patches
1. Missing type converter for ConvertMemrefStore
Affected test:
Dialect/MHAL/emulate-narrow-type.mlirRoot cause: The upstream
memref::populateMemRefNarrowTypeEmulationPatternsnow registersConvertMemrefStoreviapatterns.insert<>()without the type converter, while all other patterns usepatterns.add<>(typeConverter, ...). This means theConvertMemrefStoreconversion pattern does not have a type converter associated, so operand adaptation may not work correctly.The error is:
failed to legalize operation 'memref.store' that was explicitly marked illegalformemref.storeonmemref<8xi4>.Fix: Fix is to add the
ConvertMemrefStorepattern with the type converter associated.Diff Files
diff -rup llvm-project/llvm rocMLIR/external/llvm-project/llvmllvm-diff.txt
diff -rup llvm-project/mlir rocMLIR/external/llvm-project/mlirmlir-diff.txt
Test Plan
External Tests
check-llvmcheck-mlirCI
Submission Checklist