PR for tstellar/llvm-project#856 #857

tstellar · 2022-03-04T06:53:47Z

resolves #856

The first change is the clarification of rewrite-based semantics, and the fact that when doing the rewrite, all of the instructions involved need to have the rewrite. This is not a change in semantics: there is wide agreement that this behavior is true for most flags. But it is necessary to clarify this, and also clarify that there is a fundamental difference between a flag like `nnan` and a flag like `contract`. Note that several InstCombine transforms do not correctly check this behavior at the moment. The second change is a specific clarification of the rewrites performed by arcp. These rewrites capture what is necessary to enable the transformations that currently require just arcp, none of which are using the flag incorrectly right now.

The codegen for non-inlined version is "quite ugly" as it emits some checks to make sure the initialization routine is properly executed because the compiler does not see how the TLS object is initialized. This leads to some `callq 0x0` junk in the final outputs. For codegen details, see https://godbolt.org/z/rb5qYj3vY.

This is convenient when operating on vectors in a bulk, bit-manipulating fashion. I plan to use this in a future change.

@jasonmolenda

llvm#100443) In llvm#98403 I enabled the SBSaveCoreOptions object, which allows users via the scripting API to define what they want saved into their core file. As the first option I've added a threadlist, so users can scan and identify which threads and corresponding stacks they want to save. In order to support this, I had to add a new method to `Process.h` on how we identify which threads are to be saved, and I had to change the book keeping in minidump to ensure we don't double save the stacks. Important to @jasonmolenda I also changed the MachO coredump to accept these new APIs.

Prevously, if INT_MIN was passed as a wildcard width to a printf conversion the parser would attempt to negate it to get the positive width (and set the left justify flag), but it would underflow and the width would be treated as 0. This patch corrects the issue by instead treating a width of INT_MIN as identical to -INT_MAX. Also includes docs changes to explain this behavior and adding b to the list of int conversions.

This patch set the `allocator_idx` attribute for allocatable descriptor that have specific CUDA attribute.

…header." (llvm#101752) Reverts llvm#100837 The test Modules/builtin-vararg.c is failing on AArch64 build bots: - https://lab.llvm.org/buildbot/#/builders/190/builds/3117 - https://lab.llvm.org/buildbot/#/builders/65/builds/2302 - https://lab.llvm.org/buildbot/#/builders/154/builds/2288 Revert to get the bots back to green.

…m#100952)

…angExpressionParser constructor (llvm#101669) We plan to eventually use the Clang driver to initialize the `CompilerInstance`. This should make refactorings of this code more straightforward. **Changes**: * Introduced `SetupLangOpts` and `SetupImportStdModuleLangOpts` * Called them from `ClangExpressionParser::ClangExpressionParser`

This reverts commit 92fbc96. The patch also affected ARM and caused an assertion failure during CurDAG->Legalize (llvm#100723 (comment)).

Due to the slightly non-standard interface that returns a pointer rather than just an integer, the __syscall() utility cannot be used on all architectures. This change is required for example to use the sanitizers on Arm Morello. Pull Request: llvm#84438

This syncs the list of supported sanitizers with the matching code in clang (`FreeBSD::getSupportedSanitizers()`), Reviewed By: emaste, DimitryAndric Pull Request: llvm#84280

llvm.module.flag => llvm.module.flags

Fixes llvm#101285 (comment).

This PR has following changes/fixes to XeGPU definition: - Fix type print format for atomic_rmw - removed 2D support for MaskType - Update LoadNd definition - Add 1D TensorDesc support - Replaced vnni_axis attribute with packed attribute - Update DPAS op definition, limiting A to 2D vector, and B to either 2D/3D vector.

This adjusts the DXILOpBuilder API in a couple of ways: 1. Remove the need to call `getOverloadTy` before creating Ops 2. Introduce `tryCreateOp` to parallel `createOp` but propagate errors 3. Introduce specialized createOp methods for each DXIL Op This will simplify usage of the builder in upcoming changes, and also allows us to propagate errors via DiagnosticInfo rather than using fatal errors. Pull Request: llvm#101250

…header. (llvm#100837)" Fix the false warning > incompatible pointer types passing 'va_list' (aka '__builtin_va_list') to parameter of type 'struct __va_list_tag *' [-Wincompatible-pointer-types] The warning is wrong because both in the function declaration and at the call site we are using `va_list`. When we call `ASTContext::getBuiltinVaListDecl` at a specific moment, we end up re-entering this function which causes creating 2 instances of `BuiltinVaListDecl` and 2 instances of `VaListTagDecl` but the stored instances are unrelated to each other because of the call sequence like getBuiltinVaListDecl CreateX86_64ABIBuiltinVaListDecl VaListTagDecl = TagA indirectly call getBuiltinVaListDecl CreateX86_64ABIBuiltinVaListDecl VaListTagDecl = TagB BuiltinVaListDecl = ListB BuiltinVaListDecl = ListA Now we have `BuiltinVaListDecl == ListA` and `VaListTagDecl == TagB`. For x86_64 '__builtin_va_list' and 'struct __va_list_tag *' are compatible because '__builtin_va_list' == '__va_list_tag[1]'. But because we have unrelated decls for VaListDecl and VaListTagDecl the types are considered incompatible as we are comparing type pointers. Fix the error by creating `BuiltinVaListDecl` before `ASTReader::InitializeSema`, so that during `ASTContext::getBuiltinVaListDecl` ASTReader doesn't try to de-serialize '__builtin_va_list' and to call `ASTContext::getBuiltinVaListDecl` again. Reland with the requirement to have x86 target to avoid errors like > error: unable to create target: 'No available targets are compatible with triple "x86_64-apple-darwin"' rdar://130947515

…tor (llvm#100952)" This reverts commit ab819d7.

If there is a type T which can be converted to both float and double etc but itself is not specialized for __numeric_type, and it is called for math functions eg. fma, it will cause ambiguity with test function of __numeric_type. Since test is not template, this error is not bypassed by SFINAE. This is a design flaw of __numeric_type. This patch fixes clang wrapper header to use SFINAE to avoid such ambiguity. Fixes: SWDEV-461604 Fixes: llvm#101239

tstellar mentioned this pull request Mar 4, 2022

Test label remove #856

Open

tstellar force-pushed the main branch from fcab177 to 44aa611 Compare March 11, 2022 04:17

tstellar force-pushed the release/14.x branch 5 times, most recently from 2f5ccde to 842e936 Compare April 27, 2022 17:18

tstellar force-pushed the main branch 2 times, most recently from 2fbfaf9 to ccb58e1 Compare May 17, 2022 05:23

tstellar force-pushed the main branch from ccb58e1 to 27ffa35 Compare May 25, 2022 20:46

tstellar force-pushed the main branch 10 times, most recently from bc9b8fb to 3d8bbc8 Compare July 8, 2022 03:35

tstellar force-pushed the release/14.x branch from 842e936 to f28c006 Compare July 8, 2022 04:25

tstellar force-pushed the main branch 2 times, most recently from d3382b7 to 4b5ec5b Compare July 8, 2022 07:50

tstellar force-pushed the main branch from 4b5ec5b to 1dbf92e Compare July 15, 2022 17:44

tstellar force-pushed the main branch 4 times, most recently from d845a5e to 250ae95 Compare July 28, 2022 17:51

tstellar force-pushed the main branch from c19cd4c to 33df4d3 Compare August 9, 2022 21:41

tstellar force-pushed the main branch from 8861308 to d6b1b06 Compare February 14, 2023 21:48

jcranmer-intel and others added 22 commits August 2, 2024 15:39

ADT: Add non-const overload of PackedVector::raw_bits() (llvm#101742)

5edb493

This is convenient when operating on vectors in a bulk, bit-manipulating fashion. I plan to use this in a future change.

[flang][cuda] Set the allocator on fir.embox operation (llvm#101722)

bbdb1e4

This patch set the `allocator_idx` attribute for allocatable descriptor that have specific CUDA attribute.

[NFC][asan] Report DynInitUnPoison (llvm#101586)

72a514f

[Attributor][AMDGPU] Enable AAIndirectCallInfo for AMDAttributor (llv…

ab819d7

…m#100952)

Revert "[AMDGPU] Always lower s/udiv64 by constant to MUL (llvm#100723)"

0b92e70

This reverts commit 92fbc96. The patch also affected ARM and caused an assertion failure during CurDAG->Legalize (llvm#100723 (comment)).

[compiler-rt] Update AllSupportedArchDefs.cmake for FreeBSD

1ae837a

This syncs the list of supported sanitizers with the matching code in clang (`FreeBSD::getSupportedSanitizers()`), Reviewed By: emaste, DimitryAndric Pull Request: llvm#84280

[llvm][docs] Correct named metadata name in example code block (NFC)

35a509d

llvm.module.flag => llvm.module.flags

[libc][math][c23] Fix getpayloadf128 smoke test on RV32 (llvm#101720)

e96687a

Fixes llvm#101285 (comment).

Revert "[Attributor][AMDGPU] Enable AAIndirectCallInfo for AMDAttribu…

874cd10

…tor (llvm#100952)" This reverts commit ab819d7.

[NFC][ModuleUtils] Rename test function (llvm#101750)

88ef76c

Debug

216c611

tstellar force-pushed the main branch from d8e4904 to 216c611 Compare August 2, 2024 22:42

tstellar added 6 commits August 2, 2024 16:05

Setup python

f197921

Setup python

96427a2

Setup python

ee7f633

test

e658999

Fix

dc5c734

Fix

7075172

tstellar closed this Aug 31, 2024

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PR for tstellar/llvm-project#856 #857

PR for tstellar/llvm-project#856 #857

tstellar commented Mar 4, 2022

PR for tstellar/llvm-project#856 #857

PR for tstellar/llvm-project#856 #857

Conversation

tstellar commented Mar 4, 2022