llvm-project

Commit Graph

Author	SHA1	Message	Date
David Stuttard	7940888c59	[AMDGPU] Intrinsic to expose s_wait_event for export ready Differential Revision: https://reviews.llvm.org/D138216	2022-11-28 11:26:15 +00:00
Bjorn Pettersson	076cda0aaa	[clang][CodeGen] Switch tests to use opt -passes	2022-11-28 12:12:49 +01:00
Ayke van Laethem	131cddcba2	[AVR] Fix broken bitcast for aliases in non-zero address space This was triggered by some code in picolibc. The minimal version looks like this: double infinity(void) { return 5; } extern long double infinityl() __attribute__((__alias__("infinity"))); These two declarations have a different type (not because of the 'long double', which is also 'double' in IR, but because infinityl has variadic parameters). This led to a crash in the bitcast which assumed address space 0. Differential Revision: https://reviews.llvm.org/D138681	2022-11-27 15:27:42 +01:00
Roman Lebedev	25f01d593c	Revert "[SROA] `isVectorPromotionViable()`: memory intrinsics operate on vectors of bytes (take 2)" TableGen is still getting miscompiled on PPC buildbots. Sent a mail with request for help. This reverts commit `3c4d2a0396`.	2022-11-27 00:00:06 +03:00
Roman Lebedev	3c4d2a0396	[SROA] `isVectorPromotionViable()`: memory intrinsics operate on vectors of bytes (take 2) This is a recommit of `cf624b23bc`, which was reverted in `5cfc22cafe`, because the cut-off on the number of vector elements was not low enough, and it triggered both SDAG SDNode operand number assertions, and caused compile time explosions in some cases. Let's try with something really REALLY conservative first, just to get somewhere, and try to bump it (to 64/128) later. FIXME: should this respect TTI reg width * num vec regs? Original commit message: Now, there's a big caveat here - these bytes are abstract bytes, not the i8 we have in LLVM, so strictly speaking this is not exactly legal, see e.g. https://github.com/AliveToolkit/alive2/issues/860 ^ the "bytes" "could" have been a pointer, and loading it as an integer inserts an implicit ptrtoint. But at the same time, InstCombine's `InstCombinerImpl::SimplifyAnyMemTransfer()` would expand a memtransfer of 1/2/4/8 bytes into integer-typed load+store, so this isn't exactly a new problem. Note that in memory, poison is byte-wise, so we really can't widen elements, but SROA seems to be inconsistent here. Fixes #59116.	2022-11-26 23:19:15 +03:00
Tomas Matheson	a6aaa969f7	[AArch64] Assembly support for FEAT_LRCPC3 This patch implements assembly support for the 2022 A-Profile Architecture extension FEAT_LRCPC3. FEAT_LRCPC3 is AArch64 only and introduces new variants of load/store instructions with release consistency ordering. Specs for individual instructions can be found here: https://developer.arm.com/documentation/ddi0602/2022-09/Base-Instructions/ This feature is optionally available from v8.2a and therefore not enabled by default. Contributors: Lucas Prates Sam Elliot Son Tuan Vu Tomas Matheson Differential Revision: https://reviews.llvm.org/D138579	2022-11-25 18:59:07 +00:00
Alex Richardson	54ad4d2dd1	Drop redundant pipe to opt -instnamer in clang tests This used to be required, but the difference between asserts/!asserts builds no longer exists for %clang_cc1 (only for %clang), so they pass just fine without this flag.	2022-11-25 11:34:55 +00:00
Balazs Benics	097ce76165	[analyzer] Deprecate FAM analyzer-config, recommend -fstrict-flex-arrays instead By default, clang assumes that all trailing array objects could be a FAM. So, an array of undefined size, size 0, size 1, or even size 42 is considered as FAMs for optimizations at least. One needs to override the default behavior by supplying the `-fstrict-flex-arrays=<N>` flag, with `N > 0` value to reduce the set of FAM candidates. Value `3` is the most restrictive and `0` is the most permissive on this scale. 0: all trailing arrays are FAMs 1: only incomplete, zero and one-element arrays are FAMs 2: only incomplete, zero-element arrays are FAMs 3: only incomplete arrays are FAMs If the user is happy with consdering single-element arrays as FAMs, they just need to remove the `consider-single-element-arrays-as-flexible-array-members` from the command line. Otherwise, if they don't want to recognize such cases as FAMs, they should specify `-fstrict-flex-arrays` anyway, which will be picked up by CSA. Any use of the deprecated analyzer-config value will trigger a warning explaining what to use instead. The `-analyzer-config-help` is updated accordingly. Depends on D138657 Reviewed By: xazax.hun Differential Revision: https://reviews.llvm.org/D138659	2022-11-25 10:24:56 +01:00
Balazs Benics	3648175839	[analyzer] Consider single-elem arrays as FAMs by default According to my measurement in https://reviews.llvm.org/D108230#3933232, it seems like there is no drawback to enabling this analyzer-config by default. Actually, enabling this by default would make it consistent with the codegen of clang, which according to `-fstrict-flex-arrays`, assumes by default that all trailing arrays could be FAMs, let them be of size undefined, zero, one, or anything else. Speaking of `-fstrict-flex-arrays`, in the next patch I'll deprecate the analyzer-config FAM option in favor of that flag. That way, CSA will always be in sync with what the codegen will think of FAMs. So, if a new codebase sets `-fstrict-flex-arrays` to some value above 0, CSA will also make sure that only arrays of the right size will be considered as FAMs. Reviewed By: xazax.hun Differential Revision: https://reviews.llvm.org/D138657	2022-11-25 10:24:56 +01:00
Volodymyr Sapsai	0314ba3acb	[modules] Fix marking `ObjCMethodDecl::isOverriding` when there are no overrides. Incorrect `isOverriding` flag triggers the assertion `!Overridden.empty()` in `ObjCMethodDecl::getOverriddenMethods` when a method is marked as overriding but we cannot find any overrides. When a method is declared in a category and defined in implementation, we don't treat it as an override because it is the same method with a separate declaration and a definition. But with modules we can find a method declaration both in a modular category and a non-modular category with different memory addresses. Thus we erroneously conclude the method is overriding. Fix by comparing canonical declarations that are the same for equal entities coming from different modules. rdar://92845511 Differential Revision: https://reviews.llvm.org/D138630	2022-11-24 14:26:02 -08:00
Nathan James	15e76eed0c	[clang] Add [is\|set]Nested methods to NamespaceDecl Adds support for NamespaceDecl to inform if its part of a nested namespace. This flag only corresponds to the inner namespaces in a nested namespace declaration. In this example: namespace <X>::<Y>::<Z> {} Only <Y> and <Z> will be classified as nested. This flag isn't meant for assisting in building the AST, more for static analysis and refactorings. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D90568	2022-11-24 12:44:35 +00:00
Ben Dunbobbin	437ccf5af9	[windows-itanium] Propagate DLL storage class to Initialisation Guard Variables Initialisation Guard Variables should take their DLL storage class from the guarded variable. Otherwise, there will be a link error if the compiler inlines a reference to the guard variable into another module but that guard variable is not exported from the defining module. This is required for platforms such as PlayStation and windows-itanium, that are aiming for source compatibility with MSVC w.r.t. dllimport/export annotations, given Clang's existing design which allows for inlining of a dllimport function as long as all the variables/functions referenced are also marked dllimport. A similar change exists for the MSVC ABI: https://reviews.llvm.org/D4136. I have added a run test for windows-itanium for this issue to the build recipe: https://reviews.llvm.org/D88124. Differential Revision: https://reviews.llvm.org/D138463	2022-11-24 00:23:17 +00:00
WuXinlong	0dbc52a0ab	Add MC support of RISCV Zcd Extension This patch add the instructions of Zcd extension. Zcd is a subset of C Ext which include the double-precision floating-point instructions (c.fld, c.fldsp, c.fsd, c.fsdsp). Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D134177	2022-11-24 05:48:06 +08:00
Fangrui Song	fa7bc386ec	[modules] Support zstd in .pcm file Extend SM_SLOC_BUFFER_BLOB_COMPRESSED to allow zstd, which is much faster (compression/decompression) than zlib with a similar compression ratio. An alternative is to add a value beside SM_SLOC_BUFFER_BLOB_COMPRESSED, but reusing SM_SLOC_BUFFER_BLOB_COMPRESSED slightly simplifies the implementation and leads to better diagnostics when a slightly older Clang consumes zstd compressed blob. Compressing AST takes a small portion of WriteAST, so we can pick a higher compression level. Compiling a relatively large .pcm (absl endian) with -fmodules-embed-all-files, zstd level 9 has comparable performance with zlib-chromium level 6 (default), but provides smaller output (5809156 => 5796016). Higher zstd levels will make "Compress AST" notably slower and do not provide significant more size saving. ``` 2.219345 Total ExecuteCompiler 0.746799 Total Frontend 0.736862 Total Source 0.339434 Total ReadAST 0.165452 Total WriteAST 0.043045 Total Compress AST 0.008236 Total ParseClass 0.00633 Total InstantiateClass 0.001887 Total isPotentialConstantExpr 0.001808 Total InstantiateFunction 0.001535 Total EvaluateForOverflow 0.000986 Total EvaluateAsRValue 0.000536 Total EvaluateAsBooleanCondition 0.000308 Total EvaluateAsConstantExpr 0.000156 Total EvaluateAsInt 3.4e-05 Total EvaluateKnownConstInt 8e-06 Total EvaluateAsInitializer 0 Total PerformPendingInstantiations ``` Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D137885	2022-11-23 11:27:49 -08:00
Alexey Kreshchuk	2cea4c2395	Do not suggest taking the address of a const pointer to get void* It's more likely the user needs a const cast, but probably not sure enough that we should suggest that either - so err on the side of caution and offer no suggestion. Fixes pr58958 Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D138426	2022-11-23 18:43:06 +00:00
Hans Wennborg	907baeec49	Revert "Make -fsanitize=scudo use scudo_standalone. Delete check-scudo." It broke the build, see comments on code review. > Leaves the implementation and tests files in-place for right now, but > deletes the ability to build the old sanitizer-common based scudo. This > has been on life-support for a long time, and the newer scudo_standalone > is much better supported and maintained. > > Also patches up some GWP-ASan wording, primarily related to the fact > that -fsanitize=scudo now is scudo_standalone, and therefore the way to > reference the GWP-ASan options through the environment variable has > changed. > > Future follow-up patches will delete the original scudo, and migrate all > its tests over to be part of the scudo_standalone test suite. > > Reviewed By: vitalybuka > > Differential Revision: https://reviews.llvm.org/D138157 This reverts commit `ab1a5991fe`.	2022-11-23 16:07:07 +01:00
Balazs Benics	93b98eb399	[analyzer] getBinding should auto-detect type only if it was not given Casting a pointer to a suitably large integral type by reinterpret-cast should result in the same value as by using the `__builtin_bit_cast()`. The compiler exploits this: https://godbolt.org/z/zMP3sG683 However, the analyzer does not bind the same symbolic value to these expressions, resulting in weird situations, such as failing equality checks and even results in crashes: https://godbolt.org/z/oeMP7cj8q Previously, in the `RegionStoreManager::getBinding()` even if `T` was non-null, we replaced it with `TVR->getValueType()` in case the `MR` was `TypedValueRegion`. It doesn't make much sense to auto-detect the type if the type is already given. By not doing the auto-detection, we would just do the right thing and perform the load by that type. This means that we will cast the value to that type. So, in this patch, I'm proposing to do auto-detection only if the type was null. Here is a snippet of code, annotated by the previous and new dump values. `LocAsInteger` should wrap the `SymRegion`, since we want to load the address as if it was an integer. In none of the following cases should type auto-detection be triggered, hence we should eventually reach an `evalCast()` to lazily cast the loaded value into that type. ```lang=C++ void LValueToRValueBitCast_dumps(void p, char (array)[8]) { clang_analyzer_dump(p); // remained: &SymRegion{reg_$0<void * p>} clang_analyzer_dump(array); // remained: {{&SymRegion{reg_$1<char ()[8] array>} clang_analyzer_dump((unsigned long)p); // remained: {{&SymRegion{reg_$0<void p>} [as 64 bit integer]}} clang_analyzer_dump(__builtin_bit_cast(unsigned long, p)); <--------- change #1 // previously: {{&SymRegion{reg_$0<void * p>}}} // now: {{&SymRegion{reg_$0<void * p>} [as 64 bit integer]}} clang_analyzer_dump((unsigned long)array); // remained: {{&SymRegion{reg_$1<char ()[8] array>} [as 64 bit integer]}} clang_analyzer_dump(__builtin_bit_cast(unsigned long, array)); <--------- change #2 // previously: {{&SymRegion{reg_$1<char ()[8] array>}}} // now: {{&SymRegion{reg_$1<char (*)[8] array>} [as 64 bit integer]}} } ``` Reviewed By: xazax.hun Differential Revision: https://reviews.llvm.org/D136603	2022-11-23 15:52:11 +01:00
Benjamin Kramer	5cfc22cafe	Revert "[SROA] `isVectorPromotionViable()`: memory intrinsics operate on vectors of bytes" This reverts commit `cf624b23bc`. It triggers crashes in clang, see the comments on github on the original change.	2022-11-23 13:11:16 +01:00
Stefan Gränitz	63d65d3764	Reland "[CGObjC] Add run line for release mode in test arc-exceptions-seh.mm (NFC)" This reverts commit `a37807ac8a`. Differential Revision: https://reviews.llvm.org/D137942	2022-11-23 11:35:22 +01:00
WuXinlong	16bf359a3f	Add MC support of RISCV Zcf Extension This patch add the instructions of Zcf extension. Zcf is a subset of C Ext which include the single-precision floating-point instructions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D134176	2022-11-23 15:09:02 +08:00
Fangrui Song	987b49395c	[PPC] Undefine __ppc64__ to match GCC GCC only defines `__ppc64__` for darwin while the darwin support has been removed from llvm-project. The existence of `__ppc64__` makes some software think we are compiling for big-endian PowerPC Mac; also it lures users to write code which is not portable to GCC. It is straightforward if a distro wants to keep the macro: add `-D__ppc64__=1` to a Clang configuration file. Reviewed By: thesamesam, nemanjai Differential Revision: https://reviews.llvm.org/D137511	2022-11-22 17:01:39 -08:00
Ayke van Laethem	a8efcb96e6	[AVR][Clang] Implement __AVR_HAVE_*__ macros These macros are defined in avr-gcc and are useful when working with assembly. For example, startup code needs to copy the contents of .data from flash to RAM, but should use elpm (instead of lpm) on devices with more than 64kB flash. Without __AVR_HAVE_ELPM__, there is no way to know whether the elpm instruction is supported. This partially fixes https://github.com/llvm/llvm-project/issues/56157. Differential Revision: https://reviews.llvm.org/D137572	2022-11-23 01:21:09 +01:00
Roman Lebedev	cf624b23bc	[SROA] `isVectorPromotionViable()`: memory intrinsics operate on vectors of bytes Now, there's a big caveat here - these bytes are abstract bytes, not the i8 we have in LLVM, so strictly speaking this is not exactly legal, see e.g. https://github.com/AliveToolkit/alive2/issues/860 ^ the "bytes" "could" have been a pointer, and loading it as an integer inserts an implicit ptrtoint. But at the same time, InstCombine's `InstCombinerImpl::SimplifyAnyMemTransfer()` would expand a memtransfer of 1/2/4/8 bytes into integer-typed load+store, so this isn't exactly a new problem. Note that in memory, poison is byte-wise, so we really can't widen elements, but SROA seems to be inconsistent here. Fixes #59116.	2022-11-23 02:38:25 +03:00
Nancy Wang	b888cafcbc	XFAIL hidden-duplicates.m for AIX and zOS as this is failing on the build bots: https://lab.llvm.org/buildbot/#/builders/214/builds/4442/steps/6/logs/FAIL__Clang__hidden-duplicates_m and is being investigate under https://reviews.llvm.org/D130327.	2022-11-22 16:26:06 -05:00
Yingchi Long	2ec79afd89	[Sema] check InitListExpr format strings like {"foo"} Adds InitListExpr case in format string checks. e.g. int sprintf(char __restrict, const char __restrict, ...); int foo() { char data[100]; constexpr const char* fmt2{"%d"}; // no-warning sprintf(data, fmt2, 123); } Fixes: https://github.com/llvm/llvm-project/issues/58900 Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D137839	2022-11-23 04:58:19 +08:00
Mitch Phillips	ab1a5991fe	Make -fsanitize=scudo use scudo_standalone. Delete check-scudo. Leaves the implementation and tests files in-place for right now, but deletes the ability to build the old sanitizer-common based scudo. This has been on life-support for a long time, and the newer scudo_standalone is much better supported and maintained. Also patches up some GWP-ASan wording, primarily related to the fact that -fsanitize=scudo now is scudo_standalone, and therefore the way to reference the GWP-ASan options through the environment variable has changed. Future follow-up patches will delete the original scudo, and migrate all its tests over to be part of the scudo_standalone test suite. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D138157	2022-11-22 12:08:30 -08:00
Stefan Gränitz	a37807ac8a	Revert "[CGObjC] Add run line for release mode in test arc-exceptions-seh.mm (NFC)" This reverts commit `01023bfcd3`. The extended test now triggers undefined behavior: ``` /b/sanitizer-aarch64-linux-bootstrap-ubsan/build/llvm-project/llvm/lib/Transforms/ObjCARC/ObjCARCOpts.cpp:577:41: runtime error: load of value 180, which is not a valid value for type 'bool' #0 0xaaaae3333a30 in hasCFGChanged /b/sanitizer-aarch64-linux-bootstrap-ubsan/build/llvm-project/llvm/lib/Transforms/ObjCARC/ObjCARCOpts.cpp:577:41 #1 0xaaaae3333a30 in llvm::ObjCARCOptPass::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) /b/sanitizer-aarch64-linux-bootstrap-ubsan/build/llvm-project/llvm/lib/Transforms/ObjCARC/ObjCARCOpts.cpp:2494:26 ... ```	2022-11-22 20:51:09 +01:00
Mariya Podchishchaeva	0ffcd243a4	[Driver][Test] Fix pic.c when CLANG_DEFAULT_PIE_ON_LINUX set to OFF	2022-11-22 13:38:18 -05:00
Sami Tolvanen	5a3d6ce956	[Clang][Driver] Add KCFI to SupportsCoverage Allow `-fsanitize=kcfi` to be enabled with `-fsanitize-coverage=` modes such as `trace-{pc,cmp}`. Link: https://github.com/ClangBuiltLinux/linux/issues/1743 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D138458	2022-11-22 18:20:04 +00:00
Matt Arsenault	2edafe8393	clang/HIP: Add new header test for math IR gen The current header testing is pretty thin. This is in preparation for a series of patches to replace many builtin implementations. I did try to stress everything in this header, but skipped a few things. Mostly I didn't understand why we have various language version checks which skip defining some things. It doesn't seem right to have any of these if guards on __cplusplus, __HIPCC_RTC__, and __OPENMP_AMDGCN__.	2022-11-22 11:12:00 -05:00
Yaxun (Sam) Liu	056ebadf5c	[HIP] Fix lld failure when devie object is empty When -fgpu-rdc is used for linking relocatable objects, clang driver launches clang-offload-bundler to extract a device relocatable object from each input relocatable object file and passes the extracted files to lld. The input relocatable object file could either come from HIP program or C++ program. The relocatable object file from C++ program does not contain device relocatable objects, therefore clang-offload-bundler extracts an empty file and passes it to lld. lld treates empty file as linker script. When there is no object input file to lld, lld will emit error: target emulation unknown: -m or at least one .o file required This patch adds "elf64_amdgpu" to lld so that lld always know the target no matter whether there are object input files or not. Reviewed by: Artem Belevich, Fangrui Song Differential Revision: https://reviews.llvm.org/D138221	2022-11-22 10:38:42 -05:00
Ties Stuij	cb261e30fb	[AArch64][clang] implement 2022 General Data-Processing instructions This patch implements the 2022 Architecture General Data-Processing Instructions They include: Common Short Sequence Compression (CSSC) instructions - scalar comparison instructions SMAX, SMIN, UMAX, UMIN (32/64 bits) with or without immediate - ABS (absolute), CNT (count non-zero bits), CTZ (count trailing zeroes) - command-line options for CSSC Associated with these instructions in the documentation is the Range Prefetch Memory (RPRFM) instruction, which signals to the memory system that data memory accesses from a specified range of addresses are likely to occur in the near future. The instruction lies in hint space, and is made unconditional. Specs for the individual instructions can be found here: https://developer.arm.com/documentation/ddi0602/2022-09/Base-Instructions/ contributors to this patch: - Cullen Rhodes - Son Tuan Vu - Mark Murray - Tomas Matheson - Sam Elliott - Ties Stuij Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D138488	2022-11-22 14:23:12 +00:00
Youling Tang	ac84798570	[scudo] Add loongarch64 support for scudo Enable scudo on LoongArch64 on both clang side and compiler-rt side. Reviewed By: SixWeining Differential Revision: https://reviews.llvm.org/D138350	2022-11-22 22:02:31 +08:00
Fahad Nayyar	cdfb65e5e7	[Clang][Sema] Added space after ',' in a warning This change fixes a typo in a warning message. rdar://79707705	2022-11-22 13:26:40 +00:00
Stefan Gränitz	01023bfcd3	[CGObjC] Add run line for release mode in test arc-exceptions-seh.mm (NFC) In release mode `arc-exceptions-seh.mm` fails. It needs `-enable-objc-arc-opts=false` to skip ObjC ARC optimizations. Reviewed By: triplef Differential Revision: https://reviews.llvm.org/D137942	2022-11-22 13:43:08 +01:00
Stefan Gränitz	9a9d636cae	[CGObjC] Open cleanup scope before SaveAndRestore CurrentFuncletPad and push CatchRetScope early Pushing the `CatchRetScope` early causes cleanups for catch parameters to be emitted in the basic block of the catch handler instead of the `catchret.dest` block. This is important because the latter is not part of the catchpad and this caused code truncations due to ARC PreISel intrinsics in WinEHPrepare. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D137939	2022-11-22 12:02:53 +01:00
KAWASHIMA Takahiro	3a95d7d098	[clang] Fix -fp-model={strict\|precise} to disable -fapprox-func `-fapprox-func` should be disabled by `-fp-model={strict\|precise}`, as well as other fast-math flags. See the last changes in `clang/test/Driver/fp-model.c`. Probably this route (`case options::OPT_ffp_model_EQ`) was forgot to update in D106191 and D114564. There is no appropriate reason not to disable the flag. This commit also updates other regression tests, which are not directly related to this bug, for consistency with other fast-math flags. Differential Revision: https://reviews.llvm.org/D138109	2022-11-22 13:04:26 +09:00
David Blaikie	9df8ba631d	pr59000: Clarify packed-non-pod warning that it's pod-for-the-purposes-of-layout	2022-11-22 00:02:09 +00:00
Saleem Abdulrasool	b78d5380da	parse: process GNU and standard attributes on top-level decls We would previously reject valid input where GNU attributes preceded the standard attributes on top-level declarations. A previous attribute handling change had begun rejecting this whilst GCC does honour this layout. In practice, this breaks use of `extern "C"` attributed functions which use both standard and GNU attributes as experienced by the Swift runtime. Objective-C deserves an honourable mention for requiring some additional special casing. Because attributes on declarations and definitions differ in semantics, we need to replicate some of the logic for detecting attributes to declarations to which they appertain cannot be attributed. This should match the existing case for the application of GNU attributes to interfaces, protocols, and implementations. Take the opportunity to split out the tooling tests into two cases: ones which process macros and ones which do not. Special thanks to Aaron Ballman for the many hints and extensive rubber ducking that was involved in identifying the various places where we accidentally dropped attributes. Differential Revision: https://reviews.llvm.org/D137979 Fixes: #58229 Reviewed By: aaron.ballman, arphaman	2022-11-21 22:34:50 +00:00
Thomas Lively	ae96b5bd2d	[WebAssembly] Update relaxed-simd instruction names Including builtin and intrinsic names. These should be the final names for the proposal. https://github.com/WebAssembly/relaxed-simd/blob/main/proposals/relaxed-simd/Overview.md Reviewed By: aheejin, maratyszcza Differential Revision: https://reviews.llvm.org/D138249	2022-11-21 12:40:15 -08:00
Nathan Sidwell	eff9d72b9b	[clang] NFC: Robustify sret test regex Replace old-style, brittle, grep with new-fangled FileCheck technology. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D137941	2022-11-21 14:20:47 -05:00
John Brawn	9e3264ab20	[FPEnv] Enable strict fp for AArch64 in clang The AArch64 target now has the necessary support for strict fp, so enable it in clang. Differential Revision: https://reviews.llvm.org/D138143	2022-11-21 16:02:54 +00:00
Nico Weber	281a5c7ef1	[llvm,polly,clang] Stop setting config.enable_shared in most places Clang's lit.cfg.py reads this to add an "enable-shared" feature that three of clang's lit tests use. Nothing else reads enable_shared, so remove it from most lit.site.cfg.py.in files. Differential Revision: https://reviews.llvm.org/D138301	2022-11-21 08:54:14 -05:00
Christopher Di Bella	7994e5144a	Revert "Revert "[clang-tblgen][NFC] renames Diagnostic.Text to Diagnostic.Summary"" This reverts commit `196edb9f3f`.	2022-11-21 04:55:19 +00:00
Christopher Di Bella	196edb9f3f	Revert "[clang-tblgen][NFC] renames Diagnostic.Text to Diagnostic.Summary" This reverts commit `eb3f7880a2`.	2022-11-21 04:35:41 +00:00
Christopher Di Bella	eb3f7880a2	[clang-tblgen][NFC] renames Diagnostic.Text to Diagnostic.Summary The [Improving Clang's Diagnostics RFC][1] identifies eight broad fields for Clang to surface, two of which are text-based. Since the current diagnostics more closely map to the diagnostic summary (or headline), we should rename them to ensure that there's no confusion when Diagnostic.Reason is introduced in the near future. [1]: https://discourse.llvm.org/t/rfc-improving-clang-s-diagnostics/62584 Reviewed By: aaron.ballman, erichkeane Differential Revision: https://reviews.llvm.org/D135820	2022-11-21 03:44:37 +00:00
gonglingqin	c2ec455f18	[LoongArch] Add intrinsics for ibar, break and syscall Diagnostics for intrinsic input parameters have also been added. Differential Revision: https://reviews.llvm.org/D138094	2022-11-21 09:31:26 +08:00
Alex Richardson	0745b0c035	Fix incorrect cast in VisitSYCLUniqueStableNameExpr Clang language-level address spaces and LLVM pointer address spaces are not the same thing (even though they will both have a numeric value of zero in many cases). LangAS is a enum class to avoid implicit conversions, but `eba69b59d1` avoided the compiler error by adding a `static_cast<>`. While touching this code, simplify it by using CreatePointerBitCastOrAddrSpaceCast() which is already a no-op if the types match. This changes the code generation for spir64 to place the globals in the sycl_global addreds space, which maps to `addrspace(1)`. Reviewed By: bader Differential Revision: https://reviews.llvm.org/D138284	2022-11-19 11:43:17 +00:00
yronglin	80f444646c	[CodeGen][ARM] Fix ARMABIInfo::EmitVAAarg crash with empty record type variadic arg Fix ARMABIInfo::EmitVAAarg crash with empty record type variadic arg Open issue: https://github.com/llvm/llvm-project/issues/58794 Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D138137	2022-11-19 15:14:10 +08:00
Jennifer Yu	9d90cf2fca	[OPENMP5.1] Initial support for message clause.	2022-11-18 17:59:23 -08:00
Fazlay Rabbi	56c1660170	[OpenMP] Initial parsing/sema for 'strict' modifier with 'num_tasks' clause This patch gives basic parsing and semantic analysis support for 'strict' modifier with 'num_tasks' clause of 'taskloop' construct introduced in OpenMP 5.1 (section 2.12.2) Differential Revision: https://reviews.llvm.org/D138328	2022-11-18 16:26:47 -08:00
Doru Bercea	9e595e911e	[Clang][OpenMP] Add support for default to/from map types on target enter/exit data	2022-11-18 16:12:35 -06:00
Aaron Ballman	7fc57d7c97	Add more tests for C DRs and update the status page	2022-11-18 14:15:41 -05:00
Krzysztof Parzyszek	f01b68e4be	[Hexagon] Add checks for immediate arguments for remaining builtins Checks for builtins for the following instructions were aded: V6_v6mpyhubs10 V6_v6mpyhubs10_vxx V6_v6mpyvubs10 V6_v6mpyvubs10_vxx V6_vlutvvbi V6_vlutvvb_oracci V6_vlutvwhi V6_vlutvwh_oracci	2022-11-18 11:09:41 -08:00
Krzysztof Parzyszek	7c476697e2	[Hexagon] Add clang flags for v71, v71t, v73	2022-11-18 09:39:47 -08:00
Xing Xue	fa7477eb87	[Clang][CodeGen][AIX] Map __builtin_frexpl, __builtin_ldexpl, and __builtin_modfl to 'double' version lib calls in 64-bit 'long double' mode Summary: AIX library functions frexpl(), ldexpl(), and modfl() are for 128-bit IBM long double, i.e. __ibm128. Other *l() functions, e.g., acosl(), are for 64-bit long double. The AIX Clang compiler currently maps builtin functions __builtin_frexpl(), __builtin_ldexpl(), and __builtin_modfl() to frexpl(), ldexpl(), and modfl() in 64-bit long double mode which results in seg-faults or incorrect return values. This patch changes to map __builtin_frexpl(), __builtin_ldexpl(), and __builtin_modfl() to double version lib functions frexp(), ldexp() and modf() in 64-bit long double mode. Reviewed by: hubert.reinterpretcast, daltenty Differential Revision: https://reviews.llvm.org/D137986	2022-11-18 11:36:56 -05:00
Matt Jacobson	326393ae65	[Driver] exclude recently added tests from Windows	2022-11-18 05:30:42 -05:00
Alexander Shaposhnikov	f102fe7304	Revert "Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm"" This reverts commit `7f608a2497` and removes the dependency of Object on IRPrinter.	2022-11-18 08:58:31 +00:00
Mikhail Goncharov	7f608a2497	Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm" This reverts commit `34ab474348`. as it has introduced circular dependency lib - analysis	2022-11-18 09:25:45 +01:00
Matt Jacobson	ba7a1d9e4a	[Driver] move FreeBSD header search path management to the driver This matches OpenBSD, and it supports Swift's use of clang for its C interop functionality. Recent changes to Swift use AddClangSystemIncludeArgs() to inspect the cc1 args; this doesn't work for platforms where cc1 adds standard include paths implicitly. See: <`cf3354222d`> Also clean up InitHeaderSearch, making it clearer which targets manage header search paths in the driver. Differential Revision: https://reviews.llvm.org/D138183	2022-11-18 02:29:49 -05:00
Alexander Shaposhnikov	34ab474348	[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm Enable using -module-summary with -S (similarly to what currently can be achieved with opt <input> -o - \| llvm-dis). This is a recommit of `ef9e62469`. Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D137768	2022-11-18 05:04:07 +00:00
Fazlay Rabbi	ab9eac762c	[OpenMP] Initial parsing/sema for 'strict' modifier with 'grainsize' clause This patch gives basic parsing and semantic analysis support for 'strict' modifier with 'grainsize' clause of 'taskloop' construct introduced in OpenMP 5.1 (section 2.12.2) Differential Revision: https://reviews.llvm.org/D138217	2022-11-17 20:59:07 -08:00
Chuanqi Xu	d584468581	[C++20] [Modules] Don't emit macro definitions with named module It is meaningless to emit macro definitions for named modules. With some small experiments, the size of the module for the named modules reduced 2%~4% after this patch.	2022-11-18 11:11:17 +08:00
Chuanqi Xu	4a7be42d92	[C++20] [Modules] Remove unmaintained Header Module Currently there is a -emit-header-module mode, which can combine several headers together as a module interface. However, this breaks our assumption (for standard c++ modules) about module interface. The module interface should come from a module interface unit. And if it is a header, it should be a header unit. And currently we have no ideas to combine several headers together. So I think this mode is an experimental one and it is not maintained and it is not used. So it will be better to remove them. Reviewed By: Bigcheese, dblaikie, bruno Differential Revision: https://reviews.llvm.org/D137609	2022-11-18 10:39:33 +08:00
Qiu Chaofan	cab9c02bd9	[Clang] Fix behavior of -ffp-model option when overriden -ffp-model=strict -ffp-model=fast will still enable strict exception handling behavior, therefore clang still emits constrained FP operations in IR. -ffp-model=fast -ffp-model=strict emits two warnings: one for strict overriding fast, the other for strict overriding strict, which is confusing. Reviewed By: zahiraam Differential Revision: https://reviews.llvm.org/D137618	2022-11-18 10:34:41 +08:00
Volodymyr Sapsai	a65d5309d5	[ODRHash] Detect duplicate `ObjCProtocolDecl` ODR mismatches during parsing. When during parsing we encountered a duplicate `ObjCProtocolDecl`, we were always emitting an error. With this change we accept * when a previous `ObjCProtocolDecl` is in a hidden [sub]module; * parsed `ObjCProtocolDecl` is the same as the previous one. And in case of mismatches we provide more detailed error messages. rdar://93069080 Differential Revision: https://reviews.llvm.org/D130327	2022-11-17 18:31:32 -08:00
Volodymyr Sapsai	dcb71b5e1d	[ODRHash] Hash `ObjCPropertyDecl` and diagnose discovered mismatches. Differential Revision: https://reviews.llvm.org/D130326	2022-11-17 17:22:03 -08:00
Brad Smith	6536a67338	[Linux] Revert `1e56821bac` The glibc issue mentioned in #47994 has been fixed upstream.	2022-11-17 19:48:01 -05:00
David Blaikie	a72d8d7041	Update lambda mangling test to C++17 add some side effects so some (I guess guaranteed?) constant folding doesn't happen, keeping the test testing the things it was testing.. The test passes with 14, 17, and 20 - so let's just leave the version off so it might be able to be updated/used in C++20 when the default changes to C++20 in the future.	2022-11-18 00:24:40 +00:00
Jennifer Yu	1e054e6b52	[OPENMP5.1] Initial support for severity clause Differential Revision:https://reviews.llvm.org/D138227	2022-11-17 16:05:02 -08:00
Doru Bercea	98bfd7f976	Fix declare target implementation to support enter.	2022-11-17 17:35:53 -06:00
Craig Topper	c9320bc871	[X86] Use correctly sized floating point literals in *zero_ps/pd. This avoids depending on int->float or double->float conversion. Improving codegen with #pragma STDC FENV_ACCESS ON. Really we should improve constant folding somewhere, but this was a cheap and easy improvement. Fixes PR59052.	2022-11-17 14:28:52 -08:00
Vaibhav Yenamandra	7b6fe711b2	Refactor StaticAnalyzer to use `clang::SarifDocumentWriter` Refactor StaticAnalyzer to use clang::SarifDocumentWriter for serializing sarif diagnostics. Uses clang::SarifDocumentWriter to generate SARIF output in the StaticAnalyzer. Various bugfixes are also made to clang::SarifDocumentWriter. Summary of changes: clang/lib/Basic/Sarif.cpp: * Fix bug in adjustColumnPos introduced from prev move, it now uses FullSourceLoc::getDecomposedExpansionLoc which provides the correct location (in the presence of macros) instead of FullSourceLoc::getDecomposedLoc. * Fix createTextRegion so that it handles caret ranges correctly, this should bring it to parity with the previous implementation. clang/test/Analysis/diagnostics/Inputs/expected-sarif: * Update the schema URL to the offical website * Add the emitted defaultConfiguration sections to all rules * Annotate results with the "level" property clang/lib/StaticAnalyzer/Core/SarifDiagnostics.cpp: * Update SarifDiagnostics class to hold a clang::SarifDocumentWriter that it uses to convert diagnostics to SARIF.	2022-11-17 14:47:02 -05:00
Roman Lebedev	8adfa29706	[Pipelines] Introduce SROA after (final, run-time) loop unrolling Now that we are done with loop unrolling, be it either by LoopVectorizer, or LoopUnroll passes, some variable-offset GEP's into alloca's could have become constant-offset, thus enabling SROA and alloca promotion, yet we don't capitalize on that, which is surprizing. While it would be good to not introduce one more SROA invocation, but instead move the one from `PassBuilder::buildFunctionSimplificationPipeline()`, the existing test coverage says that is a bad idea, though it would be fine compile-time wise: https://llvm-compile-time-tracker.com/compare.php?from=b150d34c47efbd8fa09604bce805c0920360f8d7&to=5a9a5c855158b482552be8c7af3e73d67fa44805&stat=instructions So instead, i add yet another SROA run. I have checked, and it needs to be at least after said final loop unrolling. This is still fine compile-time wise: https://llvm-compile-time-tracker.com/compare.php?from=70324cd88328c0924e605fa81b696572560aa5c9&to=fb489bbef687ad821c3173a931709f9cad9aee8a&stat=instructions I've encountered this in a real code, `SROA-after-final-loop-unrolling.ll` has been reduced from https://godbolt.org/z/fsdMhETh3 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D136806	2022-11-17 21:31:30 +03:00
Jonas Paulsson	858f347c17	[Clang, SystemZ] Add support for option -mno-pic-data-is-text-relative. Add support for this GCC option which has the purpose of disallowing text relative accesses of module local symbols with PIC. In effect, this changes the code model to "medium". Reviewed By: uweigand, efriedma, MaskRay Differential Revision: https://reviews.llvm.org/D137044	2022-11-17 11:32:32 -05:00
Alex Brachet	0dff945bbc	Fix debug-info test	2022-11-17 16:02:54 +00:00
Matt Jacobson	bbe6bd724a	Trivial fix to failing test on FreeBSD This file can't use C99-style comments.	2022-11-17 00:20:23 -05:00
Serge Pavlov	1ddd586308	[clang] Missed rounding mode use in constant evaluation Integer-to-float conversion was handled in constant evaluator with default rounding mode. This change fixes the behavior and the conversion is made using rounding mode stored in ImplicitCastExpr node. Differential Revision: https://reviews.llvm.org/D137719	2022-11-17 12:05:28 +07:00
Joshua Batista	083d949f38	[HLSL] add sin library function This change exposes the sin library function for HLSL, excluding long, int, and long long doubles. Sin is supported for all scalar, vector, and matrix types. Long and long long double support is missing in this patch because those types don't exist in HLSL. Int is missing because the sin function only works on floating type arguments. The full documentation of the HLSL sin function is available here: https://docs.microsoft.com/en-us/windows/win32/direct3dhlsl/dx-graphics-hlsl-sin Reviewed By: python3kgae Differential Revision: https://reviews.llvm.org/D138161	2022-11-16 18:29:50 -08:00
Ben Shi	84ef723573	[clang] Fix wrong ABI of AVRTiny. A scalar which exceeds 4 bytes should be returned via a stack slot, on an AVRTiny device. Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D138125	2022-11-17 08:38:44 +08:00
Eli Friedman	0fcb26c5b6	[clang] Fix __try/__finally blocks in C++ constructors. We were crashing trying to convert a GlobalDecl from a CXXConstructorDecl. Instead of trying to do that conversion, just pass down the original GlobalDecl. I think we could actually compute the correct constructor/destructor kind from the context, given the way Microsoft mangling works, but it's simpler to just pass through the correct constructor/destructor kind. Differential Revision: https://reviews.llvm.org/D136776	2022-11-16 15:13:33 -08:00
Richard Smith	9e52db1827	When we run out of source locations, try to produce useful information indicating why we ran out.	2022-11-16 14:36:16 -08:00
Tom Honermann	3e25ae605e	[Clang] Correct when Itanium ABI guard variables are set for non-block variables with static or thread storage duration. Previously, Itanium ABI guard variables were set after initialization was complete for non-block declared variables with static and thread storage duration. That resulted in initialization of such variables being restarted in cases where the variable was referenced while it was still under construction. Per C++20 [class.cdtor]p2, such references are permitted (though the value obtained by such an access is unspecified). The late initialization resulted in recursive reinitialization loops for cases like this: template<typename T> struct ct { struct mc { mc() { ct<T>::smf(); } void mf() const {} }; thread_local static mc tlsdm; static void smf() { tlsdm.mf(); } }; template<typename T> thread_local typename ct<T>::mc ct<T>::tlsdm; int main() { ct<int>::smf(); } With this change, guard variables are set before initialization is started so as to avoid such reinitialization loops. Fixes https://github.com/llvm/llvm-project/issues/57828 Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D135919	2022-11-16 16:31:35 -05:00
Joshua Batista	9777e98a61	[HLSL] add cos library function This change exposes the cos library function for HLSL, excluding long, int, and long long doubles. Cos is supported for all scalar, vector, and matrix types. Long and long long double support is missing in this patch because those types don't exist in HLSL. Int is missing because the cos function only works on floating type arguments. The full documentation of the HLSL cos function is available here: https://docs.microsoft.com/en-us/windows/win32/direct3dhlsl/dx-graphics-hlsl-cos Reviewed By: python3kgae Differential Revision: https://reviews.llvm.org/D134921	2022-11-16 12:54:11 -08:00
Louis Dionne	9621b1776a	[clang] Don't include C++ Standard Library headers when -nostdinc is used This is a follow-up to `53c98d85a`, which made the same change but only for GNU. It seems that we should try to provide a consistent behavior across all targets. This fixes an issue where clang/test/Driver/nostdincxx.cpp would start failing on non-GNU targets because that test was too loose in its checks. It would only check that 'file not found' was part of the error message, but didn't ensure that the file we had not found was <vector>. Differential Revision: https://reviews.llvm.org/D138062	2022-11-16 15:25:32 -05:00
Zequan Wu	29c9287917	[AST] Fix class layout when using external layout under MS ABI. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D137806	2022-11-16 11:17:50 -08:00
Erich Keane	6c38ffc7b6	[Concepts] Fix friend-checking to include NTTPs More work for temp.friend p9, this fixes a previous bug where we didn't properly consider a friend to depend on the enclosing template if it only did so via an NTTP.	2022-11-16 07:33:15 -08:00
Michał Górny	b3f94fe1c3	[clang][Driver] allow tilde in user config dir This patch allows users to configure clang with option e.g. `-DCLANG_CONFIG_FILE_USER_DIR=~/.config/clang` or invoke clang with `--config-user-dir=~/.config/clang`. Patch merged on behalf of @paperchalice (LJC) Differential Revision: https://reviews.llvm.org/D136940	2022-11-16 13:23:25 +01:00
Ties Stuij	983f63f7f0	[AArch64][ARM] add Armv8.9-a/Armv9.4-a identifier support For both ARM and AArch64 add support for specifying -march=armv8.9a/armv9.4a to clang. Add backend plumbing like target parser and predicate support. For a summary of Amv8.9/Armv9.4 features, see: https://community.arm.com/arm-community-blogs/b/architectures-and-processors-blog/posts/arm-a-profile-architecture-2022 For detailed information, consult the Arm Architecture Reference Manual for A-profile architecture: https://developer.arm.com/documentation/ddi0487/latest/ People who contributed to this patch: - Keith Walker - Ties Stuij Reviewed By: tmatheson Differential Revision: https://reviews.llvm.org/D138010	2022-11-16 10:20:14 +00:00
gonglingqin	ddbb21bdb5	[LoongArch] Add immediate operand validity check for __builtin_loongarch_dbar Differential Revision: https://reviews.llvm.org/D137809	2022-11-16 14:47:45 +08:00
Serge Pavlov	1f67dc8b7c	[Driver] Enable nested configuration files Users may partition parameters specified by configuration file and put different groups into separate files. These files are inserted into the main file using constructs `@file`. Relative file names in it are resolved relative to the including configuration file and this is not convenient in some cases. A configuration file, which resides in system directory, may need to include a file with user-defined parameters and still provide default definitions if such file is absent. To solve such problems, the option `--config=` is allowed inside configuration files. Like `@file` it results in insertion of command-line arguments but the algorithm of file search is different and allows overriding system definitions with user ones. Differential Revision: https://reviews.llvm.org/D136354	2022-11-16 13:32:27 +07:00
Akira Hatanaka	81e33602f7	[Sema] Use the value category of the base expression when creating an ExtVectorElementExpr This fixes a bug where an lvalue ExtVectorElementExpr was created when the base expression was an ObjC property dot operator. This reverts `220d08d942`. Differential Revision: https://reviews.llvm.org/D138058	2022-11-15 17:14:48 -08:00
Akira Hatanaka	063a43b4fd	[ObjC] Fix an assertion failure in EvaluateLValue Look through parentheses when determining whether the expression is a @selector expression.	2022-11-15 14:41:28 -08:00
Ben Langmuir	dcfb25078c	[clang][deps] Remove checks that were just for exhaustiveness Instead of checking all the paths, just ensure the one we care about is correct. On a particular platform one of the paths seems to have been more canonical than we were expecting, which is fine.	2022-11-15 14:22:23 -08:00
Jennifer Yu	628fdc3f57	[OPENMP]Initial support for at clause Error directive is allowed in both declared and executable contexts. The function ActOnOpenMPAtClause is called in both places during the parsers. Adding a param "bool InExContext" to identify context which is used to emit error massage. Differential Revision: https://reviews.llvm.org/D137851	2022-11-15 14:06:50 -08:00
Ben Langmuir	05ec16d90d	[clang][deps] Avoid leaking modulemap paths across unrelated imports Use a FileEntryRef when retrieving modulemap paths in the scanner so that we use a path compatible with the original module import, rather than a FileEntry which can allow unrelated modules to leak paths into how we build a module due to FileManager mutating the path. Note: the current change prevents an "unrelated" path, but does not change how VFS mapped paths are handled (which would be calling getNameAsRequested) nor canonicalize the path. Differential Revision: https://reviews.llvm.org/D137989	2022-11-15 13:59:26 -08:00
Shafik Yaghmour	9332ddfba6	[Clang] Extend the number of case Sema::CheckForIntOverflow covers Currently Sema::CheckForIntOverflow misses several case that other compilers diagnose for overflow in integral constant expressions. This includes the arguments of a CXXConstructExpr as well as the expressions used in an ArraySubscriptExpr, CXXNewExpr and CompoundLiteralExpr. This fixes https://github.com/llvm/llvm-project/issues/58944 Differential Revision: https://reviews.llvm.org/D137897	2022-11-15 12:07:03 -08:00
Ayke van Laethem	09ab9d4d11	[AVR][Clang] Implement __AVR_ARCH__ macro This macro is defined in avr-gcc, and is very useful especially in assembly code to check whether particular instructions are supported. It is also the basis for other macros like __AVR_HAVE_ELPM__. Differential Revision: https://reviews.llvm.org/D137521	2022-11-15 15:29:37 +01:00
Ayke van Laethem	d2563775cd	[AVR][Clang] Move family names into MCU list This simplifies the code by avoiding some special cases for family names (as opposed to device names). Differential Revision: https://reviews.llvm.org/D137520	2022-11-15 15:29:37 +01:00
Chuanqi Xu	cb2289f392	[C++20] [Modules] Attach implicitly declared allocation funcitons to global module fragment [basic.stc.dynamic.general]p2 says: > The library provides default definitions for the global allocation > and deallocation functions. Some global allocation and > deallocation > functions are replaceable ([new.delete]); these are attached to > the global module ([module.unit]). But we didn't take this before and the implicitly generated functions will live in the module purview if we're compiling a module unit. This is bad since the owning module will affect the linkage of the declarations. This patch addresses this. Closes https://github.com/llvm/llvm-project/issues/58560	2022-11-15 17:21:48 +08:00
Michele Scandale	b7d7c448df	Fix `unsafe-fp-math` attribute emission. The conditions for which Clang emits the `unsafe-fp-math` function attribute has been modified as part of `84a9ec2ff1ee97fd7e8ed988f5e7b197aab84a7`. In the backend code generators `"unsafe-fp-math"="true"` enable floating point contraction for the whole function. The intent of the change in `84a9ec2ff1ee97fd7e8ed988f5e7b197aab84a7` was to prevent backend code generators performing contractions when that is not expected. However the change is inaccurate and incomplete because it allows `unsafe-fp-math` to be set also when only in-statement contraction is allowed. Consider the following example ``` float foo(float a, float b, float c) { float tmp = a * b; return tmp + c; } ``` and compile it with the command line ``` clang -fno-math-errno -funsafe-math-optimizations -ffp-contract=on \ -O2 -mavx512f -S -o - ``` The resulting assembly has a `vfmadd213ss` instruction which corresponds to a fused multiply-add. From the user perspective there shouldn't be any contraction because the multiplication and the addition are not in the same statement. The optimized IR is: ``` define float @test(float noundef %a, float noundef %b, float noundef %c) #0 { %mul = fmul reassoc nsz arcp afn float %b, %a %add = fadd reassoc nsz arcp afn float %mul, %c ret float %add } attributes #0 = { [...] "no-signed-zeros-fp-math"="true" "no-trapping-math"="true" [...] "unsafe-fp-math"="true" } ``` The `"unsafe-fp-math"="true"` function attribute allows the backend code generator to perform `(fadd (fmul a, b), c) -> (fmadd a, b, c)`. In the current IR representation there is no way to determine the statement boundaries from the original source code. Because of this for in-statement only contraction the generated IR doesn't have instructions with the `contract` fast-math flag and `llvm.fmuladd` is being used to represent contractions opportunities that occur within a single statement. Therefore `"unsafe-fp-math"="true"` can only be emitted when contraction across statements is allowed. Moreover the change in `84a9ec2ff1ee97fd7e8ed988f5e7b197aab84a7` doesn't take into account that the floating point math function attributes can be refined during IR code generation of a function to handle the cases where the floating point math options are modified within a compound statement via pragmas (see `CGFPOptionsRAII`). For consistency `unsafe-fp-math` needs to be disabled if the contraction mode for any scope/operation is not `fast`. Similarly for consistency reason the initialization of `UnsafeFPMath` of in `TargetOptions` for the backend code generation should take into account the contraction mode as well. Reviewed By: zahiraam Differential Revision: https://reviews.llvm.org/D136786	2022-11-14 20:40:57 -08:00
Roman Lebedev	b2fbafc911	[NFC][Clang] Autogenerate checklines in a test being affected by a patch	2022-11-15 03:51:24 +03:00
Fangrui Song	77bf0df376	Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm" This reverts commit `bf8381a8bc`. There is a layering violation: LLVMAnalysis depends on LLVMCore, so LLVMCore should not include LLVMAnalysis header llvm/Analysis/ModuleSummaryAnalysis.h	2022-11-14 15:51:03 -08:00
Alexander Shaposhnikov	bf8381a8bc	[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm Enable using -module-summary with -S (similarly to what currently can be achieved with opt <input> -o - \| llvm-dis). This is a recommit of `ef9e62469`. Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D137768	2022-11-14 23:24:08 +00:00
Alexander Shaposhnikov	8c15c17e3b	Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm" This reverts commit `ef9e624694` for further investigation offline. It appears to break the buildbot llvm-clang-x86_64-sie-ubuntu-fast.	2022-11-14 21:31:30 +00:00
Alexander Shaposhnikov	ef9e624694	[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm Enable using -module-summary with -S (similarly to what currently can be achieved with opt <input> -o - \| llvm-dis). Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D137768	2022-11-14 21:11:07 +00:00
Philip Reames	780c539844	[RISCV] Implement assembler support for XVentanaCondOps This change provides an implementation of the XVentanaCondOps vendor extension. This extension is defined in version 1.0.0 of the VTx-family custom instructions specification (https://github.com/ventanamicro/ventana-custom-extensions/releases/download/v1.0.0/ventana-custom-extensions-v1.0.0.pdf) by Ventana Micro Systems. In addition to the technical contribution, this change is intended to be a test case for our vendor extension policy. Once this lands, I plan to use this extension to prototype selection lowering to conditional moves. There's an RVI proposal in flight, and the expectation is that lowering to these and the new RVI instructions is likely to be substantially similar. Differential Revision: https://reviews.llvm.org/D137350	2022-11-14 09:01:54 -08:00
Luca Di Sera	5c67cf0a7f	Add clang_CXXMethod_isMoveAssignmentOperator to libclang The new method is a wrapper of `CXXMethodDecl::isMoveAssignmentOperator` and can be used to recognized move-assignment operators in libclang. An export for the function, together with its documentation, was added to "clang/include/clang-c/Index.h" with an implementation provided in "clang/tools/libclang/CIndex.cpp". The implementation was based on similar `clang_CXXMethod.*` implementations, following the same structure but calling `CXXMethodDecl::isMoveAssignmentOperator` for its main logic. The new symbol was further added to "clang/tools/libclang/libclang.map" to be exported, under the LLVM16 tag. "clang/tools/c-index-test/c-index-test.c" was modified to print a specific tag, "(move-assignment operator)", for cursors that are recognized by `clang_CXXMethod_isMoveAssignmentOperator`. A new regression test file, "clang/test/Index/move-assignment-operator.cpp", was added to ensure whether the correct constructs were recognized or not by the new function. The "clang/test/Index/get-cursor.cpp" regression test file was updated as it was affected by the new "(move-assignment operator)" tag. A binding for the new function was added to libclang's python's bindings, in "clang/bindings/python/clang/cindex.py", adding a new method for `Cursor`, `is_move_assignment_operator_method`. An accompanying test was added to `clang/bindings/python/tests/cindex/test_cursor.py`, testing the new function with the same methodology as the corresponding libclang test. The current release note, `clang/docs/ReleaseNotes.rst`, was modified to report the new addition under the "libclang" section. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D137246	2022-11-14 15:21:36 +01:00
Muhammad Omair Javaid	0b94525ddc	Revert "[libclang] Expose completion result kind in `CXCompletionResult`" This reverts commit `97105e5bf7`. It breaks clang-armv8-quick buildbot: https://lab.llvm.org/buildbot/#/builders/245/builds/761	2022-11-14 12:27:11 +04:00
Chuanqi Xu	360c5fe54c	[C++20] [Modules] Emit Macro Definition in -module-file-info action It is helpful to know whih macro definition is emitted in the module file without openning it directly. And this is not easy to be tested with the lit test. So this patch add the facility to emit macro definitions in `-module-file-info` action. And this should be innnocent for every other cases.	2022-11-14 13:28:26 +08:00
Arthur Eubanks	e564f5153f	[clang][test] Avoid UB in overload.cl	2022-11-13 14:02:24 -08:00
Roman Lebedev	6daa005b90	[NFC][Clang] Add some codegen tests for https://github.com/llvm/llvm-project/issues/58798	2022-11-13 02:39:52 +03:00
Fangrui Song	e588d23a91	Add back single quotes when dontcall attribute was split into dontcall-error/dontcall-warn Single quotes were accidentally dropped in D110364.	2022-11-11 22:59:51 -08:00
Joseph Huber	0f7e863154	[LinkerWrapper] Perform device linking steps in parallel This patch changes the device linking steps to be performed in parallel when multiple offloading architectures are being used. We use the LLVM parallelism support to accomplish this by simply doing each inidividual device linking job in a single thread. This change required re-parsing the input arguments as these arguments have internal state that would not be properly shared between the threads otherwise. By default, the parallelism uses all threads availible. But this can be controlled with the `--wrapper-jobs=` option. This was required in a few tests to ensure the ordering was still deterministic. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D136701	2022-11-11 13:46:33 -06:00
Alex Brachet	28e07984c3	Revert "[Clang][AArch64][Darwin] Enable GlobalISel by default for Darwin ARM64 platforms." This reverts commit `f64802e8d3`.	2022-11-11 19:40:08 +00:00
Zahira Ammarguellat	91628f0616	The handling of 'funsafe-math-optimizations' doesn't update the 'MathErrno' flag. But the driver checks for 'fno-math-errno' before passing 'funsafe-math-optimizations' to the FE. In GCC, the option 'funsafe-math-optimizations' doesn't affect the 'fmath-errno' flag. This patch aligns clang with GCC. '-ffast-math' sets the FPContract to 'fast'. But 'funsafe-math-optimizations' the driver doesn't consider the FPContract when handling the option. Unfortunately there are places in the BE that interpret unsafe math mode as allowing FMA. This patch makes -ffast-math' and 'funsafe-math-optimizations' behave similarly in regard to the setting of the FPContract. Differential Revision: https://reviews.llvm.org/D137578	2022-11-11 10:24:12 -05:00
Timm Bäder	99d3ead44c	[clang][Interp] Protect Record creation against infinite recursion This happens only in error cases, but we need to handle it anyway. Differential Revision: https://reviews.llvm.org/D136831	2022-11-11 08:38:06 +01:00
Timm Bäder	0dcfd0ce02	[clang][Interp] Support alignof() Support alignof() and __alignof() expressions. Fixes #58816 Differential Revision: https://reviews.llvm.org/D137240	2022-11-11 08:38:06 +01:00
Timm Bäder	7863646fd2	[clang][Interp] DerivedToBase casts Differential Revision: https://reviews.llvm.org/D137545	2022-11-11 08:38:06 +01:00
Joshua Batista	a5d14f757b	Add builtin_elementwise_sin and builtin_elementwise_cos Add codegen for llvm cos and sin elementwise builtins The sin and cos elementwise builtins are necessary for HLSL codegen. Tests were added to make sure that the expected errors are encountered when these functions are given inputs of incompatible types. The new builtins are restricted to floating point types only. Reviewed By: craig.topper, fhahn Differential Revision: https://reviews.llvm.org/D135011	2022-11-10 23:30:27 -08:00
gonglingqin	da34aff90d	[Clang][LoongArch] Implement __builtin_loongarch_crc_w_d_w builtin and add diagnostics This patch adds support to prevent __builtin_loongarch_crc_w_d_w from compiling on loongarch32 in the front end and adds diagnostics accordingly. Reference: https://github.com/gcc-mirror/gcc/blob/master/gcc/config/loongarch/larchintrin.h#L175-L184 Depends on D136906 Differential Revision: https://reviews.llvm.org/D137316	2022-11-11 09:16:57 +08:00
Weining Lu	6890b9b71e	Add missing changes for "[Clang][LoongArch] Handle -march/-m{single,double,soft}-float/-mfpu options" Some changes in D136146 were lost by an accidentally sumbit. So recover them.	2022-11-11 09:06:53 +08:00
Egor Zhdan	97105e5bf7	[libclang] Expose completion result kind in `CXCompletionResult` This allows clients of libclang to check whether a completion result is a keyword. Previously, keywords had `CursorKind == CXCursor_NotImplemented` and it wasn't trivial to distinguish a keyword from a pattern. This change moves `CodeCompletionResult::ResultKind` to `clang-c` under a new name `CXCompletionResultKind`. It also tweaks `c-index-test` to print the result kind instead of `NotImplemented`, and adjusts the tests for the new output. rdar://91852088 Differential Revision: https://reviews.llvm.org/D136844	2022-11-10 16:16:36 -08:00
Anastasia Stulova	790cbaafc9	[OpenCL] Fix diagnostics with templates in kernel args. Improve checking for the standard layout type when diagnosing the kernel argument with templated types. The check doesn't work correctly for references or pointers due to the lazy template instantiation. Current fix only improves cases where nested types in the templates do not depend on the template parameters. Differential Revision: https://reviews.llvm.org/D134445	2022-11-10 18:55:12 +00:00
gonglingqin	85f08c4197	[Clang][LoongArch] Implement __builtin_loongarch_dbar builtin Differential Revision: https://reviews.llvm.org/D136906	2022-11-10 17:27:44 +08:00
Weining Lu	60e5cfe2a4	[Clang][LoongArch] Define more LoongArch specific built-in macros Define below macros according to LoongArch toolchain conventions [1]. * `__loongarch_grlen` * `__loongarch_frlen` * `__loongarch_lp64` * `__loongarch_hard_float` * `__loongarch_soft_float` * `__loongarch_single_float` * `__loongarch_double_float` Note: 1. `__loongarch__` has been defined in earlier patch. 2. `__loongarch_arch` is not defined because I don't know how `TargetInfo` can get the arch name specified by `-march`. 3. `__loongarch_tune` will be defined in future. [1]: https://loongson.github.io/LoongArch-Documentation/LoongArch-toolchain-conventions-EN.html Depends on D136146 Differential Revision: https://reviews.llvm.org/D136413	2022-11-10 17:27:29 +08:00
Weining Lu	135a9272a4	[Clang][LoongArch] Handle -march/-m{single,double,soft}-float/-mfpu options This patch adds options -march, -msingle-float, -mdouble-float, -msoft-float and -mfpu for LoongArch. Clang options `msingle_float` and `mdouble_float` are moved from `m_mips_Features_Group` to `m_Group` because now more than targets use them. Reference: https://github.com/loongson/LoongArch-Documentation/blob/main/docs/LoongArch-toolchain-conventions-EN.adoc TODO: add -mtune. Differential Revision: https://reviews.llvm.org/D136146	2022-11-10 17:27:28 +08:00
Matt Jacobson	dd9f7963e4	[ObjC] avoid crashing when emitting synthesized getter/setter and ptrdiff_t is smaller than long On targets where ptrdiff_t is smaller than long, clang crashes when emitting synthesized getters/setters that call objc_[gs]etProperty. Explicitly emit a zext/trunc of the ivar offset value (which is defined to long) to ptrdiff_t, which objc_[gs]etProperty takes. Add a test using the AVR target, where ptrdiff_t is smaller than long. Test failed previously and passes now. Differential Revision: https://reviews.llvm.org/D112049	2022-11-10 02:10:30 -05:00
Michael Francis	e07a7040d9	[Test][AIX][pg] Add 32-bit linker invocation tests Differential Review: https://reviews.llvm.org/D137372	2022-11-09 15:02:45 -05:00
Michael Francis	dc9846ce98	[Test][AIX][p] Add 64-bit linker invocation tests Differential Review: https://reviews.llvm.org/D137373	2022-11-09 14:51:40 -05:00
Tomasz Kamiński	2fb3bec932	[analyzer] Fix crash for array-delete of UnknownVal values. We now skip the destruction of array elements for `delete[] p`, if the value of `p` is UnknownVal and does not have corresponding region. This eliminate the crash in `getDynamicElementCount` on that region and matches the behavior for deleting the array of non-constant range. Reviewed By: isuckatcs Differential Revision: https://reviews.llvm.org/D136671	2022-11-09 15:06:46 +01:00
Victor Campos	9d1ff787e5	[AArch64] Add support for the Cortex-X3 CPU Cortex-X3 is an Armv9-A AArch64 CPU. This patch introduces support for Cortex-X3. Technical Reference Manual: https://developer.arm.com/documentation/101593/latest Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D136589	2022-11-09 11:33:48 +00:00
OCHyams	4b6b2b1a42	Reapply: [Assignment Tracking][7/*] Add assignment tracking functionality to clang Reverted in `98fa95492f`. The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir This patch plumbs the AssignmentTrackingPass (AKA declare-to-assign), added in the previous patch in this set, into the optimisation pipeline from clang. clang/test/CodeGen/assignment-tracking/assignment-tracking.cpp is the main test for this patch. Note: while clang (with the help of the declare-to-assign pass) can now emit Assignment Tracking metadata, the llvm middle and back ends don't yet understand it. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D132226	2022-11-09 09:28:41 +00:00
Freddy Ye	84a18a260e	[X86] Support -march=sierraforest, grandridge, graniterapids. Reviewed By: skan, pengfei, MaskRay Differential Revision: https://reviews.llvm.org/D137153	2022-11-09 16:56:03 +08:00
Animesh Kumar	0f8e7b4329	[OpenMP] Add map clause to the LIT test on use_device_addr clause As per the OpenMP Spec, "A list item in a use_device_addr clause must have a corresponding list item in the device data environment" . Therefore a `map` clause is added which will make sure that the respective list items are mapped to the device data environment before the `use_device_addr` clause is specified. The CHECK lines are also modified based on this change. Differential Revision: https://reviews.llvm.org/D134974	2022-11-09 12:23:39 +05:30
Fangrui Song	8c2c62282f	[Driver] Refactor err_drv_unsupported_option_argument call sites to use llvm::opt::Arg::getSpelling For `-foo=bar`, getSpelling return `-foo=` which is exactly what we need from the diagnostic. Drop `-` from the err_drv_unsupported_option_argument template. This change makes `--` long option diagnostics more convenient. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D137659	2022-11-08 14:39:09 -08:00
David Green	f0e6c403c2	[AArch64] Allow users-facing feature names in clang target attributes D133848 added support for the GCC format of target("..") attributes. The supported formats to match gcc are: // "arch=<arch>" - parsed to features as per -march=.. // "cpu=<cpu>" - parsed to features as per -mcpu=.., with CPU set to <cpu> // "tune=<cpu>" - TuneCPU set to <cpu> // "+feature", "+nofeature" - Add (or remove) feature. We also support the existing formats, previously accepted by clang, for compatibility with the existing code and intrinsics code: // "feature", "no-feature" - Add (or remove) feature. The clang formats would accept and use internal feature names ("fullfp16"/"neon"/"sve") as opposed to the user facing names ("fp16"/"simd"/"sve"). Usually they use the same names, but can be different for cases like fp, fullfp16 and mte (among others). This patch makes the clang format also except the user facing names, by parsing the features through getArchExtFeature. There is a fallback if the name is not recognized (like "fullfp16"), where we add the existing string which should then be checked later for consistency. This allows the internal names to be used as before, so long as they are recognized as internal names. (Note that we currently don't have an implementation of isValidFeatureName. The backend will currently give an error like "'-sid' is not a recognized feature for this target (ignoring feature)." This should be improved in a later patch once an implementation of isValidFeatureName in clang is present). Differential Revision: https://reviews.llvm.org/D137617	2022-11-08 19:30:26 +00:00
OCHyams	98fa95492f	Revert "[Assignment Tracking][7/*] Add assignment tracking functionality to clang" This reverts commit `28f9636edd`. Bot failure: https://lab.llvm.org/buildbot/#/builders/109/builds/50251	2022-11-08 18:43:05 +00:00
OCHyams	28f9636edd	[Assignment Tracking][7/*] Add assignment tracking functionality to clang The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir This patch plumbs the AssignmentTrackingPass (AKA declare-to-assign), added in the previous patch in this set, into the optimisation pipeline from clang. clang/test/CodeGen/assignment-tracking/assignment-tracking.cpp is the main test for this patch. Note: while clang (with the help of the declare-to-assign pass) can now emit Assignment Tracking metadata, the llvm middle and back ends don't yet understand it. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D132226	2022-11-08 17:49:08 +00:00
Alexander Kornienko	a5c18fcf6e	Revert "[clang] Instantiate alias templates with sugar" This reverts commit `279fe6281d`, which causes non-linear compilation time growth. See https://reviews.llvm.org/D136565#3914755	2022-11-08 17:19:54 +01:00
Rageking8	94738a5ac3	Fix duplicate word typos; NFC This revision fixes typos where there are 2 consecutive words which are duplicated. There should be no code changes in this revision (only changes to comments and docs). Do let me know if there are any undesirable changes in this revision. Thanks.	2022-11-08 07:21:23 -05:00
Bjorn Pettersson	5f9a82683d	[clang][test] Use opt -passes=<name> instead of opt -name Updated the RUN line in several test cases to use the new PM syntax opt -passes=<pipeline> instead of the deprecated syntax opt -pass1 -pass2 This was not a complete cleanup in clang/test. But just a swipe using some simple search-and-replace. Mainly for RUN lines involving -mem2reg, -instnamer and -early-cse.	2022-11-08 12:15:42 +01:00
Thomas Preud'homme	f3a86a23c1	[Test] Fix driverkit-path.c with lib64 dir Reviewed By: yln Differential Revision: https://reviews.llvm.org/D137484	2022-11-08 09:54:47 +00:00
Tobias Hieta	aa99b607b5	[clang][pdb] Don't include -fmessage-length in PDB buildinfo As discussed in https://reviews.llvm.org/D136474 -fmessage-length creates problems with reproduciability in the PDB files. This patch just drops that argument when writing the PDB file. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D137322	2022-11-08 10:05:59 +01:00
haoyuintel	13f83365cd	[Driver] Add -fsample-profile-use-profi This patch enable `-sample-profile-use-profi` in Clang frontend as user-facing feature. By using this patch, we can use the cflag of `-fsample-profile-use-profi` instead of `-mllvm -sample-profile-use-profi`. Reviewed By: hans, MaskRay Differential Revision: https://reviews.llvm.org/D136846	2022-11-08 15:51:38 +08:00
Sami Tolvanen	41ce74e6e9	[Clang][Sema] Add -Wincompatible-function-pointer-types-strict Clang supports indirect call Control-Flow Integrity (CFI) sanitizers (e.g. -fsanitize=cfi-icall), which enforce an exact type match between a function pointer and the target function. Unfortunately, Clang doesn't provide diagnostics that help developers avoid function pointer assignments that can lead to runtime CFI failures. -Wincompatible-function-pointer-types doesn't warn about enum to integer mismatches if the types are otherwise compatible, for example, which isn't sufficient with CFI. Add -Wincompatible-function-pointer-types-strict, which checks for a stricter function type compatibility in assignments and helps warn about assignments that can potentially lead to CFI failures. Reviewed By: aaron.ballman, nickdesaulniers Differential Revision: https://reviews.llvm.org/D136790	2022-11-07 23:05:28 +00:00
Amara Emerson	f64802e8d3	[Clang][AArch64][Darwin] Enable GlobalISel by default for Darwin ARM64 platforms. Differential Revision: https://reviews.llvm.org/D137269	2022-11-07 15:04:09 -08:00
Grace Jennings	86674f66cc	[HLSL] Added HLSL this as a reference This change makes `this` a reference instead of a pointer in HLSL. HLSL does not have the `->` operator, and accesses through `this` are with the `.` syntax. Tests were added and altered to make sure the AST accurately reflects the types. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D135721	2022-11-07 13:50:08 -08:00
Matthias Braun	cafe50daf5	Explicitly initialize opaque pointer mode in CodeGenAction Explicitly call `LLVMContext::setOpaquePointers` in `CodeGenAction` before loading any IR files. With this we use the mode specified on the command-line rather than lazily initializing it based on the contents of the IR. This helps when using `-fthinlto-index` which may end up mixing files with typed and opaque pointer types which fails when the first file happened to use typed pointers since we cannot downgrade IR with opaque pointer types to typed pointer types. Differential Revision: https://reviews.llvm.org/D137475	2022-11-07 12:31:28 -08:00
Nikita Popov	6ebca03021	[Clang] Update test after wasm intrinsics attribute change (NFC) I missed this test in `d35fcf0e97`.	2022-11-07 17:42:35 +01:00

1 2 3 4 5 ...

48149 Commits