llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	fdec50182d	[CostModel] Replace getUserCost with getInstructionCost * Replace getUserCost with getInstructionCost, covering all cost kinds. * Remove getInstructionLatency, it's not implemented by any backends, and we should fold the functionality into getUserCost (now getInstructionCost) to make it easier for targets to handle the cost kinds with their existing cost callbacks. Original Patch by @samparker (Sam Parker) Differential Revision: https://reviews.llvm.org/D79483	2022-08-18 11:55:23 +01:00
Simon Pilgrim	e48892ee42	[Transforms] LICM.cpp - pull out repeated getUserCost call Pulled out of D79483	2022-08-18 10:43:29 +01:00
Konstantina	5bc8791187	[NewGVN][PHIOFOPS] Bail out if an operand is in OpSafeForPHIOfOps but it is not safe for the current basic block. NewGVN tables are not cleared out between the initial run of NewGVN and the verification. In case of phi-of-ops optimization, OpSafeForPHIOfOps goes out of sync between the two runs. One operand might not be safe for one basic block, but it might be safe for one of its successors. In this case, the operand will be added in OpSafeForPHIOfOps map. In verification phase, we reuse OpSafeForPHIOfOps without updating it again. As a result, the operand will be considered safe for phi-of-ops optimization even for the case that it is not. This patch fixes this problem. Fix for 53807. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D130910	2022-08-17 18:57:46 -07:00
Paul Kirth	656c5d652c	[clang][llvm][NFC] Change misexpect's tolerance option to be 32-bit In D131869 we noticed that we jump through some hoops because we parse the tolerance option used in MisExpect.cpp into a 64-bit integer. This is unnecessary, since the value can only be in the range [0, 100). This patch changes the underlying type to be 32-bit from where it is parsed in Clang through to it's use in LLVM. Reviewed By: jloser Differential Revision: https://reviews.llvm.org/D131935	2022-08-17 14:38:53 +00:00
Ellis Hoag	6f61594d8c	[InstrProf] Add option to avoid instrumenting small functions If a function only has a few instructions, instrumentation can significantly increase the size and performance overhead of that function. Add the `-pgo-function-size-threshold` option to select a size threshold so these small functions are not instrumented. A similar option `-fxray-instruction-threshold=<N>` is used for XRay to reduce binary size overhead [1]. [1] https://www.llvm.org/docs/XRay.html Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D131816	2022-08-17 06:47:15 -07:00
Simon Pilgrim	594c5b1a42	[SLP] Update TODO comment about shuffle mask decoding This is handled in ShuffleVectorInst/getShuffleCost - getInstructionThroughput is (slowly) being removed.	2022-08-17 11:41:46 +01:00
Zain Jaffal	f61f99a105	[instcombine] Optimise for zero initialisation of product given fast flags are enabled Currently, clang ignores the 0 initialisation in finite math For example: ``` double f_prod = 0; double arr[1000]; for (size_t i = 0; i < 1000; i++) { f_prod *= arr[i]; } ``` Clang will ignore that `f_prod` is set to zero and it will generate assembly to iterate over the loop. Reviewed By: fhahn, spatel Differential Revision: https://reviews.llvm.org/D131672	2022-08-17 11:12:15 +01:00
Martin Sebor	a7a1be11e6	[InstCombine] convert second std::min argument to same type as first Ensure both arguments to std::min have the same type in all data models.	2022-08-16 17:34:33 -06:00
Martin Sebor	345514e991	[InstCombine] Add support for strlcpy folding Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D130666	2022-08-16 16:43:40 -06:00
Martin Sebor	e858f5120d	[InstCombine] Remove assumptions about int having 32 bits Reviewed By: bjope Differential Revision: https://reviews.llvm.org/D131731	2022-08-16 15:35:08 -06:00
Sanjay Patel	ce081776b2	[FlattenCFG] avoid crash on malformed code We don't have a dominator tree in this pass, so we can't bail out sooner by checking for unreachable code, but this is a minimal fix for the example in issue #56875.	2022-08-16 15:11:00 -04:00
Danila Malyutin	451497a030	[RS4GC] Handle vectors of pointers in non-live clobbering Fix crash when trying to unconditionally cast alloca type to PointerType Differential Revision: https://reviews.llvm.org/D131146	2022-08-16 17:47:30 +03:00
Alexey Bataev	65c7cecb13	[SLP]Fix PR51320: Try to vectorize single store operands. Currently, we try to vectorize values, feeding into stores, only if slp-vectorize-hor-store option is provided. We can safely enable vectorization of the value operand of a single store in the basic block, if the operand value is used only in store. It should enable extra vectorization and should not increase compile time significantly. Fixes https://github.com/llvm/llvm-project/issues/51320 Differential Revision: https://reviews.llvm.org/D131894	2022-08-16 07:25:21 -07:00
Kevin P. Neal	7f768371a1	Fix build error: [FPEnv][EarlyCSE] Support for CSE when exception behavior is "ignore" or "maytrap" and the rounding mode is known. This should fix these build bot errors: Step 6 (build-check-mlir-build-only) failure: build (failure) C:\buildbot\mlir-x64-windows-ninja\llvm-project\llvm\lib\Transforms\Scalar\EarlyCSE.cpp(124): error C2220: the following warning is treated as an error C:\buildbot\mlir-x64-windows-ninja\llvm-project\llvm\lib\Transforms\Scalar\EarlyCSE.cpp(124): warning C4996: 'llvm::Optional<llvm::fp::ExceptionBehavior>::getValue': Use value instead. C:\buildbot\mlir-x64-windows-ninja\llvm-project\llvm\lib\Transforms\Scalar\EarlyCSE.cpp(129): warning C4996: 'llvm::Optional<llvm::RoundingMode>::getValue': Use value instead. C:\buildbot\mlir-x64-windows-ninja\llvm-project\llvm\lib\Transforms\Scalar\EarlyCSE.cpp(1386): warning C4996: 'llvm::Optional<llvm::fp::ExceptionBehavior>::getValue': Use value instead. C:\buildbot\mlir-x64-windows-ninja\llvm-project\llvm\lib\Transforms\Scalar\EarlyCSE.cpp(1388): warning C4996: 'llvm::Optional<llvm::RoundingMode>::getValue': Use value instead.	2022-08-16 08:47:36 -04:00
Kevin P. Neal	05ac82de40	[FPEnv][EarlyCSE] Support for CSE when exception behavior is "ignore" or "maytrap" and the rounding mode is known. Previously we would only CSE constrained FP intrinsics in the default floating point environment. Exception behavior of "strict" is still not allowed since we are not allowed to remove any traps in that case. There are no restrictions on CSE across function calls inside a function. Differential Revision: https://reviews.llvm.org/D112256	2022-08-16 08:31:42 -04:00
Martin Sebor	65967708d2	[InstCombine] Adjust snprintf folding of constant strings (PR #56598 ) Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D130494	2022-08-15 15:59:21 -06:00
Arthur Eubanks	633f5663c3	[LegacyPM] Remove ThinLTO bitcode writer legacy pass Using the legacy PM for the optimization pipeline is deprecated and in the process of being removed. This is a small step in that direction. For an example of migrating to the new PM: `853b57fe80`	2022-08-15 14:21:16 -07:00
Philip Reames	e792a353b5	[slp] adjust debug output to include final computed cost	2022-08-15 13:51:39 -07:00
Jameson Nash	3a8d7fe201	[SimplifyCFG] teach simplifycfg not to introduce ptrtoint for NI pointers SimplifyCFG expects to be able to cast both sides to an int, if either side can be case to an int, but this is not desirable or legal, in general, per D104547. Spotted in https://github.com/JuliaLang/julia/issues/45702 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D128670	2022-08-15 15:11:48 -04:00
Alexey Bataev	2819126d0c	[SLP][NFC]Replace multiple isa calls with single one where possible, NFC.	2022-08-15 11:56:58 -07:00
Sanjay Patel	e5748c6e73	[InstCombine] reduce sub-with-overflow ==/!= 0 The basic patterns look like this: https://alive2.llvm.org/ce/z/MDj9EC The tests have a use of the overflow value too. Otherwise, existing folds should reduce already. This was noted as a missing IR fold in: `926e7312b2` Hopefully, this makes it easier to implement a backend fix because we should get the same IR regardless of whether the source used builtins or inline code.	2022-08-15 13:03:51 -04:00
Nuno Lopes	0299ebc1bd	InstCombine: use poison instead of undef as placeholder in insertvalue [NFC] These vectors are fully initialized so the placeholder value is irrelevant	2022-08-14 21:37:23 +01:00
Kazu Hirata	50724716cd	[Transforms] Qualify auto in range-based for loops (NFC) Identified with readability-qualified-auto.	2022-08-14 12:51:58 -07:00
Kazu Hirata	448c466636	Use llvm::erase_value (NFC)	2022-08-13 12:55:50 -07:00
Kazu Hirata	109df7f9a4	[llvm] Qualify auto in range-based for loops (NFC) Identified with readability-qualified-auto.	2022-08-13 12:55:42 -07:00
Kazu Hirata	2117fcb1c0	Use Optional::transform instead of Optional::map (NFC) I'm planning to deprecate map in favor of transform for consistency with std::optional::transform in C++23.	2022-08-13 11:48:26 -07:00
Sanjay Patel	8b56fa92de	[InstCombine] fix "X\|(X^Y)" pattern-matching for commuted variants	2022-08-13 11:02:28 -04:00
Sanjay Patel	9d218b61cc	[InstCombine] reduce or-xor-or patterns (A \| ?) \| (A ^ B) --> (A \| ?) \| B https://alive2.llvm.org/ce/z/dbNQw4 This extends the existing transform to peek through another 'or' instruction for the common operand. This is the underlying missing fold that should allow issue #56711 and issue #57120 to reduce even more.	2022-08-13 09:52:01 -04:00
Sanjay Patel	763b31237f	[InstCombine] move comments closer to relevant code; NFC	2022-08-13 09:16:33 -04:00
Kevin Athey	532564de17	[MSAN] add flag to suppress storage of stack variable names with -sanitize-memory-track-origins Allows for even more savings in the binary image while simultaneously removing the name of the offending stack variable. Depends on D131631 Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D131728	2022-08-12 11:59:53 -07:00
Arthur Eubanks	a3ac1cfaed	[SampleProfile] Fix non-determinism in promoteMergeNotInlinedContextSamples() We're seeing non-determinism with loading sample profiles. It seems to be related to the order in which we merge FunctionSamples in promoteMergeNotInlinedContextSamples(). Use a MapVector to iterate over NonInlinedCallSites in the order entries were inserted. Reviewed By: wenlei, davidxl Differential Revision: https://reviews.llvm.org/D131592	2022-08-12 10:13:25 -07:00
Kevin Athey	ec277b67eb	[MSAN] Separate id ptr from constant string for variable names used in track origins. The goal is to reduce the size of the MSAN with track origins binary, by making the variable name locations constant which will allow the linker to compress them. Follows: https://reviews.llvm.org/D131415 Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D131631	2022-08-12 08:47:36 -07:00
Max Kazantsev	a3d1fb3b59	[SCEV] Prove condition invariance via context Contextual knowledge may be used to prove invariance of some conditions. For example, in this case: ``` ; %len >= 0 guard(%iv = {start,+,1}<nuw> <s %len) guard(%iv = {start,+,1}<nuw> <u %len) ``` the 2nd check always fails if `start` is negative and always passes otherwise. It looks like there are more opportunities of this kind that are still to be implemented in the future. Differential Revision: https://reviews.llvm.org/D129753 Reviewed By: apilipenko	2022-08-12 14:23:35 +07:00
Chuanqi Xu	e190b7cc90	[Coroutines] Maintain the position of final suspend Closing https://github.com/llvm/llvm-project/issues/56329 The problem happens when we try to simplify the suspend points. We might break the assumption that the final suspend lives in the last slot of Shape.CoroSuspends. This patch tries to main the assumption and fixes the problem.	2022-08-12 13:05:08 +08:00
Sanjay Patel	fa68d93d54	[InstCombine] fold reassociative fadd with negated operand We manage to iteratively achieve this result with no extra uses, and the reassociate pass can also do this, but this pattern falls through the cracks in the example from issue #57053.	2022-08-11 11:43:36 -04:00
Marco Elver	c47ec95531	[MemorySanitizer] Support memcpy.inline and memset.inline Other sanitizers (ASan, TSan, see added tests) already handle memcpy.inline and memset.inline by not relying on InstVisitor to turn the intrinsics into calls. Only MSan instrumentation currently does not support them due to missing InstVisitor callbacks. Fix it by actually making InstVisitor handle MemInlineInst. While the mem.inline intrinsics promise no calls to external functions as an optimization, for the sanitizers we need to break this guarantee since access into the runtime is required either way, and performance can no longer be guaranteed. All other cases, where generating a call is incorrect, should instead use no_sanitize. Fixes: https://github.com/llvm/llvm-project/issues/57048 Reviewed By: vitalybuka, dvyukov Differential Revision: https://reviews.llvm.org/D131577	2022-08-11 10:43:49 +02:00
Kevin Athey	057cabd997	Remove function name from sanitize-memory-track-origins binary. This work is being done to reduce the size of MSAN with track origins binary. Builds upon: https://reviews.llvm.org/D131205 Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D131415	2022-08-10 15:45:40 -07:00
Johannes Doerfert	b65471d715	[Attributor][FIX] Visit same instructions with different scopes If we collect potential values we need to visit a value even if we have seen it before if the scope is different. The scope is part of the result after all. Test included. Fixes https://github.com/llvm/llvm-project/issues/56753 Differential Revision: https://reviews.llvm.org/D131597	2022-08-10 16:02:12 -05:00
Kevin Athey	d7a47a9bb5	Desist from passing function location to __msan_set_alloca_origin4. This is done by calling __msan_set_alloca_origin and providing the location of the variable by using the call stack. This is prepatory work for dropping variable names when track-origins is enabled. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D131205	2022-08-10 09:02:53 -07:00
Nikita Popov	32017d5efe	[Attributor] Check for noalias call in AAInstanceInfo The relevant property of allocation functions of interest here is their uniqueness (in the sense of disjoint provenance), which is encoded by the noalias return attribute. Differential Revision: https://reviews.llvm.org/D130225	2022-08-10 10:27:14 +02:00
Dinar Temirbulatov	cab6cd6834	[AArch64][LoopVectorize] Introduce trip count minimal value threshold to ignore tail-folding. After D121595 was commited, I noticed regressions assosicated with small trip count numbersvectorisation by tail folding with scalable vectors. As a solution for those issues I propose to introduce the minimal trip count threshold value. Differential Revision: https://reviews.llvm.org/D130755	2022-08-09 22:10:17 +01:00
Sanjay Patel	926e7312b2	[InstCombine] fold usub.with.overflow to icmp when there's no use of the math value https://alive2.llvm.org/ce/z/UE48FH This is part of solving issue #56926.	2022-08-09 13:13:48 -04:00
Sanjay Patel	6bfe5361b7	[InstCombine] add helper function for extract of with-overflow-intrinsic; NFC We can do more with these patterns, so this block is going to grow.	2022-08-09 12:38:11 -04:00
zhongyunde	c2ab65ddaf	[IndVars] Eliminate redundant type cast with different sizes Deal with different sizes between the itofp and fptoi with trunc or sext/zext, depend on D129756. Fixes https://github.com/llvm/llvm-project/issues/55505. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D129958	2022-08-09 23:59:42 +08:00
Nikita Popov	4ac00789e1	[RelLookupTableConverter] Bail on invalid pointer size (x32) The RelLookupTableConverter pass currently only supports 64-bit pointers. This is currently enforced using an isArch64Bit() check on the target triple. However, we consider x32 to be a 64-bit target, even though the pointers are 32-bit. (And independently of that specific example, there may be address spaces with different pointer sizes.) As such, add an additional guard for the size of the pointers that are actually part of the lookup table. Differential Revision: https://reviews.llvm.org/D131399	2022-08-09 09:36:39 +02:00
Fangrui Song	de9d80c1c5	[llvm] LLVM_FALLTHROUGH => [[fallthrough]]. NFC With C++17 there is no Clang pedantic warning or MSVC C5051.	2022-08-08 11:24:15 -07:00
Ruobing Han	f756f06cc4	[SimpleLoopUnswitch] Skip non-trivial unswitching of cold loops With profile data, non-trivial LoopUnswitch will only apply on non-cold loops, as unswitching cold loops may not gain much benefit but significantly increase the code size. Reviewed By: aeubanks, asbirlea Differential Revision: https://reviews.llvm.org/D129599	2022-08-08 18:12:04 +00:00
Vang Thao	257251247a	[SROA] Try harder to find a vector promotion viable type when rewriting We are seeing significant performance loss when an alloca fails to get promoted to register. I have observed that this is due to the common type found when attempting to rewrite partition users being unviable for promotion. While if we would have continue looking for a type, we would have found a subtype in the original allocated type that would have enabled promotion. Thus first check if the initial common type found is promotion viable and if not then continue looking instead of stopping with the initial common type found. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D128073	2022-08-08 11:04:01 -07:00
Denis Antrushin	36cc533471	[EarlyCSE][OpaquePointers]Replace assert with return for mask type check. When EarlyCSE tries to common vector masked loads/stores, it first checks that they have same base operand and then assumes that this is enough for mask types to be equal. This is true for typed pointers but false for opaque ones - two loads of different vector sizes from same base pointer '%b' are the same, `ptr %b`. (For typed pointers, `%b` was cast to vector pointer type so bases were different). Change assert to return from lambda `isSubmask` so this transformation properly works with opaque pointers. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D131251	2022-08-08 16:14:42 +03:00
Kazu Hirata	e20d210eef	[llvm] Qualify auto (NFC) Identified with readability-qualified-auto.	2022-08-07 23:55:27 -07:00

1 2 3 4 5 ...

31237 Commits