llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazu Hirata	7760971741	Forward-declare raw_ostream (NFC) This patch adds forward declarations of raw_ostream to those header files that are relying on the forward declaration of raw_ostream in llvm/include/llvm/ADT/Optional.h. I'm planning to move operator<< for Optional<T> and std::optional<T> from Optional.h to llvm/include/llvm/Support/raw_ostream.h. Once I do so, we no longer need to forward-declare raw_ostream in Optional.h.	2022-12-04 21:43:10 -08:00
Fangrui Song	4b1b9e22b3	Remove unused #include "llvm/ADT/Optional.h"	2022-12-05 04:21:08 +00:00
Kazu Hirata	595f1a6aaf	[llvm] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 19:47:13 -08:00
Fangrui Song	f4c16c4473	[MC] llvm::Optional => std::optional https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 21:36:08 +00:00
Fangrui Song	bac974278c	CodeGen/CommandFlags: Convert Optional to std::optional	2022-12-03 18:38:12 +00:00
Krzysztof Parzyszek	8c7c20f033	Convert Optional<CodeModel> to std::optional<CodeModel>	2022-12-03 12:08:47 -06:00
Kazu Hirata	20cde15415	[Target] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 20:36:06 -08:00
Krzysztof Parzyszek	86fe4dfdb6	TargetTransformInfo: convert Optional to std::optional Recommit: added missing "#include <cstdint>".	2022-12-02 11:42:15 -08:00
Krzysztof Parzyszek	4e12d1836a	Revert "TargetTransformInfo: convert Optional to std::optional" This reverts commit `b83711248c`. Some buildbots are failing.	2022-12-02 11:34:04 -08:00
Krzysztof Parzyszek	b83711248c	TargetTransformInfo: convert Optional to std::optional	2022-12-02 11:27:12 -08:00
Valery Pykhtin	5ce3273ebf	[AMDGPU] Scheduler: Don't revert the schedule if the register pressure isn't changed for a region This one-linear fix improves compilation time for about ~40% on ASAN enabled code. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D136069	2022-12-02 15:59:38 +01:00
Jay Foad	0affe0c8cf	Revert "[AMDGPU] Speedup GCNDownwardRPTracker::advanceBeforeNext" This reverts commit `2d09bec169`. It was causing assertion failures in some out-of-tree tests.	2022-12-02 14:18:16 +00:00
Valery Pykhtin	2d09bec169	[AMDGPU] Speedup GCNDownwardRPTracker::advanceBeforeNext The function makes liveness tests for the entire live register set for every instruction it passes by. This becomes very slow on high RP regions such as ASAN enabled code. Instead only uses of last tracked instruction should be tested and this greatly improves compilation time. This patch revealed few bugs in SIFormMemoryClauses and PreRARematStage::sinkTriviallyRematInsts which should be fixed first. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D136267	2022-12-02 09:05:22 +01:00
Jay Foad	3d9e226081	[AMDGPU] Use s_cmp instead of s_cmpk Don't bother pre-shrinking "s_cmp_lg_u32 reg, 0" to s_cmpk_lg_u32 because 0 is already an inline constant so the s_cmpk form is no smaller. This is just for consistency with the surrounding code and to simplify a downstream patch. Differential Revision: https://reviews.llvm.org/D138993	2022-11-30 18:02:39 +00:00
Sebastian Neubauer	53efce3064	[AMDGPU] Remove todo about vector types D138205 added all the new vector types, so the todo is fixed now. Differential Revision: https://reviews.llvm.org/D139002	2022-11-30 14:13:12 +01:00
Pierre van Houtryve	a88deb4b65	[AMDGPU] Use aperture registers instead of S_GETREG Fixes a longstanding TODO in the codebase where we were using S_GETREG + shift to do something that could simply be done with an inline constant (register). Patch based on D31874 by @kzhuravl Depends on D137767 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D137542	2022-11-30 12:25:10 +00:00
Dmitry Preobrazhensky	f2d589ea46	[AMDGPU][MC][GFX11] Disable non-VGPR src operands for VOP3_DPP variants of fmac instructions Differential Revision: https://reviews.llvm.org/D138710	2022-11-30 14:48:00 +03:00
Jay Foad	2b97b069e4	[AMDGPU] Fix location of line break in VOPC instruction table	2022-11-30 11:40:36 +00:00
Nicolai Hähnle	4e79764f0a	AMDGPU: Fixup tests	2022-11-30 12:37:40 +01:00
Nicolai Hähnle	b7f44f7cf9	AMDGPU: Remove ImagePSV and move images to addrspace 7 Following up on the removal of BufferPSV in commit `43b86bf992` ("AMDGPU: Remove BufferPseudoSourceValue") It is unclear what exactly the right address space for images should be. They seem morally closest to buffers, so that's what I went with. In practical terms, address space 7 is better than address space 0 because it can't alias with LDS. Differential Revision: https://reviews.llvm.org/D138949	2022-11-30 11:32:34 +01:00
Thomas Symalla	851176c7f7	[AMDGPU] Remove AMDGPUISelDAGToDAG::isKnownNeverNaN This function removes the mentioned function, as it only does two checks which are already implemented as part of SelectionDAG::isKnownNeverNaN - which is called there. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D138938	2022-11-30 07:58:32 +01:00
Matt Arsenault	c08d55623d	AMDGPU: Fix creating illegal f16 fp_class We were missing legality checks. The device library build was broken for targets without f16 support. Technically the first pattern isn't tested by this patch; it only triggers with the isBeforeLegalize check in performAndCombine removed. I'm not sure how to trick this into appearing post-legalization.	2022-11-29 18:24:30 -05:00
Benjamin Kramer	3a86931f56	AMDGPU: Remove unused variables. NFC	2022-11-29 22:57:00 +01:00
Ron Lieberman	ca856fff1c	Revert "enable code-object-version=5" very sorry wrong repo. This reverts commit `d882ba7aea`.	2022-11-29 15:21:09 -06:00
Nicolai Hähnle	43b86bf992	AMDGPU: Remove BufferPseudoSourceValue The use of a PSV for buffer intrinsics is misleading because it may be misinterpreted as all buffer intrinsics accessing the same address in memory, which is clearly not true. Instead, build MachineMemOperands without a pointer value but with an address space, so that address space-based alias analysis can still work. There is a lot of test churn because previously address space 4 (constant address space) was used as an address space for buffer intrinsics. This doesn't make much sense and seems to have been an accident -- see the change in AMDGPUTargetMachine::getAddressSpaceForPseudoSourceKind. Differential Revision: https://reviews.llvm.org/D138711	2022-11-29 22:15:11 +01:00
Ron Lieberman	d882ba7aea	enable code-object-version=5	2022-11-29 15:11:57 -06:00
Brendon Cahoon	b32a5666a8	[AMDGPU] Unify uniform return and divergent unreachable blocks This patch fixes a "failed to annotate CFG" error in SIAnnotateControlFlow. The problem occurs when there are divergent and uniform unreachable/return blocks in the same region. In this case, AMDGPUUnifyDivergentExitNodes does not create a unified block so the region contains multiple exits. StructurizeCFG does not work properly when there are multiple exits, so the neccessary CFG transformations do not occur along divergent control flow. Subsequently, SIAnnotateControlFlow processes the path to the divergent exit block, but may only partially process blocks along a unform control flow path to another exit block. This patch fixes the bug by creating a single exit block when there is a divergent exit block in the function. Differential revision: https://reviews.llvm.org/D136892	2022-11-29 13:25:56 -06:00
Jay Foad	0daa3df3a5	[AMDGPU] Use GCNSubtarget::hasInstPrefetch instead of generation check. NFC.	2022-11-29 17:36:33 +00:00
Mateja Marjanovic	595a08847a	[AMDGPU] Add support for new LLVM vector types Add VReg, AReg and SReg on AMDGPU for bit widths: 288, 320, 352 and 384. Differential Revision: https://reviews.llvm.org/D138205	2022-11-29 17:02:04 +01:00
Dmitry Preobrazhensky	9b8eb5fa8e	[AMDGPU][MC][GFX11] Correct op_sel handling for permlane*16 Differential Revision: https://reviews.llvm.org/D137969	2022-11-29 18:45:22 +03:00
Dmitry Preobrazhensky	869fc7eabd	[AMDGPU][MC][MI100+] Enable VOP3 variants of dot2c/dot4c/dot8c opcodes Differential Revision: https://reviews.llvm.org/D138494	2022-11-29 17:38:18 +03:00
Stanislav Mekhanoshin	28eb9ed3bb	[AMDGPU] Fine tune LDS misaligned access speed Differential Revision: https://reviews.llvm.org/D124219	2022-11-28 16:12:02 -08:00
Janek van Oirschot	322966f8f8	[AMDGPU] Add llvm.is.fpclass intrinsic to existing SelectionDAG fp class support and introduce GlobalISel implementation for AMDGPU Uses existing SelectionDAG lowering of the llvm.amdgcn.class intrinsic for llvm.is.fpclass	2022-11-28 16:00:36 -05:00
Matt Arsenault	94f73fd6f8	AMDGPU: Code simplification for ctor/dtor lowering Move the shared global variable lookup into the function.	2022-11-28 15:39:50 -05:00
David Stuttard	7940888c59	[AMDGPU] Intrinsic to expose s_wait_event for export ready Differential Revision: https://reviews.llvm.org/D138216	2022-11-28 11:26:15 +00:00
Ivan Kosarev	536b8c5377	[AMDGPU][AsmParser] Remove the now-unused OptionalOperand structure. Became unused in https://reviews.llvm.org/D138492 .	2022-11-28 11:00:05 +00:00
Guillaume Chatelet	702126aec5	[NFC] Add helper method to ensure min alignment on MCSection Follow up on D138653. Differential Revision: https://reviews.llvm.org/D138686	2022-11-28 10:00:34 +00:00
Bjorn Pettersson	99c47d9e31	Remove TargetMachine::adjustPassManager Since opt no longer supports to run default (O0/O1/O2/O3/Os/Oz) pipelines using the legacy PM, there are no in-tree uses of TargetMachine::adjustPassManager remaining. This patch removes the no longer used adjustPassManager functions. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D137796	2022-11-28 10:24:16 +01:00
Kazu Hirata	589725f6e8	[llvm] Use std::size (NFC) std::size, introduced in C++17, allows us to directly obtain the number of elements of an array.	2022-11-26 13:47:32 -08:00
Kazu Hirata	aad2d272bf	[Utils] Use std::optional in AMDGPUBaseInfo.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 22:43:00 -08:00
Kazu Hirata	09e0aeaaaa	[AMDGPU] Use std::optional in SIPeepholeSDWA.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 22:40:00 -08:00
Kazu Hirata	96cb34ab5a	[AsmParser] Use std::optional in AMDGPUAsmParser.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 22:36:37 -08:00
Kazu Hirata	ebe3281793	[AMDGPU] Use std::optional in AMDGPUReleaseVGPRs.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 22:30:26 -08:00
Kazu Hirata	959c9cc7ac	[AMDGPU] Use std::optional in AMDGPUInstructionSelector.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 22:23:09 -08:00
Dmitry Preobrazhensky	a0a0dc6f8b	[AMDGPU][MC] Refactor MC Code Emitter to avoid using magic values Differential Revision: https://reviews.llvm.org/D138594	2022-11-25 18:02:00 +03:00
Guillaume Chatelet	6c09ea3fdd	[Alignment][NFC] Use Align in MCStreamer::emitValueToAlignment Differential Revision: https://reviews.llvm.org/D138674	2022-11-24 16:09:44 +00:00
Guillaume Chatelet	e647b4f519	[reland][Alignment][NFC] Use the Align type in MCSection Differential Revision: https://reviews.llvm.org/D138653	2022-11-24 13:19:18 +00:00
Guillaume Chatelet	3467f9c7d6	Revert D138653 [Alignment][NFC] Use the Align type in MCSection" This breaks the bolt project. This reverts commit `409f0dc4a4`.	2022-11-24 12:42:30 +00:00
Guillaume Chatelet	409f0dc4a4	[Alignment][NFC] Use the Align type in MCSection Differential Revision: https://reviews.llvm.org/D138653	2022-11-24 12:32:58 +00:00
Ivan Kosarev	fce7a7aa9f	[AMDGPU][AsmParser] Refine parsing instruction operands. Eliminates the need for working around optional and token operands being mistakenly parsed as expressions. Reviewed By: dp Differential Revision: https://reviews.llvm.org/D138492	2022-11-24 10:50:26 +00:00

1 2 3 4 5 ...

7433 Commits