llvm-project

Commit Graph

Author	SHA1	Message	Date
zhoujingya	294b5b4e5e	[VENTUS][fix] Remove illegal VMV_X_S/VFMV_F_S instructions' definition and patterns https://github.com/THU-DSP-LAB/llvm-project/issues/31	2023-10-10 17:30:47 +08:00
zhoujing	2d8205dd54	[VENTUS][fix] Kernel function arguments are not divergent path	2023-08-21 16:14:21 +08:00
zhoujing	d48134f86e	[VENTUS][RISCV][fix] Add divergent analysis for function arguments	2023-06-27 11:44:50 +08:00
Aries	e6b7935c89	[Ventus] ABI and stack adjustment. Remove all SGPRs(except ra) from callee saved register set, as they are mainly used in kernel function. Unify the stack to use TP only, we will emit customized instructions for SP use which should not be considered as stack according to LLVM codegen infrastructure(only 1 stack is allowed). By unifying the stack to TP based, it is much easiler for the backend codegen.	2023-06-21 13:08:02 +08:00
Aries	a7b9ee473b	[NFC] Fix comment style	2023-06-19 13:14:31 +08:00
zhoujing	2c5cbdd187	[VENTUS][RISCV][fix] Add more divergence ananlysis Referenced from AMDGPU	2023-06-15 22:34:40 +08:00
Aries	438f1c92c4	Fix some build warnings	2023-01-19 09:45:27 +08:00
Aries	dee3135130	Drafting divergent related code, not working yet.	2022-12-19 18:11:34 +08:00
Aries	894931f522	More clean up and fix build error.	2022-12-19 10:10:28 +08:00
Aries	521e83631d	Roughly cleaned RVV instruction selection.	2022-12-19 09:40:05 +08:00
Aries	f1eff7fcfe	Very very early step to remove RVV features from code base.	2022-12-16 17:33:54 +08:00
Krzysztof Parzyszek	86fe4dfdb6	TargetTransformInfo: convert Optional to std::optional Recommit: added missing "#include <cstdint>".	2022-12-02 11:42:15 -08:00
Krzysztof Parzyszek	4e12d1836a	Revert "TargetTransformInfo: convert Optional to std::optional" This reverts commit `b83711248c`. Some buildbots are failing.	2022-12-02 11:34:04 -08:00
Krzysztof Parzyszek	b83711248c	TargetTransformInfo: convert Optional to std::optional	2022-12-02 11:27:12 -08:00
Krzysztof Parzyszek	26424c96c0	Attributes: convert Optional to std::optional	2022-12-02 08:15:45 -06:00
Philip Reames	73eacf94e0	[RISCV] Incorporate LMUL into costs for arithmetic and shuffles This reuses the routine implemented in `0e6f0b7` to implement several existing TODOs. Many of the operations scale linearly with LMUL; this change represents that in the cost model. Differential Revision: https://reviews.llvm.org/D139039	2022-12-01 10:46:27 -08:00
Philip Reames	7d82c99403	[RISCV][TTI] Account for constant materialization cost when costing arithmetic operations At the IR level, we generally assume that constants are free to materialize. However, for RISCV due to some quirks of the ISA, materializing arbitrary constants can be rather expensive. We frequently fallback to constant pool loads. We've been slowly moving in the direction of modeling the cost of the remat as part of the instruction cost. This has the effect of disincentivizing vectorization - mostly SLP - when we'd have to materialize an expensive constant. We need better modeling of which constants are expensive and not, but the moment let's be consistent with how we model arithmetic and memory instructions. The difference between the two is that arithmetic can sometimes fold a splat operation which stores can not. Differential Revision: https://reviews.llvm.org/D138941	2022-11-30 07:20:51 -08:00
ShihPo Hung	0e6f0b7cc3	[RISCV] Add cost model for fixed broadcast shuffle This patch adds basic broadcast shuffle costs in order to enable SLP vectorization. And adds `getLMULCost` to consider reciprocal throughput for different LMUL. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D137276	2022-11-30 04:58:52 -08:00
Philip Reames	db07d79ab0	[RISCV] Add cost model for integer and float vector arithmetic instructions. This patch implements getArithmeticInstrCost for RISCV, supports cost model for integer and float vector arithmetic instructions. Differential Revision: https://reviews.llvm.org/D133552 (Original patch by jacquesguan. Subset by me with todos added.)	2022-11-28 09:04:38 -08:00
Philip Reames	d2cf0bd78d	[RISCV] Add callback plumbing for getArithmeticInstrCost [nfc] Most of the code for this was taken from https://reviews.llvm.org/D133552, with one bug fix by me. I'm landing the plumbing so that we can focus on the cost model pieces in the review.	2022-11-22 18:53:23 -08:00
Craig Topper	a391b49ce8	[RISCV] Prevent constant hoisting for (and (shl X, C), mask<<C) If the immediate is a shifted mask, we will use a pair of shifts and never materialize the immediate. Consider the immediate free. Reviewed By: reames, luismarques Differential Revision: https://reviews.llvm.org/D138260	2022-11-21 19:16:40 -08:00
Yeting Kuo	ed9638c44b	[VP][RISCV] Add vp.nearbyint and RISC-V support. nearbyint has the property to execute without exception. For not modifying fflags, the patch added new machine opcode PseudoVFROUND_NOEXCEPT_V that expands vfcvt.x.f.v and vfcvt.f.x.v between a pair of frflags and fsflags. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D137685	2022-11-16 14:05:35 +08:00
Yeting Kuo	5c3ca10b09	[VP][RISCV] Add vp.bswap and RISC-V support. The patch also added function expandVPBSWAP to expand ISD::VP_BSWAP nodes. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D137928	2022-11-16 11:36:38 +08:00
Craig Topper	b6ad7ab89e	[RISCV] Prevent autovectorization using vscale with Zvl32b. RVVBitsPerBlock is 64. If VLen==32, VLen/RVVBitsPerBlock is 0. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D137280	2022-11-02 13:55:21 -07:00
Philip Reames	73482b457e	[RISCV] Fix cost of legal fixed length masked load and stores We can cost them the same way as a scalable masked load/store. By hitting the default path, we were costing them as if they were being scalarized. This is a significant over estimate. Differential Revision: https://reviews.llvm.org/D137218	2022-11-02 07:24:38 -07:00
Yeting Kuo	71e4e35581	[VP][RISCV] Add vp.rint and RISC-V support. FRINT uses dynamic rounding mode instead of static rounding mode. The patch rename VFCVT_X_F_VL to VFCVT_RM_X_F_VL for static rounding mode uses and added new ISDNode VFCVT_X_F_VL directly selected to PseudoVFCVT_X_F_V. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D136662	2022-11-01 14:52:47 +08:00
Craig Topper	e94dc58dff	[RISCV] Inline scalar ceil/floor/trunc/rint/round/roundeven. This avoids the call overhead as well as the the save/restore of fflags and the snan handling in the libm function. The save/restore of fflags and snan handling are needed to be correct for -ftrapping-math. I think we can ignore them in the default environment. The inline sequence will generate an invalid exception for nan and an inexact exception if fractional bits are discarded. I've used a custom inserter to explicitly create the control flow around the float->int->float conversion. We can probably avoid the final fsgnj after the conversion for no signed zeros FMF, but I'll leave that for future work. Note the comparison constant is slightly different than glibc uses. They use 1<<53 for double, I'm using 1<<52. I believe either are valid. Numbers >= 1<<52 can't have any fractional bits. It's ok to do the float->int->float conversion on numbers between 1<<53 and 1<<52 since they will all fit in 64. We only have a problem if the double can't fit in i64 Reviewed By: reames Differential Revision: https://reviews.llvm.org/D136508	2022-10-26 14:36:49 -07:00
Craig Topper	d4dc036e70	[RISCV] Move vector cost table lookup out of the switch in getIntrinsicInstrCost. NFC This allows vectors to be looked up if the switch is used for the scalar version of an intrinsic. Extracted from D136508.	2022-10-24 20:32:22 -07:00
Craig Topper	020450211b	[RISCV] Add missing vscale x 1 cost model entries and tests. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D136411	2022-10-21 09:05:59 -07:00
Craig Topper	44f0b13494	[RISCV] Correct RISCVTTIImpl::getRegUsageForType for vectors of pointers. getPrimitiveSizeInBits returns 0 for pointers, we need to query the size via DataLayout instead. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D135976	2022-10-14 11:34:12 -07:00
Craig Topper	de0de294eb	[RISCV] Update cost of vector roundeven to match round which uses the same sequence but a different FRM value. Reviewed By: reames, eopXD Differential Revision: https://reviews.llvm.org/D134978	2022-09-30 20:01:35 -07:00
Philip Reames	02bfe2de7c	[RISCV] Adjust vector immediate store materialization cost This change updates the costs to make constant pool loads match their actual cost, and adds the broadcast special case to avoid too many regressions. We really need more information about the constants being rematerialized, but this is an incremental improvement. Differential Revision: https://reviews.llvm.org/D134746	2022-09-29 07:37:13 -07:00
Philip Reames	77b202f974	[RISCV] Rename getVectorImmCost to getStoreImmCost [nfc] My original intent had been to reuse this for arithmetic instructions as well, but due to the availability of a immediate splat encoding there, we will need different heuristics. So specialize the existing code for the store case.	2022-09-27 08:22:13 -07:00
jacquesguan	ecf327f154	[RISCV] Add cost model for vector insert/extract element. This patch adds cost model for vector insert/extract element instructions. In RVV, we could use vector scalar move instruction to insert or extract the first element, and use vslide to move it. But for mask vector or i64 vector in i32 target, we need special instructions to make it. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D133007	2022-09-14 11:10:18 +08:00
Haojian Wu	7ed68182d7	Fix a -Wswitch warning.	2022-09-13 08:57:43 +02:00
jacquesguan	b98b4fae75	[RISCV] Add cost model for compare and select instructions. This patch adds cost model for vector compare and select instructions. For vector FP compare instruction, it only add the comparisions supported natively. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D132296	2022-09-13 14:44:46 +08:00
liqinweng	9b4e75ee76	[RISCV][COST] Add cost model for mask vector select instruction when its condition is a scalar type Reviewed By: jacquesguan Differential Revision: https://reviews.llvm.org/D132992	2022-09-08 18:55:49 +08:00
Craig Topper	5d30565d80	[RISCV] Improve vector fround lowering by changing FRM. This is a follow up to D133238 which did this for ceil/floor. Reviewed By: arcbbb, frasercrmck Differential Revision: https://reviews.llvm.org/D133335	2022-09-06 09:33:13 -07:00
Craig Topper	f0332d12ae	[RISCV] Improve vector fceil/ffloor lowering by changing FRM. This adds new VFCVT pseudoinstructions that take a rounding mode operand. A custom inserter is used to insert additional instructions to change FRM around the VFCVT. Some of this is borrowed from D122860, but takes a somewhat different direction. We may migrate to that patch, but for now I was trying to keep this as independent from RVV intrinsics as I could. A followup patch will use this approach for FROUND too. Still need to fix the cost model. Reviewed By: arcbbb Differential Revision: https://reviews.llvm.org/D133238	2022-09-05 19:03:44 -07:00
jacquesguan	45c1ce321d	[RISCV] Add cost model for select and integer compare instructions. This patch adds cost model for vector select and integer compare instructions.	2022-08-31 11:32:58 +08:00
liqinweng	72c9f811d8	[RISCV][COST] Refactor for costs of integer saturing add/sub Reviewed By: reames Differential Revision: https://reviews.llvm.org/D132822	2022-08-30 11:39:55 +08:00
liqinweng	a42e21deb8	[RISCV] Refactor for costs of integer min/max Reviewed By: reames Differential Revision: https://reviews.llvm.org/D132724	2022-08-29 10:13:50 +08:00
Philip Reames	a310637132	[RISCV] Disable SLP vectorization by default due to unresolved profitability issues This change implements a TTI query with the goal of disabling slp vectorization on RISCV. The current default configuration disables SLP already, but its current tied to the ability to lower fixed length vectors. Over in D131508, I want to enable fixed length vectors for purposes of LoopVectorizer, but preliminary analysis has revealed a couple of SLP specific issues we need to resolve before enabling it by default. This change exists to allow us to enable LV without SLP. Differential Revision: https://reviews.llvm.org/D132680	2022-08-26 14:11:22 -07:00
Philip Reames	53f738ce7e	[RISCV] Add empirical costs for integer min/max and saturing add/sub All of these are lowered to a single instruction for all legal vector types.	2022-08-25 09:27:17 -07:00
Philip Reames	03798f268b	{RISCV] Backout cttz/ctlz instruction costs Craig points out correctly in post-commit review that these depend on the availability of floating point extensions.	2022-08-24 15:40:48 -07:00
Philip Reames	d4d6e71ea2	[RISCV] Add empirical costs for bswap/bitreverse/ctpop/ctlz/cttz If anyone is looking for a source of ideas on vector codegen improvements, the lowerings for several of these seem to include pretty obvious fixits.	2022-08-24 15:09:21 -07:00
Philip Reames	42af1a776a	[RISCV] Add empirically measured vector sqrt intrinsic costs	2022-08-24 14:27:57 -07:00
Philip Reames	4d3134866f	[RISCV] Add vector fabs intrinsic costs We have a fabs vector instruction, and are using it for current lowering.	2022-08-24 14:09:51 -07:00
Philip Reames	c9608d57b8	[TTI] Plumb through OperandValueInfo in getMemoryOpCost [NFC] This has the effect of exposing the power-of-two property for use in memory op costing, but no target actually uses it yet. The main point of this change is simple consistency with the recently changes getArithmeticInstrCost, and to remove the last (interface) use of OperandValueKind.	2022-08-23 07:55:42 -07:00
Philip Reames	478cf94378	[X86][AArch64][WebAsm][RISCV] Query operand properties instead of using enums directly [nfc] This is part of an ongoing transition to use OperandValueInfo which combines OperandValueKind and OperandValueProperties. This change adds some accessor methods and uses them to simplify backend code. The primary motivation of doing so is removing uses of the parameters so that an upcoming api change is less error prone.	2022-08-22 13:37:59 -07:00

1 2 3

117 Commits