clang

History

Adam Nemet e280490a85 Add loop pragma for Loop Distribution Summary: This is similar to other loop pragmas like 'vectorize'. Currently it only has state values: distribute(enable) and distribute(disable). When one of these is specified the corresponding loop metadata is generated: !{!"llvm.loop.distribute.enable", i1 true/false} As a result, loop distribution will be attempted on the loop even if Loop Distribution in not enabled globally. Analogously, with 'disable' distribution can be turned off for an individual loop even when the pass is otherwise enabled. There are some slight differences compared to the existing loop pragmas. 1. There is no 'assume_safety' variant which makes its handling slightly different from 'vectorize'/'interleave'. 2. Unlike the existing loop pragmas, it does not have a corresponding numeric pragma like 'vectorize' -> 'vectorize_width'. So for the consistency checks in CheckForIncompatibleAttributes we don't need to check it against other pragmas. We just need to check for duplicates of the same pragma. Reviewers: rsmith, dexonsmith, aaron.ballman Subscribers: bob.wilson, cfe-commits, hfinkel Differential Revision: http://reviews.llvm.org/D19403 git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@272656 91177308-0d34-0410-b5e6-96231b3b80d8		2016-06-14 12:04:26 +00:00
..
ABIInfo.h	IRGen-level lowering for the Swift calling convention.	2016-04-04 18:33:08 +00:00
Address.h	Work around build failure due to GCC 4.8.1 bug. We don't completely understand	2016-02-02 23:11:49 +00:00
BackendUtil.cpp	[asan] Added -fsanitize-address-use-after-scope flag	2016-06-02 00:24:20 +00:00
CGAtomic.cpp	[MS Volatile] Don't make volatile loads/stores to underaligned objects atomic	2016-05-24 16:09:25 +00:00
CGBlocks.cpp	CodeGen: correct assertion	2016-06-03 23:26:30 +00:00
CGBlocks.h	Move BlockByrefHelpers back to CodeGenModule.h to placate MSVC.	2015-09-08 08:21:11 +00:00
CGBuilder.h	Remove compile time PreserveName in favor of a runtime cc1 -discard-value-names option	2016-03-13 21:05:23 +00:00
CGBuiltin.cpp	Fix unused variable warning	2016-06-13 10:05:19 +00:00
CGCUDABuiltin.cpp	[CUDA] Don't crash when trying to printf a non-scalar object.	2016-02-11 02:00:52 +00:00
CGCUDANV.cpp	[CUDA] Do not generate unnecessary runtime init code.	2016-03-02 18:28:53 +00:00
CGCUDARuntime.cpp	Roll-back r250822.	2015-10-20 13:23:58 +00:00
CGCUDARuntime.h	[CUDA] Emit host-side 'shadows' for device-side global variables	2016-03-02 18:28:50 +00:00
CGCXX.cpp	revert SVN r265702, r265640	2016-04-08 16:52:00 +00:00
CGCXXABI.cpp	Add the `pass_object_size` attribute to clang.	2015-12-02 21:58:08 +00:00
CGCXXABI.h	Introduce CGCXXABI::canCallMismatchedFunctionType	2016-05-10 17:44:55 +00:00
CGCall.cpp	Remove nonsense and simplify. To forward a reference, we always just load the	2016-06-14 01:13:21 +00:00
CGCall.h	Don't emit exceptional stackrestore cleanups around inalloca functions	2015-10-08 00:17:45 +00:00
CGClass.cpp	Implementation of VlA of GNU C++ extension, by Vladimir Yakovlev.	2016-04-29 09:39:50 +00:00
CGCleanup.cpp	[CodeGen] Emit lifetime.end intrinsic after objects are destructed in	2016-04-01 22:58:55 +00:00
CGCleanup.h	Update for LLVM function name change.	2016-01-14 21:00:27 +00:00
CGDebugInfo.cpp	[DebugInfo] Add calling conventions to DISubroutineType	2016-06-08 20:41:54 +00:00
CGDebugInfo.h	Reverting 268055 as it caused PR27579.	2016-04-30 01:44:38 +00:00
CGDecl.cpp	[asan] Added -fsanitize-address-use-after-scope flag	2016-06-02 00:24:20 +00:00
CGDeclCXX.cpp	Introduce CGCXXABI::canCallMismatchedFunctionType	2016-05-10 17:44:55 +00:00
CGException.cpp	[SEH] Remove nounwind/noinline from outlined finally funclets	2016-03-11 17:36:16 +00:00
CGExpr.cpp	[MS Volatile] Don't make volatile loads/stores to underaligned objects atomic	2016-05-24 16:09:25 +00:00
CGExprAgg.cpp	Fix -Werror build.	2016-03-08 23:16:16 +00:00
CGExprCXX.cpp	[MS ABI] Don't crash when zero-initializing a vbase which contains a vbase	2016-05-12 03:51:52 +00:00
CGExprComplex.cpp	[Bugfix] Fix ICE on constexpr vector splat.	2016-01-13 01:52:39 +00:00
CGExprConstant.cpp	Revert "[Temporary] Add an ExprWithCleanups for each C++ MaterializeTemporaryExpr."	2016-06-09 21:13:39 +00:00
CGExprScalar.cpp	[OpenCL] Fix __builtin_astype for vec3 types.	2016-06-08 15:11:21 +00:00
CGLoopInfo.cpp	Add loop pragma for Loop Distribution	2016-06-14 12:04:26 +00:00
CGLoopInfo.h	Add loop pragma for Loop Distribution	2016-06-14 12:04:26 +00:00
CGObjC.cpp	Remove CXXConstructExpr::getFoundDecl(); it turned out to not be useful.	2016-06-10 00:58:19 +00:00
CGObjCGNU.cpp	Reduce the number of implicit StringRef->std::string conversions by threading StringRef through more APIs.	2016-02-13 13:42:54 +00:00
CGObjCMac.cpp	CodeGen: convert some const char * to StringRef	2016-05-16 05:06:49 +00:00
CGObjCRuntime.cpp	Preserve ExtParameterInfos into CGFunctionInfo.	2016-03-11 04:30:31 +00:00
CGObjCRuntime.h	Reduce the number of implicit StringRef->std::string conversions by threading StringRef through more APIs.	2016-02-13 13:42:54 +00:00
CGOpenCLRuntime.cpp	[OpenCL] Move OpenCLImageTypes.def from clangAST to clangBasic library.	2016-04-13 08:33:41 +00:00
CGOpenCLRuntime.h	[OpenCL] Pipe type support	2016-01-09 12:53:17 +00:00
CGOpenMPRuntime.cpp	Remove a few gendered pronouns.	2016-06-10 18:53:04 +00:00
CGOpenMPRuntime.h	[OpenMP] Codegen for target update directive.	2016-05-26 18:30:22 +00:00
CGOpenMPRuntimeNVPTX.cpp	[OPENMP] Codegen for teams directive for NVPTX	2016-04-04 15:55:02 +00:00
CGOpenMPRuntimeNVPTX.h	[OPENMP] Codegen for teams directive for NVPTX	2016-04-04 15:55:02 +00:00
CGRecordLayout.h	Make CodeGen headers self-contained.	2016-02-02 16:05:18 +00:00
CGRecordLayoutBuilder.cpp	revert SVN r265702, r265640	2016-04-08 16:52:00 +00:00
CGStmt.cpp	[CUDA] Conservatively mark inline asm as convergent.	2016-05-31 21:27:13 +00:00
CGStmtOpenMP.cpp	[OPENMP 4.5] Fixed codegen for 'priority' and destructors in task-based	2016-05-30 09:06:50 +00:00
CGVTT.cpp	CodeGen: Use 32-bit gep offsets to address vtable address points.	2016-03-14 19:07:10 +00:00
CGVTables.cpp	Update clang for LLVM API change.	2016-05-10 20:23:29 +00:00
CGVTables.h	[CodeGen] Remove dead code. NFC.	2015-10-15 15:29:40 +00:00
CGValue.h	[Sema] PR26444 fix crash when alignment value is >= 2**16	2016-03-02 06:48:47 +00:00
CMakeLists.txt	Use the new path for coverage related headers and update CMakeLists.txt	2016-04-29 18:53:16 +00:00
CodeGenABITypes.cpp	Various improvements to the public IRGen interface.	2016-05-18 05:21:18 +00:00
CodeGenAction.cpp	Embed bitcode in object file (clang cc1 part)	2016-05-11 16:26:03 +00:00
CodeGenFunction.cpp	[DebugInfo] Add calling conventions to DISubroutineType	2016-06-08 20:41:54 +00:00
CodeGenFunction.h	[OpenMP] Parsing and sema support for target update directive	2016-05-26 17:30:50 +00:00
CodeGenModule.cpp	CodeGen: tweak CFString emission for COFF targets	2016-06-01 04:22:24 +00:00
CodeGenModule.h	Re-apply r267784, r267824 and r267830.	2016-04-28 17:09:37 +00:00
CodeGenPGO.cpp	Reapply^3 "[ProfileData] (clang) Use Error in InstrProf and Coverage, NFC"	2016-05-19 03:54:54 +00:00
CodeGenPGO.h	revert SVN r265702, r265640	2016-04-08 16:52:00 +00:00
CodeGenTBAA.cpp	revert SVN r265702, r265640	2016-04-08 16:52:00 +00:00
CodeGenTBAA.h	Make the remaining headers self-contained.	2016-02-02 14:24:21 +00:00
CodeGenTypeCache.h	Compute and preserve alignment more faithfully in IR-generation.	2015-09-08 08:05:57 +00:00
CodeGenTypes.cpp	Enable support for __float128 in Clang and enable it on pertinent platforms	2016-05-09 08:52:33 +00:00
CodeGenTypes.h	IRGen-level lowering for the Swift calling convention.	2016-04-04 18:33:08 +00:00
CoverageMappingGen.cpp	Reapply [Coverage] Fix an assertion failure if the definition of an unused function spans multiple files.	2016-06-07 10:07:51 +00:00
CoverageMappingGen.h	revert SVN r265702, r265640	2016-04-08 16:52:00 +00:00
EHScopeStack.h	[CodeGen] Emit lifetime.end intrinsic after objects are destructed in	2016-04-01 22:58:55 +00:00
ItaniumCXXABI.cpp	Introduce CGCXXABI::canCallMismatchedFunctionType	2016-05-10 17:44:55 +00:00
MicrosoftCXXABI.cpp	[MS ABI] Delegating constructors should not assume they are most derived	2016-05-13 20:05:09 +00:00
ModuleBuilder.cpp	Various improvements to the public IRGen interface.	2016-05-18 05:21:18 +00:00
ObjectFilePCHContainerOperations.cpp	Apply clang-tidy's misc-move-constructor-init throughout Clang.	2016-05-27 14:27:13 +00:00
README.txt	These IRgen improvements have been done.	2009-07-23 03:03:07 +00:00
SanitizerMetadata.cpp	[ASan] Initial support for Kernel AddressSanitizer	2015-06-19 12:19:07 +00:00
SanitizerMetadata.h	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; Clang edition.	2015-02-15 22:54:08 +00:00
SwiftCallingConv.cpp	Silencing a 32-bit shift implicit conversion warning from MSVC; NFC.	2016-04-08 12:21:58 +00:00
TargetInfo.cpp	[Sparc] Complex return value ABI compliance.	2016-06-08 14:47:25 +00:00
TargetInfo.h	IRGen-level lowering for the Swift calling convention.	2016-04-04 18:33:08 +00:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//