llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	ea47ccc78f	[BOLT] Fix after DebugInfoMetadata change `0ca43d4488`	2022-12-04 18:57:52 +00:00
Kazu Hirata	e324a80fab	[BOLT] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 23:12:38 -08:00
Nico Weber	e8ce5f1ec9	[bolt] Use llvm::sys::RWMutex instead of std::shared_timed_mutex This has the following advantages: - std::shared_timed_mutex is macOS 10.12+ only. llvm::sys::RWMutex automatically switches to a different implementation internally when targeting older macOS versions. - bolt only needs std::shared_mutex, not std::shared_timed_mutex. llvm::sys::RWMutex automatically uses std::shared_mutex internally where available. std::shared_mutex and RWMutex have the same API, so no code changes other than types and includes are needed. Differential Revision: https://reviews.llvm.org/D138423	2022-11-21 19:24:32 -05:00
Kazu Hirata	1fa870b1bd	Use None consistently (NFC) This patch replaces NoneType() and NoneType::None with None in preparation for migration from llvm::Optional to std::optional. In the std::optional world, we are not guranteed to be able to default-construct std::nullopt_t or peek what's inside it, so neither NoneType() nor NoneType::None has a corresponding expression in the std::optional world. Once we consistently use None, we should even be able to replace the contents of llvm/include/llvm/ADT/None.h with something like: using NoneType = std::nullopt_t; inline constexpr std::nullopt_t None = std::nullopt; to ease the migration from llvm::Optional to std::optional. Differential Revision: https://reviews.llvm.org/D138376	2022-11-20 00:24:40 -08:00
Nico Weber	f65e8c3c51	[bolt] Fix std::prev()-past-begin in veneer handling code matchLinkerVeneer() returns 3 if `Instruction` and the last two instructions in `[Instructions.begin, Instructions.end())` match the pattern ADRP x16, imm ADD x16, x16, imm BR x16 BinaryContext.cpp used to use --Count; for (auto It = std::prev(Instructions.end()); Count != 0; It = std::prev(It), --Count) { ...use It... } to walk these instructions. The first `--Count` skips the instruction that's in `Instruction` instead of in `Instructions`. The loop then walks over `Instructions`. However, on the last iteration, this calls `std::prev()` on an iterator that points at the container's begin(), which can blow up. Instead, use rbegin(), which sidesteps this issue. Fixes test/AArch64/veneer-gold.s on a macOS host. With this, check-bolt passes on macOS. Differential Revision: https://reviews.llvm.org/D138313	2022-11-18 14:42:08 -05:00
revunov.denis@huawei.com	c92ff2a3c4	[BOLT][NFC] Fix possible use-after-free If NewName twine has reference to the old name, then after Section.Name = NewName.str(); this reference is invalidated, so we cannot use NewName.str() anymore. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D137616	2022-11-14 13:30:22 +00:00
Maksim Panchenko	bcc4c90954	[BOLT] Fix instruction encoding validation Always use non-symbolizing disassembler for instruction encoding validation as symbols will be treated as undefined/zeros be the encoder and causing byte sequence mismatches. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D136118	2022-10-18 13:50:00 -07:00
Maksim Panchenko	4d3a0cade2	[BOLT] Section-handling refactoring/overhaul Simplify the logic of handling sections in BOLT. This change brings more direct and predictable mapping of BinarySection instances to sections in the input and output files. * Only sections from the input binary will have a non-null SectionRef. When a new section is created as a copy of the input section, its SectionRef is reset to null. * RewriteInstance::getOutputSectionName() is removed as the section name in the output file is now defined by BinarySection::getOutputName(). * Querying BinaryContext for sections by name uses their original name. E.g., getUniqueSectionByName(".rodata") will return the original section even if the new .rodata section was created. * Input file sections (with relocations applied) are emitted via MC with ".bolt.org" prefix. However, their name in the output binary is unchanged unless a new section with the same name is created. * New sections are emitted internally with ".bolt.new" prefix if there's a name conflict with an input file section. Their original name is preserved in the output file. * Section header string table is properly populated with section names that are actually used. Previously we used to include discarded section names as well. * Fix the problem when dynamic relocations were propagated to a new section with a name that matched a section in the input binary. E.g., the new .rodata with jump tables had dynamic relocations from the original .rodata. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D135494	2022-10-13 23:10:39 -07:00
Rafael Auler	8d1fc45dc3	[BOLT][NFC] Refactor creation of symbol+addend references Put code that creates references to symbol+addend behind MCPlusBuilder. Will use this later in validate memory references pass. Reviewed By: #bolt, maksfb, yota9 Differential Revision: https://reviews.llvm.org/D134097	2022-10-12 18:39:26 -07:00
revunov.denis@huawei.com	553c238952	[BOLT] Preserve original LSDA type encoding In non-pie binaries BOLT unconditionally converted type encoding from indirect to absptr, which broke std exceptions since pointers to their typeinfo were only assigned at runtime in .data section. In this patch we preserve original encoding so that indirect remains indirect and can be resolved at runtime, and absolute remains absolute. Reviewed By: rafauler, maksfb Differential Revision: https://reviews.llvm.org/D132484	2022-09-14 16:33:47 +00:00
Denis Revunov	6040415ef9	[BOLT][AArch64] Handle references to the middle of Constant Islands Fix BinaryContext::handleAddressRef to properly detect references to other function's Constant islands. Revieved By: rafauler, yota9 Differential Revision: https://reviews.llvm.org/D132376	2022-08-25 04:32:35 -04:00
Fabian Parzefall	07f63b0ac5	[BOLT] Allocate FunctionFragment on heap This changes `FunctionFragment` from being used as a temporary proxy object to access basic block ranges to a heap-allocated object that can store fragment-specific information. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D132050	2022-08-24 18:06:08 -07:00
Fabian Parzefall	5065134aa0	Revert "[BOLT] Allocate FunctionFragment on heap" This reverts commit `101344af1a`.	2022-08-24 10:51:36 -07:00
Fabian Parzefall	101344af1a	[BOLT] Allocate FunctionFragment on heap This changes `FunctionFragment` from being used as a temporary proxy object to access basic block ranges to a heap-allocated object that can store fragment-specific information. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D132050	2022-08-24 10:17:17 -07:00
Kazu Hirata	258531b7ac	Remove redundant initialization of Optional (NFC)	2022-08-20 21:18:28 -07:00
Fabian Parzefall	0f74d191d1	[BOLT] Generate sections for multiple fragments This patch adds support to generate any number of sections that are assigned to fragments of functions that are split more than two-way. With this, a function's nth split fragment goes into section `.text.cold.n`. This also changes `FunctionLayout::erase` to make sure, that there are no empty fragments at the end of the function. This sometimes happens when blocks are erased from the function. To avoid creating symbols pointing to these fragments, they need to be removed. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D130521	2022-08-18 21:55:06 -07:00
Fabian Parzefall	275e075cbe	[BOLT] Support passing fragments to code emission This changes code emission such that it can emit specific function fragments instead of scanning all basic blocks of a function and just emitting those that are hot or cold. To implement this, `FunctionLayout` explicitly distinguishes the "main" fragment (i.e. the one that contains the entry block and is associated with the original symbol) from "split" fragments. Additionally, `BinaryFunction` receives support for multiple cold symbols - one for each split fragment. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D130052	2022-08-18 21:55:06 -07:00
Amir Ayupov	055f9f6d08	[BOLT][NFC] Simplify debug logging in case of JT heuristic failure Move logging into LLVM_DEBUG scope. Remove redundant printing of jump table parents: Old logging: ``` failed to analyze jump table in function _ZN12_GLOBAL__N_116InitHeaderSearch23Ad dDefaultCIncludePathsERKN4llvm6TripleERKN5clang19HeaderSearchOptionsE/1(2) PIC Jump table JUMP_TABLE/_ZN12_GLOBAL__N_116InitHeaderSearch23AddDefaultCInclud ePathsERKN4llvm6TripleERKN5clang19HeaderSearchOptionsE/1.1 for function _ZN12_GL OBAL__N_116InitHeaderSearch23AddDefaultCIncludePathsERKN4llvm6TripleERKN5clang19 HeaderSearchOptionsE/1(2) at 0x65996e0 with a total count of 0: 0x9dc next jump table at 0x659a810 belongs to function _ZN5clang5Lexer40LexDependencyD irectiveTokenWhileSkippingERNS_5TokenE PIC Jump table JUMP_TABLE/_ZN5clang5Lexer40LexDependencyDirectiveTokenWhileSkipp ingERNS_5TokenE.0 for function _ZN5clang5Lexer40LexDependencyDirectiveTokenWhile SkippingERNS_5TokenE at 0x659a810 with a total count of 0: jump table heuristic failure ``` New logging: ``` failed to analyze PIC Jump table JUMP_TABLE/_ZN12_GLOBAL__N_116InitHeaderSearch2 3AddDefaultCIncludePathsERKN4llvm6TripleERKN5clang19HeaderSearchOptionsE/1.1 for function _ZN12_GLOBAL__N_116InitHeaderSearch23AddDefaultCIncludePathsERKN4llvm6T ripleERKN5clang19HeaderSearchOptionsE/1(*2) at 0x65996e0 with a total count of 0: absolute offset: 0x52ac58c next PIC Jump table JUMP_TABLE/_ZN5clang5Lexer40LexDependencyDirectiveTokenWhile SkippingERNS_5TokenE.0 for function _ZN5clang5Lexer40LexDependencyDirectiveToken WhileSkippingERNS_5TokenE at 0x659a810 with a total count of 0: jump table heuristic failure ``` Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D131243	2022-08-17 17:35:16 -07:00
Amir Ayupov	556efdba85	[BOLT][NFC] Extend debug logging in analyzeJumpTable Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D131918	2022-08-15 20:34:40 -07:00
Thorsten Schütt	0c9258612b	[bolt] silence unused variables warnings	2022-08-06 20:52:45 +02:00
David Blaikie	7651522b78	Fold assert-used variable into assert Fixes #56724	2022-08-01 21:57:11 +00:00
Amir Ayupov	468d4f6d18	Revert "[BOLT] Ignore functions accessing false positive jump tables" This diff uncovers an ASAN leak in getOrCreateJumpTable: ``` Indirect leak of 264 byte(s) in 1 object(s) allocated from: #1 0x4f6e48c in llvm::bolt::BinaryContext::getOrCreateJumpTable ... ``` The removal of an assertion needs to be accompanied by proper deallocation of a `JumpTable` object for which `analyzeJumpTable` was unsuccessful. This reverts commit `52cd00cabf`.	2022-07-30 10:39:46 -07:00
Huan Nguyen	52cd00cabf	[BOLT] Ignore functions accessing false positive jump tables Disassembly and branch target analysis are not decoupled, so any analysis that depends on disassembly may not operate properly. In specific, analyzeJumpTable uses instruction bounds check property. A jump table was analyzed twice: (a) during disassembly, and (b) after disassembly, so there are potentially some mismatched results. In this update, functions that access JTs which fail the second check will be marked as ignored. Test Plan: ``` ninja check-bolt ``` Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D130431	2022-07-28 23:22:17 -07:00
Huan Nguyen	05523dc32d	[BOLT] Support multiple parents for split jump table There are two assumptions regarding jump table: (a) It is accessed by only one fragment, say, Parent (b) All entries target instructions in Parent For (a), BOLT stores jump table entries as relative offset to Parent. For (b), BOLT treats jump table entries target somewhere out of Parent as INVALID_OFFSET, including fragment of same split function. In this update, we extend (a) and (b) to include fragment of same split functinon. For (a), we store jump table entries in absolute offset instead. In addition, jump table will store all fragments that access it. A fragment uses this information to only create label for jump table entries that target to that fragment. For (b), using absolute offset allows jump table entries to target fragments of same split function, i.e., extend support for split jump table. This can be done using relocation (fragment start/size) and fragment detection heuristics (e.g., using symbol name pattern for non-stripped binaries). For jump table targets that can only be reached by one fragment, we mark them as local label; otherwise, they would be the secondary function entry to the target fragment. Test Plan ``` ninja check-bolt ``` Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D128474	2022-07-13 23:37:31 -07:00
Vladislav Khmelevsky	35efe1d806	[BOLT][AArch64] Handle gold linker veneers The gold linker veneers are written between functions without symbols, so we to handle it specially in BOLT. Vladislav Khmelevsky, Advanced Software Technology Lab, Huawei Differential Revision: https://reviews.llvm.org/D129260	2022-07-13 14:47:22 +03:00
Denis Revunov	7564167885	[BOLT][AArch64] Use all supported CPU features on AArch64 Since we now have +all feature for AArch64 disassembler, we can use it in BOLT and allow it to disassemble all ARM instructions supported by LLVM. Reviewed by: rafauler Differential Revision: https://reviews.llvm.org/D129139	2022-07-12 03:56:04 -04:00
Alexander Yermolovich	e159abdb04	[BOLT][DWARF] Support mix mode DWARF Added support for mixing monolithic DWARF5 with legacy DWARF, and monolithic legacy and DWARF5 split dwarf. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D128232	2022-06-30 16:53:15 -07:00
Rafael Auler	fc2d96c334	Revert "[BOLT][AArch64] Handle gold linker veneers" This reverts commit `425dda76e9`. This commit is currently causing BOLT to crash in one of our binaries and needs a bit more checking to make sure it is safe to land.	2022-06-28 19:23:28 -07:00
Vladislav Khmelevsky	425dda76e9	[BOLT][AArch64] Handle gold linker veneers The gold linker veneers are written between functions without symbols, so we to handle it specially in BOLT. Vladislav Khmelevsky, Advanced Software Technology Lab, Huawei Differential Revision: https://reviews.llvm.org/D128082	2022-06-28 16:14:05 +03:00
Amir Ayupov	d2c8769936	[BOLT][NFC] Use range-based STL wrappers Replace `std::` algorithms taking begin/end iterators with `llvm::` counterparts accepting ranges. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D128154	2022-06-23 22:16:27 -07:00
Huan Nguyen	28b1dcb122	[BOLT] Allow function fragments to point to one jump table Resolve a crash related to split functions Due to split function optimization, a function can be divided to two  fragments, and both fragments can access same jump table. This violates  the assumption that a jump table can only have one parent function,  which causes a crash during instrumentation. We want to support the case: different functions cannot access same jump tables, but different fragments of same function can! As all fragments are from same function, we point JT::Parent to one specific fragment. Right now it is the first disassembled fragment, but we can point it to the function's main fragment later. Functions are disassembled sequentially. Previously, at the end of processing a function, JT::OffsetEntries is cleared, so other fragment can no longer reuse JT::OffsetEntries. To extend the support for split function, we only clear JT::OffsetEntries after all functions are disassembled. Let say A.hot and A.cold access JT of three targets {X, Y, Z}, where X and Y are in A.hot, and Z is in A.cold. Suppose that A.hot is disassembled first, JT::OffsetEntries = {X',Y',INVALID_OFFSET}. When A.cold is disassembled, it cannot reuse JT::OffsetEntries above due to different fragment start. A simple solution: A.hot = {X',Y',INVALID_OFFSET} A.cold = {INVALID_OFFSET, INVALID_OFFSET, INVALID_OFFSET} We update the assertion to allow different fragments of same function to get the same JumpTable object. Potential improvements: A.hot = {X',Y',INVALID_OFFSET} A.cold = {INVALID_OFFSET, INVALID_OFFSET, Z'} The main issue is A.hot and A.cold have separate CFGs, thus jump table targets are still constrained within fragment bounds. Future improvements: A.hot = {X, Y, Z} A.cold = {X, Y, Z} Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D127924	2022-06-17 16:22:30 -07:00
Amir Ayupov	7dee646b28	[BOLT][NFC] Move printDebugInfo out of BC::printInstruction Simplify `BinaryContext::printInstruction`. Reviewed By: ayermolo Differential Revision: https://reviews.llvm.org/D127561	2022-06-11 11:58:36 -07:00
Fangrui Song	adf4142f76	[MC] De-capitalize SwitchSection. NFC Add SwitchSection to return switchSection. The API will be removed soon.	2022-06-10 22:50:55 -07:00
Huan Nguyen	82095bd5ed	[BOLT] Mark fragments related to split jump table as non-simple Mark fragments related to split jump table as non-simple. A function could be splitted into hot and cold fragments. A split jump table is challenging for correctly reconstructing control flow graphs, so it was marked as ignored. This update marks those fragments as non-simple, allowing them to be printed and partial control flow graph construction. Test Plan: ``` llvm-lit -a tools/bolt/test/X86/split-func-icf.s ``` This test has two functions (main, main2), each has a jump table target to the same cold portion main2.cold.1(*2). We try to print out only this cold portion. If it is ignored, it cannot be printed. If it is non-simple, it can be printed. We verify that it can be printed. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D127464	2022-06-10 15:49:32 -07:00
Fangrui Song	b92436efcb	[bolt] Remove unneeded cl::ZeroOrMore for cl::opt options	2022-06-05 13:29:49 -07:00
Maksim Panchenko	e290133c76	[BOLT] Add new class for symbolizing X86 instructions Summary: While disassembling instructions, we need to replace certain immediate operands with symbols. This symbolizing process relies on reading relocations against instructions. However, some X86 instructions can have multiple immediate operands and up to two relocations against them. Thus, correctly matching a relocation to an operand is not always possible without knowing the operand offset within the instruction. Luckily, LLVM provides an interface for passing the required info from the disassembler via a virtual MCSymbolizer class. Creating a target-specific version allows a precise matching of relocations to operands. This diff adds X86MCSymbolizer class that performs X86-specific symbolizing (currently limited to non-branch instructions). Reviewers: yota9, Amir, ayermolo, rafauler, zr33 Differential Revision: https://reviews.llvm.org/D120928	2022-05-31 17:48:19 -07:00
Denis Revunov	8579db96e8	[BOLT] [AArch64] Handle constant islands spanning multiple functions Fix BOLT's constant island mapping when a constant island marked by $d spans multiple functions. Currently, because BOLT only marks the constant island in the first function where $d is located, if the next function contains data at its start, BOLT will miss the data and try to disassemble it. This patch adds code to explicitly go through all symbols between $d and $x markers and mark their respective offsets as data, which stops BOLT from trying to disassemble data. It also adds MarkerType enum and refactors related functions. Reviewed By: yota9, rafauler Differential Revision: https://reviews.llvm.org/D126177	2022-05-31 13:51:35 -07:00
Amir Ayupov	69f87b6c29	[BOLT][NFC] Customize endline character for printInstruction(s) This would be used in `BF::dumpGraph` to dump left-justified text. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D126232	2022-05-24 18:26:12 -07:00
Amir Ayupov	c907d6e0e9	[BOLT][NFC] Suppress unused variable warnings Addresses the warnings emitted by Apple Clang 13.1.6 (Xcode 13.3.1). Tip @tschuett issue #55404. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D125733	2022-05-17 14:30:23 -07:00
Alexander Yermolovich	ba1ac98c62	[BOLT][DWARF] Add version 5 split dwarf support Added support for DWARF5 Split Dwarf. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D122988	2022-05-05 14:59:05 -07:00
Alexander Yermolovich	014cd37f51	[BOLT][DWARF] Implement monolithic DWARF5 Added implementation to support DWARF5 in monolithic mode. Next step DWARF5 split dwarf support. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D121876	2022-04-21 16:02:23 -07:00
Vladislav Khmelevsky	63686af1e1	[BOLT] Fix build with GCC 7.3.0 The gcc 7.3.0 version raises "could not covert" error without std::move used explicitly. Differential Revision: https://reviews.llvm.org/D124009	2022-04-21 13:47:58 +03:00
Maksim Panchenko	77b75ca53f	[BOLT][perf2bolt] Fix base address calculation for shared objects When processing profile data for shared object or PIE, perf2bolt needs to calculate base address of the binary based on the map info reported by the perf tool. When the mapping data provided is for the second (or any other than the first) segment and the segment's file offset does not match its memory offset, perf2bolt uses wrong assumption about the binary base address. Add a function to calculate binary base address using the reported memory mapping and use the returned base for further address adjustments. Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D123755	2022-04-14 10:29:53 -07:00
Vladislav Khmelevsky	4c14519ecb	[BOLT] LongJmp: Check for shouldEmit Check that the function will be emitted in the final binary. Preserving old function address is needed in case it is PLT trampiline, that is currently not moved by the BOLT. Differential Revision: https://reviews.llvm.org/D122098	2022-03-31 22:33:09 +03:00
Amir Ayupov	c31af7cfe3	[MC][BOLT] Add setter for AllowAtInName Use the setter in BOLT to allow printing names with variant kind in the name (e.g. "func@PLT"). Fixes BOLT buildbot tests that broke after D122516: https://lab.llvm.org/buildbot/#/builders/215/builds/3595 Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D122694	2022-03-30 13:04:28 -07:00
Elvina Yakubova	db65429db5	[BOLT] Divide RegularPageSize for X86 and AArch64 cases For AArch64 in some cases/some distributions ld uses 64K alignment of LOAD segments by default. Reviewed By: yota9, maksfb Differential Revision: https://reviews.llvm.org/D119267	2022-03-10 23:09:50 +03:00
Amir Ayupov	32d2473a5d	[BOLT][NFC] Report errors from createBinaryContext and RewriteInstance ctor Refactor createBinaryContext and RewriteInstance/MachORewriteInstance constructors to report an error in a library and fuzzer-friendly way instead of returning a nullptr or exiting. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D119658	2022-02-17 00:50:52 -08:00
Shao-Ce SUN	2aed07e96c	[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter` Reviewed By: skan Differential Revision: https://reviews.llvm.org/D119846	2022-02-16 13:10:09 +08:00
Shao-Ce SUN	9cc49c1951	Revert "[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter`" This reverts commit `fe25c06cc5`.	2022-02-16 11:57:49 +08:00
Shao-Ce SUN	fe25c06cc5	[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter` For ten years, it seems that `MCRegisterInfo` is not used by any target. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D119846	2022-02-16 11:47:17 +08:00

1 2

60 Commits