llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazu Hirata	9f252e5567	[llvm] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 17:31:17 -08:00
Kazu Hirata	3c09ed006a	[llvm] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 17:12:44 -08:00
Fangrui Song	f4c16c4473	[MC] llvm::Optional => std::optional https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 21:36:08 +00:00
Fangrui Song	c302fb5cc3	[Object] llvm::Optional => std::optional	2022-12-04 09:11:11 +00:00
Kazu Hirata	b4482f7ca0	[tools] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 21:11:40 -08:00
Kazu Hirata	0428cf9ef1	[llvm-objdump] Use std::optional in llvm-objdump.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 18:58:05 -08:00
Brad Smith	d06b8b4ae3	[llvm-objdump] Left justify display of OpenBSD headers This looks the same as OpenBSD's GNU objdump.	2022-11-19 04:41:15 -05:00
Brad Smith	96c037ef9c	[llvm] - Recognizing 'PT_OPENBSD_MUTABLE' segment type. Recognizing 'PT_OPENBSD_MUTABLE' segment type. `bd249b5664` Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D137903	2022-11-18 20:42:10 -05:00
Joseph Huber	8298f0b7b9	[Binary] Support extracting offloading files from COFF This patch adds initial support for extracting offloading binaries from `COFF` objects. This is a first step to allow building offloading files on Windows targets with the new driver. Depends on D136796 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D136855	2022-11-03 16:19:14 -05:00
Joseph Huber	3384f05a2c	[llvm-objdump][Offload] Use common offload extraction method A previous patch introduced a common function used to extract offloading binaries from an image. Therefore we no longer need to duplicate the functionality in the `llvm-objdump` implementation. Functionally, this removes the old warning behaviour when given malformed input. This has been changed to a hard error, which is effectively the same. This required a slight tweak in the linker wrapper to filter out the user passing shared objects directly. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D136796	2022-11-03 16:19:13 -05:00
Daniel Thornburgh	cc2457ca1b	[llvm-objdump] Set --print-imm-hex by default. This was previously attempted in 2016 by colinl's D18770, but LLD tests were missed, which caused the change to be reverted. Setting --print-imm-hex by default brings llvm-objdump's behavior closer in line with objdump, and it makes it easier to read addresses and alignment from the disassembly. It may make non-address immediates harder to interpret, but it still seems the better default, barring more context-sensitive base selection logic. Differential Revision: https://reviews.llvm.org/D136972	2022-10-30 13:36:18 -07:00
Keith Smiley	bc99fd95e0	[llvm-objdump/mac] Add new function starts print mode This updates the `--function-starts` argument to now accept 3 different modes, `addrs` for just printing the addresses of the function starts (previous behavior), `names` for just printing the names of the function starts, and `both` to print them both side by side. In general if you're debugging function starts issues it's useful to see the symbol name alongside the address. This also mirrors Apple's `dyldinfo -function_starts` command which prints both. Differential Revision: https://reviews.llvm.org/D119050	2022-10-14 15:42:18 -07:00
Fangrui Song	71bad24b0d	[llvm-objdump] Add HelpText for --no-addresses after D135040	2022-10-13 12:50:15 -07:00
Pierre van Houtryve	02b5d3bc3b	[llvm-objdump] Support nonzero section addresses in addSymbolizer The previous calculations seem to have assumed that the section address would be zero. This is true for relocatable object files, but certainly not for linked files like shared libraries. Fixed the calculations to make them identical to the "real" `getInstruction` call below & added a regression test. Reviewed By: scott.linder, simon_tatham Differential Revision: https://reviews.llvm.org/D135430	2022-10-12 10:44:03 +00:00
Shubham Sandeep Rastogi	f491b898c5	Revert "Remove the dependency between lib/DebugInfoDWARF and MC." This reverts commit `d96ade00c3`.	2022-10-06 14:58:34 -07:00
Shubham Sandeep Rastogi	d96ade00c3	Remove the dependency between lib/DebugInfoDWARF and MC. This patch had to be reverted because on gcc 7.5.0 we see an error converting from std::unique_ptr<MCRegisterInfo> to Expected<std::unique_ptr<MCRegisterInfo>> as the return type for the function createRegInfo. This has now been fixed.	2022-10-06 14:46:01 -07:00
Shubham Sandeep Rastogi	870b74d590	Revert "Remove the dependency between lib/DebugInfoDWARF and MC." This reverts commit `0008990479`.	2022-10-06 09:30:46 -07:00
Shubham Sandeep Rastogi	0008990479	Remove the dependency between lib/DebugInfoDWARF and MC. Differential Revision: https://reviews.llvm.org/D134817	2022-10-06 09:25:57 -07:00
Daniel Thornburgh	2e91a5f546	[llvm-objdump] Add --build-id flag for debuginfod lookups without binary. Adding a --build-id flag allows handling binaries that are referenced in logs from remote systems, but that aren't necessarily present on the local machine. These are fetched via debuginfod and handled as if they were input filenames. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D133992	2022-10-04 13:44:25 -07:00
Daniel Thornburgh	410c6ca9a4	[llvm-objdump] [debuginfod] Fetch for very-stripped binaries. When a binary is missing section headers or symbols, objdump can't provide as good of a disassembly. This change makes objdump try to fetch a better verion of the binary by its build ID. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D132887	2022-10-04 11:40:36 -07:00
Jim Radford	ffecb643ee	[objdump] Support finding --source via --dsym files Add support for auto-detecting or specifying dSYM files/directories to allow interleaving source with disassembly. Differential Revision: https://reviews.llvm.org/D135117 Patch by Jim Radford.	2022-10-04 11:15:51 -07:00
Fangrui Song	5c7566cd05	[llvm-objdump] Add --no-addresses as an alias for --no-leading-addr The output is similar to objdump --no-addresses since binutils 2.35. Depends on D135039 Close #58088 Differential Revision: https://reviews.llvm.org/D135040	2022-10-04 10:01:30 -07:00
Fangrui Song	ad92a3db2e	[llvm-objdump] --no-leading-addr: hide inline relocation offsets It seems to make sense to omit offsets when --no-leading-addr is specified. The output is now closer to objdump -dr --no-addresses (non-wide output). Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D135039	2022-10-04 10:00:21 -07:00
Daniel Thornburgh	d033ece0c9	[llvm-objdump] Find debug information with Build ID/debuginfod. Uses the library introduced in https://reviews.llvm.org/D132504 to add build ID fetching to llvm-objdump. This allows viewing source when disassembling stripped objects. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D131224	2022-10-03 16:17:45 -07:00
Fangrui Song	25394c9d10	[llvm-objdump] Change printSymbolVersionDependency to use ELFFile API When .gnu.version_r is empty (allowed by readelf but warned by objdump), llvm-objdump -p may decode the next section as .gnu.version_r and may crash due to out-of-bounds C string reference. ELFFile<ELFT>::getVersionDependencies handles 0-entry .gnu.version_r gracefully. Just use it. Fix https://github.com/llvm/llvm-project/issues/57707 Differential Revision: https://reviews.llvm.org/D133751	2022-09-14 12:30:34 -07:00
Namhyung Kim	43efb5e445	[llvm-objdump] Create name for fake sections It doesn't have a section header string table so add a vector to have the strings and create name based on the program header type and the index. Differential Revision: https://reviews.llvm.org/D131290	2022-09-09 12:27:07 +01:00
Joe Loser	5e96cea1db	[llvm] Use std::size instead of llvm::array_lengthof LLVM contains a helpful function for getting the size of a C-style array: `llvm::array_lengthof`. This is useful prior to C++17, but not as helpful for C++17 or later: `std::size` already has support for C-style arrays. Change call sites to use `std::size` instead. Differential Revision: https://reviews.llvm.org/D133429	2022-09-08 09:01:53 -06:00
Arpad Borsos	de3633e746	[llvm-objdump][COFF] Correctly decode `UOP_Epilog` opcodes At least `ntdll` is using the undocumented version 2 unwind info, and opcode 6, which is already defined as `UOP_Epilog`. Using `llvm-objdump --unwind` with `ntdll` would previously result in unreachable assertions because this code was missing from `getNumUsedSlots` and `getUnwindCodeTypeName`. The slots of these codes comes from `57bfe47451/src/coreclr/inc/win64unwind.h (L51-L52)` which I would assume is a good authoritative source. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D107655	2022-09-01 14:05:14 -07:00
Daniel Bertalan	f7b752d277	[lld-macho] Set the SG_READ_ONLY flag on __DATA_CONST This flag instructs dyld to make the segment read-only after fixups have been performed. I'm not sure why this flag is needed, as on macOS 13 beta at least, __DATA_CONST is read-only even without this flag; but ld64 sets it as well. Differential Revision: https://reviews.llvm.org/D133010	2022-08-31 17:04:20 +02:00
Daniel Bertalan	389e0a81a1	[lld-macho] Support synthesizing __TEXT,__init_offsets This section stores 32-bit `__TEXT` segment offsets of initializer functions, and is used instead of `__mod_init_func` when chained fixups are enabled. Storing the offsets lets us avoid emitting fixups for the initializers. Differential Revision: https://reviews.llvm.org/D132947	2022-08-31 10:13:45 +02:00
Nico Weber	b9d63e10b9	[llvm-otool] Print dyld_info output before chained_fixup output This matches otool. Differential Revision: https://reviews.llvm.org/D132865	2022-08-30 11:03:30 -04:00
Daniel Bertalan	47e4663c4e	[llvm-objdump] Add -dyld_info to llvm-otool This option outputs the location, encoded value and target of chained fixups, using the same format as `otool -dyld_info`. This initial implementation only supports the DYLD_CHAINED_PTR_64 and DYLD_CHAINED_PTR_64_OFFSET pointer encodings, which are used in x86_64 and arm64 userspace binaries. When Apple's effort to upstream their chained fixups code continues, we'll replace this code with the then-upstreamed code. But we need something in the meantime for testing ld64.lld's chained fixups code. Differential Revision: https://reviews.llvm.org/D132036	2022-08-28 09:22:41 +02:00
Peter Cooper	6113998069	Add MachO MH_FILESET support to objdump https://reviews.llvm.org/D131909	2022-08-24 13:34:43 -07:00
Daniel Bertalan	686d8ce1ab	[llvm-objdump] Complete -chained_fixups support This commit adds definitions for the `dyld_chained_import*` structs. The imports array is now printed with `llvm-otool -chained_fixups`. This completes this option's implementation. A slight difference from cctools otool is that we don't yet dump the raw bytes of the imports entries. When Apple's effort to upstream their chained fixups code continues, we'll replace this code with the then-upstreamed code. But we need something in the meantime for testing ld64.lld's chained fixups code. Differential Revision: https://reviews.llvm.org/D131982	2022-08-24 19:29:11 +02:00
Simon Tatham	8e29f3f1c3	[llvm-objdump] Handle multiple syms at same addr in disassembly. The main disassembly loop in llvm-objdump works by iterating through the symbols in a code section, and for each one, dumping the range of the section from that symbol to the next. If there's another symbol defined at the same location, then that range will have length 0, and llvm-objdump will skip over the symbol entirely. As a result, llvm-objdump will only show the last of the symbols defined at that address. Not only that, but the other symbols won't even be checked against the `--disassemble-symbol` list. So if you have two symbols `foo` and `bar` defined in the same place, then one of `--disassemble-symbol=foo` and `--disassemble-symbol=bar` will generate an error message and no disassembly. I think a better approach in that situation is to prioritise display of the symbol the user actually asked for. Also, if the user specifically asks for disassembly of //both// of two symbols defined at the same address, the best response I can think of is to disassemble the code once, preceded by both symbol names. This involves teaching llvm-objdump to be able to display more than one symbol name at the head of a disassembled section, which also makes it possible to implement a `--show-all-symbols` option to display //every// symbol defined in the code, not just the most preferred one at each address. This change also turns out to fix a bug in which `--disassemble-all` on a mixed Arm/Thumb ELF file would fail to switch disassembly states between Arm and Thumb functions, because the mapping symbols were accidentally ignored. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D131589	2022-08-24 15:08:12 +01:00
Kazu Hirata	258531b7ac	Remove redundant initialization of Optional (NFC)	2022-08-20 21:18:28 -07:00
Daniel Bertalan	11443ef85d	[llvm-objdump] Support dumping segment information with -chained_fixups This commit adds the definitions for `dyld_chained_starts_in_image`, `dyld_chained_starts_in_segment`, and related enums. Dumping their contents is possible with the -chained_fixups flag of llvm-otool. The chained-fixups.yaml test was changed to cover bindings/rebases, as well as weak imports, weak symbols and flat namespace symbols. Now that we have actual fixup entries, the __DATA segment contains data that would need to be hexdumped in YAML. We also test empty pages (to look for the "DYLD_CHAINED_PTR_START_NONE" annotation), so the YAML would end up quite large. So instead, this commit includes a binary file. When Apple's effort to upstream their chained fixups code continues, we'll replace this code with the then-upstreamed code. But we need something in the meantime for testing ld64.lld's chained fixups code. Differential Revision: https://reviews.llvm.org/D131961	2022-08-18 09:29:27 +02:00
Nico Weber	1642667392	[llvm-objdump --macho] Rename --dyld_info to --dyld-info llvm-objdump takes foo-bar style flags, while llvm-otool takes foo_bar style flags. dyld_info was the only exception to that. Add a -dyld_info flag to llvm-otool instead. (Both in llvm-objdump and llvm-otool, the flag doesn't really do anything yet.) Differential Revision: https://reviews.llvm.org/D131897	2022-08-17 12:58:29 -04:00
Nico Weber	940e178c00	[llvm-objdump] Start on -chained_fixups for llvm-otool And --chained-fixups for llvm-objdump. For now, this only prints the dyld_chained_fixups_header and adds plumbing for the flag. This will be expanded in future commits. When Apple's effort to upstream their chained fixups code continues, we'll replace this code with the then-upstreamed code. But we need something in the meantime for testing ld64.lld's chained fixups code. Update chained-fixups.yaml with a file that actually contains the chained fixup data (`LinkEditData` doesn't encode it yet, so use `__LINKEDIT` via `--raw-segment=data`). Differential Revision: https://reviews.llvm.org/D131890	2022-08-15 10:58:52 -04:00
Simon Tatham	72017e9b16	[llvm-objdump,ARM] Fix big-endian AArch32 disassembly. The ABI for big-endian AArch32, as specified by AAELF32, is above- averagely complicated. Relocatable object files are expected to store instruction encodings in byte order matching the ELF file's endianness (so, big-endian for a BE ELF file). But executable images can //either// do that //or// store instructions little-endian regardless of data and ELF endianness (to support BE32 and BE8 platforms respectively). They signal the latter by setting the EF_ARM_BE8 flag in the ELF header. (In the case of the Thumb instruction set, this all means that each 16-bit halfword of a Thumb instruction is stored in one or other endianness. The two halfwords of a 32-bit Thumb instruction must appear in the same order no matter what, because the first halfword is the one that must avoid overlapping the encoding of any 16-bit Thumb instruction.) llvm-objdump was unconditionally expecting Arm instructions to be stored little-endian. So it would correctly disassemble a BE8 image, but if you gave it a BE32 image or a BE object file, it would retrieve every instruction in byte-swapped form and disassemble it to nonsense. (Even an object file output by LLVM itself, because ARMMCCodeEmitter outputs instructions big-endian in big-endian mode, which is correct for writing an object file.) This patch allows llvm-objdump to correctly disassemble all three of those classes of Arm ELF file. It does it by introducing a new SubtargetFeature for big-endian instructions, setting it from the ELF image type and flags during llvm-objdump setup, and teaching both ARMDisassembler and llvm-objdump itself to pay attention to it when retrieving instruction data from a section being disassembled. Differential Revision: https://reviews.llvm.org/D130902	2022-08-08 10:49:51 +01:00
Simon Tatham	1c3d0a2e87	[llvm-objdump] Fix type mismatch in std::min. I broke the build just now by trying to do std::min between a size_t and a uint64_t, which of course worked fine on my 64-bit test platform.	2022-07-26 10:02:37 +01:00
Simon Tatham	1bc7b06ffd	[llvm-objdump,ARM] Make dumpARMELFData line up with instructions. The whitespace in output lines containing disassembled instructions was extremely mismatched against that in `.word` lines produced from dumping literal pools and other data in Arm ELF files. This patch adjusts `dumpARMELFData` so that it uses the same alignment system as in the instruction pretty-printers. Now the two classes of line are aligned sensibly alongside each other. Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D130359	2022-07-26 09:35:31 +01:00
Simon Tatham	2b38f58930	[llvm-objdump,ARM] Add PrettyPrinters for Arm and AArch64. Most Arm disassemblers, including GNU objdump and Arm's own `fromelf`, emit an instruction's raw encoding as a 32-bit words or (for Thumb) one or two 16-bit halfwords, in logical order rather than according to their storage endianness. This is generally easier to read: it matches the encoding diagrams in the architecture spec, it matches the value you'd write in a `.inst` directive, and it means that fields within the instruction encoding that span more than one byte (such as branch offsets or `SVC` immediates) can be read directly in the encoding without having to mentally reverse the bytes. llvm-objdump already has a system of PrettyPrinter subclasses which makes it easy for a target to drop in its own preferred formatting. This patch adds pretty-printers for all the Arm targets, so that llvm-objdump will display Arm instruction encodings in their preferred layout instead of little-endian and bytewise. Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D130358	2022-07-26 09:35:30 +01:00
Simon Tatham	55f1fbf005	[MC,llvm-objdump,ARM] Target-dependent disassembly resync policy. Currently, when llvm-objdump is disassembling a code section and encounters a point where no instruction can be decoded, it uses the same policy on all targets: consume one byte of the section, emit it as "<unknown>", and try disassembling from the next byte position. On an architecture where instructions are always 4 bytes long and 4-byte aligned, this makes no sense at all. If a 4-byte word cannot be decoded as an instruction, then the next place that a valid instruction could //possibly// be found is 4 bytes further on. Disassembling from a misaligned address can't possibly produce anything that the code generator intended, or that the CPU would even attempt to execute. This patch introduces a new MCDisassembler virtual method called `suggestBytesToSkip`, which allows each target to choose its own resynchronization policy. For Arm (as opposed to Thumb) and AArch64, I've filled in the new method to return a fixed width of 4. Thumb is a more interesting case, because the criterion for identifying 2-byte and 4-byte instruction encodings is very simple, and doesn't require the particular instruction to be recognized. So `suggestBytesToSkip` is also passed an ArrayRef of the bytes in question, so that it can take that into account. The new test case shows Thumb disassembly skipping over two unrecognized instructions, and identifying one as 2-byte and one as 4-byte. For targets other than Arm and AArch64, this is NFC: the base class implementation of `suggestBytesToSkip` still returns 1, so that the existing behavior is unchanged. Other targets can fill in their own implementations as they see fit; I haven't attempted to choose a new behavior for each one myself. I've updated all the call sites of `MCDisassembler::getInstruction` in llvm-objdump, and also one in sancov, which was the only other place I spotted the same idiom of `if (Size == 0) Size = 1` after a call to `getInstruction`. Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D130357	2022-07-26 09:35:30 +01:00
Simon Tatham	e35fec2c02	[llvm-objdump,ARM] Fix .byte directives dumping the wrong byte. The clause in `dumpARMELFData` that dumps a single byte as a `.byte` directive was printing the operand of that directive as `Bytes[0]`, not `Bytes[Index]`. In particular, this led to the `dumpBytes` output to its left not matching it! Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D130360	2022-07-25 14:55:33 +01:00
Rahman Lavaee	ed93d157de	[llvm-objdump] Support --symbolize-operands when there is a single SHT_LLVM_BB_ADDR_MAP section for all text sections When linking, using `-Wl,-z,keep-text-section-prefix` results in multiple text sections while all `SHT_LLVM_BB_ADDR_MAP` sections are linked into a single one. In such case, we should not read the corresponding section for each text section, and instead read all `SHT_LLVM_BB_ADDR_MAP` sections before disassembly. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D129924	2022-07-18 16:51:22 -07:00
Namhyung Kim	69b312cde4	[llvm-objdump] Create fake sections for a ELF core file The linux perf tools use /proc/kcore for disassembly kernel functions. Actually it copies the relevant parts to a temp file and then pass it to objdump. But it doesn't have section headers so llvm-objdump cannot handle it. Let's create fake section headers for the program headers. It'd have a single section for each segment to cover the entire range. And for this purpose we can consider only executable code segments. With this change, I can see the following command shows proper outputs. perf annotate --stdio --objdump=/path/to/llvm-objdump Differential Revision: https://reviews.llvm.org/D128705	2022-07-14 13:39:59 +01:00
Kazu Hirata	611ffcf4e4	[llvm] Use value instead of getValue (NFC)	2022-07-13 23:11:56 -07:00
Joseph Huber	e0fb9f55b6	[llvm-objdump] Fix alignment issues when dumping offloading sections Summary: The `.llvm.offloading` section should always be aligned by `8`. However, we may want to show the offloading data stored in a static library. In this case, even though the section's alignment is correct, the offset inside the archive will result in the memory buffer being misaligned. TO combat this we simply check if the buffer does not have the proper alignment and copies it to a new buffer if not. This copy should have the proper alignment.	2022-07-08 14:30:06 -04:00
Fangrui Song	d3712b0852	[llvm-objdump] Change some nonnull pointers to references. NFC	2022-07-07 12:14:59 -07:00

1 2 3 4 5 ...

1075 Commits