llvm-project

Commit Graph

Author	SHA1	Message	Date
Sam McCall	882a05afa1	[Format] Fix crash when hitting eof while lexing JS template string Different loop termination conditions resulted in confusion of whether Offset was intended to be inside or outside the token. This ultimately led to constructing an out-of-range SourceLocation. Fix by making Offset consistently point after* the token. Differential Revision: https://reviews.llvm.org/D135356	2022-10-06 17:00:41 +02:00
owenca	b60e7a7f1a	[clang-format] Handle C# interpolated verbatim string prefix @$ Fixes #58062. Differential Revision: https://reviews.llvm.org/D135026	2022-10-04 18:27:36 -07:00
Fangrui Song	3f18f7c007	[clang] LLVM_FALLTHROUGH => [[fallthrough]]. NFC With C++17 there is no Clang pedantic warning or MSVC C5051. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D131346	2022-08-08 09:12:46 -07:00
Kazu Hirata	71336d03f1	Use llvm::any_of (NFC)	2022-07-31 15:17:08 -07:00
sstwcw	f93182a887	[clang-format] Handle Verilog numbers and operators Reviewed By: HazardyKnusperkeks Differential Revision: https://reviews.llvm.org/D126845	2022-07-29 00:38:29 +00:00
owenca	0ffb3dd33e	[clang-format] Fix a hang when formatting C# $@ string literals Fixes #56624. Differential Revision: https://reviews.llvm.org/D130411	2022-07-25 23:17:54 -07:00
Kevin Cadieux	a9bef0707d	[clang-format] Fix incorrect isspace input (NFC) This change fixes a clang-format unit test failure introduced by [D124748](https://reviews.llvm.org/D124748). The `countLeadingWhitespace` function was calling `isspace` with values that could fall outside the valid input range. The valid input range for `isspace` is unsigned 0-255. Values outside this range produce undefined behavior, which on Windows manifests as an assertion being raised in the debug runtime libraries. `countLeadingWhitespace` was calling `isspace` with a signed char that could produce a negative value if the underlying byte's value was 128 or above, which can happen for non-ASCII encodings. The fix is to use `StringRef`'s `bytes_begin` and `bytes_end` iterators to read the values as unsigned chars instead. This bug can be reproduced by building the `check-clang-unit` target with a DEBUG configuration under Windows. This change is already covered by existing unit tests. Reviewed By: MyDeveloperDay Differential Revision: https://reviews.llvm.org/D128786	2022-06-29 10:20:46 -07:00
sstwcw	141ad3ba05	[clang-format] Fix uninitialized memory problem The setLength function checks for the token kind which could be uninitialized in the previous version. The problem was introduced in `2e32ff106e`. Reviewed By: MyDeveloperDay, owenpan Differential Revision: https://reviews.llvm.org/D128607	2022-06-26 22:23:50 +00:00
sstwcw	2e32ff106e	[clang-format] Handle Verilog preprocessor directives Verilog uses the backtick instead of the hash. In this revision backticks are lexed manually and then get labeled as hashes so the logic for handling C preprocessor stuff don't have to change. Hashes get labeled as identifiers for Verilog-specific stuff like delays. Reviewed By: HazardyKnusperkeks Differential Revision: https://reviews.llvm.org/D124749	2022-06-26 02:02:29 +00:00
sstwcw	370bee4801	[clang-format] Fix whitespace counting stuff The current way of counting whitespace would count backticks as whitespace. For Verilog stuff we need backticks to be handled correctly. For JavaScript the current way is to compare the entire token text to see if it's a backtick. However, when the backtick is the first token following an escaped newline, the escaped newline will be part of the tok::unknown token. Verilog has macros and escaped newlines unlike JavaScript. So we can't regard an entire tok::unknown token as whitespace. Previously, the start of every token would be matched for newlines. Now, it is all whitespace instead of just newlines. The column counting problem has already been fixed for JavaScript in `e71b4cbdd1` by counting columns elsewhere. Reviewed By: HazardyKnusperkeks Differential Revision: https://reviews.llvm.org/D124748	2022-06-26 01:27:27 +00:00
owenca	bebf7bdf9a	[clang-format][NFC] Insert/remove braces in clang/lib/Format/ Differential Revision: https://reviews.llvm.org/D126157	2022-05-24 19:06:04 -07:00
Marek Kurdej	573a5b5800	Revert "[clang-format] Fix WhitespaceSensitiveMacros not being honoured when macro closing parenthesis is followed by a newline." This reverts commit `50cd52d935`. It provoked regressions in C++ and ObjectiveC as described in https://reviews.llvm.org/D123676#3515949. Reproducers: ``` MACRO_BEGIN #if A int f(); #else int f(); #endif ``` ``` NS_SWIFT_NAME(A) @interface B : C @property(readonly) D value; @end ```	2022-05-18 07:27:45 +02:00
Marek Kurdej	50cd52d935	[clang-format] Fix WhitespaceSensitiveMacros not being honoured when macro closing parenthesis is followed by a newline. Fixes https://github.com/llvm/llvm-project/issues/54522. This fixes regression introduced in `5e5efd8a91`. Before the culprit commit, macros in WhitespaceSensitiveMacros were correctly formatted even if their closing parenthesis weren't followed by semicolon (or, to be precise, when they were followed by a newline). That commit changed the type of the macro token type from TT_UntouchableMacroFunc to TT_FunctionLikeOrFreestandingMacro. Correct formatting (with `WhitespaceSensitiveMacros = ['FOO']`): ``` FOO(1+2) FOO(1+2); ``` Regressed formatting: ``` FOO(1 + 2) FOO(1+2); ``` Reviewed By: HazardyKnusperkeks, owenpan, ksyx Differential Revision: https://reviews.llvm.org/D123676	2022-05-09 10:59:33 +02:00
Dawid Jurczak	a64d3c602f	[NFC][Lexer] Make Lexer::LangOpts const reference This change can be seen as code cleanup but motivation is more performance related. While browsing perf reports captured during Linux build we can notice unusual portion of instructions executed in std::vector<std::string> copy constructor like: 0.59% 0.58% clang-14 clang-14 [.] std::vector<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > > >::vector or even: 1.42% 0.26% clang clang-14 [.] clang::LangOptions::LangOptions \| --1.16%--clang::LangOptions::LangOptions \| --0.74%--std::vector<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > > >::vector After more digging we can see that relevant LangOptions std::vector members (Files, ModuleFeatures and NoBuiltinFuncs) are constructed when Lexer::LangOpts field is initialized on list: Lexer::Lexer(..., const LangOptions &langOpts, ...) : ..., LangOpts(langOpts), Since LangOptions copy constructor is called by Lexer(..., const LangOptions &LangOpts,...) and local Lexer objects are created thousands times (in Lexer::getRawToken, Preprocessor::EnterSourceFile and more) during single module processing in frontend it makes std::vector copy constructors surprisingly hot. Unfortunately even though in current Lexer implementation mentioned std::vector members are unused and most of time empty, no compiler is smart enough to optimize their std::vector copy constructors out (take a look at test assembly): https://godbolt.org/z/hdoxPfMYY even with LTO enabled. However there is simple way to fix this. Since Lexer doesn't access Files, ModuleFeatures, NoBuiltinFuncs and any other LangOptions fields (but only LangOptionsBase) we can simply get rid of redundant copy constructor assembly by changing LangOpts type to more appropriate const LangOptions reference: https://godbolt.org/z/fP7de9176 Additionally we need to store LineComment outside LangOpts because it's written in SkipLineComment function. Also FormatTokenLexer need to be adjusted a bit to avoid lifetime issues related to passing local LangOpts reference to Lexer. After this change I can see more than 1% speedup in some of my microbenchmarks when using Clang release binary built with LTO. For Linux build gains are not so significant but still nice at the level of -0.4%/-0.5% instructions drop. Differential Revision: https://reviews.llvm.org/D120334	2022-02-28 15:42:19 +01:00
Marek Kurdej	fee4a9712f	[clang-format] Use FormatToken::is* functions without passing through `Tok`. NFC.	2022-02-22 16:41:15 +01:00
Marek Kurdej	7d5062c6ac	[clang-format] Remove unnecessary parentheses in return statements. NFC.	2022-02-12 21:25:52 +01:00
Marek Kurdej	d079995dd0	[clang-format] Elide unnecessary braces. NFC.	2022-02-02 15:28:53 +01:00
Marek Kurdej	10243d0dfd	[clang-format] Simplify use of StringRef::substr(). NFC.	2022-02-02 14:36:00 +01:00
owenca	c95afac89e	[clang-format][NFC] Clean up tryMergeLessLess() Differential Revision: https://reviews.llvm.org/D117759	2022-01-20 14:35:07 -08:00
Marek Kurdej	82452be5cb	[clang-format] Refactor: add FormatToken::hasWhitespaceBefore(). NFC. This factors out a pattern that comes up from time to time. Reviewed By: MyDeveloperDay, HazardyKnusperkeks, owenpan Differential Revision: https://reviews.llvm.org/D117769	2022-01-20 21:16:17 +01:00
Jino Park	560eb2277b	[clang-format] Fix bug in parsing `operator<` with template Fixes https://github.com/llvm/llvm-project/issues/44601. This patch handles a bug when parsing a below example code : ``` template <class> class S; template <class T> bool operator<(S<T> const &x, S<T> const &y) { return x.i < y.i; } template <class T> class S { int i = 42; friend bool operator< <>(S const &, S const &); }; int main() { return S<int>{} < S<int>{}; } ``` which parse `< <>` as `<< >`, not `< <>` in terms of tokens as discussed in discord. 1. Add a condition in `tryMergeLessLess()` considering `operator` keyword and `>` 2. Force to leave a whitespace between `tok::less` and a template opener 3. Add unit test Reviewed By: MyDeveloperDay, curdeius Differential Revision: https://reviews.llvm.org/D117398	2022-01-20 08:59:04 +01:00
mydeveloperday	142e79b868	[clang-format] NFC use recently added Style.isJavaScript() Improve the readability of these if(Style==FormatStyle::LK_JavsScript) clauses	2021-12-21 14:24:12 +00:00
Manuel Klimek	d688b31628	Fix segfault in clang-format. Fix bug where we'd read past the end of the tokens after merging _T macro strings.	2021-12-01 11:57:41 +01:00
mydeveloperday	1cb3cfd932	[clang-format] [NFC] build clang-format with -Wall When building clang-format with -Wall on Visual Studio 20119 we see the following, prevent this the only -Wall error ``` ..FormatTokenLexer.cpp(45) : warning C4868: compiler may not enforce left-to-right evaluation order in braced initializer list ``` Reviewed By: HazardyKnusperkeks Differential Revision: https://reviews.llvm.org/D113844	2021-11-23 10:43:27 +00:00
Kazu Hirata	f1c159cc90	[Format, Sema] Use range-based for loops with llvm::reverse (NFC)	2021-11-17 08:52:35 -08:00
Vitali Lovich	be9a87fe9b	[clang-format] Add IfMacros option https://bugs.llvm.org/show_bug.cgi?id=49354 Differential Revision: https://reviews.llvm.org/D102730	2021-06-23 08:51:53 -07:00
Eliza Velasquez	cdf33962d9	[clang-format] Rename common types between C#/JS Reviewed By: curdeius Differential Revision: https://reviews.llvm.org/D101862	2021-05-06 12:12:58 +02:00
Eliza Velasquez	a437befa8f	[clang-format] Add more support for C# 8 nullables This adds support for the null-coalescing assignment and null-forgiving operators. https://docs.microsoft.com/en-us/dotnet/csharp/language-reference/operators/null-coalescing-operator https://docs.microsoft.com/en-us/dotnet/csharp/language-reference/operators/null-forgiving Reviewed By: krasimir, curdeius Differential Revision: https://reviews.llvm.org/D101702	2021-05-06 11:58:38 +02:00
Björn Schäpers	bcc1dee600	[clang-format] Add StatementAttributeLikeMacros option This allows to ignore for example Qts emit when AlignConsecutiveDeclarations is set, otherwise it is parsed as a type and it results in some misformating: unsigned char MyChar = 'x'; emit signal(MyChar); Differential Revision: https://reviews.llvm.org/D93776	2021-01-18 06:54:31 +01:00
Martin Probst	16212b8b3e	clang-format: [JS] support new assignment operators. Before: a && = b; After: a &&= b; These operators are new additions in ES2021. Differential Revision: https://reviews.llvm.org/D91132	2020-11-10 09:26:46 +01:00
Duncan P. N. Exon Smith	b3eff6b7bb	Lexer: Update the Lexer to use MemoryBufferRef, NFC Update `Lexer` / `Lexer::Lexer` to use `MemoryBufferRef` instead of `MemoryBuffer`. Callers that were acquiring a `MemoryBuffer` via `SourceManager::getBuffer` were updated, such that if they checked `Invalid` they use `getBufferOrNone` and otherwise `getBufferOrFake`. Differential Revision: https://reviews.llvm.org/D89398	2020-10-19 19:10:21 -04:00
Duncan P. N. Exon Smith	0ddf4bd47c	clang/{Format,Rewrite}: Stop using SourceManager::getBuffer, NFC Update clang/lib/Format and clang/lib/Rewrite to use a `MemoryBufferRef` from `getBufferOrFake` instead of `MemoryBuffer*` from `getBuffer`. No functionality change here, since the call sites weren't checking if the buffer was valid. Differential Revision: https://reviews.llvm.org/D89406	2020-10-19 17:02:59 -04:00
mydeveloperday	40e771c1c0	[clang-format][regression][PR47461] ifdef causes catch to be seen as a function https://bugs.llvm.org/show_bug.cgi?id=47461 The following change {D80940} caused a regression in code which ifdef's around the try and catch block cause incorrect brace placement around the catch ``` try { } catch (...) { // This is not a small function bar = 1; } } ``` The brace after the catch will be placed on a newline Reviewed By: curdeius Differential Revision: https://reviews.llvm.org/D87291	2020-09-17 13:23:06 +01:00
Alex Richardson	e7bd058c7e	[clang-format] Allow configuring list of macros that map to attributes This adds a `AttributeMacros` configuration option that causes certain identifiers to be parsed like a __attribute__((foo)) annotation. This is motivated by our CHERI C/C++ fork which adds a __capability qualifier for pointer/reference. Without this change clang-format parses many type declarations as multiplications/bitwise-and instead. I initially considered adding "__capability" as a new clang-format keyword, but having a list of macros that should be treated as attributes is more flexible since it can be used e.g. for static analyzer annotations or other language extensions. Example: std::vector<foo * __capability> -> std::vector<foo *__capability> Depends on D86775 (to apply cleanly) Reviewed By: MyDeveloperDay, jrtc27 Differential Revision: https://reviews.llvm.org/D86782	2020-09-07 10:09:17 +01:00
Manuel Klimek	8c2a613976	Hand Allocator and IdentifierTable into FormatTokenLexer. This allows us to share the allocator in the future so we can create tokens while parsing. Differential Revision: https://reviews.llvm.org/D83218	2020-07-07 11:56:34 +02:00
Jake Merdich	0c332a7784	[clang-format] Preserve whitespace in selected macros Summary: https://bugs.llvm.org/show_bug.cgi?id=46383 When the c preprocessor stringizes tokens, the generated string literals are affected by the whitespace. This means clang-format can affect codegen silently, adding spaces and newlines to strings. Practically speaking, the vast majority of cases will be harmless, only affecting single identifiers or debug macros. In the interest of doing no harm in other cases though, this introduces a blacklist option 'WhitespaceSensitiveMacros', which contains a list of names of function-like macros whose contents should not be touched by clang-format, period. Clang-format can't automatically detect these without a real compile context, so users will have to specify it explicitly (it still beats clang-format off'ing at every invocation). Defaults include "STRINGIZE", "PP_STRINGIZE", and "BOOST_PP_STRINGIZE". Subscribers: kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D82620	2020-06-29 09:57:47 -04:00
mydeveloperday	6a0484f04b	[clang-format] [PR46159] Linux kernel 'C' code uses 'try' as a variable name, allow clang-format to handle such cases Reviewed By: curdeius Differential Revision: https://reviews.llvm.org/D80940	2020-06-03 20:44:45 +01:00
mydeveloperday	cc918e90c0	[clang-format] [PR33890] Add support for Microsoft C++/CLI non standard for each looping extension Summary: https://bugs.llvm.org/show_bug.cgi?id=33890 This revision allow the microsoft `for each(.... in ...` nonstandard C++ extension which can be used in C++/CLI to be handled as a ForEach macro. This prevents the breaking between the for and each onto a new line Reviewed By: JakeMerdichAMD Subscribers: cfe-commits Tags: #clang, #clang-format Differential Revision: https://reviews.llvm.org/D80228	2020-05-20 07:44:36 +01:00
Manuel Klimek	53cc90f789	Make FormatToken::Type private. This enables us to intercept changes to the token type via setType(), which is a precondition for being able to use multi-pass formatting for macro arguments. Differential Revision: https://reviews.llvm.org/D67405	2020-05-13 16:37:58 +02:00
Jonathan Coe	900dee8c8e	[clang-format] Do not merge target-name and : for C# attributes Summary: Re-use token type `TT_AttributeColon` for C# attribute target colons. Reviewers: krasimir Reviewed By: krasimir Subscribers: MyDeveloperDay, cfe-commits Tags: #clang-format, #clang Differential Revision: https://reviews.llvm.org/D75465	2020-03-03 22:17:25 +00:00
Jonathan Coe	9c4afce702	[clang-format] Rename CSharpNullConditionalSq and add missing test Summary: Rename CSharpNullConditionalSq to CSharpNullConditionalLSquare. Add test for spaces inside [] with C# Null conditionals. Address comments missed from https://reviews.llvm.org/D75368. Reviewers: krasimir Reviewed By: krasimir Subscribers: cfe-commits Tags: #clang-format, #clang Differential Revision: https://reviews.llvm.org/D75456	2020-03-02 15:46:33 +00:00
Jonathan Coe	c3af063c2b	[clang-format] Handle NullCoalescing and NullConditional operators in C# Summary: Disable merging of Type? into a single token. Merge ?? ?. and ?[ into a single token. Reviewers: krasimir, MyDeveloperDay Reviewed By: krasimir Subscribers: cfe-commits Tags: #clang-format, #clang Differential Revision: https://reviews.llvm.org/D75368	2020-03-02 13:55:54 +00:00
Jonathan Coe	7dfe0cc7f5	[clang-format] Recognize C# named argument colons as a token type Summary: No longer merge 'name' and ':' into a single token. Ensure that line breaks cannot be placed before or after a named-argument colon. Ensure that no space is inserted before a named-argument colon. Reviewers: krasimir Reviewed By: krasimir Subscribers: cfe-commits, MyDeveloperDay Tags: #clang-format, #clang Differential Revision: https://reviews.llvm.org/D75244	2020-02-27 13:47:29 +00:00
Jonathan Coe	a11ff39ba2	[clang-format] Merge name and colon into a single token for C# named arguments Summary: Merge 'argumentName' and ':' into a single token in foo(argumentName: bar). Add C# named argument as a token type. Reviewers: krasimir, MyDeveloperDay Reviewed By: krasimir Tags: #clang-format Differential Revision: https://reviews.llvm.org/D74894	2020-02-20 19:23:38 +00:00
Jonathan Coe	ca1fd460f1	[clang-format] Do not treat C# attribute targets as labels Summary: Merge '[', 'target' , ':' into a single token for C# attributes to prevent the target from being seen as a label. Reviewers: MyDeveloperDay, krasimir Reviewed By: krasimir Tags: #clang-format Differential Revision: https://reviews.llvm.org/D74043	2020-02-05 17:30:24 +00:00
Jonathan Coe	f9f0919db7	[clang-format] Improve support for multiline C# strings Reviewers: krasimir Reviewed By: krasimir Tags: #clang-format Differential Revision: https://reviews.llvm.org/D73622	2020-01-30 13:45:48 +00:00
Jonathan Coe	9d212e83e9	[clang-format] Handle quotes and escaped braces in C# interpolated strings Summary: This addresses issues raised in https://bugs.llvm.org/show_bug.cgi?id=44454. There are outstanding issues with multi-line verbatim strings in C# that will be addressed in a follow-up PR. Reviewers: krasimir, MyDeveloperDay Reviewed By: krasimir, MyDeveloperDay Subscribers: MyDeveloperDay Tags: #clang-format Differential Revision: https://reviews.llvm.org/D73492	2020-01-28 14:46:27 +00:00
Krasimir Georgiev	36a8f7f6d8	[clang-format] Handle escaped " in C# string-literals Reviewers: krasimir Reviewed By: krasimir Subscribers: klimek, MyDeveloperDay Tags: #clang-format Differential Revision: https://reviews.llvm.org/D73353	2020-01-27 12:57:20 +01:00
Martin Probst	4c9d691445	clang-format: [JS] fix `??` opreator wrapping. Summary: clang-format currently treats the nullish coalescing operator `??` like the ternary operator. That causes multiple nullish terms to be each indented relative to the last `??`, as they would in a ternary. The `??` operator is often used in chains though, and as such more similar to other binary operators, such as `\|\|`. So to fix the indent, set its token type to `\|\|`, so it inherits the same treatment. This opens up the question of operator precedence. However, `??` is required to be parenthesized when mixed with `\|\|` and `&&`, so this is not a problem that can come up in syntactically legal code. Reviewers: krasimir Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73026	2020-01-20 17:07:14 +01:00
Martin Probst	a7638d3849	clang-format: [JS] support null operators. Summary: JavaScript / TypeScript is adding two new operators: the null propagating operator `?.` and the nullish coalescing operator `??`. const x = foo ?? 'default'; const z = foo?.bar?.baz; This change adds support to lex and format both. Reviewers: krasimir Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D69971	2019-11-11 16:35:35 +01:00

1 2

82 Commits