llvm-project

Commit Graph

Author	SHA1	Message	Date
Dominic Chen	5470b1fcb5	[libc] Resolve NaN/implementation-defined behavior of floating-point tests Differential Revision: https://reviews.llvm.org/D134917	2022-10-05 16:23:19 -07:00
Siva Chandra Reddy	3e1986682f	[libc][Obvious] Add "__" prefix to sched_getcpucount in the spec and elsewhere. Without this fix, the declaration in sched.h will not have the "__" prefix and will cause a compile failure. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D135286	2022-10-05 18:01:13 +00:00
Siva Chandra Reddy	995105de1b	[libc] Add the POSIX waitpid function and the BSD wait4 function. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D135225	2022-10-05 07:38:55 +00:00
Michael Jones	a9c7d0e1ae	[libc] fix fullbuild handling for macro functions Now they're only included at all if the mode is fullbuild. Differential Revision: https://reviews.llvm.org/D135221	2022-10-04 15:37:31 -07:00
Michael Jones	0b790afb06	[libc] add CPU_COUNT macro and backing function Add the macro CPU_COUNT as well as a backing function to implement the functionality. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D135179	2022-10-04 15:22:55 -07:00
Michael Jones	38b6f58e33	[libc] implement basic rand and srand This provides the reference implementation of rand and srand. In future this will likely be upgraded to something that supports full ints. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D135187	2022-10-04 13:31:26 -07:00
Siva Chandra Reddy	e3638e83db	[libc] Add a minimal implementation of the POSIX fork function. A very simple and minimal implementation of fork is added. Future changes will add more functionality to satisfy POSIX and Linux requirements. An implementation of wait and a few support macros in sys/wait.h have also been added to help with testing the fork function. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D135131	2022-10-04 19:12:08 +00:00
Michael Jones	dbca7b4b2e	[libc] disable syscall test without fullbuild Our syscall implementation depends on a specific macro that's only defined in our headers. If we're not using our headers, then the test doesn't work. I've disabled the test in this case because there's no point in testing the system libc's syscall implementation. Differential Revision: https://reviews.llvm.org/D134994	2022-09-30 15:57:10 -07:00
Michael Jones	1801c356f6	[libc] add syscall function Add the syscall wrapper function and tests. It's implemented using a macro to guarantee the minimum number of arguments. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D134919	2022-09-30 15:46:28 -07:00
Siva Chandra Reddy	215c9fa4de	[libc] Re-enable functions from signal.h and re-enable abort. They were disabled because we were including linux/signal.h from our signal.h. Linux's signal.h is not designed to be included from user programs as it causes a lot of non-standard name pollution. Also, it is not self-contained. This change defines types and macros relevant for signal related syscalls within libc's headers and removes inclusion of Linux headers. This patch enables the funtions only for x86_64. They will be enabled for aarch64 also in a follow up patch after testing. Reviewed By: abrachet, lntue Differential Revision: https://reviews.llvm.org/D134567	2022-09-30 07:31:50 +00:00
Michael Jones	073534cb6f	[libc][windows] rename fenv internals for windows On windows, including math.h causes macros for "OVERFLOW" and "UNDERFLOW" to be defined. This patch renames some variables internal to FEnvImpl.h to avoid colliding with those. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134775	2022-09-29 16:55:02 -07:00
Siva Chandra	5e56e294ae	[libc][Obvious] Enable some of the recently added functions on aarch64.	2022-09-29 15:06:44 -07:00
Siva Chandra Reddy	545b954251	[libc] Add GNU extension functions sched_getaffinity and sched_setaffinity. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D134858	2022-09-29 20:31:46 +00:00
Michael Jones	b49d626cb4	[libc] add clock_gettime Add the clock_gettime syscall wrapper and tests. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134773	2022-09-29 10:23:21 -07:00
Guillaume Chatelet	060a43ced2	[libc][NFC] Move alignment utils to utils.h	2022-09-29 13:51:35 +00:00
Guillaume Chatelet	e095c3ed7c	[libc][test] Better reporting for MemoryMatcher	2022-09-29 12:14:05 +00:00
Siva Chandra Reddy	3367539010	[libc] Add implementation of pthread_once. The existing thrd_once function has been refactored so that the implementation can be shared between thrd_once and pthread_once functions. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D134716	2022-09-28 06:54:48 +00:00
Michael Jones	943dcf87e3	[libc][windows] fix small build issues. The windows build has fallen behind a little, this patch fixes some issues that were preventing it from building. Specifically: Some subfolders weren't being included, leading to missing targets in the cmake. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134676	2022-09-27 15:59:01 -07:00
Tue Ly	e15b2da42f	[libc][math] Simplify tanf implementation and improve its performance. Simplify `tanf` implementation and improve its performance. Completely reuse the implementation of `sinf`, `cosf`, `sincosf` and use the definition `tan(x) = sin(x)/cos(x)`. Performance benchmark using perf tool from the CORE-MATH project on Ryzen 1700: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh tanf GNU libc version: 2.35 GNU libc release: stable CORE-MATH reciprocal throughput : 18.558 System LIBC reciprocal throughput : 49.919 BEFORE: LIBC reciprocal throughput : 36.480 LIBC reciprocal throughput : 27.217 (with `-msse4.2` flag) LIBC reciprocal throughput : 20.205 (with `-mfma` flag) AFTER: LIBC reciprocal throughput : 30.337 LIBC reciprocal throughput : 21.072 (with `-msse4.2` flag) LIBC reciprocal throughput : 15.804 (with `-mfma` flag) $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh tanf --latency GNU libc version: 2.35 GNU libc release: stable CORE-MATH latency : 56.702 System LIBC latency : 107.206 BEFORE LIBC latency : 97.598 LIBC latency : 91.119 (with `-msse4.2` flag) LIBC latency : 82.655 (with `-mfma` flag) AFTER LIBC latency : 74.560 LIBC latency : 66.575 (with `-msse4.2` flag) LIBC latency : 61.636 (with `-mfma` flag) ``` Reviewed By: zimmermann6 Differential Revision: https://reviews.llvm.org/D134575	2022-09-26 21:36:12 -04:00
Guillaume Chatelet	aec908f9b2	[libc][NFC] Move bzero_inline to separate file This allows for easier discovery.	2022-09-26 12:57:51 +00:00
Guillaume Chatelet	2188cf9fa4	[libc][NFC] Remove new framework, a simpler one is coming	2022-09-26 12:42:38 +00:00
Guillaume Chatelet	8a55dafdd0	[libc][NFC] introduce inline_bzero	2022-09-26 12:34:10 +00:00
Raman Tenneti	8f1e362ee9	Implement nanosleep per https://pubs.opengroup.org/onlinepubs/009695399/basedefs/time.h.html Tested: Limited unit test: This makes a call and checks that no error was returned, but we currently don't have the ability to ensure that time has elapsed as expected. Co-authored-by: Jeff Bailey <jeffbailey@google.com> Reviewed By: sivachandra, jeffbailey Differential Revision: https://reviews.llvm.org/D134095	2022-09-24 00:13:58 +00:00
Michael Jones	736e215ca7	[libc][obvious] disable mprotect test under sanitizers fixes a build failure in my previous patch Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134552	2022-09-23 10:05:09 -07:00
Michael Jones	85c70da732	[libc] add madvise and posix_madvise Add the madvise and posix_madvise syscall wrappers and tests. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134498	2022-09-23 09:36:40 -07:00
Michael Jones	47b724048b	[libc] add mprotect Add the mprotect syscall wrapper and tests. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134497	2022-09-23 09:36:37 -07:00
Michael Jones	8d615a5e8a	[libc] move sys/mman macros to /include Previously the mman macros were in api.td, but platform differences are easier to handle with preprocessor macros so they have been moved to include. Also I completed the list of macros (at least for what I need soon) and fixed some previously incorrect values. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D134491	2022-09-23 09:36:34 -07:00
Siva Chandra Reddy	4f1474daec	[libc] Add implementations of POSIX getpid, getppid, getuid, geteuid functions. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D134338	2022-09-21 18:41:20 +00:00
Siva Chandra Reddy	e310f8bddf	[libc] Add implementation of functions stat, fstat and lstat. All supporting type and macro definitions have also been added. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D134262	2022-09-21 18:35:02 +00:00
Michael Jones	a9e0dbefdd	[libc] add fputs and puts add fputs, puts, and the EOF macro that they use. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134328	2022-09-21 11:10:20 -07:00
Michael Jones	42bcb35c0f	[libc] add strerror Strerror maps error numbers to strings. Additionally, a utility for mapping errors to strings was added so that it could be reused for perror and similar. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134074	2022-09-20 16:23:36 -07:00
Jeff Bailey	faeb237bac	[libc] Fix TWS issues in .td files Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134256	2022-09-20 14:25:53 +00:00
Jeff Bailey	6007a4a619	[libc] Remove unneeded extra include Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134255	2022-09-20 14:24:46 +00:00
Tue Ly	354ee3814c	[libc][Obvious] Fix exp10f spec.	2022-09-19 11:21:01 -04:00
Tue Ly	47c4a87641	[libc][Obvious] Remove constexpr qualifier from Exp10Base::powb_lo.	2022-09-19 10:13:29 -04:00
Tue Ly	a752460d73	[libc][math] Implement exp10f function correctly rounded to all rounding modes. Implement exp10f function correctly rounded to all rounding modes. Algorithm: perform range reduction to reduce ``` 10^x = 2^(hi + mid) * 10^lo ``` where: ``` hi is an integer, 0 <= mid * 2^5 < 2^5 -log10(2) / 2^6 <= lo <= log10(2) / 2^6 ``` Then `2^mid` is stored in a table of 32 entries and the product `2^hi * 2^mid` is performed by adding `hi` into the exponent field of `2^mid`. `10^lo` is then approximated by a degree-5 minimax polynomials generated by Sollya with: ``` > P = fpminimax((10^x - 1)/x, 4, [\|D...\|], [-log10(2)/64. log10(2)/64]); ``` Performance benchmark using perf tool from the CORE-MATH project on Ryzen 1700: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh exp10f GNU libc version: 2.35 GNU libc release: stable CORE-MATH reciprocal throughput : 10.215 System LIBC reciprocal throughput : 7.944 LIBC reciprocal throughput : 38.538 LIBC reciprocal throughput : 12.175 (with `-msse4.2` flag) LIBC reciprocal throughput : 9.862 (with `-mfma` flag) $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh exp10f --latency GNU libc version: 2.35 GNU libc release: stable CORE-MATH latency : 40.744 System LIBC latency : 37.546 BEFORE LIBC latency : 48.989 LIBC latency : 44.486 (with `-msse4.2` flag) LIBC latency : 40.221 (with `-mfma` flag) ``` This patch relies on https://reviews.llvm.org/D134002 Reviewed By: orex, zimmermann6 Differential Revision: https://reviews.llvm.org/D134104	2022-09-19 10:01:40 -04:00
Tue Ly	cd1d71c5f1	[libc][Obvious] Remove constexpr qualifier from ExpBase::powb_lo.	2022-09-19 09:29:37 -04:00
Tue Ly	4973eee122	[libc][math] Improve tanhf performance. Optimize the core part of `tanhf` implementation that is to compute `e^x` similar to https://reviews.llvm.org/D133870. Factor the constants and polynomial approximation out so that it can be used for `exp10f` Performance benchmark using perf tool from the CORE-MATH project on Ryzen 1700: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh tanhf GNU libc version: 2.35 GNU libc release: stable CORE-MATH reciprocal throughput : 13.377 System LIBC reciprocal throughput : 55.046 BEFORE: LIBC reciprocal throughput : 75.674 LIBC reciprocal throughput : 33.242 (with `-msse4.2` flag) LIBC reciprocal throughput : 25.927 (with `-mfma` flag) AFTER: LIBC reciprocal throughput : 26.359 LIBC reciprocal throughput : 18.888 (with `-msse4.2` flag) LIBC reciprocal throughput : 14.243 (with `-mfma` flag) $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh tanhf --latency GNU libc version: 2.35 GNU libc release: stable CORE-MATH latency : 43.365 System LIBC latency : 123.499 BEFORE LIBC latency : 112.968 LIBC latency : 104.908 (with `-msse4.2` flag) LIBC latency : 92.310 (with `-mfma` flag) AFTER LIBC latency : 69.828 LIBC latency : 63.874 (with `-msse4.2` flag) LIBC latency : 57.427 (with `-mfma` flag) ``` Reviewed By: orex, zimmermann6 Differential Revision: https://reviews.llvm.org/D134002	2022-09-19 08:43:03 -04:00
Michael Jones	70f1f302ca	[libc][cmake] separate installing headers Now libc headers can be installed separately from installing the rest of the libc. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D133960	2022-09-16 15:50:28 -07:00
Siva Chandra Reddy	7fb96fb5d3	[libc] Add implementation of POSIX "uname" function. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D134065	2022-09-16 21:21:29 +00:00
Siva Chandra Reddy	9050a59c66	[libc][Obvious] Fix typo in struct rlimit name - remove the "_t" suffix.	2022-09-16 21:07:17 +00:00
Siva Chandra Reddy	f5cbbb9988	[libc] Add implementation of POSIX setrlimit and getrlimit functions. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D134016	2022-09-16 20:50:28 +00:00
Siva Chandra Reddy	d23d858d04	[libc] Add the implementation of the "remove" function. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D133922	2022-09-15 17:32:02 +00:00
Tue Ly	1c89ae71ea	[libc][math] Improve sinhf and coshf performance. Optimize `sinhf` and `coshf` by computing exp(x) and exp(-x) simultaneously. Currently `sinhf` and `coshf` are implemented using the following formulas: ``` sinh(x) = 0.5 (exp(x) - 1) - 0.5(exp(-x) - 1) cosh(x) = 0.5exp(x) + 0.5exp(-x) ``` where `exp(x)` and `exp(-x)` are calculated separately using the formula: ``` exp(x) ~ 2^hi * 2^mid * exp(dx) ~ 2^hi * 2^mid * P(dx) ``` By expanding the polynomial `P(dx)` into even and odd parts ``` P(dx) = P_even(dx) + dx * P_odd(dx) ``` we can see that the computations of `exp(x)` and `exp(-x)` have many things in common, namely: ``` exp(x) ~ 2^(hi + mid) * (P_even(dx) + dx * P_odd(dx)) exp(-x) ~ 2^(-(hi + mid)) * (P_even(dx) - dx * P_odd(dx)) ``` Expanding `sinh(x)` and `cosh(x)` with respect to the above formulas, we can compute these two functions as follow in order to maximize the sharing parts: ``` sinh(x) = (e^x - e^(-x)) / 2 ~ 0.5 * (P_even * (2^(hi + mid) - 2^(-(hi + mid))) + dx * P_odd * (2^(hi + mid) + 2^(-(hi + mid)))) cosh(x) = (e^x + e^(-x)) / 2 ~ 0.5 * (P_even * (2^(hi + mid) + 2^(-(hi + mid))) + dx * P_odd * (2^(hi + mid) - 2^(-(hi + mid)))) ``` So in this patch, we perform the following optimizations for `sinhf` and `coshf`: # Use the above formulas to maximize sharing intermediate results, # Apply similar optimizations from https://reviews.llvm.org/D133870 Performance benchmark using `perf` tool from the CORE-MATH project on Ryzen 1700: For `sinhf`: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh sinhf GNU libc version: 2.35 GNU libc release: stable CORE-MATH reciprocal throughput : 16.718 System LIBC reciprocal throughput : 63.151 BEFORE: LIBC reciprocal throughput : 90.116 LIBC reciprocal throughput : 28.554 (with `-msse4.2` flag) LIBC reciprocal throughput : 22.577 (with `-mfma` flag) AFTER: LIBC reciprocal throughput : 36.482 LIBC reciprocal throughput : 16.955 (with `-msse4.2` flag) LIBC reciprocal throughput : 13.943 (with `-mfma` flag) $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh sinhf --latency GNU libc version: 2.35 GNU libc release: stable CORE-MATH latency : 48.821 System LIBC latency : 137.019 BEFORE LIBC latency : 97.122 LIBC latency : 84.214 (with `-msse4.2` flag) LIBC latency : 71.611 (with `-mfma` flag) AFTER LIBC latency : 54.555 LIBC latency : 50.865 (with `-msse4.2` flag) LIBC latency : 48.700 (with `-mfma` flag) ``` For `coshf`: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh coshf GNU libc version: 2.35 GNU libc release: stable CORE-MATH reciprocal throughput : 16.939 System LIBC reciprocal throughput : 19.695 BEFORE: LIBC reciprocal throughput : 52.845 LIBC reciprocal throughput : 29.174 (with `-msse4.2` flag) LIBC reciprocal throughput : 22.553 (with `-mfma` flag) AFTER: LIBC reciprocal throughput : 37.169 LIBC reciprocal throughput : 17.805 (with `-msse4.2` flag) LIBC reciprocal throughput : 14.691 (with `-mfma` flag) $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh coshf --latency GNU libc version: 2.35 GNU libc release: stable CORE-MATH latency : 48.478 System LIBC latency : 48.044 BEFORE LIBC latency : 99.123 LIBC latency : 85.595 (with `-msse4.2` flag) LIBC latency : 72.776 (with `-mfma` flag) AFTER LIBC latency : 57.760 LIBC latency : 53.967 (with `-msse4.2` flag) LIBC latency : 50.987 (with `-mfma` flag) ``` Reviewed By: orex, zimmermann6 Differential Revision: https://reviews.llvm.org/D133913	2022-09-15 09:20:39 -04:00
Siva Chandra Reddy	6e675fba3a	[libc] Add POSIX functions pread and pwrite. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D133888	2022-09-14 20:52:20 +00:00
Tue Ly	e6226e6b72	[libc][math] Improve exp2f performance. Reduce the number of subintervals that need lookup table and optimize the evaluation steps. Currently, `exp2f` is computed by reducing to `2^hi * 2^mid * 2^lo` where `-16/32 <= mid <= 15/32` and `-1/64 <= lo <= 1/64`, and `2^lo` is then approximated by a degree 6 polynomial. Experiment with Sollya showed that by using a degree 6 polynomial, we can approximate `2^lo` for a bigger range with reasonable errors: ``` > P = fpminimax((2^x - 1)/x, 5, [\|D...\|], [-1/64, 1/64]); > dirtyinfnorm(2^x - 1 - xP, [-1/64, 1/64]); 0x1.e18a1bc09114def49eb851655e2e5c4dd08075ac2p-63 > P = fpminimax((2^x - 1)/x, 5, [\|D...\|], [-1/32, 1/32]); > dirtyinfnorm(2^x - 1 - xP, [-1/32, 1/32]); 0x1.05627b6ed48ca417fe53e3495f7df4baf84a05e2ap-56 ``` So we can optimize the implementation a bit with: # Reduce the range to `mid = i/16` for `i = 0..15` and `-1/32 <= lo <= 1/32` # Store the table `2^mid` in bits, and add `hi` directly to its exponent field to compute `2^hi * 2^mid` # Rearrange the order of evaluating the polynomial approximating `2^lo`. Performance benchmark using perf tool from the CORE-MATH project on Ryzen 1700: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh exp2f GNU libc version: 2.35 GNU libc release: stable CORE-MATH reciprocal throughput : 9.534 System LIBC reciprocal throughput : 6.229 BEFORE: LIBC reciprocal throughput : 21.405 LIBC reciprocal throughput : 15.241 (with `-msse4.2` flag) LIBC reciprocal throughput : 11.111 (with `-mfma` flag) AFTER: LIBC reciprocal throughput : 18.617 LIBC reciprocal throughput : 12.852 (with `-msse4.2` flag) LIBC reciprocal throughput : 9.253 (with `-mfma` flag) $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh exp2f --latency GNU libc version: 2.35 GNU libc release: stable CORE-MATH latency : 40.869 System LIBC latency : 30.580 BEFORE LIBC latency : 64.888 LIBC latency : 61.027 (with `-msse4.2` flag) LIBC latency : 48.778 (with `-mfma` flag) AFTER LIBC latency : 48.803 LIBC latency : 45.047 (with `-msse4.2` flag) LIBC latency : 37.487 (with `-mfma` flag) ``` Reviewed By: sivachandra, orex Differential Revision: https://reviews.llvm.org/D133870	2022-09-14 14:44:25 -04:00
Siva Chandra	cae9c64f08	[libc][Obvious] Fix typo in the alternate path of the POSIX "access" function.	2022-09-14 01:03:07 -07:00
Siva Chandra Reddy	419580c699	[libc] Add implementation of POSIX function "access". Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D133814	2022-09-14 07:44:47 +00:00
Siva Chandra Reddy	ef3e80b6bd	[libc][Obvious] Use unique test file names in dup, dup2 and dup3 tests.	2022-09-13 18:20:04 +00:00
Siva Chandra Reddy	8989aa003f	[libc] Add POSIX functions dup, dup2, and GNU extension function dup3. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D133748	2022-09-13 18:06:30 +00:00
Tue Ly	463dcc8749	[libc][math] Implement acosf function correctly rounded for all rounding modes. Implement acosf function correctly rounded for all rounding modes. We perform range reduction as follows: - When `\|x\| < 2^(-10)`, we use cubic Taylor polynomial: ``` acos(x) = pi/2 - asin(x) ~ pi/2 - x - x^3 / 6. ``` - When `2^(-10) <= \|x\| <= 0.5`, we use the same approximation that is used for `asinf(x)` when `\|x\| <= 0.5`: ``` acos(x) = pi/2 - asin(x) ~ pi/2 - x - x^3 * P(x^2). ``` - When `0.5 < x <= 1`, we use the double angle formula: `cos(2y) = 1 - 2 * sin^2 (y)` to reduce to: ``` acos(x) = 2 * asin( sqrt( (1 - x)/2 ) ) ``` - When `-1 <= x < -0.5`, we reduce to the positive case above using the formula: ``` acos(x) = pi - acos(-x) ``` Performance benchmark using perf tool from the CORE-MATH project on Ryzen 1700: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh acosf GNU libc version: 2.35 GNU libc release: stable CORE-MATH reciprocal throughput : 28.613 System LIBC reciprocal throughput : 29.204 LIBC reciprocal throughput : 24.271 $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh asinf --latency GNU libc version: 2.35 GNU libc release: stable CORE-MATH latency : 55.554 System LIBC latency : 76.879 LIBC latency : 62.118 ``` Reviewed By: orex, zimmermann6 Differential Revision: https://reviews.llvm.org/D133550	2022-09-09 09:55:30 -04:00
Tue Ly	e2f065c2a3	[libc][math] Implement asinf function correctly rounded for all rounding modes. Implement asinf function correctly rounded for all rounding modes. For `\|x\| <= 0.5`, we approximate `asin(x)` by ``` asin(x) = x * P(x^2) ``` where `P(X^2) = Q(X)` is a degree-20 minimax even polynomial approximating `asin(x)/x` on `[0, 0.5]` generated by Sollya with: ``` > Q = fpminimax(asin(x)/x, [\|0, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20\|], [\|1, D...\|], [0, 0.5]); ``` When `\|x\| > 0.5`, we perform range reduction as follow: Assume further that `0.5 < x <= 1`, and let: ``` y = asin(x) ``` We will use the double angle formula: ``` cos(2X) = 1 - 2 sin^2(X) ``` and the complement angle identity: ``` x = sin(y) = cos(pi/2 - y) = 1 - 2 sin^2 (pi/4 - y/2) ``` So: ``` sin(pi/4 - y/2) = sqrt( (1 - x)/2 ) ``` And hence: ``` pi/4 - y/2 = asin( sqrt( (1 - x)/2 ) ) ``` Equivalently: ``` asin(x) = y = pi/2 - 2 * asin( sqrt( (1 - x)/2 ) ) ``` Let `u = (1 - x)/2`, then ``` asin(x) = pi/2 - 2 * asin(u) ``` Moreover, since `0.5 < x <= 1`, ``` 0 <= u < 1/4, and 0 <= sqrt(u) < 0.5. ``` And hence we can reuse the same polynomial approximation of `asin(x)` when `\|x\| <= 0.5`: ``` asin(x) = pi/2 - 2 * u * P(u^2). ``` Performance benchmark using `perf` tool from the CORE-MATH project on Ryzen 1700: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh asinf CORE-MATH reciprocal throughput : 23.418 System LIBC reciprocal throughput : 27.310 LIBC reciprocal throughput : 22.741 $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh asinf --latency GNU libc version: 2.35 GNU libc release: stable CORE-MATH latency : 58.884 System LIBC latency : 62.055 LIBC latency : 62.037 ``` Reviewed By: orex, zimmermann6 Differential Revision: https://reviews.llvm.org/D133400	2022-09-07 19:27:47 -04:00
Tue Ly	bb6966aa53	[libc] Return correct values for hypot when overflowed. Hypot incorrectly returns +Inf when overflowed with FE_DOWNWARD and FE_TOWARDZERO rounding modes. Reviewed By: sivachandra, zimmermann6 Differential Revision: https://reviews.llvm.org/D133370	2022-09-07 19:23:11 -04:00
Alex Brachet	5c78c154df	[libc][NFC] clang-format	2022-09-02 21:17:34 +00:00
Alex Brachet	e66a1a5a39	[libc][NFC] Use no_sanitize("all") This function cannot have any instrumentation because it's assembly must match exactly what the debugger is expecting. Previously it was just a list of what sanitizers we expect libc would be sanitized with but this is untenable.	2022-09-02 19:07:39 +00:00
Jeff Bailey	0dcbe0e1df	[libc] Add Buildbot to External Links Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D133186	2022-09-02 14:11:09 +00:00
Tue Ly	a4d48e3b0b	[libc][NFC] Use cpp::optional for checking exceptional values of math functions. Update the utility functions for checking exceptional values of math functions to use cpp::optional return values. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D133134	2022-09-01 17:39:12 -04:00
Michael Jones	cb84721c3c	[libc] add division, modulo, and power to UInt This adds division and power implementations to UInt. Modulo and division are handled by the same function. These are necessary for some higher order mathematics, often involving large floating point numbers. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D132184	2022-09-01 11:22:26 -07:00
Michael Jones	fe41529755	[libc] move builtin_wrappers out of fputil builtin_wrappers contains the wrappers for the clz builtins, which do not depend on anything in fputil. This patch moves the file out of FPUtil. The location is updated as appropriate. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D133035	2022-09-01 11:17:00 -07:00
Siva Chandra Reddy	4c810ecb68	[libc] Add arm-32 syscall implementation. The implementation currently supports only non-thumb mode. As a test for the implementation, mmap and munmap functions have been enabled. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D132825	2022-08-31 18:10:40 +00:00
Dong-hee Na	e75cce76ab	[libc] Fix typo in lseek syscall number macro. Differential Revision: https://reviews.llvm.org/D133022	2022-08-31 17:50:24 +00:00
Michael Jones	9ac66f0650	[libc][cmake] split fputil into individual targets The libc.src.__support.FPUtil.fputil target encompassed many unrelated files, and provided a lot of hidden dependencies. This patch splits out all of these files into component parts and cleans up the cmake files that used them. It does not touch any source files for simplicity, but there may be changes made to them in future patches. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D132980	2022-08-31 10:44:52 -07:00
Tue Ly	647b190a5c	[libc][doc] Update implementation status of atanf and atanhf.	2022-08-31 01:27:23 -04:00
Kirill Okhotnikov	8c9d685fa8	[libc][math] Fix broken atan function.	2022-08-30 23:04:00 +02:00
Kirill Okhotnikov	6c75240da0	[libc][math] Fix broken tests.	2022-08-30 22:59:47 +02:00
Kirill Okhotnikov	77e1d9beed	[libc][math] Added atanf function. Performance by core-math (core-math/glibc 2.31/current llvm-14): 28.879/20.843/20.15 Differential Revision: https://reviews.llvm.org/D132842	2022-08-30 22:39:54 +02:00
Kirill Okhotnikov	6c1fc7e430	[libc][math] Added atanhf function. Performance by core-math (core-math/glibc 2.31/current llvm-14): 10.845/43.174/13.467 The review is done on top of D132809. Differential Revision: https://reviews.llvm.org/D132811	2022-08-30 22:39:54 +02:00
Kirill Okhotnikov	89ed5b7c50	[libc][math] Added auxiliary function log2_eval for asinhf/acoshf/atanhf. 1) `double log2_eval(double)` function added with better than float precision is added. 2) Some refactoring done to put all auxiliary functions and corresponding data to one place to reuse the code. 3) Added tests for new functions. 4) Performance and precision tests of the function shows, that it more precise than exiting log2, (no exceptional cases), but timing is ~5% higer that on current one. Differential Revision: https://reviews.llvm.org/D132809	2022-08-30 22:39:54 +02:00
Siva Chandra Reddy	3a5ec60ae3	[libc][Obvious] Re-enable math unit tests.	2022-08-29 04:29:22 +00:00
Siva Chandra Reddy	f6506ec443	[libc] Implement POSIX truncate and ftruncate functions for Linux. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D132705	2022-08-26 19:27:24 +00:00
Siva Chandra Reddy	b8be3dabde	[libc] Add Linux implementation of GNU extension function sendfile. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D132721	2022-08-26 19:13:40 +00:00
Michael Jones	ecca895295	[libc] add compile option for printf arg type array This patch allows for adjusting the size of the array that printf uses to track the types of arguments in index mode. This is useful for optimizing the tradeoff between memory usage and performance. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131993	2022-08-26 11:38:04 -07:00
Siva Chandra Reddy	00e51f04e8	[libc] Implement linux link, linkat, symlink, symlinkat, readlink, readlinkat. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D132619	2022-08-25 18:50:39 +00:00
Guillaume Chatelet	9d239b37f7	[NFC][libc] Move Uint implementation to parent directory Differential Revision: https://reviews.llvm.org/D132638	2022-08-25 12:42:06 +00:00
Guillaume Chatelet	6ad8c1f076	[NFC][libc] Fix unused variable in string_writer_test test	2022-08-25 10:19:59 +02:00
Guillaume Chatelet	5a5f02f0e9	[NFC][libc] Remove double semicolon	2022-08-25 10:17:12 +02:00
Siva Chandra Reddy	85dff76416	[libc] Add linux implementation of POSIX fchmodat function. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D132533	2022-08-24 18:46:29 +00:00
Michael Jones	096463d08e	[libc] move printf to use StringViews The FormatSection and the writer functions both previously took a char* and a length to represent a string. Now they use the StringView class to represent that more succinctly. This change also required fixing everywhere these were used, so it touches a lot of files. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131994	2022-08-24 10:28:31 -07:00
Siva Chandra	d00e97df0f	[libc][Obvious] Fix typo is chmod implementation. This now allows enabling the chmod function on aarch64.	2022-08-23 15:01:21 -07:00
Siva Chandra	8856137ce7	[libc] Enable a few entrypoints on aarch64 which are now available on x86_64.	2022-08-23 12:31:42 -07:00
Siva Chandra Reddy	055322891c	[libc] Add Linux implementations of POSIX chdir, fchdir, chmod and fchmod. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D132445	2022-08-23 17:16:49 +00:00
Guillaume Chatelet	1e5b3ce707	[reland][NFC][libc] standardize string_view	2022-08-23 12:40:49 +00:00
Guillaume Chatelet	eebe9b2964	Revert "[reland][NFC][libc] standardize string_view" This reverts commit `df99774ef7`.	2022-08-23 12:40:48 +00:00
Guillaume Chatelet	df99774ef7	[reland][NFC][libc] standardize string_view	2022-08-23 12:10:28 +00:00
Guillaume Chatelet	54cfe5f778	Revert "[reland][NFC][libc] standardize string_view" This reverts commit `522d29a6a7`.	2022-08-23 11:50:56 +00:00
Guillaume Chatelet	522d29a6a7	[reland][NFC][libc] standardize string_view	2022-08-23 11:48:53 +00:00
Guillaume Chatelet	0df7e1b0e5	Revert "[reland][NFC][libc] standardize string_view" This reverts commit `187099da1c`.	2022-08-23 11:00:22 +00:00
Guillaume Chatelet	187099da1c	[reland][NFC][libc] standardize string_view	2022-08-23 10:55:57 +00:00
Guillaume Chatelet	abed700019	Revert "[NFC][libc] standardize string_view" This reverts commit `abe0fa6b17`.	2022-08-23 10:43:22 +00:00
Guillaume Chatelet	abe0fa6b17	[NFC][libc] standardize string_view	2022-08-23 09:57:20 +00:00
Siva Chandra Reddy	3496dd3bb6	[libc][Obvious] Move some macros definitions to sys/stat.h. They were previously incorrectly listed in fcntl.h	2022-08-22 23:19:18 +00:00
Guillaume Chatelet	41d4067d0b	[NFC][libc] clarify CPP folder documentation	2022-08-22 15:41:41 +00:00
Guillaume Chatelet	aa59c9810a	[libc][NFC] Use STL case for string_view	2022-08-22 15:25:14 +00:00
Guillaume Chatelet	c860fd3f25	[NFC][libc] Remove ArrayRef which is replaced with span	2022-08-22 10:52:28 +00:00
Guillaume Chatelet	4d2319123a	[NFC][libc] Switch File impl to span instead of ArrayRef	2022-08-22 10:27:07 +00:00
Guillaume Chatelet	22bc0fde55	[NFC][libc] Switch mem* tests from ArrayRef to span	2022-08-22 09:58:35 +00:00
Guillaume Chatelet	f890f80d67	[libc] Allow span views over mutable data	2022-08-22 09:43:08 +00:00
Guillaume Chatelet	e5d5146323	[libc] Allow construction of const span from mutable span	2022-08-22 08:54:31 +00:00
Guillaume Chatelet	a1c42cb803	[reland][NFC][libc] Use span instead of ArrayRef	2022-08-22 08:22:38 +00:00
Guillaume Chatelet	70ca9a6600	Revert "[NFC][libc] Use span instead of ArrayRef" This reverts commit `e7b250b8a6`.	2022-08-22 08:14:22 +00:00
Guillaume Chatelet	e7b250b8a6	[NFC][libc] Use span instead of ArrayRef	2022-08-22 08:09:57 +00:00
Guillaume Chatelet	5541e6bc05	[libc] Add a span facility This is intended to replace ArrayRef. Differential Revision: https://reviews.llvm.org/D132254	2022-08-22 07:27:00 +00:00
Guillaume Chatelet	f00567eceb	[libc][NFC] Move blockstore out of the CPP folder	2022-08-19 21:51:53 +00:00
Guillaume Chatelet	e2d7975843	[libc][NFC] Use STL case for bitset and simplify implementation	2022-08-19 21:39:14 +00:00
Guillaume Chatelet	d769cd8cdc	[reland][libc][NFC] Use STL case for bit	2022-08-19 21:26:26 +00:00
Guillaume Chatelet	175560fc21	Revert "[libc][NFC] Use STL case for bit" This reverts commit `7ba14b8611`.	2022-08-19 21:17:48 +00:00
Guillaume Chatelet	7ba14b8611	[libc][NFC] Use STL case for bit	2022-08-19 21:10:27 +00:00
Siva Chandra Reddy	4d97766316	[libc] Install startup files with the install-libc target. Differential Revision: https://reviews.llvm.org/D132212	2022-08-19 20:15:16 +00:00
Guillaume Chatelet	7799b5c0e6	[reland][NFC][libc] rearrange aarch64 memset code to better match new implementation Differential Revision: https://reviews.llvm.org/D132121	2022-08-19 15:31:45 +00:00
Guillaume Chatelet	31fbcccb31	Revert "[NFC][libc] rearrange aarch64 memset code to better match new implementation" This reverts commit `4d931b6e1e`.	2022-08-19 15:02:16 +00:00
Guillaume Chatelet	4d931b6e1e	[NFC][libc] rearrange aarch64 memset code to better match new implementation Differential Revision: https://reviews.llvm.org/D132121	2022-08-19 14:48:17 +00:00
Siva Chandra Reddy	84517b6fb1	[libc] Add more headers to the linux x86_64 and aarch64 configs.	2022-08-19 07:15:08 +00:00
Siva Chandra Reddy	857286f757	[libc][NFC] Change the libc fullbuild target name to "libc".	2022-08-19 07:07:30 +00:00
John Ericson	e941b031d3	Revert "[cmake] Use `CMAKE_INSTALL_LIBDIR` too" This reverts commit `f7a33090a9`. Unfortunately this causes a number of failures that didn't show up in my local build.	2022-08-18 22:46:32 -04:00
John Ericson	f7a33090a9	[cmake] Use `CMAKE_INSTALL_LIBDIR` too We held off on this before as `LLVM_LIBDIR_SUFFIX` conflicted with it. Now we return this. `LLVM_LIBDIR_SUFFIX` is kept as a deprecated way to set `CMAKE_INSTALL_LIBDIR`. The other `*_LIBDIR_SUFFIX` are just removed entirely. I imagine this is too potentially-breaking to make LLVM 15. That's fine. I have a more minimal version of this in the disto (NixOS) patches for LLVM 15 (like previous versions). This more expansive version I will test harder after the release is cut. Reviewed By: sebastian-ne, ldionne, #libc, #libc_abi Differential Revision: https://reviews.llvm.org/D130586	2022-08-18 15:33:35 -04:00
Vitaly Buka	4e5d64e83e	[NFC][libc] Detect HWASAN support macro	2022-08-18 10:19:18 -07:00
Siva Chandra Reddy	d20e632853	[libc][Obvious] Update the static archive target name for libc-api-test.	2022-08-17 23:18:33 +00:00
Siva Chandra Reddy	e5edd74bb7	[libc] Add a target to install libc in the full build mode. * In the full build mode, `ninja install-libc` will install the headers as well the static archive named libc.a. * In the default mode, `ninja install-llvmlibc` will only install the static archive libllvmlibc.a. Reviewed By: jeffbailey Differential Revision: https://reviews.llvm.org/D132015	2022-08-17 22:49:02 +00:00
Siva Chandra Reddy	eb0715e3c7	[libc][NFC] Make IntegerToString simpler to use at call-sites. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D131943	2022-08-17 07:57:06 +00:00
Siva Chandra Reddy	8e99ad6546	[libc][Obvious] Rearrange few header targets to satisfy dependency order.	2022-08-16 22:33:45 +00:00
Siva Chandra Reddy	e17ff7dd2a	[libc][Obvious] Convert an add_header target to add_header_library target.	2022-08-16 20:33:24 +00:00
Michael Jones	438d1f18a5	[libc] add guard for file pieces of printf In the printf_core CMake, the file pieces are defined as object libraries that depend on the File data structure. If these are added unconditionally they'll try to evaluate that dependancy even when there is no File available. This patch adds a guard to prevent that error. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131921	2022-08-15 15:43:24 -07:00
Michael Jones	e0e7fa36d3	[libc] enable s(n)printf without fullbuild To use the FILE data structure, LLVM-libc must be in fullbuild mode since it expects its own implementation. This means that (f)printf can't be used without fullbuild, but s(n)printf only uses strings. This patch adjusts the CMake to allow for this. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D131913	2022-08-15 13:45:34 -07:00
Michael Jones	0e27dfd560	[libc] add sprintf size comparison To accurately measure the size of sprintf in a finished binary, the easiest method is to simply build a binary with and without sprintf. This patch adds an integration test that can be built with and without sprintf, as well as targets to build it. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131735	2022-08-12 11:46:55 -07:00
Siva Chandra Reddy	e4efe2cfc6	[libc] Pack memory functions in the integration test libc.a files. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D131752	2022-08-12 17:27:26 +00:00
Tue Ly	af59bac4ca	[libc] Add missing header and Windows entrypoints for tanf.	2022-08-12 09:34:33 -04:00
Tue Ly	82d6e77048	[libc] Implement tanf function correctly rounded for all rounding modes. Implement tanf function correctly rounded for all rounding modes. We use the range reduction that is shared with `sinf`, `cosf`, and `sincosf`: ``` k = round(x * 32/pi) and y = x * (32/pi) - k. ``` Then we use the tangent of sum formula: ``` tan(x) = tan((k + y)* pi/32) = tan((k mod 32) * pi / 32 + y * pi/32) = (tan((k mod 32) * pi/32) + tan(y * pi/32)) / (1 - tan((k mod 32) * pi/32) * tan(y * pi/32)) ``` We need to make a further reduction when `k mod 32 >= 16` due to the pole at `pi/2` of `tan(x)` function: ``` if (k mod 32 >= 16): k = k - 31, y = y - 1.0 ``` And to compute the final result, we store `tan(k * pi/32)` for `k = -15..15` in a table of 32 double values, and evaluate `tan(y * pi/32)` with a degree-11 minimax odd polynomial generated by Sollya with: ``` > P = fpminimax(tan(y * pi/32)/y, [\|0, 2, 4, 6, 8, 10\|], [\|D...\|], [0, 1.5]); ``` Performance benchmark using `perf` tool from the CORE-MATH project on Ryzen 1700: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh tanf CORE-MATH reciprocal throughput : 18.586 System LIBC reciprocal throughput : 50.068 LIBC reciprocal throughput : 33.823 LIBC reciprocal throughput : 25.161 (with `-msse4.2` flag) LIBC reciprocal throughput : 19.157 (with `-mfma` flag) $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh tanf --latency GNU libc version: 2.31 GNU libc release: stable CORE-MATH latency : 55.630 System LIBC latency : 106.264 LIBC latency : 96.060 LIBC latency : 90.727 (with `-msse4.2` flag) LIBC latency : 82.361 (with `-mfma` flag) ``` Reviewed By: orex Differential Revision: https://reviews.llvm.org/D131715	2022-08-12 09:21:05 -04:00
Siva Chandra Reddy	4a738ee822	[libc] Add implemementations of thread specific data related API. Specifically, POSIX functions pthread_key_create, pthread_key_delete, pthread_setspecific and pthread_getspecific have been added. The C standard equivalents tss_create, tss_delete, tss_set and tss_get have also been added. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D131647	2022-08-12 05:28:40 +00:00
Michael Jones	5e22ef3198	[libc] move int conversion out of base template The convert_alpha_numeric function is intentionally non-templated so that its code can be reused for different bases in code-size sensitive cases. Previously it was inside the IntegerToString class which created a different version for each base. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D131725	2022-08-11 16:56:27 -07:00
Siva Chandra Reddy	57afb48057	[libc][Obvious] Fix thrd_join's first arg. First argument to thrd_join was incorrectly listed as a pointer to a thrd_t value. It should instead be a thrd_t value argument.	2022-08-11 19:18:11 +00:00
Tue Ly	42f183792c	[libc] Change sinf/cosf range reduction to mod pi/32 to be shared with tanf. Change sinf/cosf range reduction to mod pi/32 to be shared with tanf, since polynomial approximations for tanf on subintervals of length pi/16 do not provide enough accuracy. Reviewed By: orex Differential Revision: https://reviews.llvm.org/D131652	2022-08-11 09:41:45 -04:00
Michael Jones	3510082a1c	[libc] move to combined integer converter The functions converting integers into decimal, hexadecimal, and octal, are all very similar. This patch moves to a combined converter to save code size. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131302	2022-08-10 14:48:30 -07:00
Michael Jones	42e2946f51	[libc][NFC] add arrow operator to optional Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131610	2022-08-10 14:47:04 -07:00
Michael Jones	276e108bf9	[libc][NFC] clean int to string static templates Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131611	2022-08-10 13:11:29 -07:00
Michael Jones	23ace05e0a	[libc] add int to string for extended bases The default IntegerToString class only supports base 10, this patch adds a version which supports any base between 2 and 36 inclusive. This will be used in an upcoming patch. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131301	2022-08-10 11:28:14 -07:00
Michael Jones	5dd8553423	[libc] Support StringView equality tests Previously, the integer_to_string tests used EXPECT_TRUE(.equals) which doesn't have useful error messages. Now they properly check equality with the EXPECT_EQ macro, which allows for comparing the strings more naturally. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131300	2022-08-10 11:23:53 -07:00
Siva Chandra Reddy	0071a79532	[libc] Add implementation of pthread_exit and thrd_exit. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D131451	2022-08-10 06:28:47 +00:00
Siva Chandra Reddy	80652d270a	[libc][NFC] Fix a few compiler warnings.	2022-08-09 22:30:40 +00:00
Siva Chandra Reddy	c6dcc359ac	[libc] Add __cxa_atexit support to the atexit function. Reviewed By: abrachet Differential Revision: https://reviews.llvm.org/D131219	2022-08-09 21:42:41 +00:00
Siva Chandra Reddy	daa44a2309	[libc] Add a utility data structure named FixedVector. This data structure uses a backing cpp::array object and supports a vector like push_back API. In comparison with a traditional vector data structure, it is of a fixed capacity and cannot be resized. Differential Revision: https://reviews.llvm.org/D131377	2022-08-09 21:29:54 +00:00
Jeff Bailey	7889c41938	[libc] Website fixes (sidebar and mobile) Add "using" and "status" sections to the sidebar to make getting these easier. Fixed mobile formatting not overflow left and right. Tested: Chrome on Desktop, using mobile restrictions in devtools. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131369	2022-08-08 18:38:01 +00:00
Jeff Bailey	f493b21e16	[libc] Update look and feel of libc.llvm.org This design is borrowed from the lldb folks (thank you!) to declutter the page. * The version number at the top is removed. * Links are pushed over to a sidebar * The sidebar has headings There are other minor changes: * The warning about this project not being ready is now an RST "warning" * Links to the Bug Reports and the Source Code are Added * Refer to this project as either "The LLVM C LIbrary" or "The libc" Tested: Built locally Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D131242	2022-08-05 18:18:40 +00:00
Tue Ly	131dda9acc	[libc] Implement sincosf function correctly rounded to all rounding modes. Refactor common range reductions and evaluations for sinf, cosf, and sincosf. Added exhaustive tests for sincosf. Performance before the patch: ``` System LIBC reciprocal throughput : 30.205 LIBC reciprocal throughput : 30.533 System LIBC latency : 67.961 LIBC latency : 61.564 ``` Performance after the patch: ``` System LIBC reciprocal throughput : 30.409 LIBC reciprocal throughput : 20.273 System LIBC latency : 67.527 LIBC latency : 61.959 ``` Reviewed By: orex Differential Revision: https://reviews.llvm.org/D130901	2022-08-05 09:58:01 -04:00
Jeff Bailey	3b631e47fe	[libc] Trivial implementation of std::optional This class has only the minimum functionality in it to provide what the TZ variable parsing needs. In particular, the standard makes guarantees about how trivial the destructors are, throws an expception if it's used incorrectly, etc. There are also missing features. Tested: Trivial testsuite added, and use in development. Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D129920	2022-08-05 02:51:44 +00:00
Tue Ly	c308a88716	[libc] Add subtraction for UInt<N> class. Add subtraction operators (-, -=) for UInt<N> class. Reviewed By: michaelrj, orex Differential Revision: https://reviews.llvm.org/D131196	2022-08-04 20:37:43 -04:00
Siva Chandra Reddy	2e4ef9b6ef	[libc][NFC] Add a few compiler warning flags. A bunch of cleanup to supress the new warnings is also done. Reviewed By: abrachet Differential Revision: https://reviews.llvm.org/D130723	2022-08-04 23:46:38 +00:00
Tue Ly	d1a9ba7b67	[libc] Prevent overflow from intermediate results when adding UInt<N> values. Prevent overflow from intermediate results when adding UInt<N> values. Reviewed By: orex Differential Revision: https://reviews.llvm.org/D131095	2022-08-04 15:01:16 -04:00
Siva Chandra	c9e1ecd46d	[libc] Adjust aarch64 startup code to reliably sniff out the start args. Since this now allows, the init/fini array iteration has been added in a similar fashion to x86_64 and the corresponding test enabled. Reviewed By: jeffbailey Differential Revision: https://reviews.llvm.org/D131133	2022-08-03 23:12:02 -07:00
Siva Chandra	f0bb276794	[libc] Disable the loader init_fini_array_test for non-x86_64 targets. Loader for aarch64 will be extended in a future pass and this test will be enabled back again.	2022-08-03 12:06:13 -07:00
Siva Chandra Reddy	f23076f6c5	[libc] Add init and fini array iteration to the loader. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D130966	2022-08-03 17:53:23 +00:00
Siva Chandra Reddy	658c84e415	[libc] Add GNU extension functions pthread_setname_np and pthread_getname_np. Reviewed By: michaelrj, lntue Differential Revision: https://reviews.llvm.org/D130872	2022-08-02 01:57:03 +00:00
Tue Ly	69cc240534	[libc][doc] Update implementation status of tanhf.	2022-08-01 17:45:40 -04:00
Kirill Okhotnikov	2a558b6bff	[libc][math] Build fail fix: obsolete header.	2022-08-01 22:55:15 +02:00
Kirill Okhotnikov	5ef987c985	[libc][math] Added tanhf function. Correct rounding function. Performance ~2x faster than glibc analog. Performance (llvm 12 intel): ``` CORE_MATH_PERF_MODE=rdtsc PERF_ARGS='' ./perf.sh tanhf GNU libc version: 2.31 GNU libc release: stable 13.279 37.492 18.145 CORE_MATH_PERF_MODE=rdtsc PERF_ARGS='--latency' ./perf.sh tanhf GNU libc version: 2.31 GNU libc release: stable 40.658 109.582 66.568 ``` Differential Revision: https://reviews.llvm.org/D130780	2022-08-01 22:43:00 +02:00
Guillaume Chatelet	49eb58063f	[libc][NFC] Use STL case for utility Migrating all private STL code to the standard STL case but keeping it under the CPP namespace to avoid confusion. Differential Revision: https://reviews.llvm.org/D130771	2022-08-01 09:27:37 +00:00
Guillaume Chatelet	91eb0b6584	[libc][NFC] Use STL case for limits Migrating all private STL code to the standard STL case but keeping it under the CPP namespace to avoid confusion. Differential Revision: https://reviews.llvm.org/D130762	2022-08-01 09:18:25 +00:00
Guillaume Chatelet	3f3bbd7370	[libc][NFC] Use STL case for functional Migrating all private STL code to the standard STL case but keeping it under the CPP namespace to avoid confusion. Differential Revision: https://reviews.llvm.org/D130760	2022-08-01 09:10:59 +00:00
Guillaume Chatelet	d3d498fbf6	Reland [libc][NFC] Use STL case for array This is a reland of https://reviews.llvm.org/D130773	2022-08-01 08:47:27 +00:00
Guillaume Chatelet	de00bd573e	Revert "[libc][NFC] Use STL case for array" This reverts commit `7add0e5fdc`.	2022-08-01 08:44:52 +00:00
Guillaume Chatelet	7add0e5fdc	[libc][NFC] Use STL case for array Migrating all private STL code to the standard STL case but keeping it under the CPP namespace to avoid confusion. Differential Revision: https://reviews.llvm.org/D130773	2022-08-01 08:43:05 +00:00
Siva Chandra Reddy	89e5c20ca9	[libc][NFC] Mark a constructor of StringView to be constexpr.	2022-08-01 07:13:20 +00:00
Siva Chandra Reddy	f1d274b5c6	[libc][NFC] Make the buffer size of the integer to string converter public. This allows users of the IntegerToString class to size their buffers appropriately at compile time.	2022-08-01 06:54:17 +00:00
Siva Chandra Reddy	d52e8708ae	[libc][NFC] Mark a few methods of StringView to constexpr.	2022-08-01 06:06:49 +00:00
Tue Ly	17df74214c	[libc][doc] Update implementation status of exp2f, sinhf, and coshf.	2022-07-31 16:32:21 -04:00
Siva Chandra Reddy	12df3080fe	[libc] Compile integration tests with -ffreestanding to avoid mixup with system libc.	2022-07-30 03:06:08 +00:00
Tue Ly	2ff187fbc9	[libc] Implement cosf function that is correctly rounded to all rounding modes. Implement cosf function that is correctly rounded to all rounding modes. Performance benchmark using perf tool from CORE-MATH project (https://gitlab.inria.fr/core-math/core-math/-/tree/master) on Ryzen 1700: Before this patch (not correctly rounded): ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh cosf CORE-MATH reciprocal throughput : 19.043 System LIBC reciprocal throughput : 26.328 LIBC reciprocal throughput : 30.955 $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh cosf --latency GNU libc version: 2.31 GNU libc release: stable CORE-MATH latency : 49.995 System LIBC latency : 59.286 LIBC latency : 60.174 ``` After this patch (correctly rounded): ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh cosf GNU libc version: 2.31 GNU libc release: stable CORE-MATH reciprocal throughput : 19.072 System LIBC reciprocal throughput : 26.286 LIBC reciprocal throughput : 13.631 $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh cosf --latency GNU libc version: 2.31 GNU libc release: stable CORE-MATH latency : 49.872 System LIBC latency : 59.468 LIBC latency : 56.119 ``` Reviewed By: orex, zimmermann6 Differential Revision: https://reviews.llvm.org/D130644	2022-07-29 21:08:31 -04:00
Kirill Okhotnikov	a7f55f0805	[libc][math] Added sinhf function. Differential Revision: https://reviews.llvm.org/D129278	2022-07-29 17:20:53 +02:00
Kirill Okhotnikov	fcb9d7e2cf	[libc][math] Added coshf function. Differential Revision: https://reviews.llvm.org/D129275	2022-07-29 16:57:28 +02:00
Guillaume Chatelet	0c3037dfc5	[libc] Fix prototype_test_gen	2022-07-29 10:18:54 +00:00
Guillaume Chatelet	e5e0e7963c	[libc] Fix prototype_test_gen	2022-07-29 10:07:34 +00:00
Guillaume Chatelet	039fb3e5a1	Fix typo in FPUtil/aarch64/FMA.h	2022-07-29 10:04:11 +00:00
Guillaume Chatelet	f72261508a	[libc][NFC] Use STL case for type_traits Migrating all private STL code to the standard STL case but keeping it under the CPP namespace to avoid confusion. Starting with the type_traits header. Differential Revision: https://reviews.llvm.org/D130727	2022-07-29 09:57:03 +00:00
Kirill Okhotnikov	4b41e7b436	[libc][math] Universal exp function for cosh/sinh calculation. Added a function and test, which can be used later for cosh/sinh and possibly for expf/expm1f. Differential Revision: https://reviews.llvm.org/D129215	2022-07-28 11:24:09 +02:00
Kirill Okhotnikov	c78144e1c7	[libc][math] Improved performance of exp2f function. New exp2 function algorithm: 1) Improved performance: 8.176 vs 15.270 by core-math perf tool. 2) Improved accuracy. Only two special values left. 3) Lookup table size reduced twice. Differential Revision: https://reviews.llvm.org/D129005	2022-07-28 10:57:16 +02:00
Tue Ly	15b9380dfd	[libc] Change sinf range reduction to mod pi/16 to be shared with cosf. Change `sinf` range reduction to mod pi/16 to be shared with `cosf`. Previously, `sinf` used range reduction `mod pi`, but this cannot be used to implement `cosf` since the minimax algorithm for `cosf` does not converge due to critical points at `pi/2`. In order to be able to share the same range reduction functions for both `sinf` and `cosf`, we change the range reduction to `mod pi/16` for the following reasons: - The table size is sufficiently small: 32 entries for `sin(k * pi/16)` with `k = 0..31`. It could be reduced to 16 entries if we treat the final sign separately, with an extra multiplication at the end. - The polynomials' degrees are reduced to 7/8 from 15, with extra computations to combine `sin` and `cos` with trig sum equality. - The number of exceptional cases reduced to 2 (with FMA) and 3 (without FMA). - The latency is reduced while maintaining similar throughput as before. Reviewed By: zimmermann6 Differential Revision: https://reviews.llvm.org/D130629	2022-07-27 12:23:36 -04:00
Tue Ly	628fbbef81	[libc] Use nearest_integer instructions to improve expm1f performance. Use nearest_integer instructions to improve expf performance. Performance tests with CORE-MATH's perf tool: Before the patch: ``` $ ./perf.sh expm1f LIBC-location: /home/lnt/experiment/llvm/llvm-project/build/projects/libc/lib/libllvmlibc.a GNU libc version: 2.31 GNU libc release: stable CORE-MATH reciprocal throughput : 10.096 System LIBC reciprocal throughput : 44.036 LIBC reciprocal throughput : 11.575 $ ./perf.sh expm1f --latency LIBC-location: /home/lnt/experiment/llvm/llvm-project/build/projects/libc/lib/libllvmlibc.a GNU libc version: 2.31 GNU libc release: stable CORE-MATH latency : 42.239 System LIBC latency : 122.815 LIBC latency : 50.122 ``` After the patch: ``` $ ./perf.sh expm1f LIBC-location: /home/lnt/experiment/llvm/llvm-project/build/projects/libc/lib/libllvmlibc.a GNU libc version: 2.31 GNU libc release: stable CORE-MATH reciprocal throughput : 10.046 System LIBC reciprocal throughput : 43.899 LIBC reciprocal throughput : 9.179 $ ./perf.sh expm1f --latency LIBC-location: /home/lnt/experiment/llvm/llvm-project/build/projects/libc/lib/libllvmlibc.a GNU libc version: 2.31 GNU libc release: stable CORE-MATH latency : 42.078 System LIBC latency : 120.488 LIBC latency : 41.528 ``` Reviewed By: zimmermann6 Differential Revision: https://reviews.llvm.org/D130502	2022-07-26 09:12:37 -04:00
Tue Ly	91ee672062	[libc] Use nearest_integer instructions to improve expf performance. Use nearest_integer instructions to improve expf performance. Performance tests with CORE-MATH's perf tool: Before the patch: ``` $ ./perf.sh expf LIBC-location: /home/lnt/experiment/llvm-project/build/projects/libc/lib/libllvmlibc.a GNU libc version: 2.31 GNU libc release: stable CORE-MATH reciprocal throughput : 9.860 System LIBC reciprocal throughput : 7.728 LIBC reciprocal throughput : 12.363 $ ./perf.sh expf --latency LIBC-location: /home/lnt/experiment/llvm-project/build/projects/libc/lib/libllvmlibc.a GNU libc version: 2.31 GNU libc release: stable CORE-MATH latency : 42.802 System LIBC latency : 35.941 LIBC latency : 49.808 ``` After the patch: ``` $ ./perf.sh expf LIBC-location: /home/lnt/experiment/llvm/llvm-project/build/projects/libc/lib/libllvmlibc.a GNU libc version: 2.31 GNU libc release: stable CORE-MATH reciprocal throughput : 9.441 System LIBC reciprocal throughput : 7.382 LIBC reciprocal throughput : 8.843 $ ./perf.sh expf --latency LIBC-location: /home/lnt/experiment/llvm/llvm-project/build/projects/libc/lib/libllvmlibc.a GNU libc version: 2.31 GNU libc release: stable CORE-MATH latency : 44.192 System LIBC latency : 37.693 LIBC latency : 44.145 ``` Reviewed By: zimmermann6 Differential Revision: https://reviews.llvm.org/D130498	2022-07-26 09:11:27 -04:00
Siva Chandra Reddy	8fbc3c1179	[libc][Obvious] Use the correct StringView constructor in dirent_test.	2022-07-25 20:47:17 +00:00
Siva Chandra Reddy	35ea84ad6a	[libc] Add dirent.h functions opendir, readdir, closedir and dirfd. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D130459	2022-07-25 20:23:25 +00:00
Siva Chandra Reddy	5edc7ce235	[libc] Add a simple StringStream class. This class will be used in future changes to construct simple strings. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D130334	2022-07-25 18:27:29 +00:00
Tue Ly	83193a5e72	[libc] Raise FE_INVALID for sinf with +- inf inputs.	2022-07-25 10:20:32 -04:00
Michael Jones	3b0c78fe3b	[libc][nfc] move printf inf/nan to separate function The floating point functions all use the same inf and nan formatting. By separating this functionality out of the %a conversion I make it available for reuse by %f/e/g. Reviewed By: lntue, sivachandra Differential Revision: https://reviews.llvm.org/D129665	2022-07-22 10:29:35 -07:00
Alex Brachet	5e2d5071ff	[libc] Don't call user comparator function for equal pointers The standard says two equal pointers must compare equal so there is no need to call the user comparator function in this case. Differential Revision: https://reviews.llvm.org/D130310	2022-07-22 17:03:16 +00:00
Tue Ly	600172a72b	[libc] Temporarily disable arm32's sinf, cosf, sincosf entrypoints. With correctly rounded implementations, these functions will be tested for all rounding modes. Since fegetround and fesetround are not implemented for arm32, these tests will fail in all non-default rounding modes. We will re-enable these entrypoints and tests once fegetround and fesetround are implemented for arm32.	2022-07-22 10:46:23 -04:00
Tue Ly	d883a4ad02	[libc] Implement sinf function that is correctly rounded to all rounding modes. Implement sinf function that is correctly rounded to all rounding modes. - We use a simple range reduction for `pi/16 < \|x\|` : Let `k = round(x / pi)` and `y = (x/pi) - k`. So `k` is an integer and `-0.5 <= y <= 0.5`. Then ``` sin(x) = sin(ypi + kpi) = (-1)^(k & 1) * sin(ypi) ~ (-1)^(k & 1) y * P(y^2) ``` where `yP(y^2)` is a degree-15 minimax polynomial generated by Sollya with: ``` > P = fpminimax(sin(xpi)/x, [\|0, 2, 4, 6, 8, 10, 12, 14\|], [\|D...\|], [0, 0.5]); ``` - Performance benchmark using perf tool from CORE-MATH project (https://gitlab.inria.fr/core-math/core-math/-/tree/master) on Ryzen 1700: Before this patch (not correctly rounded): ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh sinf CORE-MATH reciprocal throughput : 17.892 System LIBC reciprocal throughput : 25.559 LIBC reciprocal throughput : 29.381 ``` After this patch (correctly rounded): ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh sinf CORE-MATH reciprocal throughput : 17.896 System LIBC reciprocal throughput : 25.740 LIBC reciprocal throughput : 27.872 LIBC reciprocal throughput : 20.012 (with `-msse4.2` flag) LIBC reciprocal throughput : 14.244 (with `-mfma` flag) ``` Reviewed By: zimmermann6 Differential Revision: https://reviews.llvm.org/D123154	2022-07-22 10:07:31 -04:00
Tue Ly	ed261e7106	[libc] Add float type and flag for nearest_integer to enable SSE4.2. Add float type and flag for nearest integer to automatically test with and without SSE4.2 flag. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D129916	2022-07-22 09:29:41 -04:00
Siva Chandra Reddy	7c666c14f8	[libc] Add a convenience class and function for integer to string conversion. Printf's integer converter has been modified to use the new converter. In future, it will be used to implement other parts of the libc. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D130227	2022-07-21 20:18:59 +00:00
Siva Chandra Reddy	f8322d1351	[libc] Add a method `find_last_of` to StringView. Reviewed By: jeffbailey Differential Revision: https://reviews.llvm.org/D130112	2022-07-19 22:43:43 +00:00
Michael Jones	bf7f01d857	[libc] fix strtofloatingpoint on rare edge case Currently, there are two string parsers that can be used in a call to strtofloatingpoint. There is the main parser used by Clinger's fast path and Eisel-Lemire, and the backup parser used by Simple Decimal Conversion. There was a bug in the backup parser where if the number had more than 800 digits (the size of the SDC buffer) before the decimal point, it would just ignore the digits after the 800th and not count them into the exponent. This patch fixes that issue and adds regression tests. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D130032	2022-07-18 14:23:33 -07:00
Jeff Bailey	8aad330eeb	[libc] Fix API for remove_{prefix, suffix} The API in StringView.h for remove_prefix was incorrect and was returning a new StringView rather than just altering the view. As part of this, also removed some of the safety features. The comment correctly noted that the behaviour is undefined in some cases, but the code and test cases checked for that. One caller was relying on the old behaviour, so fixed it and added some comments. Tested: check-libc llvmlibc libc-loader-tests Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D129950	2022-07-18 14:40:09 +00:00
Kazu Hirata	8dfdb80f72	Ensure newlines at the end of files (NFC)	2022-07-17 15:37:45 -07:00
Michael Jones	3b3b816f29	[libc] add rounding modes to printf float conv This adds functionality for rounding towards negative inf, positive inf, and zero to the float hex conversion (%a). Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D129702	2022-07-15 11:11:32 -07:00
Michael Jones	9fa6a88a16	[libc][arm32] add string stdlib & math entrypoints This patch adds all the string and stdlib entrypoints, as well as a few math entrypoints to the arm32 build. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D129821	2022-07-15 10:10:58 -07:00
Siva Chandra	98fdabecf5	[libc] Enable a few stdlib and time functions on aarch64.	2022-07-14 14:37:50 -07:00
Siva Chandra	75a628925e	[libc] Enable few stdio functions on aarch64.	2022-07-14 13:40:48 -07:00
Siva Chandra	edee61b55c	[libc] Enable few pthread and threads functions on aarch64.	2022-07-14 13:25:21 -07:00
Siva Chandra Reddy	8dc42802f7	[libc] Add implementations of pthread_equal and pthread_self. Reviewed By: michaelrj, lntue Differential Revision: https://reviews.llvm.org/D129729	2022-07-14 20:12:35 +00:00
Tue Ly	0f782b84cb	[libc] Add nearest integer instructions to fputil. Add round to nearest integer instructions to fputil. This will be used in sinf implementation https://reviews.llvm.org/D123154 Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D129776	2022-07-14 13:20:35 -04:00
Jeff Bailey	897b80147d	Add support for three more string_view functions Add support for three more string_view functions 1) starts_with(char) 2) ends_with(char) 3) find_first_of(char, size_t) Reimplemented trim in terms of the new starts_with and ends_with. Tested: New unit tests. Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D129618	2022-07-14 14:20:20 +00:00
Siva Chandra Reddy	5e61b9c556	[libc][NFC] Make all integration tests depend on the threads implementation. The integration tests use the loader which sets up the main thread's self object. So, all integration tests have to depend on the threads implementation.	2022-07-13 20:51:12 +00:00

... 2 3 4 5 6 ...

1266 Commits