Commits · 6858ae5f9203b27451c1e6b81a474122da8bebff · Libraries / SLEEF

Jan 26, 2021
- [Quad] Add 128bit dispatcher (#398) · 6858ae5f
  Naoki Shibata authored Jan 26, 2021
```
This patch adds a dispatcher for 128-bit wide vector functions in the quad library.
```
  6858ae5f
Jan 25, 2021
- [Quad] Add scalar dispatcher (#397) · 1c0d5b73
  Naoki Shibata authored Jan 25, 2021
```
This patch adds a dispatcher for the scalar functions in the quad library.
```
  1c0d5b73
Jan 19, 2021

Naoki Shibata authored Jan 19, 2021

With this patch, the inline headers can be generated with MinGW.
This patch also enables CI testing on MinGW build.

75b62aaf

Jan 18, 2021

[Quad] Define constants (#395) · 49703959

Naoki Shibata authored Jan 18, 2021

This is a combined patch including the following items.

* Define quad-precision constants in the header files
* Add macros for libquadmath compatibility
* Remove unions from helperpurec_scalar.h. Unions are removed from sleefquadinline_cuda.h, as a result.

49703959

Jan 17, 2021

[CUDA] Include purec header from CUDA (#394) · bf50d612

Naoki Shibata authored Jan 17, 2021

With this patch, the pure C inline header can be included from CUDA programs along with the CUDA inline header.

bf50d612

Jan 14, 2021
- [Cleanup] Eliminate warnings (#393) · 295a8b6c
  Naoki Shibata authored Jan 14, 2021
```
This is a combined patch for eliminating most of the warning messages.
```
  295a8b6c
Jan 13, 2021

[Cleanup] Conform strict aliasing rules (#392) · 1d66bbda

Naoki Shibata authored Jan 13, 2021

This is a combined patch for removing potential problems with the strict aliasing rule.
It also drops long double support for DFT.

1d66bbda

Jan 12, 2021
- [SVE] Workaround for ICE with GCC-10 & SVE (#391) · ea29e62f
  Naoki Shibata authored Jan 12, 2021
```
With this patch, `-fno-tree-vrp` compiler option is added if the SVE code is compiled with GCC.
```
  ea29e62f
Jan 08, 2021

Add preliminary support for iOS (#389) · e7b4784b

Naoki Shibata authored Jan 08, 2021

This patch adds support and build-only testing for iOS.
ios.toolchain.cmake is required for building, which can be downloaded from at https://github.com/leetal/ios-cmake

.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

e7b4784b

Preliminary android support (#388) · f0b48888

Naoki Shibata authored Jan 08, 2021



This patch adds preliminary support and build-only testing for android OS.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

f0b48888

Jan 07, 2021

[Quad] Add ldexp, ilogb, fma and hypot (#387) · 480b2827

Naoki Shibata authored Jan 07, 2021



This patch adds quad-precision ldexp, ilogb, fma and hypot.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

480b2827

Jan 03, 2021

[Quad] Add frexp and modf · 497769eb

Naoki Shibata authored Jan 03, 2021



This patch adds quad-precision frexp and modf.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

497769eb

Jan 02, 2021

[Quad] Add cbrt (#385) · f9d5b33d

Naoki Shibata authored Jan 02, 2021



This patch adds quad-precision cbrt function.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

f9d5b33d

Jan 01, 2021

[Quad] another cleanup (#384) · 900b9e29

Naoki Shibata authored Jan 02, 2021



Another cleanup of the quad library

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

900b9e29

Dec 31, 2020
- [Quad] Add fmod and remainder (#383) · 40461463
  Naoki Shibata authored Jan 01, 2021
```
This patch adds quad-precision fmod and remainder.
```
  40461463
Dec 28, 2020

[Quad] Add nearest integer functions (#382) · 2fb91ef1

Naoki Shibata authored Dec 28, 2020



This patch adds quad-precision trunc, floor, ceil, round and rint.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

2fb91ef1

Dec 27, 2020
- [Quad] Add pow and atan2 (#381) · a3d78aaf
  Naoki Shibata authored Dec 28, 2020
```
Co-authored-by: shibatch <shibatch.sf.net@gmail.com>
```
  a3d78aaf
Dec 25, 2020

[Quad] Add quad-precision inverse hyperbolic functions (#380) · 088e362e

Naoki Shibata authored Dec 25, 2020



This patch adds quad-precision asinh, acosh and atanh.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

088e362e

Dec 23, 2020
- Introduce vcast_vm_i64 and vcast_vm_u64 · afd668f1
  Naoki Shibata authored Dec 23, 2020
```
Co-authored-by: shibatch <shibatch.sf.net@gmail.com>
```
  afd668f1
- [Arm Mac] Add Arm Mac build-only testing (#378) · 005e74ba
  Naoki Shibata authored Dec 23, 2020
```
This patch adds build-only testing to the Jenkins configuration.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>
```
  005e74ba
Dec 22, 2020

[Quad] Add hyperbolic functions (#377) · ba4a3b4c
Naoki Shibata authored Dec 22, 2020
```
* no message

* no message

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>
```
ba4a3b4c

Make sleef compilable if CMAKE_OSX_ARCHITECTURES is passed (#376) · e0a003ee

Nikita Shulga authored Dec 21, 2020

* Simplify x86 arch check

* Fix sleef compilation when CMAKE_OSX_ARCHITECTURES is passed

Test plan:
  Run `cmake .. -DCMAKE_OSX_ARCHITECTURES=x86_64 -G Ninja; ninja` on M1 Mac
  Run `cmake .. -DCMAKE_OSX_ARCHITECTURES=arm64 -G Ninja; ninja` on x86 Mac

* Compile host executable as universable binaries on OS X
If multiarch option is passed

e0a003ee

Dec 21, 2020
- [Quad] Add fabsq (#375) · 80b994fc
  Naoki Shibata authored Dec 21, 2020
```
Co-authored-by: shibatch <shibatch.sf.net@gmail.com>
```
  80b994fc
Dec 20, 2020
- Cleanups (#374) · cb21c85a
  Naoki Shibata authored Dec 20, 2020
```
Co-authored-by: shibatch <shibatch.sf.net@gmail.com>
```
  cb21c85a
Dec 19, 2020

Small CUDA fixes · c6a21432

Naoki Shibata authored Dec 19, 2020

With this patch, double2 and float2 data types can be used instead of Sleef_double2 and Sleef_float2 for CUDA.
It also eliminates a need for including float.h when using the CUDA header file.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

c6a21432

Dec 18, 2020

Fix vfma_pn functions for aarch64 · dfe0bd6b

Naoki Shibata authored Dec 18, 2020

This pull request is made following issue https://github.com/shibatch/sleef/issues/371

.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

dfe0bd6b

Dec 16, 2020

Faster erf (#370) · c831a39a

Naoki Shibata authored Dec 16, 2020



This patch revises the algorithm for computing the error function.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

c831a39a

Dec 09, 2020
- This patch adds s390x and ppc64le support to the benchmarking utility. · 43232ea3
  Naoki Shibata authored Dec 09, 2020
```
Co-authored-by: shibatch <shibatch.sf.net@gmail.com>
```
  43232ea3
- Move aarch32 to jenkins (#368) · 30547c46
  Naoki Shibata authored Dec 09, 2020
```
This patch moves AArch32 CI testing from travis to jenkins.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>
```
  30547c46
Dec 08, 2020

Add power9 support (#360) · be8783d9

Naoki Shibata authored Dec 09, 2020

This patch adds POWER9 support as mentioned in issue https://github.com/shibatch/sleef/issues/313

.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

be8783d9

Quad inline header (#366) · b8544819

Naoki Shibata authored Dec 08, 2020

With this patch, the quad functions can be used with header files in which all the functions are included
It also adds support for CUDA quad functions.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

b8544819

Dec 04, 2020

Revive i386 support (#365) · a38c0492

Naoki Shibata authored Dec 04, 2020



This patch revives the broken i386 support.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

a38c0492

Nov 28, 2020
- Define cost of tests (#363) · ab02634e
  Naoki Shibata authored Nov 29, 2020
```
With this patch, cost for each test is defined to speed-up testing.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>
```
  ab02634e
- no message (#362) · f66b143a
  Naoki Shibata authored Nov 28, 2020
```
Co-authored-by: shibatch <shibatch.sf.net@gmail.com>
```
  f66b143a
- Fix sleef.h generation on rebuilds (#361) · d7f7e84a
  peterbell10 authored Nov 28, 2020
  
  d7f7e84a
Nov 25, 2020

Address issue #354 (#359) · 03e18099

Naoki Shibata authored Nov 25, 2020



This patch fixes the problem pointed out in issue #354.
It also changes the CI setting for mac.

Co-authored-by: shibatch <shibatch.sf.net@gmail.com>

03e18099

Fix undefined behaviour in xnextafterf (#358) · 1cad9ca7
elfringham authored Nov 25, 2020

1cad9ca7

Fix undefined behavior in xnextafterf (#357) · ef21e3ad

Nikita Shulga authored Nov 24, 2020

C standard does not define how sign bit should be handled during left shift, which triggers UBSAN runtime error:
```
src/libm/sleefsimdsp.c:3031:101: runtime error: left shift of 1 by 31 places cannot be represented in type 'int'
```
Discovered while working on https://github.com/pytorch/pytorch/pull/48275

ef21e3ad

Nov 24, 2020

Fix undefined behaviour in vilogbk_vi_vd (#355) · 75153c66

Nikita Shulga authored Nov 23, 2020

C standard do not define have left shift should affect the sign bit, which results in the following runtime error if `vilogbk_vi_vd` is compiled by clang with sanitizer checks enabled:
```
sleef/src/libm/sleefsimddp.c:329:49: runtime error: left shift of 4095 by 20 places cannot be represented in type 'int'
```
Can be fixed by explicitly specifying type of shift 1st operand as unsigned: i.e. replacing `1` with `1U`

75153c66

Nov 18, 2020

Update build rules for Apple Silicon (#353) · 7ce51c44

Nikita Shulga authored Nov 17, 2020

CMAKE_SYSTEM_PROCESSOR is set to "arm64" on Apple M1 machines

Discovered while working on https://github.com/pytorch/pytorch/issues/48145

7ce51c44