Commits · 3.4.0 · Libraries / SLEEF

Apr 28, 2019
- no message (#256) · 8df2bce4
  Naoki Shibata authored Apr 28, 2019
```
Change version number to 3.4.0
```
  3.4.0
  
  8df2bce4
Apr 27, 2019
- no message (#255) · 79b28f4e
  Naoki Shibata authored Apr 27, 2019
  
  79b28f4e
- Update htmldocs (#254) · 569e1da0
  Naoki Shibata authored Apr 27, 2019
```
This patch updates the html documents.
This patch also re-enables testing with Intel Compiler.
```
  569e1da0
Mar 12, 2019

Change copyright year (#248) · 5d84bbf1
Naoki Shibata authored Mar 12, 2019
```
This patch updates the copyright year of files.
```
5d84bbf1

Add low accuracy sin, cos, log2, exp2 and pow (#229) · a77ba9e1

Naoki Shibata authored Mar 12, 2019

This patch adds the following functions.

* fastsinf_u3500
* fastcosf_u3500
* fastpowf_u3500
* log2_u35, log2f_u35
* exp2_u35, exp2f_u35

The error bound of fastsin and fastcos are max(2e-6, 350ULPs).
Each function has corresponding deterministic functions.

a77ba9e1

Mar 07, 2019

Add (no-op) VECTOR_CC to sse/avx dispatch (#243) · 747565b0

Yichao Yu authored Mar 06, 2019

Currently a no-op but this makes it easier to support vector call attribute on x86,
which might be necessary on windows.

747565b0

Mar 06, 2019
- Merge pull request #241 from yuyichao/missing-vec · 8f191984
  Francesco Petrogalli authored Mar 06, 2019
```
Add missing VECTOR_CC for sincos
```
  8f191984
Mar 05, 2019

[LIBM] Fix sin, cos and exp (#246) · a2f7b0bb

Naoki Shibata authored Mar 05, 2019

On rare occasions, the error with sin_u10 and cos_u10 functions exceeds 1.0 ULP.
This patch fixes this problem.

a2f7b0bb

Mar 04, 2019

[Quad] Add trigonometric functions (#240) · 7784b689

Naoki Shibata authored Mar 04, 2019

This patch adds sin, cos, tan, exp, exp2, exp10, expm1, log, log2, log10, log1p, asin, acos, atan, comparison functions and cast functions between quad and double to libsleefquad.

7784b689

Feb 25, 2019
- Merge pull request #242 from yuyichao/rename-vectorcc · 12308174
  Francesco Petrogalli authored Feb 25, 2019
```
Remove vectorcc from getInt and getPtr declarations
```
  12308174
- Remove vectorcc from getInt and getPtr declarations · f6cad734
  Yichao Yu authored Feb 24, 2019
```
These functions are not defined with vectorcc so they shouldn't be declared with it either.
```
  f6cad734
- Add missing VECTOR_CC for sincos · 95e3ba1b
  Yichao Yu authored Feb 24, 2019
  
  95e3ba1b
Feb 12, 2019

[LIBM] Introduce faster method for evaluating polynomials (#239) · ca4fd109

Naoki Shibata authored Feb 13, 2019

This patch replaces Horner method which was used to evaluate polynomials with Estrin's method( https://en.wikipedia.org/wiki/Estrin%27s_scheme ) that allows more parallel computations with out-of-order execution.
This patch also introducing a new reduction method to tan.
With this patch, mainly computation for double-precision functions becomes faster, and the effect is like a few percent to 20 percent. For example, the ratio between execution time of the following functions before and after applying this patch is shown below.

Sleef_atan2d4_u35 : 1.21
Sleef_powd4_u10 : 1.17
Sleef_sind4_u35 : 1.10
Sleef_tand4_u10 : 1.04
Sleef_tand4_u35 : 1.17

ca4fd109

Jan 29, 2019

[Quad] Add functions for conversion between quad and string (#237) · 9a3aecfe

Naoki Shibata authored Jan 29, 2019

This patch adds Sleef_strtoq and Sleef_qtostr which can be used to convert between a quad value and a string. These functions are not vectorized. The corresponding testers are also added.

This patch also adds functions for subtraction.

Intel compiler testing is temporarily disabled because of license expiration( https://github.com/shibatch/sleef/issues/238 ).

9a3aecfe

Jan 24, 2019

Add quadprecision math library (#235) · a0537162

Naoki Shibata authored Jan 24, 2019

This is a part of implementation of issue #233 ( https://github.com/shibatch/sleef/issues/233 ).
At this point, add, mul, div and sqrt with testers are implemented. Remaining functions will be committed in the succeeding PRs.
As for vector extensions, SSE2, AVX, FMA4, AVX2, AV2_128, AVX512F, AdvSIMD and SVE are supported.

This quad-precision math library is built only if -DBUILD_QUAD option is given to cmake. For some time(1 year?), this sub-project is positioned at alpha development stage.

a0537162

Jan 23, 2019

[CI] Fix configuration · 8e6e52f2

Francesco Petrogalli authored Jan 24, 2019

1. `-march=armv8-a+simd` is removed as it is not necessary (#232)
2. Delete output that is never generated (#231)

It also includes changes of CI setting for removing GCC/OSX testing on travis. This is because updating gcc with brew takes too much time now. Instead of this, build with gcc is now tested on Jenkins.

8e6e52f2

no message · 79df29e3
Naoki Shibata authored Jan 23, 2019

79df29e3

Oct 23, 2018
- Merge pull request #228 from shibatch/merging-3.3.1-with-aavpcs · ae55d715
  Francesco Petrogalli authored Oct 22, 2018
```
Merging 3.3.1 with aavpcs
```
  ae55d715
Oct 22, 2018
- [changelog] Update AAVPCS support for next release. · 1bd7c49a
  Francesco Petrogalli authored Oct 22, 2018
  
  1bd7c49a
- Correct definition of vrint_vi2_vf · f4bcfed9
  Naoki Shibata authored Oct 22, 2018
  
  f4bcfed9
Oct 15, 2018
- [build] Fix x86 build. · d1daea9b
  Francesco Petrogalli authored Oct 15, 2018
```
The x86 dispatcher was build for SP when targeting DP.
```
  d1daea9b
- [AArch64] Add missing attribute AAVPCS to definition. · d13f97bd
  Francesco Petrogalli authored Oct 15, 2018
  
  d13f97bd
- [aarch64] Add AAVPCS calling conventions to tester3. · 18d7ddef
  Francesco Petrogalli authored Oct 15, 2018
  
  18d7ddef
- [Jenkins] Add AArch64 Vector PCS testing, with libsleefgnuabi. · fb5e7063
  Francesco Petrogalli authored Oct 15, 2018
  
  fb5e7063
- Fix build failures after merge. · 85d2cd58
  Francesco Petrogalli authored Oct 15, 2018
  
  85d2cd58
- Merge branch 'merging-arm-contributions' into merging-3.3.1-with-aavpcs · b6749485
  Francesco Petrogalli authored Oct 15, 2018
  
  b6749485
- [aarch64] Enable `aarch64_vector_pcs` attribute. · 605a7d9d
  Kerry McLaughlin authored Oct 15, 2018
```
This commit enables building `libsleef` and `libsleefgnuabi` with the
`aarch64_vector_pcs` attribute defined in the _Vector Function ABI
specification for AArch64_ [1].

The build must be configured with `-DFORCE_AAVPCS=On`. By default this
configure variable is set to `Off`.

[1] https://developer.arm.com/products/software-development-tools/hpc/arm-compiler-for-hpc/vector-function-abi
```
  605a7d9d
Oct 11, 2018
- Merge branch 'Add_hash_based_testing2' · 05dce05f
  Naoki Shibata authored Oct 11, 2018
  
  05dce05f
- [Tester] Faster tester and iut (#223) · ffd58fea
  Naoki Shibata authored Oct 11, 2018
```
This patch reduces testing time to 50%.
```
  ffd58fea
Oct 08, 2018

Fix tester (#226) · a4fb670f

Naoki Shibata authored Oct 08, 2018

I found a bug of tester in denormal/nonnumber handling of functions with two arguments.
This patch fixes that bug.
There is no change in the library itself.

a4fb670f

Sep 10, 2018
- no message · 9b4ae0cf
  Naoki Shibata authored Sep 10, 2018
  
  9b4ae0cf
- no message · 8be300cd
  Naoki Shibata authored Sep 10, 2018
  
  8be300cd
Sep 01, 2018
- no message · 018683bc
  Naoki Shibata authored Sep 01, 2018
  
  018683bc
- no message · f8d915c0
  Naoki Shibata authored Sep 01, 2018
  
  f8d915c0
- no message · bae81d56
  Naoki Shibata authored Sep 01, 2018
  
  bae81d56
Aug 31, 2018
- no message · 32b7f2f7
  Naoki Shibata authored Sep 01, 2018
  
  32b7f2f7
- no message · 7d362504
  Naoki Shibata authored Sep 01, 2018
  
  7d362504
- no message · f91bca5f
  Naoki Shibata authored Aug 31, 2018
  
  f91bca5f
- no message · 64cd76f9
  Naoki Shibata authored Aug 31, 2018
  
  64cd76f9
- no message · 3af0ce0a
  Naoki Shibata authored Aug 31, 2018
  
  3af0ce0a