Skip to content
  1. Sep 15, 2020
  2. Sep 14, 2020
  3. Sep 13, 2020
  4. Sep 01, 2020
  5. Aug 31, 2020
  6. Aug 28, 2020
  7. Aug 27, 2020
  8. Aug 26, 2020
  9. Aug 25, 2020
  10. Aug 21, 2020
  11. Jul 23, 2020
  12. Jul 10, 2020
    • Yichao Yu's avatar
      Correctly preserve default CFLAGS (#312) · e0c74aaf
      Yichao Yu authored
      1. `string(CONCAT)` does not append content to the string but replaces the content of the output variable. Appending should use `string(APPEND)` instead.
      2. The contents of the two arguments should be concatenated with a space so
         `string(APPEND)`/`string(CONCAT)` aren't the right/most convinient functions to use either.
         Since there're only two strings input, simply use a string interpolation instead.
      e0c74aaf
  13. Jun 24, 2020
  14. May 01, 2020
  15. Apr 11, 2020
  16. Apr 08, 2020
  17. Apr 07, 2020
  18. Apr 02, 2020
  19. Mar 25, 2020
  20. Mar 21, 2020
    • Naoki Shibata's avatar
      [AArch64] Add alternative division and sqrt methods · 1ba18367
      Naoki Shibata authored
      
      
      With this patch, alternative division and sqrt can be chosen by specifying -DENABLE_ALTDIV=TRUE -DENABLE_ALTSQRT=TRUE as cmake options.
      The alternative methods use combinations of FMA operations to compute division and sqrt.
      These methods could possibly be beneficial for micro-architectures on which the corresponding instructions are non-pipelined and have long latencies.
      
      Co-authored-by: default avatarshibatch <shibatch.sf.net@gmail.com>
      1ba18367
  21. Mar 19, 2020
    • Sayed Adel's avatar
      PPC64LE: Fix build issue on gcc (#288) · 374fa975
      Sayed Adel authored
      * ppc64le: Several improvements for VSX
      
        - fix the build on gcc
        - fix accuracy tests
        - fix the build with C++ when 'vector' compiler token is combined,
          NOTE: the generated header is using '__vector' instead of 'vector'
        - fix unaligned memory load
        - set rounding to the nearest even, make it equivalent to X86
        - improve the performance of conversions
        - improve the performance of shuffles and reverses
        - cleanup the whole VSX implementation
      
      * Travis CI: Add ppc64le support
      374fa975
  22. Feb 28, 2020
  23. Feb 25, 2020
    • Naoki Shibata's avatar
      no message (#286) · b30a1276
      Naoki Shibata authored
      This patch allows rounding operations to use _mm512_roundscale_pd or _mm512_roundscale_ps with AVX-512. This should make the whole computation faster.
      b30a1276
  24. Feb 23, 2020
  25. Jan 21, 2020
Loading