AVX Extensions - Ongoing development?

Message boards : Number crunching : AVX Extensions - Ongoing development?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7

AuthorMessage
Orioneti

Send message
Joined: 22 Oct 07
Posts: 21
Credit: 23,642,634
RAC: 0
Finland
Message 1108599 - Posted: 22 May 2011, 7:09:32 UTC - in response to Message 1108596.  

2600k@4488Mhz w7sp1 ...


Ftst_v7_J45 started.

Optimal function choices:
--------------------------------------------------------
                            name   timing   error
--------------------------------------------------------
                v_BaseLineSmooth (no other)

              v_GetPowerSpectrum 0.000089 0.00000  test
             v_vGetPowerSpectrum 0.000044 0.00000  test
            v_vGetPowerSpectrum2 0.000054 0.00000  test
     v_vGetPowerSpectrumUnrolled 0.000041 0.00000  test
    v_vGetPowerSpectrumUnrolled2 0.000056 0.00000  test
           v_avxGetPowerSpectrum 0.000037 0.00000  test
           v_avxGetPowerSpectrum 0.000037 0.00000  choice

                     v_ChirpData 0.003751 0.00000  test
                   fpu_ChirpData 0.008935 0.00000  test
               fpu_opt_ChirpData 0.003708 0.00000  test
             v_vChirpData_x86_64 0.043466 0.00000  test
               sse1_ChirpData_ak 0.005292 0.00000  test
             sse1_ChirpData_ak8e 0.004292 0.00000  test
             sse1_ChirpData_ak8h 0.004518 0.00000  test
               sse2_ChirpData_ak 0.004998 0.00000  test
              sse2_ChirpData_ak8 0.003289 0.00000  test
               sse3_ChirpData_ak 0.004819 0.00000  test
              sse3_ChirpData_ak8 0.003247 0.00000  test
                 avx_ChirpData_a 0.001538 0.00000  test
                 avx_ChirpData_b 0.001720 0.00000  test
                 avx_ChirpData_c 0.001547 0.00000  test
                 avx_ChirpData_d 0.001439 0.00000  test
                 avx_ChirpData_d 0.001439 0.00000  choice

                     v_Transpose 0.002295 0.00000  test
                    v_Transpose2 0.002469 0.00000  test
                    v_Transpose4 0.001244 0.00000  test
                    v_Transpose8 0.002225 0.00000  test
                  v_pfTranspose2 0.001383 0.00000  test
                  v_pfTranspose4 0.001181 0.00000  test
                  v_pfTranspose8 0.002386 0.00000  test
                   v_vTranspose4 0.000728 0.00000  test
                 v_vTranspose4np 0.000984 0.00000  test
                v_vTranspose4ntw 0.006246 0.00000  test
              v_vTranspose4x8ntw 0.002533 0.00000  test
             v_vTranspose4x16ntw 0.000708 0.00000  test
            v_vpfTranspose8x4ntw 0.006067 0.00000  test
            v_avxTranspose4x8ntw 0.002625 0.00000  test
           v_avxTranspose4x16ntw 0.000612 0.00000  test
            v_avxTranspose8x4ntw 0.006264 0.00000  test
          v_avxTranspose8x8ntw_a 0.001964 0.00000  test
          v_avxTranspose8x8ntw_b 0.002406 0.00000  test
           v_avxTranspose4x16ntw 0.000612 0.00000  choice

                 FPU opt folding 0.001707 0.00000  test
                  AK SSE folding 0.000374 0.00000  test
                  BH SSE folding 0.000357 0.00000  test
                JS AVX_a folding 0.000314 0.00000  test
                JS AVX_c folding 0.000319 0.00000  test
                JS AVX_a folding 0.000314 0.00000  choice

                   Test duration     3.01 seconds



2600k@stock w7sp1 ...


Ftst_v7_J45 started.

Optimal function choices:
--------------------------------------------------------
                            name   timing   error
--------------------------------------------------------
                v_BaseLineSmooth (no other)

              v_GetPowerSpectrum 0.000114 0.00000  test
             v_vGetPowerSpectrum 0.000057 0.00000  test
            v_vGetPowerSpectrum2 0.000069 0.00000  test
     v_vGetPowerSpectrumUnrolled 0.000052 0.00000  test
    v_vGetPowerSpectrumUnrolled2 0.000072 0.00000  test
           v_avxGetPowerSpectrum 0.000047 0.00000  test
           v_avxGetPowerSpectrum 0.000047 0.00000  choice

                     v_ChirpData 0.004174 0.00000  test
                   fpu_ChirpData 0.011399 0.00000  test
               fpu_opt_ChirpData 0.004132 0.00000  test
             v_vChirpData_x86_64 0.055563 0.00000  test
               sse1_ChirpData_ak 0.006768 0.00000  test
             sse1_ChirpData_ak8e 0.005486 0.00000  test
             sse1_ChirpData_ak8h 0.005763 0.00000  test
               sse2_ChirpData_ak 0.006365 0.00000  test
              sse2_ChirpData_ak8 0.004205 0.00000  test
               sse3_ChirpData_ak 0.006157 0.00000  test
              sse3_ChirpData_ak8 0.004132 0.00000  test
                 avx_ChirpData_a 0.001956 0.00000  test
                 avx_ChirpData_b 0.002194 0.00000  test
                 avx_ChirpData_c 0.001969 0.00000  test
                 avx_ChirpData_d 0.001815 0.00000  test
                 avx_ChirpData_d 0.001815 0.00000  choice

                     v_Transpose 0.002897 0.00000  test
                    v_Transpose2 0.003133 0.00000  test
                    v_Transpose4 0.001580 0.00000  test
                    v_Transpose8 0.002816 0.00000  test
                  v_pfTranspose2 0.001655 0.00000  test
                  v_pfTranspose4 0.001482 0.00000  test
                  v_pfTranspose8 0.002997 0.00000  test
                   v_vTranspose4 0.000878 0.00000  test
                 v_vTranspose4np 0.001245 0.00000  test
                v_vTranspose4ntw 0.007768 0.00000  test
              v_vTranspose4x8ntw 0.003076 0.00000  test
             v_vTranspose4x16ntw 0.000856 0.00000  test
            v_vpfTranspose8x4ntw 0.007466 0.00000  test
            v_avxTranspose4x8ntw 0.003220 0.00000  test
           v_avxTranspose4x16ntw 0.000735 0.00000  test
            v_avxTranspose8x4ntw 0.007752 0.00000  test
          v_avxTranspose8x8ntw_a 0.002395 0.00000  test
          v_avxTranspose8x8ntw_b 0.002921 0.00000  test
           v_avxTranspose4x16ntw 0.000735 0.00000  choice

                 FPU opt folding 0.002181 0.00000  test
                  AK SSE folding 0.000479 0.00000  test
                  BH SSE folding 0.000458 0.00000  test
                JS AVX_a folding 0.000402 0.00000  test
                JS AVX_c folding 0.000408 0.00000  test
                JS AVX_a folding 0.000402 0.00000  choice

                   Test duration     3.77 seconds

Ftst_v7 completed successfully.
ID: 1108599 · Report as offensive
Stewart

Send message
Joined: 28 Aug 07
Posts: 4
Credit: 829,029
RAC: 0
United States
Message 1109215 - Posted: 23 May 2011, 23:03:22 UTC

i5-2500K at 4.4GHz, Win7 SP1
=========================================================
Ftst_v7_J45 started.

Optimal function choices:
--------------------------------------------------------
                            name   timing   error
--------------------------------------------------------
                v_BaseLineSmooth (no other)

              v_GetPowerSpectrum 0.000090 0.00000  test
             v_vGetPowerSpectrum 0.000045 0.00000  test
            v_vGetPowerSpectrum2 0.000054 0.00000  test
     v_vGetPowerSpectrumUnrolled 0.000042 0.00000  test
    v_vGetPowerSpectrumUnrolled2 0.000057 0.00000  test
           v_avxGetPowerSpectrum 0.000037 0.00000  test
           v_avxGetPowerSpectrum 0.000037 0.00000  choice

                     v_ChirpData 0.003710 0.00000  test
                   fpu_ChirpData 0.009046 0.00000  test
               fpu_opt_ChirpData 0.003696 0.00000  test
             v_vChirpData_x86_64 0.044385 0.00000  test
               sse1_ChirpData_ak 0.005062 0.00000  test
             sse1_ChirpData_ak8e 0.004202 0.00000  test
             sse1_ChirpData_ak8h 0.004348 0.00000  test
               sse2_ChirpData_ak 0.004942 0.00000  test
              sse2_ChirpData_ak8 0.003118 0.00000  test
               sse3_ChirpData_ak 0.004789 0.00000  test
              sse3_ChirpData_ak8 0.003079 0.00000  test
                 avx_ChirpData_a 0.001561 0.00000  test
                 avx_ChirpData_b 0.001532 0.00000  test
                 avx_ChirpData_c 0.001575 0.00000  test
                 avx_ChirpData_d 0.001451 0.00000  test
                 avx_ChirpData_d 0.001451 0.00000  choice

                     v_Transpose 0.002687 0.00000  test
                    v_Transpose2 0.002578 0.00000  test
                    v_Transpose4 0.001341 0.00000  test
                    v_Transpose8 0.002378 0.00000  test
                  v_pfTranspose2 0.001668 0.00000  test
                  v_pfTranspose4 0.001475 0.00000  test
                  v_pfTranspose8 0.002782 0.00000  test
                   v_vTranspose4 0.000911 0.00000  test
                 v_vTranspose4np 0.001079 0.00000  test
                v_vTranspose4ntw 0.006266 0.00000  test
              v_vTranspose4x8ntw 0.002541 0.00000  test
             v_vTranspose4x16ntw 0.000748 0.00000  test
            v_vpfTranspose8x4ntw 0.006083 0.00000  test
            v_avxTranspose4x8ntw 0.002636 0.00000  test
           v_avxTranspose4x16ntw 0.000676 0.00000  test
            v_avxTranspose8x4ntw 0.006290 0.00000  test
          v_avxTranspose8x8ntw_a 0.002008 0.00000  test
          v_avxTranspose8x8ntw_b 0.002421 0.00000  test
           v_avxTranspose4x16ntw 0.000676 0.00000  choice

                 FPU opt folding 0.001742 0.00000  test
                  AK SSE folding 0.000382 0.00000  test
                  BH SSE folding 0.000364 0.00000  test
                JS AVX_a folding 0.000319 0.00000  test
                JS AVX_c folding 0.000325 0.00000  test
                JS AVX_a folding 0.000319 0.00000  choice

                   Test duration     3.07 seconds

Ftst_v7 completed successfully.
ID: 1109215 · Report as offensive
Ivailo Bonev
Volunteer tester
Avatar

Send message
Joined: 26 Jun 00
Posts: 247
Credit: 35,864,461
RAC: 2
Bulgaria
Message 1124242 - Posted: 3 Jul 2011, 12:13:17 UTC

I don't know are there any development efforts now, but I find interesting document about AVX extensions on Intel web site: Intel Advanced Vector Extensions Programming Reference (June 2011) (pdf). Hope it helps :)
ID: 1124242 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 1124244 - Posted: 3 Jul 2011, 12:27:10 UTC - in response to Message 1124242.  

Nice info on AVX, with 595 pages, this can take awhile ;-) . But certainly worthwhile
reading.


ID: 1124244 · Report as offensive
Previous · 1 . . . 4 · 5 · 6 · 7

Message boards : Number crunching : AVX Extensions - Ongoing development?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.