#!/bin/csh # foreach i (*.f) sxf90 -C hopt -o $i.exe $i end intgl4.f: f90: vec(1): intgl4.f, line 27: Vectorized loop. f90: intgl4.f, intg4a: There is 1 diagnose. intosc.f: f90: vec(3): intosc.f, line 14: Unvectorized loop. f90: intosc.f, intos1: There is 1 diagnose. intpi3.f: f90: vec(3): intpi3.f, line 25: Unvectorized loop. f90: intpi3.f, intpi: There is 1 diagnose. jacobi11.f: f90: vec(3): jacobi11.f, line 17: Unvectorized loop. f90: vec(1): jacobi11.f, line 27: Vectorized loop. f90: vec(2): jacobi11.f, line 36: Partially vectorized loop. f90: vec(1): jacobi11.f, line 38: Vectorized loop. f90: jacobi11.f, jacob2: There are 4 diagnoses. f90: vec(1): jacobi11.f, line 70: Vectorized loop. f90: vec(1): jacobi11.f, line 70: Vectorized loop. f90: vec(1): jacobi11.f, line 78: Vectorized loop. f90: vec(1): jacobi11.f, line 98: Vectorized loop. f90: vec(1): jacobi11.f, line 104: Vectorized loop. f90: vec(1): jacobi11.f, line 110: Vectorized loop. f90: vec(1): jacobi11.f, line 122: Vectorized loop. f90: vec(1): jacobi11.f, line 135: Vectorized loop. f90: vec(1): jacobi11.f, line 138: Vectorized loop. f90: vec(1): jacobi11.f, line 143: Vectorized loop. f90: vec(1): jacobi11.f, line 159: Vectorized loop. f90: vec(1): jacobi11.f, line 162: Vectorized loop. f90: jacobi11.f, jacobi: There are 12 diagnoses. jacobi1h.f: f90: vec(3): jacobi1h.f, line 17: Unvectorized loop. f90: vec(1): jacobi1h.f, line 27: Vectorized loop. f90: vec(2): jacobi1h.f, line 36: Partially vectorized loop. f90: vec(1): jacobi1h.f, line 38: Vectorized loop. f90: jacobi1h.f, jacob2: There are 4 diagnoses. f90: vec(1): jacobi1h.f, line 70: Vectorized loop. f90: vec(1): jacobi1h.f, line 70: Vectorized loop. f90: vec(1): jacobi1h.f, line 78: Vectorized loop. f90: vec(1): jacobi1h.f, line 98: Vectorized loop. f90: vec(1): jacobi1h.f, line 104: Vectorized loop. f90: vec(1): jacobi1h.f, line 110: Vectorized loop. f90: vec(1): jacobi1h.f, line 122: Vectorized loop. f90: vec(1): jacobi1h.f, line 135: Vectorized loop. f90: vec(1): jacobi1h.f, line 138: Vectorized loop. f90: vec(1): jacobi1h.f, line 143: Vectorized loop. f90: vec(1): jacobi1h.f, line 159: Vectorized loop. f90: vec(1): jacobi1h.f, line 162: Vectorized loop. f90: jacobi1h.f, jacobi: There are 12 diagnoses. leqs1k.f: f90: vec(1): leqs1k.f, line 19: Vectorized loop. f90: vec(3): leqs1k.f, line 25: Unvectorized loop. f90: vec(1): leqs1k.f, line 36: Vectorized loop. f90: vec(1): leqs1k.f, line 42: Vectorized loop. f90: leqs1k.f, leqs2: There are 4 diagnoses. f90: vec(1): leqs1k.f, line 59: Vectorized loop. f90: vec(1): leqs1k.f, line 64: Vectorized loop. f90: vec(1): leqs1k.f, line 82: Vectorized loop. f90: vec(1): leqs1k.f, line 88: Vectorized loop. f90: vec(1): leqs1k.f, line 96: Vectorized loop. f90: vec(1): leqs1k.f, line 100: Vectorized loop. f90: leqs1k.f, sleqs: There are 6 diagnoses. leqs4h.f: f90: vec(1): leqs4h.f, line 19: Vectorized loop. f90: vec(3): leqs4h.f, line 25: Unvectorized loop. f90: vec(1): leqs4h.f, line 36: Vectorized loop. f90: vec(1): leqs4h.f, line 42: Vectorized loop. f90: leqs4h.f, leqs2: There are 4 diagnoses. f90: vec(1): leqs4h.f, line 59: Vectorized loop. f90: vec(1): leqs4h.f, line 64: Vectorized loop. f90: vec(1): leqs4h.f, line 82: Vectorized loop. f90: vec(1): leqs4h.f, line 88: Vectorized loop. f90: vec(1): leqs4h.f, line 96: Vectorized loop. f90: vec(1): leqs4h.f, line 100: Vectorized loop. f90: leqs4h.f, sleqs: There are 6 diagnoses. mathfnc1.f: f90: vec(1): mathfnc1.f, line 17: Vectorized loop. f90: mathfnc1.f, mthf1b: There is 1 diagnose. mathfnc2.f: f90: vec(1): mathfnc2.f, line 17: Vectorized loop. f90: mathfnc2.f, mthf2b: There is 1 diagnose. matvec.f: f90: vec(1): matvec.f, line 24: Vectorized loop. f90: vec(1): matvec.f, line 29: Vectorized loop. f90: vec(1): matvec.f, line 36: Vectorized loop. f90: vec(1): matvec.f, line 46: Vectorized loop. f90: vec(1): matvec.f, line 51: Vectorized loop. f90: vec(1): matvec.f, line 51: Vectorized loop. f90: matvec.f, matvc1: There are 6 diagnoses. matvecz.f: f90: vec(1): matvecz.f, line 32: Vectorized loop. f90: vec(1): matvecz.f, line 37: Vectorized loop. f90: vec(2): matvecz.f, line 42: Partially vectorized loop. f90: vec(1): matvecz.f, line 44: Vectorized loop. f90: vec(1): matvecz.f, line 54: Vectorized loop. f90: vec(1): matvecz.f, line 59: Vectorized loop. f90: vec(1): matvecz.f, line 59: Vectorized loop. f90: matvecz.f, mtvcz1: There are 7 diagnoses. osc.f: f90: vec(3): osc.f, line 16: Unvectorized loop. f90: osc.f, osc1: There is 1 diagnose. permute1.f: f90: vec(1): permute1.f, line 25: Vectorized loop. f90: vec(3): permute1.f, line 39: Unvectorized loop. f90: vec(1): permute1.f, line 48: Vectorized loop. f90: permute1.f, perm: There are 3 diagnoses. permute2.f: f90: vec(1): permute2.f, line 25: Vectorized loop. f90: vec(2): permute2.f, line 32: Partially vectorized loop. f90: vec(3): permute2.f, line 39: Unvectorized loop. f90: vec(1): permute2.f, line 48: Vectorized loop. f90: permute2.f, perm: There are 4 diagnoses. permute3.f: f90: vec(1): permute3.f, line 25: Vectorized loop. f90: vec(2): permute3.f, line 32: Partially vectorized loop. f90: vec(3): permute3.f, line 39: Unvectorized loop. f90: vec(1): permute3.f, line 48: Vectorized loop. f90: permute3.f, perm: There are 4 diagnoses. pi3.f: f90: vec(3): pi3.f, line 21: Unvectorized loop. f90: pi3.f, pi3a: There is 1 diagnose. runge.f: f90: vec(3): runge.f, line 31: Unvectorized loop. f90: runge.f, runge1: There is 1 diagnose. # set machine = "sx4.yukawa.kyoto-u.ac.jp" set compile = "sxf90 -C hopt" cd sx/bench/ver2 setenv F_PROGINF DETAIL echo "Fortran Bench Mark Tests" echo "machine=" $machine echo "compile with [" $compile "]" echo "Date = " `date` @ n = 0 foreach i (matvec matvecz jacobi11 jacobi1h leqs4h leqs1k intosc osc intpi3 pi3 runge intgl4 mathfnc1 mathfnc2 permute1 permute2 permute3) echo "------------------------------------------------------------------------------" @ n = $n + 1 echo "($n) program =" $i time $i.f.exe end Warning: no access to tty (Bad file number). Thus no job control in this shell. Fortran Bench Mark Tests machine= sx4.yukawa.kyoto-u.ac.jp compile with [ sxf90 -C hopt ] Date = Fri Jul 3 04:13:47 JST 1998 ------------------------------------------------------------------------------ (1) program = matvec MULTIPLY 1023*1023 REAL MATRIX TO VECTOR 3* 2000 TIMES. LARGEST EIGENVALUES= 1023.93053 1022.12165 1021.00563 14.44u 0.27s 0:21.61 68.0% ------------------------------------------------------------------------------ (2) program = matvecz MULTIPLY 723* 723 COMPLEX MATRIX TO VECTOR 3* 1000 TIMES LARGEST EIGENVALUES= 723.98791 722.12717 720.97077 14.02u 0.29s 0:21.96 65.1% ------------------------------------------------------------------------------ (3) program = jacobi11 DIMENSION= 11 #JACOBI ROT CYC= 5* 200000 EMIN= -0.663815 EMAX= 10.309197 259.95u 1.05s 5:47.69 75.0% ------------------------------------------------------------------------------ (4) program = jacobi1h DIMENSION=101 #JACOBI ROT CYC= 5* 200 EMIN= -3.859769 EMAX= 104.276991 24.90u 0.46s 0:33.87 74.8% ------------------------------------------------------------------------------ (5) program = leqs4h DIM= 401 #INV= 78 RESULTS: 0.9989295956 0.9989933805 0.9982736386 6.28u 0.28s 0:10.71 61.2% ------------------------------------------------------------------------------ (6) program = leqs1k DIM= 1001 #INV= 5 RESULTS: 0.9983222082 0.9999235137 1.0001292085 3.57u 0.27s 0:07.16 53.6% ------------------------------------------------------------------------------ (7) program = intosc I( 1000000000)= -2267 I( 1000000001)= -1233 228.01u 0.84s 5:07.17 74.5% ------------------------------------------------------------------------------ (8) program = osc I( 1000000000)= 9861.8410 I( 1000000001)= 9426.5602 44.01u 0.39s 1:01.62 72.0% ------------------------------------------------------------------------------ (9) program = intpi3 PI= 50893811591411064*120/(8*N**5+15*N**3), N= 3000 258.16u 1.24s 5:47.75 74.5% ------------------------------------------------------------------------------ (10) program = pi3 N= 1500 PI= 3.141594622 354.07u 1.42s 7:51.71 75.3% ------------------------------------------------------------------------------ (11) program = runge 4TH-ORDER RUNGE-KUTTA FOR POTENTIAL=LOG(R). INITIAL R= 1.0 0.0 V= 0.0 1.5 TIME= 200000000* 0.00050 R= 1.95711 -1.69674 V= 0.29953 0.50675 150.41u 0.66s 3:23.09 74.3% ------------------------------------------------------------------------------ (12) program = intgl4 #PTS= 200**4 A= 0.1000 S= 3.244281965 25.87u 0.55s 0:34.84 75.8% ------------------------------------------------------------------------------ (13) program = mathfnc1 INTEGRAL[ 0.1 0.9: 100000000 STEPS](SIN(X)+SQRT(COS(X)))DX= 1.111225554 5.33u 0.33s 0:08.94 63.3% ------------------------------------------------------------------------------ (14) program = mathfnc2 INTEGRAL[ 0.1 0.9: 100000000 STEPS]ATAN2(EXP(X),LOG(X+1))DX= 1.081785145 9.00u 0.30s 0:14.32 64.9% ------------------------------------------------------------------------------ (15) program = permute1 INT*4 ARRAY_SIZE= 1024 #PERMUTATIONS/ARRAY_SIZE= 262144 A= 149 293 506 728 569 228 413 551 883 676 45.26u 0.36s 1:01.95 73.6% ------------------------------------------------------------------------------ (16) program = permute2 INT*4 ARRAY_SIZE= 32768 #PERMUTATIONS/ARRAY_SIZE= 8192 A= 149 293 16365 22125 2429 5407 28746 12182 14590 11825 80.63u 0.45s 1:51.00 73.0% ------------------------------------------------------------------------------ (17) program = permute3 INT*4 ARRAY_SIZE= 1048576 #PERMUTATIONS/ARRAY_SIZE= 256 A= 149 293 524435 178798 836951 263828 662616 351174 26552 147396 99.12u 0.54s 2:14.59 74.0% ****** Program Information ****** Real Time (sec) : 18.910741 User Time (sec) : 14.438427 Sys Time (sec) : 0.012795 Vector Time (sec) : 13.690772 Inst. Count : 468111771. V. Inst. Count : 122910349. V. Element Count : 31439547603. FLOP Count : 14155757819. MOPS : 2201.399732 MFLOPS : 980.422449 VLEN : 255.792518 V. Op. Ratio (%) : 98.913940 Memory Size (MB) : 12.031250 MIPS : 32.421245 I-Cache (sec) : 0.000996 O-Cache (sec) : 0.000376 Bank (sec) : 0.000152 ****** Program Information ****** Real Time (sec) : 19.807900 User Time (sec) : 14.016934 Sys Time (sec) : 0.026327 Vector Time (sec) : 13.486056 Inst. Count : 283686679. V. Inst. Count : 128228270. V. Element Count : 31455615312. FLOP Count : 13683192448. MOPS : 2255.205992 MFLOPS : 976.190112 VLEN : 245.309520 V. Op. Ratio (%) : 99.508215 Memory Size (MB) : 12.031250 MIPS : 20.238854 I-Cache (sec) : 0.001817 O-Cache (sec) : 0.000602 Bank (sec) : 2.445277 ****** Program Information ****** Real Time (sec) : 345.473393 User Time (sec) : 259.948069 Sys Time (sec) : 0.786900 Vector Time (sec) : 143.248652 Inst. Count : 18478691957. V. Inst. Count : 3040399112. V. Element Count : 30682390905. FLOP Count : 12579218747. MOPS : 177.422683 MFLOPS : 48.391276 VLEN : 10.091567 V. Op. Ratio (%) : 66.526314 Memory Size (MB) : 4.031250 MIPS : 71.086091 I-Cache (sec) : 0.130338 O-Cache (sec) : 22.424891 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 31.657800 User Time (sec) : 24.897683 Sys Time (sec) : 0.196969 Vector Time (sec) : 15.135286 Inst. Count : 1709910423. V. Inst. Count : 302051870. V. Element Count : 14547362520. FLOP Count : 8067165371. MOPS : 640.831570 MFLOPS : 324.012699 VLEN : 48.161803 V. Op. Ratio (%) : 91.176189 Memory Size (MB) : 4.031250 MIPS : 68.677493 I-Cache (sec) : 0.019387 O-Cache (sec) : 2.026995 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 8.401306 User Time (sec) : 6.277064 Sys Time (sec) : 0.015128 Vector Time (sec) : 4.629534 Inst. Count : 680370027. V. Inst. Count : 50962337. V. Element Count : 8588946787. FLOP Count : 3433641328. MOPS : 1468.577356 MFLOPS : 547.013886 VLEN : 168.535183 V. Op. Ratio (%) : 93.172234 Memory Size (MB) : 5.031250 MIPS : 108.389845 I-Cache (sec) : 0.001685 O-Cache (sec) : 0.004657 Bank (sec) : 0.000030 ****** Program Information ****** Real Time (sec) : 5.212877 User Time (sec) : 3.568548 Sys Time (sec) : 0.008629 Vector Time (sec) : 2.913519 Inst. Count : 323194029. V. Inst. Count : 38859324. V. Element Count : 8410874882. FLOP Count : 3369771873. MOPS : 2436.624186 MFLOPS : 944.297842 VLEN : 216.444189 V. Op. Ratio (%) : 96.729984 Memory Size (MB) : 12.031250 MIPS : 90.567384 I-Cache (sec) : 0.000855 O-Cache (sec) : 0.000759 Bank (sec) : 0.000084 ****** Program Information ****** Real Time (sec) : 305.448018 User Time (sec) : 228.010488 Sys Time (sec) : 0.582176 Vector Time (sec) : 0.000001 Inst. Count : 7000876291. V. Inst. Count : 4. V. Element Count : 78. FLOP Count : 1000000068. MOPS : 30.704186 MFLOPS : 4.385763 VLEN : 19.500000 V. Op. Ratio (%) : 0.000001 Memory Size (MB) : 4.031250 MIPS : 30.704185 I-Cache (sec) : 0.020944 O-Cache (sec) : 0.012722 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 59.634616 User Time (sec) : 44.009128 Sys Time (sec) : 0.125336 Vector Time (sec) : 0.000001 Inst. Count : 4000877305. V. Inst. Count : 4. V. Element Count : 78. FLOP Count : 2000000110. MOPS : 90.910171 MFLOPS : 45.445120 VLEN : 19.500000 V. Op. Ratio (%) : 0.000002 Memory Size (MB) : 4.031250 MIPS : 90.910170 I-Cache (sec) : 0.004333 O-Cache (sec) : 0.002592 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 345.679157 User Time (sec) : 258.161443 Sys Time (sec) : 0.986468 Vector Time (sec) : 0.000001 Inst. Count : 21691178858. V. Inst. Count : 8. V. Element Count : 86. FLOP Count : 52929956. MOPS : 84.021761 MFLOPS : 0.205027 VLEN : 10.750000 V. Op. Ratio (%) : 0.000000 Memory Size (MB) : 4.031250 MIPS : 84.021760 I-Cache (sec) : 0.041617 O-Cache (sec) : 0.021429 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 469.564469 User Time (sec) : 354.064860 Sys Time (sec) : 1.163552 Vector Time (sec) : 0.000001 Inst. Count : 23814434077. V. Inst. Count : 4. V. Element Count : 78. FLOP Count : 10136256083. MOPS : 67.260090 MFLOPS : 28.628247 VLEN : 19.500000 V. Op. Ratio (%) : 0.000000 Memory Size (MB) : 4.031250 MIPS : 67.260089 I-Cache (sec) : 0.048537 O-Cache (sec) : 0.025781 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 201.115957 User Time (sec) : 150.413702 Sys Time (sec) : 0.401858 Vector Time (sec) : 0.000001 Inst. Count : 17400884137. V. Inst. Count : 8. V. Element Count : 102. FLOP Count : 16800000338. MOPS : 115.686829 MFLOPS : 111.691955 VLEN : 12.750000 V. Op. Ratio (%) : 0.000001 Memory Size (MB) : 4.031250 MIPS : 115.686829 I-Cache (sec) : 0.017226 O-Cache (sec) : 0.008582 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 33.193921 User Time (sec) : 25.867065 Sys Time (sec) : 0.289676 Vector Time (sec) : 25.857040 Inst. Count : 513358221. V. Inst. Count : 200000004. V. Element Count : 44928000078. FLOP Count : 18088200718. MOPS : 1748.994629 MFLOPS : 699.275333 VLEN : 224.639996 V. Op. Ratio (%) : 99.307363 Memory Size (MB) : 4.031250 MIPS : 19.846017 I-Cache (sec) : 0.013104 O-Cache (sec) : 0.006427 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 6.720210 User Time (sec) : 5.329043 Sys Time (sec) : 0.068846 Vector Time (sec) : 5.319446 Inst. Count : 97361887. V. Inst. Count : 45312520. V. Element Count : 11600002138. FLOP Count : 5800000409. MOPS : 2186.518598 MFLOPS : 1088.375618 VLEN : 255.999934 V. Op. Ratio (%) : 99.553303 Memory Size (MB) : 4.031250 MIPS : 18.270051 I-Cache (sec) : 0.006268 O-Cache (sec) : 0.001937 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 12.036291 User Time (sec) : 9.000172 Sys Time (sec) : 0.036833 Vector Time (sec) : 8.990949 Inst. Count : 148924427. V. Inst. Count : 69140645. V. Element Count : 17700002138. FLOP Count : 9600000409. MOPS : 1975.494120 MFLOPS : 1066.646384 VLEN : 255.999957 V. Op. Ratio (%) : 99.551267 Memory Size (MB) : 4.031250 MIPS : 16.546843 I-Cache (sec) : 0.004483 O-Cache (sec) : 0.001034 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 60.102290 User Time (sec) : 45.260146 Sys Time (sec) : 0.110516 Vector Time (sec) : 0.000006 Inst. Count : 7265414627. V. Inst. Count : 37. V. Element Count : 6998. FLOP Count : 524366. MOPS : 160.525808 MFLOPS : 0.011586 VLEN : 189.135135 V. Op. Ratio (%) : 0.000096 Memory Size (MB) : 4.031250 MIPS : 160.525654 I-Cache (sec) : 0.005760 O-Cache (sec) : 0.006371 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 108.546951 User Time (sec) : 80.628500 Sys Time (sec) : 0.194507 Vector Time (sec) : 0.000290 Inst. Count : 8054170225. V. Inst. Count : 2065. V. Element Count : 526166. FLOP Count : 41051. MOPS : 99.898849 MFLOPS : 0.000509 VLEN : 254.801937 V. Op. Ratio (%) : 0.006532 Memory Size (MB) : 4.031250 MIPS : 99.892349 I-Cache (sec) : 0.008596 O-Cache (sec) : 33.975163 Bank (sec) : 0.000000 ****** Program Information ****** Real Time (sec) : 132.537565 User Time (sec) : 99.118718 Sys Time (sec) : 0.282603 Vector Time (sec) : 0.002828 Inst. Count : 8054057547. V. Inst. Count : 24633. V. Element Count : 6303574. FLOP Count : 1381. MOPS : 81.320024 MFLOPS : 0.000014 VLEN : 255.899566 V. Op. Ratio (%) : 0.078205 Memory Size (MB) : 8.031250 MIPS : 81.256676 I-Cache (sec) : 0.010574 O-Cache (sec) : 51.500487 Bank (sec) : 0.000000