Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[cmsdy] in dsample.f of pp_dy3j.mad P0_gux_taptamggux, cache xbin_min…
… for xmin=0 and xbin_max for xmax=1 (part2 of madgraph5#969) There is indeed another clear and not too small improvement CUDACPP_RUNTIME_DISABLEFPE=1 ./build.cuda_d_inl0_hrd0/madevent_cuda < /tmp/avalassi/input_dy3j_x1_cudacpp [COUNTERS] PROGRAM TOTAL : 4.2184s [COUNTERS] Fortran Other ( 0 ) : 0.1695s [COUNTERS] Fortran Initialise(I/O) ( 1 ) : 0.0672s [COUNTERS] Fortran Random2Momenta ( 3 ) : 2.9293s for 1170103 events => throughput is 2.50E-06 events/s [COUNTERS] Fortran PDFs ( 4 ) : 0.1094s for 49152 events => throughput is 2.23E-06 events/s [COUNTERS] Fortran UpdateScaleCouplings ( 5 ) : 0.1379s for 16384 events => throughput is 8.42E-06 events/s [COUNTERS] Fortran Reweight ( 6 ) : 0.0560s for 16384 events => throughput is 3.42E-06 events/s [COUNTERS] Fortran Unweight(LHE-I/O) ( 7 ) : 0.0707s for 16384 events => throughput is 4.31E-06 events/s [COUNTERS] Fortran SamplePutPoint ( 8 ) : 0.1447s for 1170103 events => throughput is 1.24E-07 events/s [COUNTERS] CudaCpp Initialise ( 11 ) : 0.4719s [COUNTERS] CudaCpp Finalise ( 12 ) : 0.0267s [COUNTERS] CudaCpp MEs ( 19 ) : 0.0350s for 16384 events => throughput is 2.13E-06 events/s [COUNTERS] OVERALL NON-MEs ( 21 ) : 4.1834s [COUNTERS] OVERALL MEs ( 22 ) : 0.0350s for 16384 events => throughput is 2.13E-06 events/s
- Loading branch information