-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rodinia simulation config #318
Comments
We don't have one for TITAN RTX. You can change the 2060 config. Change the number of SMs and clock frequency to match your card. SM-related stuff should be similar. Change L2 size as well. |
Thanks. I have tried to generate an CONFIG file in acclesim-tuner, here is my turing_TITANRTX_hw_def.h file. I changed frequency, L1cache size and WARP_SCHEDS_PER_SM from turing_rtx2060 header file.
I use tuner.py to generate config file and move it to the folders as required. Then when I use the run_simulations.py script to launch all 16 possible combinations, the error happends:
accel-sim-framework: release Only one config, TITANRTX-SASS successfully execute simulation, and the others all failed. So How to fix the errors here? If would be so great if you give me some hints. |
Finally, I see the mistakes in repo, However, the simulation of all IPOLY-related trials failed. I try to run /home/data/userhome/liqiang/lab/gpu/accel-sim-framework/sim_run_11.0/l1_bw_32f/NO_ARGS/TITANRTX-SASS-IPOLY-GTO-32B-FRFCFS/justrun.sh, it show:
which indicates I have wrong bank number setting, I want to know which config in my config file should be revised? |
First, don't mix release/dev. If you are using dev for gpgpu-sim then use dev for accel-sim as well. The last line just tells you the problem.
Why is |
the gpgpu_n_mem parameter is generated by tuner.py, it is wrong for TITAN RTX. In white paper of Turing arch, the number of memory controllers of TU102 (TITAN RTX) is 12, same as config of RTX2060. So, How come the tuner.py use GPU_Microbenchmark to generate a gpgpu_n_mem of 3? here is the stats.txt obtained by runing
why the testbench give us a mistaken number? here is my newest hw_def file :
|
Hi, accel-sim devolopers:
I am confused about the CONFIG, because I got the trace obtained from TITAN RTX (shown below, turning arch) in my server? So which config should I use in /home/data/userhome/liqiang/lab/gpu/accel-sim-framework/util/job_launching/configs/define-standard-cfgs.yml?
$ nvidia-smi
Tue Jul 9 16:45:54 2024
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 450.80.02 Driver Version: 450.80.02 CUDA Version: 11.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 TITAN RTX Off | 00000000:3B:00.0 Off | N/A |
| 44% 41C P0 58W / 280W | 0MiB / 24220MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 1 TITAN RTX Off | 00000000:5E:00.0 Off | N/A |
| 49% 39C P0 57W / 280W | 0MiB / 24220MiB | 1% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 2 TITAN RTX Off | 00000000:B1:00.0 Off | N/A |
| 38% 36C P0 63W / 280W | 0MiB / 24220MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 3 TITAN RTX Off | 00000000:D9:00.0 Off | N/A |
| 22% 36C P0 39W / 280W | 0MiB / 24220MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
/home/data/userhome/liqiang/lab/gpu/accel-sim-framework/util/job_launching/configs/define-standard-cfgs.yml
#Basefile Configs
#Pascal
TITANX:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM6_TITANX/gpgpusim.config"
TITANXX:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/TITANX-pascal/gpgpusim.config"
#Kepler
TITANK:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM3_KEPLER_TITAN/gpgpusim.config"
#Ampere RTX 3070
RTX3070:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM86_RTX3070/gpgpusim.config"
#Turing
RTX2060:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM75_RTX2060/gpgpusim.config"
#Turing
RTX2060_S:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM75_RTX2060_S/gpgpusim.config"
#Volta
TITANV:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM7_TITANV/gpgpusim.config"
#Volta
TITANV_OLD:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM7_TITANV_OLD/gpgpusim.config"
QV100:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM7_QV100/gpgpusim.config"
QV100_64SM:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM7_QV100_SMs/gpgpusim.config"
QV100_SASS:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM7_QV100_SASS/gpgpusim.config"
QV100_old:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM7_QV100_old/gpgpusim.config"
#Fermi
GTX480:
base_file: "$GPGPUSIM_ROOT/configs/tested-cfgs/SM2_GTX480/gpgpusim.config"
#To keep your configurations straight - we recommend specifying
#If you are using SASS or PTX in the config:
#For example: QV100-SASS or QV100-PTX.
SASS:
extra_params: "#SASS-Driven Accel-Sim"
PTX:
extra_params: "#PTX-Driven GPGPU-Sim"
The text was updated successfully, but these errors were encountered: