![]() |
Project
|
This is a quick summary how to run the full system test (FST) as stress test on the EPN. (For the full FST documentation, see https://github.com/AliceO2Group/AliceO2/blob/dev/prodtests/full-system-test/documentation/full-system-test-setup.md and https://github.com/AliceO2Group/AliceO2/blob/dev/prodtests/full-system-test/documentation/full-system-test.md)
/home/drohr/alitest/tmp-fst*
. The folder with the highest number is the latest dataset. However, data formats are still evolving, and it requires rerunning the simulation regularly. I.e. please try my latest data set, if it doesn't work, please generate a new one as described below.o2-dataflow
defaults and cannot run the simulation, and also they lack readout. Thus you need to build O2PDPSuite
and Readout
(the version matching the O2PDPSuite RPM you want to use for running the test) yourself with alibuild
on an EPN: aliBuild --defaults o2 build O2PDPSuite Readout --jobs 32 --debug
. The flag --jobs
configures the number of parallel jobs and can be changed.alienv enter O2PDPSuite/latest Readout/latest
.NEvents=650 NEventsQED=10000 SHMSIZE=128000000000 TPCTRACKERSCRATCHMEMORY=40000000000 SPLITTRDDIGI=0 GENERATE_ITSMFT_DICTIONARIES=1 $O2_ROOT/prodtests/full_system_test.sh
$O2_ROOT/prodtests/full-system-test/convert-raw-to-tf-file.sh
.mv raw/timeframe raw/timeframe-org; mkdir raw/timeframe-tmpfs; ln -s timeframe-tmpfs raw/timeframe
module load O2PDPSuite
(this will load the latest O2 software installed on that EPN).sudo mount -t tmpfs tmpfs raw/timeframe-tmpfs; sudo cp raw/timeframe-org/* raw/timeframe
mount | grep tmpfs
), and don't mount multiple of them since memory is critical!)TFDELAY=5
.echo 1 > /proc/sys/vm/drop_caches
start-tmux.sh
, the env variables are set automatically to access the EPN-internal CCDB server.TFDELAY=2.5 NTIMEFRAMES=1000000 $O2_ROOT/prodtests/full-system-test/start_tmux.sh dd
This will start a tmux session with 3 shells, the upper 2 shells are the 2 DPL workflows, one per NUMA domain, for the processing. The lower shell is the input with DataDistribution's StfBuilder. Leave it running and check that the StfBuilder doesn't complain that its buffer is full. Then the EPN can sustain the rate.
reconstruction_t0
to reconstruction_t3
)