7-9-18
- Stephen Willson (Unlicensed)
- Jelon Anderson (Deactivated)
Owned by Stephen Willson (Unlicensed)
Test Configuration
Tip of master, commit 9acabe9a46f61d6726d8e056253c75f515004726
All tests run with ofi+psm2, ib0.
daos_test: Run with 8 server (boro-[4-11]), 4 client (boro-[12-14,27]). Killed servers, cleaned /mnt/daos in between runs listed below.
Tests requiring pool to be created via dmg used 4GB pool. These used boro-12 as client.
mpich tests used boro-4 as server, boro-12 as client, with a 1GB pool.
Test Results
daos_test
Separate runs with cleanup in between:
- -mpcCiAeoRd - PASS
- -r - FAIL
- DAOS-1113 - Getting issue details... STATUS
- -O - PASS
daosperf
1K Records
CREDITS=1
Expand source
[sdwillso@boro-4 ~]$ CREDITS=1 ./daos_m/src/tests/daos_perf.sh daos 200 1000 1K + /home/sdwillso/daos/daos_m/opt/ompi/bin/orterun -quiet --hostfile /home/sdwillso/scripts/host.cli.1 --ompi-server file:/home/sdwillso/scripts/uri.txt -x DD_SUBSYS= -x DD_MASK= -x D_LOG_FILE=/tmp/daos_perf.log /home/sdwillso/daos/daos_m/install/bin/daos_perf -T daos -P 2G -d 1 -a 200 -r 1000 -s 1K -C 1 -t -z Test : DAOS (full stack) Parameters : pool size : 2048 MB credits : 1 (sync I/O for -ve) obj_per_cont : 1 x 2 (procs) dkey_per_obj : 1 akey_per_dkey : 200 recx_per_akey : 1000 value type : single value size : 1024 zero copy : yes overwrite : yes verify fetch : no VOS file : <NULL> 5d7c6644: rank 1 became pool service leader 0 Started... update successfully completed: duration : 20.598233 sec bandwith : 18.964 MB/sec rate : 19419.14 IO/sec latency : 51.496 us (nonsense if credits > 1) Duration across processes: MAX duration : 20.598233 sec MIN duration : 4.910205 sec Average duration : 12.754219 sec 5d7c6644: rank 1 no longer pool service leader 0
CREDITS=8
Expand source
[sdwillso@boro-4 ~]$ CREDITS=8 ./daos_m/src/tests/daos_perf.sh daos 200 1000 1K + /home/sdwillso/daos/daos_m/opt/ompi/bin/orterun -quiet --hostfile /home/sdwillso/scripts/host.cli.1 --ompi-server file:/home/sdwillso/scripts/uri.txt -x DD_SUBSYS= -x DD_MASK= -x D_LOG_FILE=/tmp/daos_perf.log /home/sdwillso/daos/daos_m/install/bin/daos_perf -T daos -P 2G -d 1 -a 200 -r 1000 -s 1K -C 8 -t -z Test : DAOS (full stack) Parameters : pool size : 2048 MB credits : 8 (sync I/O for -ve) obj_per_cont : 1 x 2 (procs) dkey_per_obj : 1 akey_per_dkey : 200 recx_per_akey : 1000 value type : single value size : 1024 zero copy : yes overwrite : yes verify fetch : no VOS file : <NULL> 6e34a4b5: rank 1 became pool service leader 0 Started... Authentication failed. update successfully completed: duration : 19.963876 sec bandwith : 19.567 MB/sec rate : 20036.19 IO/sec latency : 49.910 us (nonsense if credits > 1) Duration across processes: MAX duration : 19.963876 sec MIN duration : 3.151514 sec Average duration : 11.557695 sec 6e34a4b5: rank 1 no longer pool service leader 0
4K Records
CREDITS=1
Expand source
[sdwillso@boro-4 ~]$ CREDITS=1 ./daos_m/src/tests/daos_perf.sh daos 200 1000 4K + /home/sdwillso/daos/daos_m/opt/ompi/bin/orterun -quiet --hostfile /home/sdwillso/scripts/host.cli.1 --ompi-server file:/home/sdwillso/scripts/uri.txt -x DD_SUBSYS= -x DD_MASK= -x D_LOG_FILE=/tmp/daos_perf.log /home/sdwillso/daos/daos_m/install/bin/daos_perf -T daos -P 2G -d 1 -a 200 -r 1000 -s 4K -C 1 -t -z Test : DAOS (full stack) Parameters : pool size : 2048 MB credits : 1 (sync I/O for -ve) obj_per_cont : 1 x 2 (procs) dkey_per_obj : 1 akey_per_dkey : 200 recx_per_akey : 1000 value type : single value size : 4096 zero copy : yes overwrite : yes verify fetch : no VOS file : <NULL> b1c019d3: rank 1 became pool service leader 0 Started... update successfully completed: duration : 22.928364 sec bandwith : 68.147 MB/sec rate : 17445.64 IO/sec latency : 57.321 us (nonsense if credits > 1) Duration across processes: MAX duration : 22.928364 sec MIN duration : 6.717804 sec Average duration : 14.823084 sec b1c019d3: rank 1 no longer pool service leader 0
IOR, 10GB pool, data verification enabled
Expand source
[sdwillso@boro-4 ~]$ orterun -np 1 --hostfile ~/hostlists/daos_client_hostlist --mca mtl ^psm2,ofi --ompi-server file:~/scripts/uri.txt ior -v -W -i 5 -a DAOS -w -o `uuidgen` -b 5g -t 1m -O daospool=287c595a-e291-4e1f-bd12-203add1ce6c7,daosrecordsize=1m,daosstripesize=1m,daosstripecount=1024,daosaios=16,daosobjectclass=LARGE,daosPoolSvc=1,daosepoch=1 IOR-3.0.1: MPI Coordinated Test of Parallel I/O Began: Mon Jul 9 18:48:34 2018 Command line used: ior -v -W -i 5 -a DAOS -w -o b08692c6-8b97-4bc5-8650-5fb3ea631a11 -b 5g -t 1m -O daospool=287c595a-e291-4e1f-bd12-203add1ce6c7,daosrecordsize=1m,daosstripesize=1m,daosstripecount=1024,daosaios=16,daosobjectclass=LARGE,daosPoolSvc=1,daosepoch=1 Machine: Linux boro-12.boro.hpdd.intel.com Start time skew across all tasks: 0.00 sec Test 0 started: Mon Jul 9 18:48:34 2018 Path: /home/sdwillso FS: 3.8 TiB Used FS: 11.3% Inodes: 250.0 Mi Used Inodes: 2.3% Participating tasks: 1 [0] WARNING: USING daosStripeMax CAUSES READS TO RETURN INVALID DATA Summary: api = DAOS test filename = b08692c6-8b97-4bc5-8650-5fb3ea631a11 access = single-shared-file, independent pattern = segmented (1 segment) ordering in a file = sequential offsets ordering inter file= no tasks offsets clients = 1 (1 per node) repetitions = 5 xfersize = 1 MiB blocksize = 5 GiB aggregate filesize = 5 GiB access bw(MiB/s) block(KiB) xfer(KiB) open(s) wr/rd(s) close(s) total(s) iter ------ --------- ---------- --------- -------- -------- -------- -------- ---- Commencing write performance test: Mon Jul 9 18:48:34 2018 write 5159 5242880 1024.00 0.001189 0.989069 0.002148 0.992428 0 Verifying contents of the file(s) just written. Mon Jul 9 18:48:35 2018 remove - - - - - - 0.002572 0 Commencing write performance test: Mon Jul 9 18:48:38 2018 write 5597 5242880 1024.00 0.000863 0.910895 0.002950 0.914720 1 Verifying contents of the file(s) just written. Mon Jul 9 18:48:39 2018 remove - - - - - - 0.002403 1 Commencing write performance test: Mon Jul 9 18:48:42 2018 write 5572 5242880 1024.00 0.000843 0.916124 0.001854 0.918837 2 Verifying contents of the file(s) just written. Mon Jul 9 18:48:43 2018 remove - - - - - - 0.002374 2 Commencing write performance test: Mon Jul 9 18:48:47 2018 write 5623 5242880 1024.00 0.000807 0.907463 0.002305 0.910585 3 Verifying contents of the file(s) just written. Mon Jul 9 18:48:48 2018 remove - - - - - - 0.002329 3 Commencing write performance test: Mon Jul 9 18:48:51 2018 write 5633 5242880 1024.00 0.000825 0.905196 0.002821 0.908855 4 Verifying contents of the file(s) just written. Mon Jul 9 18:48:52 2018 remove - - - - - - 0.002288 4 Max Write: 5633.46 MiB/sec (5907.12 MB/sec) Summary of all tests: Operation Max(MiB) Min(MiB) Mean(MiB) StdDev Mean(s) Test# #Tasks tPN reps fPP reord reordoff reordrand seed segcnt blksiz xsize aggsize API RefNum write 5633.46 5159.06 5516.98 180.21 0.92908 0 1 1 5 0 0 1 0 0 1 5368709120 1048576 5368709120 DAOS 0 Finished: Mon Jul 9 18:48:57 2018
daos_bench
kv-idx-update
Time: 20.167734 seconds (49584.152068 ops per second)
Expand source
[sdwillso@boro-4 ~]$ orterun -np 1 --mca mtl ^psm2,ofi --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-idx-update --testid=1 --svc=1 --dpool=7e2e7c3e-4f0d-46f9-add7-76216ffaaf4d --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000 ================================ DAOSBENCH (KV) Started at Mon Jul 9 18:57:00 2018 ================================= =============================== Test Setup --------------- Test: kv-idx-update DAOS pool :7e2e7c3e-4f0d-46f9-add7-76216ffaaf4d DAOS container :d21bec33-6161-4f2a-9bff-9e39fb2ddeb1 Value buffer size: 64 Number of processes: 1 Number of indexes/process: 1000000 Number of asynchronous I/O: 32 =============================== kv-idx-update Time: 20.167734 seconds (49584.152068 ops per second)
kv-dkey-update
Time: 0.003268 seconds (30598.336293 ops per second)
Expand source
[sdwillso@boro-4 ~]$ orterun -np 1 --mca mtl ^psm2,ofi --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-dkey-update --testid=1 --svc=1 --dpool=8bc543b2-6bdb-4eb7-b36a-596e00822341 --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000 ================================ DAOSBENCH (KV) Started at Mon Jul 9 18:59:19 2018 ================================= =============================== Test Setup --------------- Test: kv-dkey-update DAOS pool :8bc543b2-6bdb-4eb7-b36a-596e00822341 DAOS container :8ff173da-dd42-403f-8a62-c12c09da8b83 Value buffer size: 64 Number of processes: 1 Number of keys/process: 100 Number of asynchronous I/O: 32 =============================== kv-dkey-update Time: 0.003268 seconds (30598.336293 ops per second)
kv-akey-update
Time: 0.003016 seconds (33155.355491 ops per second)
Expand source
[sdwillso@boro-4 ~]$ orterun -np 1 --mca mtl ^psm2,ofi --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-akey-update --testid=1 --svc=1 --dpool=671044ba-bbec-4040-ae45-28d075592c18 --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000 ================================ DAOSBENCH (KV) Started at Mon Jul 9 19:00:53 2018 ================================= =============================== Test Setup --------------- Test: kv-akey-update DAOS pool :671044ba-bbec-4040-ae45-28d075592c18 DAOS container :6a150423-9330-4104-8179-8d5d54506c94 Value buffer size: 64 Number of processes: 1 Number of keys/process: 100 Number of asynchronous I/O: 32 =============================== kv-akey-update Time: 0.003016 seconds (33155.355491 ops per second)
kv-dkey-fetch
Time: 0.001232 seconds (81155.207275 ops per second)
Expand source
[sdwillso@boro-4 ~]$ orterun -np 1 --mca mtl ^psm2,ofi --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-dkey-fetch --testid=1 --svc=1 --dpool=85858409-ae40-45e4-b32a-5b90b009a1d9 --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000 ================================ DAOSBENCH (KV) Started at Mon Jul 9 19:02:22 2018 ================================= =============================== Test Setup --------------- Test: kv-dkey-fetch DAOS pool :85858409-ae40-45e4-b32a-5b90b009a1d9 DAOS container :0425eb16-2714-4ac4-9b0e-048d9be38873 Value buffer size: 64 Number of processes: 1 Number of keys/process: 100 Number of asynchronous I/O: 32 =============================== kv-dkey-fetch Time: 0.001232 seconds (81155.207275 ops per second)
kv-akey-fetch
Time: 0.001274 seconds (78512.856391 ops per second)
Expand source
[sdwillso@boro-4 ~]$ orterun -np 1 --mca mtl ^psm2,ofi --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-akey-fetch --testid=1 --svc=1 --dpool=088d22fd-5389-4849-a94f-4877b1c43366 --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000 ================================ DAOSBENCH (KV) Started at Mon Jul 9 19:03:44 2018 ================================= =============================== Test Setup --------------- Test: kv-akey-fetch DAOS pool :088d22fd-5389-4849-a94f-4877b1c43366 DAOS container :d2c93780-ba42-4aff-a690-0863c4f229f8 Value buffer size: 64 Number of processes: 1 Number of keys/process: 100 Number of asynchronous I/O: 32 =============================== kv-akey-fetch Time: 0.001274 seconds (78512.856391 ops per second)
mpich tests
Results: Over psm2, failure immediately, I see what I believe is CART-470 - Getting issue details... STATUS