10-9-18
- Stephen Willson (Unlicensed)
- Jelon Anderson (Deactivated)
Owned by Stephen Willson (Unlicensed)
Tip of master, commit 4c52159f3b7d13f98bbea892a61424501336c127
All tests run with ofi+psm2, ib0.
daos_test: Run with 8 server (boro-[4-11]), 2 client (boro-12,16). Killed servers, cleaned /mnt/daos in between runs listed below.
Tests requiring pool to be created via dmg used 4GB pool. These used boro-12 as client.
mpich tests used boro-4 as server, boro-12 as client, with a 1GB pool.
Test Results
daos_test
Separate runs with cleanup in between:
- -mpcCAeoRdiO - PASS
- -r - FAIL
- DAOS-1556 - Getting issue details... STATUS
- PASS with sockets
daosperf
1K Records
CREDITS=1
Expand source
[sdwillso@boro-4 ior]$ orterun -x DAOS_IMPLICIT_PURGE=1 -x FI_PSM2_DISCONNECT=1 --mca mtl ^psm2,ofi -np 1 -quiet --hostfile ~/scripts/host.cli.1 --ompi-server file:~/scripts/uri.txt -x DD_SUBSYS= -x DD_MASK= -x D_LOG_FILE=/tmp/daos_perf.log daos_perf -T daos -P 2G -d 1 -a 200 -r 1000 -s 1K -C 1 -t -z ModuleCmd_Load.c(213):ERROR:105: Unable to locate a modulefile for 'openmpi-x86_64' Test : DAOS (full stack) Parameters : pool size : 2048 MB credits : 1 (sync I/O for -ve) obj_per_cont : 1 x 1 (procs) dkey_per_obj : 1 akey_per_dkey : 200 recx_per_akey : 1000 value type : single value size : 1024 zero copy : yes overwrite : yes verify fetch : no VOS file : <NULL> 773cee88: rank 1 became pool service leader 0 Started... update successfully completed: duration : 93.750784 sec bandwith : 2.083 MB/sec rate : 2133.32 IO/sec latency : 468.754 us (nonsense if credits > 1) Duration across processes: MAX duration : 93.750784 sec MIN duration : 93.750784 sec Average duration : 93.750784 sec 773cee88: rank 1 no longer pool service leader 0
CREDITS=8
4K Records
CREDITS=1
IOR, 40GB pool, data verification enabled
Expand source
[sdwillso@boro-4 ior]$ orterun -x FI_PSM2_DISCONNECT=1 -x CRT_CTX_NUM=1 -N 1 --hostfile ~/hostlists/daos_client_hostlist --mca mtl ^psm2,ofi --ompi-server file:~/scripts/uri.txt ior -v -W -i 5 -a DAOS -w -o `uuidgen` -b 5g -t 1m -- -p 32c77913-63e2-4ac4-9a72-8471a074b00a -v 1 -r 1m -s 1m -c 1024 -a 16 -o LARGE ior WARNING: assuming POSIX-based backend for DAOS statfs call. ior WARNING: assuming POSIX-based backend for DAOS mkdir call. ior WARNING: assuming POSIX-based backend for DAOS rmdir call. ior WARNING: assuming POSIX-based backend for DAOS access call. ior WARNING: assuming POSIX-based backend for DAOS stat call. ior WARNING: assuming POSIX-based backend for DAOS statfs call. ior WARNING: assuming POSIX-based backend for DAOS mkdir call. ior WARNING: assuming POSIX-based backend for DAOS rmdir call. ior WARNING: assuming POSIX-based backend for DAOS access call. ior WARNING: assuming POSIX-based backend for DAOS stat call. IOR-3.1.0: MPI Coordinated Test of Parallel I/O Began : Wed Oct 10 19:21:43 2018 Command line : ior -v -W -i 5 -a DAOS -w -o 38685a73-cb22-4ad1-9f6e-86761c3cd327 -b 5g -t 1m -- -p 32c77913-63e2-4ac4-9a72-8471a074b00a -v 1 -r 1m -s 1m -c 1024 -a 16 -o LARGE Machine : Linux boro-12.boro.hpdd.intel.com Start time skew across all tasks: 14690266.15 sec TestID : 0 StartTime : Wed Oct 10 19:21:43 2018 Path : /home/sdwillso/ior FS : 3.8 TiB Used FS: 15.5% Inodes: 250.0 Mi Used Inodes: 2.9% Participating tasks: 2 [0] WARNING: USING daosStripeMax CAUSES READS TO RETURN INVALID DATA Options: api : DAOS apiVersion : DAOS test filename : 38685a73-cb22-4ad1-9f6e-86761c3cd327 access : single-shared-file type : independent segments : 1 ordering in a file : sequential ordering inter file : no tasks offsets tasks : 2 clients per node : 1 repetitions : 5 xfersize : 1 MiB blocksize : 5 GiB aggregate filesize : 10 GiB Results: access bw(MiB/s) block(KiB) xfer(KiB) open(s) wr/rd(s) close(s) total(s) iter ------ --------- ---------- --------- -------- -------- -------- -------- ---- Commencing write performance test: Wed Oct 10 19:21:43 2018 write 3543.51 5242880 1024.00 0.025051 2.84 0.022931 2.89 0 Verifying contents of the file(s) just written. Wed Oct 10 19:21:46 2018 remove - - - - - - 0.048828 0 Commencing write performance test: Wed Oct 10 19:21:54 2018 write 3898 5242880 1024.00 0.022545 2.58 0.021514 2.63 1 Verifying contents of the file(s) just written. Wed Oct 10 19:21:57 2018 remove - - - - - - 0.048216 1 Commencing write performance test: Wed Oct 10 19:22:04 2018 write 4385 5242880 1024.00 0.023126 2.29 0.020592 2.34 2 Verifying contents of the file(s) just written. Wed Oct 10 19:22:07 2018 remove - - - - - - 0.048469 2 Commencing write performance test: Wed Oct 10 19:22:14 2018 write 4428 5242880 1024.00 0.023201 2.27 0.022094 2.31 3 Verifying contents of the file(s) just written. Wed Oct 10 19:22:17 2018 remove - - - - - - 0.048720 3 Commencing write performance test: Wed Oct 10 19:22:24 2018 write 5082 5242880 1024.00 0.023438 1.97 0.020868 2.01 4 Verifying contents of the file(s) just written. Wed Oct 10 19:22:26 2018 remove - - - - - - 0.048331 4 Max Write: 5082.07 MiB/sec (5328.94 MB/sec) Summary of all tests: Operation Max(MiB) Min(MiB) Mean(MiB) StdDev Max(OPs) Min(OPs) Mean(OPs) StdDev Mean(s) Test# #Tasks tPN reps fPP reord reordoff reordrand seed segcnt blksiz xsize aggs(MiB) API RefNum write 5082.07 3543.51 4267.34 522.20 5082.07 3543.51 4267.34 522.20 2.43587 0 2 1 5 0 0 1 0 0 1 5368709120 1048576 10240.0 DAOS 0 Finished : Wed Oct 10 19:22:35 2018
daos_bench
kv-idx-update
Time: 621.290206 seconds (1609.553781 ops per second)
Expand source
[sdwillso@boro-4 daos_m]$ orterun -np 1 --mca mtl ^psm2,ofi --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-idx-update --testid=1 --svc=1 --dpool=a0b139a9-5eca-4163-8350-5a1bd8e2512e --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000 ================================ DAOSBENCH (KV) Started at Tue Oct 9 19:42:52 2018 ================================= =============================== Test Setup --------------- Test: kv-idx-update DAOS pool :a0b139a9-5eca-4163-8350-5a1bd8e2512e DAOS container :697096ef-3916-43ba-8bda-b337ddedd25f Value buffer size: 64 Number of processes: 1 Number of indexes/process: 1000000 Number of asynchronous I/O: 32 =============================== kv-idx-update Time: 621.290206 seconds (1609.553781 ops per second)
kv-dkey-update
Time: 0.128483 seconds (778.311407 ops per second)
Expand source
[sdwillso@boro-4 daos_m]$ orterun -np 1 --mca mtl ^psm2,ofi --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-dkey-update --testid=1 --svc=1 --dpool=5def48df-05dd-46a9-85c7-8d48db61fda9 --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000 ================================ DAOSBENCH (KV) Started at Tue Oct 9 22:38:13 2018 ================================= =============================== Test Setup --------------- Test: kv-dkey-update DAOS pool :5def48df-05dd-46a9-85c7-8d48db61fda9 DAOS container :d7ef1f97-0fea-46d2-bc69-cbb406c35a47 Value buffer size: 64 Number of processes: 1 Number of keys/process: 100 Number of asynchronous I/O: 32 =============================== kv-dkey-update Time: 0.128483 seconds (778.311407 ops per second) Ended at Tue Oct 9 22:38:14 2018
kv-akey-update
Time: 0.068052 seconds (1469.464258 ops per second)
Expand source
[sdwillso@boro-4 daos_m]$ orterun -np 1 --mca mtl ^psm2,ofi --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-akey-update --testid=1 --svc=1 --dpool=b649d72a-cf92-4d0f-8b7d-dd5560f9dad5 --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000 ================================ DAOSBENCH (KV) Started at Tue Oct 9 22:40:16 2018 ================================= =============================== Test Setup --------------- Test: kv-akey-update DAOS pool :b649d72a-cf92-4d0f-8b7d-dd5560f9dad5 DAOS container :4974cbd5-161f-42ee-8725-4653ae0d311d Value buffer size: 64 Number of processes: 1 Number of keys/process: 100 Number of asynchronous I/O: 32 =============================== kv-akey-update Time: 0.068052 seconds (1469.464258 ops per second) Ended at Tue Oct 9 22:40:18 2018
kv-dkey-fetch
Time: 0.064442 seconds (1551.788574 ops per second)
Expand source
[sdwillso@boro-4 daos_m]$ orterun -np 1 --mca mtl ^psm2,ofi --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-dkey-fetch --testid=1 --svc=1 --dpool=fec3e0e6-1f9a-4d77-a70c-84650b3cec4d --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000 ================================ DAOSBENCH (KV) Started at Tue Oct 9 22:41:52 2018 ================================= =============================== Test Setup --------------- Test: kv-dkey-fetch DAOS pool :fec3e0e6-1f9a-4d77-a70c-84650b3cec4d DAOS container :fb63293b-c694-4dc2-a115-064067d7a0a8 Value buffer size: 64 Number of processes: 1 Number of keys/process: 100 Number of asynchronous I/O: 32 =============================== kv-dkey-fetch Time: 0.064442 seconds (1551.788574 ops per second) Ended at Tue Oct 9 22:41:54 2018
kv-akey-fetch
Time: 0.041619 seconds (2402.744065 ops per second)
Expand source
[sdwillso@boro-4 daos_m]$ orterun -np 1 --mca mtl ^psm2,ofi --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-akey-fetch --testid=1 --svc=1 --dpool=28efff5c-7580-4213-baee-f20c1f8e78ff --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000 ================================ DAOSBENCH (KV) Started at Tue Oct 9 22:43:36 2018 ================================= =============================== Test Setup --------------- Test: kv-akey-fetch DAOS pool :28efff5c-7580-4213-baee-f20c1f8e78ff DAOS container :b640feff-a78b-4309-8e7e-bf2dfb51ce16 Value buffer size: 64 Number of processes: 1 Number of keys/process: 100 Number of asynchronous I/O: 32 =============================== kv-akey-fetch Time: 0.041619 seconds (2402.744065 ops per second) Ended at Tue Oct 9 22:43:37 2018
CaRT Self-Test
Small IO
Expand source
[sdwillso@boro-4 ior]$ orterun -np 1 -ompi-server file:~/scripts/uri.txt self_test --group-name daos_server --endpoint 0:0 --message-sizes 0 --max-inflight-rpcs 16 --repetitions 100000 Adding endpoints: ranks: 0 (# ranks = 1) tags: 0 (# tags = 1) Warning: No --master-endpoint specified; using this command line application as the master endpoint Self Test Parameters: Group name to test against: daos_server # endpoints: 1 Message sizes: [(0-EMPTY 0-EMPTY)] Buffer addresses end with: <Default> Repetitions per size: 100000 Max inflight RPCs: 16 host boro-4.boro.hpdd.intel.com finished self_test duration 20.481752 S. ################################################## Results for message size (0-EMPTY 0-EMPTY) (max_inflight_rpcs = 16): Master Endpoint 0:0 ------------------- RPC Bandwidth (MB/sec): 0.00 RPC Throughput (RPCs/sec): 4882 RPC Latencies (us): Min : 1221 25th %: 3220 Median : 3241 75th %: 3262 Max : 13199 Average: 3245 Std Dev: 92.53 RPC Failures: 0 Endpoint results (rank:tag - Median Latency (us)): 0:0 - 3241
Large IO Bulk PUT
Expand source
[sdwillso@boro-4 ior]$ orterun -np 1 -ompi-server file:~/scripts/uri.txt self_test --group-name daos_server --endpoint 0:0 --message-sizes "0 b1048576" --max-inflight-rpcs 16 --repetitions 1000 Adding endpoints: ranks: 0 (# ranks = 1) tags: 0 (# tags = 1) Warning: No --master-endpoint specified; using this command line application as the master endpoint Self Test Parameters: Group name to test against: daos_server # endpoints: 1 Message sizes: [(0-EMPTY 1048576-BULK_PUT)] Buffer addresses end with: <Default> Repetitions per size: 1000 Max inflight RPCs: 16 host boro-4.boro.hpdd.intel.com finished self_test duration 0.338974 S. ################################################## Results for message size (0-EMPTY 1048576-BULK_PUT) (max_inflight_rpcs = 16): Master Endpoint 0:0 ------------------- RPC Bandwidth (MB/sec): 2950.08 RPC Throughput (RPCs/sec): 2950 RPC Latencies (us): Min : 2338 25th %: 5352 Median : 5374 75th %: 5398 Max : 6314 Average: 5358 Std Dev: 250.75 RPC Failures: 0 Endpoint results (rank:tag - Median Latency (us)): 0:0 - 5374
Large IO Bulk GET
Expand source
[sdwillso@boro-4 ior]$ orterun -np 1 -ompi-server file:~/scripts/uri.txt self_test --group-name daos_server --endpoint 0:0 --message-sizes "b1048576 0" --max-inflight-rpcs 16 --repetitions 1000 Adding endpoints: ranks: 0 (# ranks = 1) tags: 0 (# tags = 1) Warning: No --master-endpoint specified; using this command line application as the master endpoint Self Test Parameters: Group name to test against: daos_server # endpoints: 1 Message sizes: [(1048576-BULK_GET 0-EMPTY)] Buffer addresses end with: <Default> Repetitions per size: 1000 Max inflight RPCs: 16 host boro-4.boro.hpdd.intel.com finished self_test duration 0.335589 S. ################################################## Results for message size (1048576-BULK_GET 0-EMPTY) (max_inflight_rpcs = 16): Master Endpoint 0:0 ------------------- RPC Bandwidth (MB/sec): 2979.84 RPC Throughput (RPCs/sec): 2980 RPC Latencies (us): Min : 2298 25th %: 5292 Median : 5314 75th %: 5354 Max : 6064 Average: 5303 Std Dev: 244.05 RPC Failures: 0 Endpoint results (rank:tag - Median Latency (us)): 0:0 - 5314