...
All tests run with ofi+psm2, ib0. , with exception of IOR; this was run with sockets. See NOTE below.
daos_test: Run with 8 server (boro-[3-10]), 4 client (boro-[11-14]). Killed servers, cleaned /mnt/daos in between runs listed below.
...
daosbench and daos_perf were both run with DAOS_IMPLICIT_PURGE=1.
NOTE: IOR was run with sockets due to consistent error at start of test:
Code Block |
---|
[sdwillso@boro-11 ~]$ orterun -np 1 --hostfile ~/hostlists/daos_client_hostlist --mca mtl ^psm2,ofi --ompi-server file:~/scripts/uri.txt ior -v -W -i 5 -a DAOS -w -o `uuidgen` -b 5g -t 1m -O daospool=00c026da-1231-4ba8-b676-95a71fa1fea5,daosrecordsize=1m,daosstripesize=1m,daosstripecount=1024,daosaios=16,daosobjectclass=LARGE,daosPoolSvc=1,daosepoch=1
IOR-3.0.1: MPI Coordinated Test of Parallel I/O
Began: Tue May 29 22:11:38 2018
Command line used: ior -v -W -i 5 -a DAOS -w -o 657f6932-7df3-40e6-8e65-76c5ca18bf07 -b 5g -t 1m -O daospool=00c026da-1231-4ba8-b676-95a71fa1fea5,daosrecordsize=1m,daosstripesize=1m,daosstripecount=1024,daosaios=16,daosobjectclass=LARGE,daosPoolSvc=1,daosepoch=1
Machine: Linux boro-11.boro.hpdd.intel.com
Start time skew across all tasks: 0.00 sec
Test 0 started: Tue May 29 22:11:38 2018
Path: /home/sdwillso
FS: 3.8 TiB Used FS: 9.3% Inodes: 250.0 Mi Used Inodes: 1.9%
Participating tasks: 1
[0] WARNING: USING daosStripeMax CAUSES READS TO RETURN INVALID DATA
boro-11.boro.hpdd.intel.com.19512Received eager message(s) ptype=0x1 opcode=0xc9 from an unknown process (err=49)
boro-11.boro.hpdd.intel.com.19512Received eager message(s) ptype=0x1 opcode=0xc9 from an unknown process (err=49)
boro-11.boro.hpdd.intel.com.19512Received eager message(s) ptype=0x1 opcode=0xc9 from an unknown process (err=49)
boro-11.boro.hpdd.intel.com.19512Received eager message(s) ptype=0x1 opcode=0xc9 from an unknown process (err=49)
...
etc. repeating until ctrl+c |
Suggestions for remedying issue involved ensuring nodes were clean. All processes cleaned, /mnt/daos/ cleaned, nodes rebooted, still see same issue.
Test Results
daos_test
Separate runs with cleanup in between:
...
CREDITS=8
4K Records
CREDITS=1
IOR w/sockets, 2 client 10GB pool, data verification enabled
Code Block | ||||
---|---|---|---|---|
| ||||
[sdwillso@boro-11 ~]$ orterun -np 1 --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt ior -v -W -i 5 -a DAOS -w -o `uuidgen` -b 5g -t 1m -O daospool=0c35d24c-df37-43e3-9283-0150272956df,daosrecordsize=1m,daosstripesize=1m,daosstripecount=1024,daosaios=16,daosobjectclass=LARGE,daosPoolSvc=1,daosepoch=1
IOR-3.0.1: MPI Coordinated Test of Parallel I/O
Began: Tue May 29 22:14:58 2018
Command line used: ior -v -W -i 5 -a DAOS -w -o 882d642f-892c-4c13-9f6e-2cd625bb5981 -b 5g -t 1m -O daospool=0c35d24c-df37-43e3-9283-0150272956df,daosrecordsize=1m,daosstripesize=1m,daosstripecount=1024,daosaios=16,daosobjectclass=LARGE,daosPoolSvc=1,daosepoch=1
Machine: Linux boro-11.boro.hpdd.intel.com
Start time skew across all tasks: 0.00 sec
Test 0 started: Tue May 29 22:14:58 2018
Path: /home/sdwillso
FS: 3.8 TiB Used FS: 9.3% Inodes: 250.0 Mi Used Inodes: 1.9%
Participating tasks: 1
[0] WARNING: USING daosStripeMax CAUSES READS TO RETURN INVALID DATA
Summary:
api = DAOS
test filename = 882d642f-892c-4c13-9f6e-2cd625bb5981
access = single-shared-file, independent
pattern = segmented (1 segment)
ordering in a file = sequential offsets
ordering inter file= no tasks offsets
clients = 1 (1 per node)
repetitions = 5
xfersize = 1 MiB
blocksize = 5 GiB
aggregate filesize = 5 GiB
access bw(MiB/s) block(KiB) xfer(KiB) open(s) wr/rd(s) close(s) total(s) iter
------ --------- ---------- --------- -------- -------- -------- -------- ----
Commencing write performance test: Tue May 29 22:14:58 2018
write 1888.81 5242880 1024.00 0.002534 2.70 0.007510 2.71 0
Verifying contents of the file(s) just written.
Tue May 29 22:15:00 2018
remove - - - - - - 0.003222 0
Commencing write performance test: Tue May 29 22:15:07 2018
write 1958.07 5242880 1024.00 0.001571 2.61 0.005053 2.61 1
Verifying contents of the file(s) just written.
Tue May 29 22:15:10 2018
remove - - - - - - 0.003029 1
Commencing write performance test: Tue May 29 22:15:16 2018
write 1971.30 5242880 1024.00 0.001469 2.59 0.010100 2.60 2
Verifying contents of the file(s) just written.
Tue May 29 22:15:19 2018
remove - - - - - - 0.003084 2
Commencing write performance test: Tue May 29 22:15:25 2018
write 1988.23 5242880 1024.00 0.001451 2.57 0.004563 2.58 3
Verifying contents of the file(s) just written.
Tue May 29 22:15:28 2018
remove - - - - - - 0.003083 3
Commencing write performance test: Tue May 29 22:15:34 2018
write 1986.93 5242880 1024.00 0.001568 2.57 0.007999 2.58 4
Verifying contents of the file(s) just written.
Tue May 29 22:15:37 2018
remove - - - - - - 0.003055 4
Max Write: 1988.23 MiB/sec (2084.81 MB/sec)
Summary of all tests:
Operation Max(MiB) Min(MiB) Mean(MiB) StdDev Mean(s) Test# #Tasks tPN reps fPP reord reordoff reordrand seed segcnt blksiz xsize aggsize API RefNum
write 1988.23 1888.81 1958.67 36.64 2.61496 0 1 1 5 0 0 1 0 0 1 5368709120 1048576 5368709120 DAOS 0
Finished: Tue May 29 22:15:43 2018 |
daos_bench
kv-idx-update
kv-dkey-update
...