Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Repair Jira Macros

...

Code Block
linenumberstrue
collapsetrue
[sdwillso@boro-4 ~]$ orterun --mca mtl ^psm2,ofi -N 1 -quiet --hostfile ~/scripts/host.cli.1 --ompi-server file:~/scripts/uri.txt -x DD_SUBSYS= -x DD_MASK= -x D_LOG_FILE=/tmp/daos_perf.log daos_perf -T daos -P 2G -d 1 -a 200 -r 1000 -s 1K -C 1 -t -z
Test :
	DAOS (full stack)
Parameters :
	pool size     : 2048 MB
	credits       : 1 (sync I/O for -ve)
	obj_per_cont  : 1 x 2 (procs)
	dkey_per_obj  : 1
	akey_per_dkey : 200
	recx_per_akey : 1000
	value type    : single
	value size    : 1024
	zero copy     : yes
	overwrite     : yes
	verify fetch  : no
	VOS file      : <NULL>
151265ce: rank 1 became pool service leader 0
Started...
update successfully completed:
	duration : 91.945923  sec
	bandwith : 4.248      MB/sec
	rate     : 4350.38    IO/sec
	latency  : 229.865    us (nonsense if credits > 1)
Duration across processes:
	MAX duration : 91.945923  sec
	MIN duration : 88.257693  sec
	Average duration : 90.101808  sec
151265ce: rank 1 no longer pool service leader 0

CREDITS=8

4K Records

CREDITS=1

IOR, 10GB pool, data verification enabled

daos_bench

kv-idx-update

  • At end of this test with multiple servers, container destroy failshitting segfault
    • Jira Legacy
      serverHPDD Community JiraSystem JIRA
      columnskey,summary,type,created,updated,due,assignee,reporter,priority,status,resolution
      serverId8bba2dd1f325724b-4333f7c9-300634db-bfcdbd1c-f35d4ebbd2ad69d12ec98a69
      keyDAOSCART-1243
Time: 580.999753 seconds (1721.171127 ops per second)

...

linenumberstrue
collapsetrue

...

    • 496

4K Records

CREDITS=1

  • hitting segfault
    • Jira Legacy
      serverSystem JIRA
      columnskey,summary,type,created,updated,due,assignee,reporter,priority,status,resolution
      serverIdf325724b-f7c9-34db-bd1c-69d12ec98a69
      keyCART-496

IOR, 10GB pool, data verification enabled

daos_bench

kv-idx-update

  • At end of this test with multiple servers, container destroy fails
    • Jira Legacy
      serverSystem JIRA
      columnskey,summary,type,created,updated,due,assignee,reporter,priority,status,resolution
      serverIdf325724b-f7c9-34db-bd1c-69d12ec98a69
      keyDAOS-1243
Time: 580.999753 seconds (1721.171127 ops per second)
Code Block
linenumberstrue
collapsetrue
[sdwillso@boro-4 ~]$ orterun -np 1 --mca mtl ^psm2,ofi  --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-idx-update --testid=1 --svc=1 --dpool=c8ba8257-0b50-47c5-8d4f-bc08ebc1e907 --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000
================================
DAOSBENCH (KV)
Started at
Wed Sep  5 21:50:45 2018
=================================
===============================
Test Setup
---------------
Test: kv-idx-update
DAOS pool :c8ba8257-0b50-47c5-8d4f-bc08ebc1e907
DAOS container :dd26496c-2bd3-4583-81eb-a0b826441022
Value buffer size: 64
Number of processes: 1
Number of indexes/process: 1000000
Number of asynchronous I/O: 32
===============================
kv-idx-update
Time: 580.999753 seconds (1721.171127 ops per second)

...

Code Block
linenumberstrue
collapsetrue
[sdwillso@boro-4 ~]$ orterun -N 1 --mca mtl ^psm2,ofi  --hostfile ~/hostlists/daos_client_hostlist --ompi-server file:~/scripts/uri.txt daosbench --test=kv-akey-fetch --testid=1 --svc=1 --dpool=af6b05aa-ae57-43ee-8552-f89dd7f6f271 --container=`uuidgen` --object-class=tiny --aios=32 --indexes=1000000
================================
DAOSBENCH (KV)
Started at
Wed Sep  5 23:08:02 2018
=================================
===============================
Test Setup
======
Test Setup
---------------
Test: kv-akey-fetch
DAOS pool :af6b05aa-ae57-43ee-8552-f89dd7f6f271
DAOS container :c684d271-6870-4f68-8ba5-ae3920a528e6
Value buffer size: 64
Number of processes: 2
Number of keys/process: 100
Number of asynchronous I/O: 32
===============================
kv-akey-fetch
Time: 0.146468 seconds (1365.487761 ops per second)

CaRT Self-Test

Small IO

Code Block
linenumberstrue
collapsetrue
[sdwillso@boro-4 ~]$ orterun -np 1 -ompi-server file:~/scripts/uri.txt self_test --group-name daos_server --endpoint 0:0 --message-sizes 0 --max-inflight-rpcs 16 --repetitions 100000
Adding endpoints:
  ranks: 0 (# ranks = 1)
  tags: 0 (# tags = 1)
Warning: No --master-endpoint specified; using this command line application as the master endpoint
Self Test Parameters:
  Group name to test against: daos_server
  # endpoints:                1
  Message sizes:              [(0-EMPTY 0-EMPTY)]
  Buffer addresses end with:  <Default>
  Repetitions per size:       100000
  Max inflight RPCs:          16

host boro-4.boro.hpdd.intel.com finished self_test duration 18.610635 S.
##################################################
Results for message size (0-EMPTY 0-EMPTY) (max_inflight_rpcs = 16):

Master Endpoint 0:0
-------------------
	RPC Bandwidth (MB/sec): 0.00
	RPC Throughput (RPCs/sec): 5373
	RPC Latencies (us):
		Min    : 1054
		25th  %: 2901
		Median : 2931
		75th  %: 2958
		Max    : 13237
		Average: 2947
		Std Dev: 326.50
	RPC Failures: 0

	Endpoint results (rank:tag - Median Latency (us)):
		0:0 - 2931

Large IO Bulk PUT

Code Block
linenumberstrue
collapsetrue
[sdwillso@boro-4 ~]$ orterun -np 1 -ompi-server file:~/scripts/uri.txt self_test --group-name daos_server --endpoint 0:0 --message-sizes "0 b1048576" --max-inflight-rpcs 16 --repetitions 1000
Adding endpoints:
  ranks: 0 (# ranks = 1)
  tags: 0 (# tags = 1)
Warning: No --master-endpoint specified; using this command line application as the master endpoint
Self Test Parameters:
  Group name to test against: daos_server
  # endpoints:                1
  Message sizes:              [(0-EMPTY 1048576-BULK_PUT)]
  Buffer addresses end with:  <Default>
  Repetitions per size:       1000
  Max inflight RPCs:          16

host boro-4.boro.hpdd.intel.com finished self_test duration 0.337048 S.
##################################################
Results for message size (0-EMPTY 1048576-BULK_PUT) (max_inflight_rpcs = 16):

Master Endpoint 0:0
-------------------
	RPC Bandwidth (MB/sec): 2966.94
	RPC Throughput (RPCs/sec): 2967
	RPC Latencies (us):
		Min    : 2272
		25th  %: 5311
		Median : 5340
		75th  %: 5365
		Max    : 6216
		Average: 5325
		Std Dev: 249.22
	RPC Failures: 0

	Endpoint results (rank:tag - Median Latency (us)):
		0:0 - 5340

Large IO Bulk GET

Code Block
linenumberstrue
collapsetrue
etitions 1000
Adding endpoints:
  ranks: 0 (# ranks = 1)
  tags: 0 (# tags = 1)
Warning: No --master-endpoint specified; using this command line application as the master endpoint
Self Test Parameters:
  Group name to test against: daos_server
  # endpoints:                1
  Message sizes:              [(1048576-BULK_GET 1048576-BULK_PUT)]
  Buffer addresses end with:  <Default>
  Repetitions per size:       1000
  Max inflight RPCs:          16

host boro-4.boro.hpdd.intel.com finished self_test duration 0.585722 S.
##################################################
Results for message size (1048576-BULK_GET 1048576-BULK_PUT) (max_inflight_rpcs = 16):

Master Endpoint 0:0
----------------
Test: kv-akey-fetch
DAOS pool :af6b05aa-ae57-43ee-8552-f89dd7f6f271
DAOS container :c684d271-6870-4f68-8ba5-ae3920a528e6
Value buffer size: 64
Number of processes: 2
Number of keys/process: 100
Number of asynchronous I/O: 32
===============================
kv-akey-fetch
Time: 0.146468 seconds (1365.487761 ops per second)

CaRT Self-Test

Small IO

Large IO Bulk PUT

Large IO Bulk GET

mpich tests

...

------
	RPC Bandwidth (MB/sec): 3414.59
	RPC Throughput (RPCs/sec): 1707
	RPC Latencies (us):
		Min    : 2926
		25th  %: 8731
		Median : 8787
		75th  %: 9713
		Max    : 14597
		Average: 9290
		Std Dev: 1592.07
	RPC Failures: 0

	Endpoint results (rank:tag - Median Latency (us)):
		0:0 - 8787

mpich tests

Results: Hangs at first test, this is known issue, 

Jira Legacy
serverSystem JIRA
columnskey,summary,type,created,updated,due,assignee,reporter,priority,status,resolution
serverIdf325724b-f7c9-34db-bd1c-69d12ec98a69
keyCART-496