Submitter: Stephen Pu from H3C
Reviewer: Johann Lombardi (Deactivated) Liang Zhen and other team members from Daos
Status: Need more specification and under review
Expected result:
Request item scope is defined for Daos and H3C collaboration before Q4.2022.
Scope may divide into 2~3 sub iterations.
1. Request priority defined and aligned.
2. Request specified for Product definition.(NOT design stage)
3. General feasibility and estimation could be given.
NAME | Name | Scenario | Description | H3C Proposal | Daos feedback | Priority | If contributed to community | Feasibility and effort estimation | Owner(i.e. who own design who own dev and testing) | Delivery plan (Q1, Q2, Q3 in 2022) | Risk /Comments |
---|---|---|---|---|---|---|---|---|---|---|---|
volume | create |
| User can create a block storage volume by specific size, name and others attributs |
| |||||||
delete |
| User can delete a block storage volume by specific volume name or uuid |
| ||||||||
expand | online, offline | The volume size can be expanded in the IO operation service of online or offline cases. |
| ||||||||
search |
| The specific volume could be searched by their volume name or uuid |
| ||||||||
thin provisioning vol |
| The volume presented appears to be the full provisioned capacity to the application servers, but nothing has been allocated until write operations occur. |
| ||||||||
thick provisioning vol |
| With thick provisioning, the complete amount of virtual disk storage capacity is pre-allocated on the physical storage when the virtual disk is created. A thick-provisioned virtual disk consumes all the space allocated to it in the datastore |
| ||||||||
modify |
| The volume's attributes can by modified, like: name |
| ||||||||
recycle bin |
| Recycle Bin is a snapshot recovery feature that enables you to restore accidentally deleted |
| ||||||||
snapshoot |
| You can back up the data on your volumes by taking point-in-time snapshots. |
| ||||||||
clone |
| A clone of a Block Storage volume is a copy made of an existing volume at a specific moment in time. |
| ||||||||
QoS |
| You can use quality of service (QoS) to guarantee that performance of critical workloads is not degraded |
| ||||||||
block | NVMe-oF |
| Use NVMe over Fabric protocol to setup a NVMe block storage. |
| |||||||
system | deployment | Automatic cluster deployment |
| YES | |||||||
installation | Automatic cluster installation |
| YES | ||||||||
rollback |
| YES | |||||||||
web potal |
|
| YES | ||||||||
upgrade | online upgrade |
| YES | ||||||||
offline upgrade |
| YES | |||||||||
rollback |
| YES | |||||||||
system | online monitoring |
| YES | ||||||||
offline monitoring |
| YES | |||||||||
alarm |
|
| YES | ||||||||
Log |
|
| YES | ||||||||
web page |
|
| YES | ||||||||
pool | Pool | block storage pool | The user can create a specific block storage pool with size or name. |
| |||||||
Pool expand | online | The specific block storage pool can be expand its size by no interrupt with their operation (online case) |
| ||||||||
cluster | Node server | Add new node (online) | The cluster can add a new node server to existing cluster system without interrupt cluster's operation (online case) |
| |||||||
delete a node (online) | The cluster can delete a new node server to existing cluster system without interrupt cluster's operation (online case) |
| |||||||||
SSD Disk | Add a new SSD (online) | The node server can add a new NVMe SSD without interrupt cluster's operation (Online case) |
| Big Effort | |||||||
Replace a SSD (online) | The node server can replace a new NVMe SSD with original one. (online case) |
| |||||||||
remove exist SSD (online) | The node server can remove a running or failed NVMe SSD in system. (Online case) |
| |||||||||
| PMEM Disk | Add a new PMEM (online?) |
Pmem capacity expanding. i.e. First config is 256GB * 12 config but due to adding new nvme disk or improve performance by caching more data into pmem, then expand to 512GB * 12. |
| |||||||
| Replace a PMEM (online) low priority as PMEM long lifecycel | The node server can replace a newPMEM with original one. (online case) |
| Low priority duet to long-life assurance of pmem | |||||||
|
|
|
| Invalid | |||||||
cluster | Network | Network exception robust | The cluster could keep providing IO operation service in the random network failiure case. |
| |||||||
NIC exception robust | The cluster could keep providing IO operation service in the random node's NIC failiure case. |
| |||||||||
SSD Disk Exception |
| SSD Disk Exception is the test driven case |
| ||||||||
Node Exception |
| Node Exception is the test driven case |
| ||||||||
Disk Usage Optimization | PMEM space optimization | Currently, the PMEM size is divided with the amount of NVMe SSD, it caused a big PMEM space waste. |
| ||||||||
|
|
|
| ||||||||
Data Rebuild | 4TB/H | The data rebuild speed above 4TB/H, in the cluster of 3 nodes each node has 8 NVMe SSD. |
| ||||||||
Node reboot | IO recovery time | When a node rebooted, how many seconds the IO service could be recovery back to the 100% before it's reboot. |
|