Home Blog Page 225

Storage facing trillion-row database apocalypse

HDD storage
HDD storage

A new era of large-scale analytics and storage is opening up ahead of us. Ocient, Imply, VAST Data, and WEKA are four startups positioned to store and access data among hundreds of petabytes or trillions of database rows in seconds. They all employ massively parallel access techniques in one way or another and fundamentally use software rather than hardware to achieve their performance levels.

Update. SingleStore’s database can chew through a trillion rows/second using 448 Xeon Platinum 8180 Skylake cores, 28 per server, as its blog describes. The data was stored on SSDs, though that wasn’t really significant since the queries were run warm-start, so the data was already cached in memory. The network was 10-gigabit Ethernet.

Having said that, it needs pointing out that NVMe SSDs are a fundamental part of the VAST Data and Ocient products, and have played a role in WEKA deployments as well.

The need for such high-speed access to structured and unstructured data is not general yet. It’s concentrated in a few markets – such as financial trading (where VAST and WEKA are doing well), online media advertising display technology (an Ocient focus), high-performance computing (WEKA again), and AI/ML model training. 

The AI/ML driver

VAST co-founder Jeff Denworth thinks the use of AI/ML technology is going to spread into the general business market. Most businesses will have a need to trawl through their internal production logs and external customer interaction data to find patterns, analyze causes, and make decisions to optimize internal and external operations. This may save or make pennies per operation but cause significant amounts of cash to be earned or saved at a larger scale.

ML models are being used to aid health device scan diagnoses, investment trading decisions, factory production operations, logistics delivery paths, product recommendations, process improvements and staff efficiency. The complexity of ML models is roughly doubling year-on-year, according to Denworth. The general rule is that the larger the model, the better the training and subsequent inferences.

Pure is moving into the larger dataset market, and high-end array supplier Infinidat might say it is already there.

All these companies are intent on reacting to this move from petabytes to exabytes. They see it affecting on-premises environments as well as public cloud ones. VAST is an on-premises company but will be cloud-connected – if not cloud-present – in some way in the future. Ocient is both on-premises and in the cloud, as is WEKA. Imply is pure software so can run in the cloud, whereas Infinidat is an on-premises business.

Their acceptance and embrace of exabyte-level scale sets them apart from mainstream storage providers who will, Denworth says, have to overcome significant architectural disadvantages if they want to compete. 

Ocient Hyperscale Data Warehouse

Ocient has just launched its Hyperscale Data Warehouse product. It is a v19.0 product – earlier versions having been used, successfully it claims, for hyperscale deployments over the past year with a select group of enterprise customers. It says the product is engineered to deliver unmatched price-performance for rapid complex and continuous analysis of massive structured and semi-structured datasets. Customers can execute previously infeasible workloads in interactive time, returning results in seconds or minutes versus hours or days.

The software has a Compute Adjacent Storage Architecture (CASA), Ocient says, which places storage adjacent to compute on industry-standard NVMe solid-state drives. This delivers hundreds of millions of random read IOPS and enables massively parallelized processing across simultaneous loading, transformation, storage, and data analysis of complex data types. The whole data path has been optimized for such performance. 

For example, it has a high-throughput custom interface to NVMe SSDs with highly parallel reads with high queue depths to saturate drive hardware. There is a lock-free, massively parallel SQL cost optimizer that ensures each query plan is executed to the best of its ability within its service class and without impacting performance of other workloads or users.  

The Ocient Hyperscale Data Warehouse is generally available as a fully managed service hosted in OcientCloud, on-premises in the customer’s datacenter, and is in the Google Cloud Marketplace.

Incumbent state

VAST Data has a significant software launch coming up. Denworth says that what VAST did for its hardware array, stateless controllers, and single-tier QLC flash storage, it will now do for software. 

Incumbents will need to respond to match what the newcomers have. Going all-flash and single-tier is not enough – they have to change their software. This could mean software technology which will take years to develop from the ground up. We might see incumbents buying this technology rather than developing it. We might see processor chip developers, like Nvidia, buying their way in to keep their GPUs fed with the data they need to crunch AI/ML training models.

Unless Dell EMC, IBM, HPE, NetApp, Qumulo, and the object storage suppliers can demonstrate that they can operate at the same scale, performance, resilience, and cost as these up-and-comers, they may have to fight harder for the multi-hundred petabyte-level, trillion-row structured/unstructured dataset area – at least if what Imply, Ocient, VAST and WEKA see coming is correct. 

China lockdowns hit Seagate revenue growth

Seagate HAMR technology
Seagate HAMR technology

Seagate reported slower sequential sales growth for the third quarter of its fiscal 2022 ended 1 April amid component shortages and with government imposed lockdowns in China blamed for disrupting production.

Revenues came in at $2.8 billion, up 2.6 percent year-on-year but down on the $3.2bn booked in the prior quarter. The drive maker generated a $346 million net profit, up 5.2 percent annually. Seagate originally estimated better Q3 income growth, forecasting 6.2 percent, although it revised that guidance downwards as the quarter progressed.

CEO Dave Mosley cited outside pressures: “Seagate’s March quarter financial results were consistent with our revised outlook, with record nearline product revenue driven by cloud customers partially offsetting multiple macro-related headwinds that impacted other end markets, particularly video and image applications, and pressured profitability.”

He added: “Our focus is on mitigating these external challenges through ongoing expense discipline, new pricing strategies and operational efficiencies.” He talked of taking aggressive actions in these areas.

The company said video and image apps (VIA) demand was affected by non-HDD component shortages and COVID-19 lockdown measures, particularly in China. PC-related demand sank as OEMs bought fewer drives. The bright spot was mass-capacity nearline drive revenues, up 24 percent annually, with capacity shipped up 23 percent year-on-year, thanks to cloud customers bulking up their disk drive estates.

Financial summary

  • Gross margin – 28.8 percent, up 3 percent annually, down 10 percent quarter-on-quarter
  • Free cash flow – $363 million, up 32 percent year-over-year
  • Diluted EPS – $1.56, up 12 percent year-on-year, down 30 percent quarter-on-quarter
  • Operating expenses – $345 million, 5 percent higher year-on-year

Overall disk drive exabytes shipped were lower than the previous two quarters, as the chart shows:

Seagate capacity shipped

This was unexpected as capacity shipped was on a seemingly inexorable rise. The nearline increase did not compensate for the lower VIA and legacy drive revenues.

In the earnings call, Mosley revealed that Seagate is “shipping 20-plus terabyte drives in high volume” and expects unit shipments “to more than triple quarter-over-quarter in Q4 to well over 1 million units. This puts us on a pace for the company to achieve crossover with 18-terabyte drives early in the new fiscal year.”

Mosley repeated the point, saying: “The 20-plus terabyte products… represents the highest capacity drives commercially available today.” But Seagate has not announced any 20-plus TB drives, only reaching 20TB with the Exos X20. We expect an announcement soon. He also said Seagate had “30-plus terabyte HAMR products under development” and almost ready.

The VIA market, affected by delays to new security surveillance and smart city infrastructure projects due in large part to physical installations being hindered, should pick up in the second half of the year, Mosley hopes.

Seagate’s outlook for the next quarter was affected by industry-wide supply challenges from the ongoing conflict in Ukraine and COVID restrictions, which are constraining growth over the near term. It is expecting revenues of $2.8 billion plus or minus $150 million, which would represent a 6.7 percent decline year-on-year at the mid-point. 

Mosley did say Seagate was convinced that in general there would be a very strong second half of the calendar year. As for legacy drives, he said: “the legacy market is not going down very much anymore. So flat is probably the way I’d call it for a couple of quarters.” On that basis we should see a return to exabytes shipped growth.

Disk industry analysts will now be looking at Western Digital to see if its forthcoming results are similarly affected. They are scheduled to come out later today.

Storage news ticker – April 27

Storage news
Storage news

Arcion has announced a partnership to bring its cloud-native, CDC-based data replication platform to Databricks. Arcion’s product can integrate transactional systems with the Databricks Lakehouse in real time, at scale, and with guaranteed transactional integrity. It says it is the only fully managed, distributed data replication as a service on the market today, offering zero-code, zero-maintenance change data capture (CDC) pipelines that can be deployed in minutes. Data teams can move high-volume data from transactional databases like Oracle and MySQL, without a single line of code. Arcion’s Partner Connect automatically configures the resources necessary to begin using streaming data pipelines and enable real-time data ingestion with pipelines between Oracle, MySQL, and Snowflake (additional sources coming soon) to the Databricks Lakehouse. 

Canonical’s Ubuntu 22.04 LTS is now generally available, featuring significant leaps forward in cloud confidential computing, real-time kernel for industrial applications, and enterprise Active Directory, PCI-DSS, HIPAA, FIPS, and FedRAMP compliance. Ubuntu is the only Linux distribution supporting Azure Confidential VMs, which deliver confidentiality not only between different cloud customers but also between customers and the cloud itself. It features hardware-level encrypted guest isolation, combined with measured boot and TPM-backed full-disk encryption implemented in Ubuntu and Azure Managed HSM. Customer code and data are encrypted in use, in transit, and at rest using encryption keys that are protected and can be controlled by the customer.

GRAID Technology announced its SupremeRAID SR-1010 – the world’s fastest NVMe/NVMeoF RAID card. It supports PCIe 4 and delivers 19.4 million random read IOPS, 1.5 million random write IOPS. The sequential read bandwidth is 110GB/sec and it provides 22GB/sec when sequentially writing. The card will be generally available on May 1, 2022, for immediate shipment through GRAID’s global authorized reseller network and through GRAID’s OEM partners. 

Graid storage slide

Index Engines announced an improved dashboard for its CyberSense security analytics product, which provides post-attack forensic reports. CyberSense detects signs of attack vectors by scanning backup and snapshot data utilizing over 200 content-based analytics and machine learning to identify corruption and the last good version of files and databases. It has hundreds of users worldwide and detects signs of ransomware with 99.5 per cent accuracy, based on testing of over 20 million clean and infected backup sets. While the accuracy was unparalleled, the previous interface could be too complex for users in crisis mode. The new CyberSense interface simplifies the user experience, providing detailed insight into the who, what, where, and when of an attack.

iXsystems announced its TruePartner channel program and says its several hundred global partners generated a 152 per cent increase in year-over-year sales in 2021. It didn’t provide the actual sales figure. This was driven by the combination of business-optimized Open Storage, TrueNAS Community-orchestrated deployment opportunities, and the RevMatch reciprocal revenue multiplier which allows partners to achieve the highest revenue gains possible with up to a 3x revenue match by iXsystems.

Memory Guy Jim Handy writes: “There’s something really odd about Nimbus Data’s colossal 100 terabyte ExaDrive DC SSDs, and it’s not their sheer capacity (although that’s pretty remarkable by itself!). The strange thing is that they can’t be worn out. It’s physically impossible.” Why is that? His blog “An SSD You Can’t Wear Out” explains: “When such a large capacity is accessed through an I/O channel with a more modest bandwidth, it’s impossible to overwrite the flash enough times to wear it out.”

Platform9, which provides an open distributed cloud service, announced that GigaOm named it a Leader and Outperformer in its 2022 Radar for Evaluating Managed Kubernetes Solutions report. GigaOm’s Radar reports evaluate vendors on innovation, maturity, vision, the capacity of execution, and other criteria that impact overall IT strategy.

Scale-out filesystem supplier Qumulo has announced a Cloud Now program with a free petabyte of storage. It aims to provide a no-cost, low risk, and rapid solution for customers looking to move their workloads to the cloud and avoid supply chain constraints. The program has free cloud software and underlying storage, and a white glove onboarding experience. This offer is available for Qumulo Cloud Q across the three major supported public clouds for customers to build proofs of concept at scales up to one petabyte. Cloud Now allows customers to find the right workloads for the cloud and test their workloads faster without paying for software licensing and cloud infrastructure costs on AWS, Azure and the Google Cloud Platform.

Object storage software supplier Scality announced the company’s revenue increased by 50 per cent year-on-year in the first quarter of 2022. ARTESCA – Scality’s lightweight, cloud-native object storage offering – now has customers in seven countries, and scored its first multimillion-dollar customer. It will serve as the cloud-native content management solution for a leading enterprise information archive provider in North America. ARTESCA is supported on Dell 740 XD2, HPE Apollo and Proliant, Lenovo SR650 and Supermicro A+ storage server platforms.

Seagate is sponsoring the Alfa Romeo F1 Team ORLEN, which is using Seagate Lyve Cloud to store, access, and move its unstructured data at scale. Seagate has become the team’s official Object Storage Service Partner. Its Seagate Lyve Cloud branding is visible on the team’s C42 cars’ headrests as the team’s drivers – Valtteri Bottas and Zhou Guanyu – race in the 2022 Formula One championship.

DRAM and NAND fabber SK hynix has launched a research website to share its recent semiconductor research to academia and industry. The Revolutionary Technology Center (RTC) at SK hynix was established in 2021 to prepare for the future of the company’s semiconductor technologies. The mission of RTC is to propose mid- to long-term semiconductor research solutions in response to the challenges which current computing has faced in recent years. RTC started with three major research directions in the category of “Research Area”: Revolutionary Memory, Beyond Memory, and Next Generation Computing. Check it out – there’s interesting stuff there.

Researchers of the Max Planck Institute for Chemical Physics of Solids and the Fraunhofer Institute for Photonic Microsystems IPMS have launched a joint project to investigate novel materials for spintronics. The project is being funded by Sächsische Aufbaubank. Chiral crystals are a promising class of materials that have remained largely unexplored for their potential use in spin-based electronics. In chiral materials, the atoms, of which the crystal is composed, can exist in two inequivalent arrangements, which look like the mirrored image of each other. The project aims to bridge the gap between understanding how chirality and spin currents are related and assessing the potential of chiral materials in electronic applications.

StreamNative, a cloud-native messaging and streaming platform founded by the original developers of Apache Pulsar, announced StreamNative Cloud for Kafka – a managed solution that supports Kafka protocol but operates Apache Pulsar under the hood. It says for use cases where data needs to be segmented into tens of thousands of granular topics, Kafka often struggles with poor performance. Or, for teams with a large number of microservices, the lack of native multi-tenancy often leads to a large number of Kafka clusters, typically with low utilization. StreamNative Cloud for Kafka helps to remove current Kafka limitations without having to change your application code. StreamNative Cloud for Kafka is currently in private beta.

Swissbit is expanding its range of industrial grade SD and microSD memory cards. The S-55 and S-58 series combine industrial grade 3D TLC NAND technology with a controller and firmware optimized for reliability. The SD and microSD cards are available in capacities ranging from 16 to 512GB and support an extended temperature range of -40 to +85°C. The new S-55 and S-58 series are based on Micron flash chips and complement Swissbit’s existing S-50 and S-56 series, which use Kioxia 3D NAND.

ThinkParQ announced the release of BeeGFS v7.3.0. It includes support for Nvidia’s Magnum IO GPUDirect Storage (GDS) and for client-side multirail RDMA networking, Arm architectures, along with Linux 5.10. Multirail network configurations will allow the BeeGFS client to be configured to use multiple RDMA capable network interfaces on the same network, while automatically balancing the traffic across all those interfaces.

SK hynix celebrates its best first quarter

SK Hynix Icheon campus

Korean DRAM and NAND maker SK hynix had a record first 2022 quarter with income boosted by Solidigm SDD earnings and a slower decline in DRAM prices than it expected.

The company reported ₩12.16 trillion ($9.6 billion) in revenues for 2022’s Q1, a seasonally low quarter, up 43.2 per cent on the year, with a profit of ₩1.98 trillion ($1.56 billion), a rise of 99.4 per cent. Solidigm, the acquired Intel SSD business, was incorporated as SK hynix’s subsidiary at the end of 2021.

Kevin (Jongwon) Noh, SK hynix CMO, said: “I think we’ve accomplished meaningful performance despite the seasonality issue. As demand for server chips is on the rise, the memory business will improve into the second half.”

DRAM demand should rise in the second half of the year as new server CPUs, such as Intel’s Sapphire Rapids and AMD’s Genoa EPYC, trigger buying activity.

The stellar results were still affected by customer supply chain issues and a company statement said: “SK hynix responded to customer demand in a flexible way, while focusing on profitability management, resulting in handsome earnings.” There was also a ₩380 billion ($3 million) provision for compensation, including product exchange, to a customer after it found a performance weakness in some DRAM products supplied by SK hynix.

SK Hynix quarterly revenues by fiscal year
SK hynix quarterly revenues by fiscal year. The Q1 2022 jumps evident

SK hynix said it had improved the yield rate of its advanced 1anm DRAM products (the fourth generation of the 10nm node) and 176-layer 4D NAND. The company said it is on track to develop the next-generation products, which include 238-layer 3D NAND.

This is the highest number of layers yet revealed by a 3D NAND manufacturer. We understand Solidigm is developing 196 layers and Samsung 200. Micron is producing a 176-layer product and hasn’t said what’s next, and YMTC is at the 128-layer level, some way behind.

SK hynix’s NAND revenue was helped by Solidigm’s contribution. There was weak demand in China for mobile device NAND. NAND shipments in the second quarter should rise due to enterprise SSD demand. Coincidentally, Solidigm recently announced two new enterprise SSDs.

Equipment shipment delays may well affect SK hynix’s mass-production timeline, according to Aaron Rakers, managing director and technology analyst at Wells Fargo Se.

Infinidat accelerates high-end array

High-end enterprise array supplier Infinidat has added a faster top-end system with boosted hardware and software, increased cyber-resilience, and enhanced AIOps and DevOps capabilities. 

Infinidat supplies InfiniBox arrays with DRAM caching delivering 90 percent or more of read requests from memory, with disk-based and all-flash SSA systems both faster than typical all-flash arrays thanks to its “Neural Cache.” The company provides 100 percent data availability and claims a lower TCO than comparable systems. The SSA systems contributed roughly 15 percent of its overall revenue in the first 2022 quarter. 

Phil Bullinger, Infinidat
Phil Bullinger

Infindat CEO Phil Bullinger said in a statement: “Infinidat is accelerating its business with the InfiniBox solid-state array platform, expanding our opportunities and capturing all-flash wins in the enterprise storage market where the highest levels of performance and availability, extensive cyber-resilience, and cost-effectiveness are required for the most demanding applications and workloads.”

There are multiple parts to Infindat’s news; first is the updated InfiniBox SSA II array, adding cyber-resilience to Infinidat all-flash systems, enhancing InfiniBox’s OS for increased parallelism, and adding AIOps to Infinidat’s arrays.

The SSA II is an enhanced SSA I with more capacity, faster controller CPUs delivering more IOPS and throughput, increased InfiniBand paths, and file access protocols. A table comparing the two existing SSA “S” models with the two new “T” models is below:

Infinidat comparisons

Delivered IOPS have increased by 53.3 percent and throughputs have risen by half as well. Infinidat says its InfiniBox SSA II has 35 microseconds of latency, lower than any other comparable enterprise storage platform in the industry.

The InfiniBox OS was optimized for additional interconnects (InfiniBand), gained more parallelism for additional cores, and has new algorithms for workload optimization. NFS and SMB file access were also added. The OS automatically adapts to changes, such as adding new servers and applications, without human intervention.

Cyber-resilience is an important aspect of Infinidat’s announcement and its InfiniGuard secondary storage system is being, in effect, extended to cover its entire InfiniBox array portfolio, including the SSA II, with an InfiniSafe Reference Architecture. This provides immutable data snapshots, logical air-gapping, a fenced forensic environment, and virtually instantaneous data recovery from the snapshots.

InfiniOps is Infinidat’s automated operations brand. In July, Infinidat unveiled its AIOps offering – Neural Cache – InfiniVerse cloud-based, AI-driven monitoring, predictive analytics and support spanning Infinidat’s systems and ecosystem of AI technology partnerships. This has developed into InfiniOps, which has tighter integration with the AIOps datacenter ecosystem and also the DevOps world.

Infinidat now supports CSI 2.1.0 for Kubernetes, VMware Tanzu, and RedHat’s OpenShift environment. It also has Ansible integration to simplify storage system management and integration.

The InfiniBox SSA II is available for purchase or with Infinidat’s flexible consumption options, like all Infinidat products, including Storage-as-a-Service with Infinidat’s FLX program and Capacity-on-Demand with Infinidat’s Elastic Pricing model.

Solidigm upgrades datacenter SSDs for speed

Solidigm has upgraded its PCIe 4 datacenter SSDs to faster models using denser 3D NAND chips.

Update: Solidigm performance numbers updated. 29 April 2022.

Intel’s storage unit first produced D7-P5500 (1 drive write/day) and D7-P5600 (3 drive writes/day) SSDs using 96-layer 3D NAND organised into TLC (3bits/cell) format, with a PCIe 4 x 4 lane interface. When SK hynix bought the unit, it was renamed Solidigm and continued with its Intel-era product branding.

The two new SSDs in the D7 datacenter line are the P5520 and the P5620. Both use 144-layer TLC NAND.

Solidigm’s GM for datacenter products, David Dixon, said in the announcement: “The D7-P5520 and the D7-P5620 are the culmination of these insights and of multiple 3D NAND and PCIe 4.0 generations, which enable us to deliver a best-in-class offering and a new paradigm in solid-state storage.” 

The existing PCIe 4-based P5510 is a read-intensive drive supporting up to 1 DWPD and the mixed-use P5610 also up to 1DWPD. The new drives are different with the P5520 being a 1 DWPD product, designed for read-intensive and light mixed workloads, and the P5620, intended for mixed workloads, supporting up to 3 DWPD.

The P5520 capacities and formats are:

  • U.2 – 1.92TB, 3.84TB, 7.68TB, 15.36TB
  • E1.S 9.5mm, 15mm – 1.92TB, 3.84TB, 7.68TB
  • E1.L  9mm – 15.36TB

The P5620 comes in only the U.2 format and has a 1.6TB, 3.2TB, 6.4TB, and 12.8TB capacity range, reflecting its greater over-provisioning need with a mixed-use workload.

Performance picture

A new paradigm? Let’s see. Here are Solidigm-supplied performance charts, with the P5520 compared to the older PCIe 3-based P4510:

Solidigm comparison

The P5620 is compared to the P4610, again PCIe 3-based:

Solidigm comparison

The size of some of these improvements is not surprising as we are comparing P4510 PCIe 3 interface SSDs with PCIe 4, which is twice as fast. How do the new Solidigm SSDs compare with the prior P5500 and P5600, which also use the PCIe 4 bus and were built with 96-layer 3D NAND? 

Blocks & Files table.

The new products’ maximum read IOPS is ten per cent higher, and the P5520 outputs 70 per cent more random write IOPS than the older P5500.

The sequential bandwidth picture is mixed as well. Sequential reads are a tad faster with the new products: 7.1GB/sec vs 7GB/sec, but sequential writes are a tad slower at 4.2GB/sec vs 4.3GB/sec. 

The new products’ latency is worse. The P5500/P5600 delivers latency as low as 9µs read and 12µs write but the newer P5520/P5620 only achieves 75µs read and 15µs write – a substantially slower read latency and slightly worse write latency.

We know that a new Inspur PCIe 4.0 NVMe SSD has 1.5 million random read IOPS and 400K random write IOPS, along with 7GB/sec sequential reads and 5GB/sec sequential writes. Solidigm’s new kit doesn’t really stack up on this basis when compared to Inspur’s product.

Consistency

Solidigm declares that its new drives are designed and tested with zero tolerance for data errors, consistently durable IOPS and QoS, and with near-zero lifetime performance degradation. It wants us to know that the P5520 delivers up to 40 per cent better read responsiveness in the presence of write pressure as compared to the competition – a key consideration, it says, for real-world workloads. It also provides up to 90 per cent IOPS consistency and <0.3 per cent variability over the drive’s life.

One other point: Solidigm says that OEMs designing for a 10 million IOPS 2U server can, by using these drives, reduce their storage footprint up to 27 per cent while lowering power consumption up to 28 per cent compared to the P5510 product.

Solidigm says it already has a customer for the drives: China’s ByteDance, which owns TikTok. It will use the D7-P5520 in its Volcano Engine business public cloud service. 

Trench 3D NAND: The solid-state future?

Like any solid-state technology, 3D NAND will reach a point when it can’t meet demand for increased density, and replacement technology will come along. Semiconductor and nanotechnology research powerhouse IMEC reckons the answer will lie with Trench 3D NAND.

IMEC, the Interuniversity Microelectronics Centre, is a non-profit R&D institute with 4,000 research staff headquartered in Leuven, Belgium, and a track record for shrinking electronics to increase compute and memory performance.

An article from the institute, titled “The role of 3D-NAND-Flash and FeFET in the data storage roadmap” discusses how semiconductor elements like NAND cells are shrunk in the x, y, and z (length, width, height) dimensions to make the cells smaller so more can be built on a wafer to make denser chips and thus lower-cost or higher-capacity devices.

3D NAND

3D NAND was a response to scaling down limitations of planar NAND technology, which was based on floating gate transistors – floating, because its threshold voltage changes as the cell is written with a binary value and electrons flow across it. Planar, or 2D NAND, cell dimensions were progressively shrunk from 120nm, but from the 20–15nm level access speed decreases, the read error rate goes up, and program/erase (write) cycle number goes down – all because there basically aren’t enough electrons available to keep the cell’s state stable.

With 3D NAND, the planar cells were flipped from a horizontal to vertical orientation and strings of them built to form 3D layers. At the same time the cell dimensions were increased to to 30–50nm, with a 140nm x–y pitch, thus increasing the number of electrons per bit and making cells easier and faster to read and have a longer life.

IMEC 3D NAND structure. BL = bitline. WP = word plate. BSP = bottom select plate. SP = source plate. TSL = top select line

NAND chip density (capacity) can be increased by adding more layers, and the layer count has gone up from 12 through 24, 32, 468, 64, 96, 112, and is now at the 144 and 176-layer levels – depending on the manufacturer – with 200+, 500+, and even 1,000 layers in prospect.

3D NAND also increased density by adding bits to a cell, starting from single-layer cells (SLC) to 2-bit multi-layer cells (MLC), 3-bit triple layer cells (TLC), 4-bit quad-layer cells (QLC), and 5-bit penta-level cell (PLC) technology in development. 

The counter to development in this fashion is that each step upwards in layer count prolongs production of the wafer. A hole etched through 64 layers is easier to do than 112 and 144 layers – the deeper the hole, the harder it is to etch consistently.

The foundry tools needed to deposit the layers of chemicals and etch out holes have to become more capable and complex as layer counts increase, adding to cost. There can be stress-induced errors in cells as layer counts increase as well, so finer, more precise production control is needed.

The difficulty at 200+ layers will be commensurately greater and larger again at 500 or more layers. String-stacking – placing two blocks of 3D NAND chips one above the other to bypass this problem – has already been done by some manufacturers, with, for example, a 128-layer device built from two 64-layer components. 

But string-stacking delays the inevitable. Some time in the future the extra production difficulty will make it cost-prohibitive to go to the next layer count. At that point, the IMEC authors suggest, the NAND cells will have to get smaller again to enable denser chips to be built and so increase drive capacity and/or lower electricity consumption and ease cooling. 

The authors suggest we could see “stacks of 1,000 layers being unavailable until 2030” unless new technology is explored.

Trenching

Generally speaking, 3D NAND has a cylindrical gate-all-around (GAA) design, and shrinking the cell size would mean component layer thicknesses and height would be reduced, threatening the ability of the cell to meet performance, stability, and longevity requirements.

The IMEC authors suggest a trench-like architecture in which “the memory cells are no longer circular. They are implemented at the sidewall of a trench, with two transistors at opposite ends of the trench, which significantly increases bit density.”

They write: “From an operation point of view, this trench cell resembles a planar unit cell (being put upright) compared to the circular GAA NAND-Flash cell.”  

In their view, “Although it comes with a slight penalty in electrical characteristics (such as the program/erase window), the unit cell area in a trench-like configuration can be reduced in the x–y direction compared to a ‘GAA’ cell. As such, the trench cell is put forward as a next generation NAND-Flash cell architecture – expected to reduce the x–y pitch from today’s 140nm (effective) to about 30nm.”

Ze height is ze problem

Reduction in the z-dimension (height) may need material changes. “A z-shrink of the NAND-Flash layer stack involves squeezing the materials that are used for creating the word-line layers, including the word-line metal.”

They write: “Reducing the word-line metal thickness comes along with an unwanted resistivity increase, which enhances resistance-capacitance (RC) delay and slows down access times. IMEC, therefore, is looking at alternative metals such as Ru (Ruthenium) and (barrierless) Mo (Molybdenum) with potentially lower resistivity at small dimensions.”

IMEC researchers are “exploring alternative materials for the charge trap layers, the tunnel dielectrics and the metal gate stack, and … investigating their impact on memory performance.”

It’s a whole lot of work-in-progress and the NAND foundry folks and their in-house research engineers have to be involved as well –; they get to make the NAND wafers, after all. 

If you are interested further then read the paper – there’s lots more in it. 

Storage news ticker – April 26

Kioxia XG8 storage
Kioxia XG8 storage

Ethernet adapter supplier Chelsio Communications has introduced its seventh-generation T7 Unified Wire DPU (SmartNIC). It is built with up to eight Arm Cortex A72 cores for embedded applications’ control path processing, gen 7 Very Large Instruction Word (VLIW) processor for storage, networking, and security protocol processing at up to 400Gbit/s, using 56Gbit PAM4 SerDes, and PCIe 5.0 x16.

The T7 can be a Root Complex and/or End Point, a PCIe switch, and an Ethernet switch. It supports prior T5 and T6 functionality plus hardware-based acceleration for RoCEv2, compression, dedupe, erasure coding, root of trust, and other features.

Chelsio T7 storage infrastructure
Chelsio T7 graphic

T7 is expected to provide a solid fabric replacement for InfiniBand for GPU clusters or HPC installations. Chelsio said the T7 has several confirmed OEM wins already in the cloud, storage, and server space. T7 DPU ASICs will be sampling in 3Q22 and is expected to be in production on first silicon within one quarter.   

Hitachi Vantara, which has had a 10-year relationship with customer BMW Group, has signed a further six-year strategic partnership agreement with the car maker to sell its high-end all-NVMe flash VSP 5600 storage array. This will be provided via the consumption-based EverFlex as-a-Service offering. It says this hybrid cloud deployment will take up less datacenter floorspace and need less electrical power than the systems it replaces. Hitachi Vantara says it will support the BMW Group in 10 countries, including Germany, the USA, Japan, China, and the UK.

China’s Inspur has a new Enterprise NVMe SSD which it says is among the fastest PCIe 4 SSDs available. It comes in the U.2 (2.5-inch) format and is built from an unnamed supplier’s 3D NAND in  0.96, 1.6, 1.92, 3.2, 3.84, 6.4, and 7.68TB capacities. The maximum random read/write IOPS numbers are 1.5 million/400,000, the sequential read bandwidth is 7GB/sec, and sequential write is 5GB/sec. It has a four-lane PCIe 4 interface with Inspur’s AIPR (Asynchronous Independent Plane Read) technology to widen the data access channel between the main controller and NAND, which it says greatly improves the back-end performance. The drive has Zoned Namespace (ZNS) support but customers will need to independently develop the upper-layer software for this.

The drive is rated at 2.6 million hours MTBF, and supports up to 10,000 Phase/Erase cycles. A dynamic eight-level power consumption adjustment mechanism can improve energy efficiency by over 70 percent compared to previous generations. A key-value format will be supported in the future along with EDFSFF and PCIe 5.

Kioxia is sampling a new XG8 PCIe 4.0 gumstick SSD in single-sided M./2 (2280) format. It is a successor to the XG7 and has the same 512GB, 1TB, 2TB and 4TB capacity range. Both are high-end client drives for use by OEMs in notebooks, gaming systems, and desktops. Like the XG7, the XG8 has optional TCG Pyrite 2.01 and TCG Opal SSC 2.01 support. It is built from BiCS gen 5 flash with 112 layers using TLC format.

The XG7 used BiCS gen 4 at 96 layers in TLC. Kioxia told us the XG8 has 2x the performance of the (PCIe 3) XG6, which would mean, taken literally, around 710,000/730,000 random read/write IOPS, 6GB/sec sequential reads, and 4.4GB/sec sequential writes.

Media and video storage workflow systems supplier Quantum has a new H4000 Essential system which integrates  its CatDV media asset management and  StorNext 7 shared storage software on the H4000 hybrid flash/disk storage hardware platform. This H4000 Essential bundle provides highly available shared storage and automatic content indexing, discovery, and workflow collaboration for small creative teams. It doesn’t need specialised IT admin staff or skills, can be up and running in minutes, and integrates media management and storage so creative teams can spend less time searching for content.

Quantum CatDV is  pre-installed and pre-configured for use by up to 10 named users, including use of Adobe Premiere Pro CatDV Panel.The system has integration with Quantum Cloud-Based Analytics (CBA) software for remote, proactive system monitoring. There are available archiving plugins for Backblaze B2 Cloud Storage, or choice of S3-compatible storage.

TeamGroup has announced the upgrade of its ELITE U-DIMM DDR5 Standard Memory with an increased frequency from 4,800MHz to 5,600MHz with the same 1.1V voltage. It’s available in different capacities, including 2X8GB, 2X16GB, and 2X32GB, and has been sent to major motherboard manufacturers for verification.

Exbibyte

Exbibyte – 1,000 pebibytes. See Decimal and Binary Prefix entry.

Exabyte

Exabyte – 1,000 petabytes. See Decimal and Binary Prefix entry.

ETL

ETL – Extract, Transform and Load – the processing steps needed to extract data from some repository, change its characteristics and format, and then load it into a data warehouse for analysis.

Ethernet

Ethernet – a network scheme for connecting IT devices, such as server computer, network and storage systems, inside a datacenter. Such a network is called a local area network (LAN) in contrast to a longer distance wide-area network (WAN).