Category:

Storage

HPE Discover 2018 – Nimble storage finally gets sync replication, more

by Philip Sellers November 27, 2018

written by Philip Sellers

After months of teasing that synchronous replication was coming, Nimble has finally delivered the capabilities to customers. HPE announced the new capability this week during HPE Discover in Madrid, but it went beyond simple replication in the same announcement. HPE also enabled the ability to do metro clustering between Nimble storage arrays in the same release.

On launch, Peer Persistence on Nimble will allow for Microsoft SQL workloads on Windows server and VMware vSphere workloads to do seamless failover between two arrays in synchronous distances. Future releases will enhance support and an Oracle certification expected soon after release.

Peer Persistence is useful for critical workloads where customers want to eliminate the fault domain of a single storage array. This type of single point of failure is sometimes viewed as unavoidable and so arrays are architected to have lots of resiliency to mitigate the risk. Environmental issues (power, cooling, etc.) and even a bad code release (yes, they do happen) could put a critical system at risk for downtime and a costly outage, however metro storage clusters allow a pair of arrays to work together to protect workloads.

What is Peer Persistence?

Peer Persistence is a branding for metro-storage clustering technology within the HPE line of storage. First delivered on 3PAR in 2013, Peer Persistence allows synchronous replication to seamlessly switch between two arrays – enabling failover between arrays for site protection or for fault domain protection within a datacenter.

Within the SCSI stack, ALUA signals active and standby paths to the compute nodes to allow it to know which are the preferred paths and which are unavailable. In a 3PAR configuration, the standby paths represent the paths to the secondary storage array that is in read-only mode during synchronous replication. When a failure occurs on a primary array, an arbitration/quorum node allows the secondary node to come online as the primary and continue to accept IO without interruption. A SCSI signal is sent over the bus to the compute node and lets it know that the active paths have now moved to standby and the standby paths are now active. Internal OS queues hold any IO in transit at the time of the failover.

After a failure occurs, the quorum operation will dictate which array is in control. If the failure is repaired, the array comes back into the replication relationship and the data is synchronized between the arrays. The repaired array becomes the secondary and its paths are re-added to the host through a SCSI rescan.

See more?

If you want to see more about Peer Persistence on HPE Nimble storage, take a look at Calvin Zito’s ChalkTalk about the topic on YouTube.

November 27, 2018 1 comment

HPE introduces Optane in 3PAR and Nimble, skips over NVMe drives

by Philip Sellers November 27, 2018

written by Philip Sellers

This week I’m onsite at HPE Discover and I have had some interesting discussions with folks from HPE about NVMe and their choices around array architecture in 3PAR and Nimble. HPE announced Memory-Driven Flash that is available for 3PAR arrays now and coming to Nimble arrays in 2019. Memory-Driven Flash is storage class memory as a caching layer in its existing array architectures and is based on the Intel Optane storage class memory in an add-in card (AIC). 3PAR and Nimble are not utilizing NVDIMM forms of Intel Optane in the release, though that difference should not be a detriment in the current designs.

For the management at HPE, their talking points are around latency analysis and pinpointing the bottlenecks within the IO pipeline. Both product management teams (3PAR and Nimble) insist that their current bottlenecks are not around the backend storage, but rather the front-end controllers in today’s arrays. So simply enabling NVMe based drives in the array does little to address customers real concern, which is lower latency storage IO. With controller bottlenecks in play, a lot of this ceiling is created because of the extra processing of data services on the controllers.

In testing, the teams discovered that using storage class memory as a caching layer – which the company showed as a technical preview in 3PAR last year at Discover – greatly accelerated workloads and reduced latency. The released products are based around the same design as the technical preview.

So why not also enable NVMe drives at this point? HPE says it is waiting out the emergence of standards and protocol maturity around NMVe before moving to NVMe drives – and because the backend is not where they see the latency introduced in the storage stack. They reiterate the point that the backend drives are NOT where arrays are constrained – and that is an industry-wide assertion, as I understood it – but rather at the controllers.

NVMe as a protocol lacks support for multipathing and failover and other essential resiliency features taken for granted in the SCSI protocol. The short answer is simply that SCSI is robust and NVMe is new with lots of growth. From personal testing of an NVMe over Fabric array, driver support is spotty at best with just a few Linux distributions offering support for NVMe over Fabric. Microsoft has taken the stance that it is not even offering driver support at this time for NVMe over Fabric. The lack of multipathing means that NVMe over Fabric LUNs cannot be used for things like VMware VMFS datastores or clustering use cases – many of the high-profile, high-importance use cases typically associated with super-fast, low latency storage arrays.

That led me to ask them about their data services adding to latency that might otherwise be avoided in future iterations of their arrays. Product managers agreed and said that was the thinking behind the introduction of adaptive data services in the Nimble platform. By making the data services selective, you can achieve the full potential of the hardware, but you also have the choice to enable those data services where desired to improve the cost-effectiveness of the array.

November 27, 2018 1 comment

Datacenter News Storage

Repost: E8 Storage and Mellanox BlueField SmartNIC deliver high performance with hardware acceleration

by Phil Marsden November 13, 2018

written by Phil Marsden

E8 Storage and Mellanox BlueField SmartNIC deliver high performance with hardware acceleration

Showcasing a live demo at SuperComputing 2018, the integration optimizes computing resources while accelerating data intensive workloads

SANTA CLARA, Calif., November 13, 2018 – E8 Storage today announced it is expanding its technology partnership with Mellanox with an integrated solution built on Mellanox BlueField SmartNICs, which will enable customers to maximize computing resources by pairing the E8 agent with the SmartNIC. The new integration will allow users to utilize the benefits of CPU offload to overcome any performance bottlenecks to application performance. E8 storage will be joining Mellanox at SuperComputing 2018 (SC18) in Texas, at booth #3207, to showcase a live demo of the solution.

By offloading the E8 Agent functionality to the Mellanox BlueField SmartNIC, the joint solution ensures that the compute resources remain dedicated to the customer’s applications. All the storage data path operations are managed by the dedicated computing resources of the offload engine, not by valuable server compute cores. The ability to accelerate performance and optimize compute resources is ideal for intensive workloads that need to operate in real time, such as genomics, analytics and AI.

The integrated solution delivers the proven high performance and low latency of E8 Storage systems, while adding dedicated computing power in a low- profile PCI form factor. E8 Storage delivers 10x the performance of existing flash solutions, with consistent sub-millisecond latency, and has set benchmark records in multiple industries.

Please visit booth #3207 at SC18 in Dallas where E8 Storage and Mellanox will be on hand to provide a live demo.

Event: SuperComputing 2018

Date: November 12-15 2018

Booth: #3207

Location: Kay Bailey Hutchison Convention Center, 650 S. Griffin Street, Dallas, TX 75202

Link: https://sc18.supercomputing.org

“The combination of the E8 agent with our BlueField SmartNIC is ideal for organizations that need fast and scalable performance,” said Gilad Shainer, vice president of marketing at Mellanox. “The integration is complimentary and allows customers to take advantage of full acceleration, while adding dedicated computing power, networking and processing subsystems. The result is high performance and acceleration, while significantly reducing total cost of ownership.”

Zivan Ori, co-founder and CEO of E8 Storage explained, “The SmartNIC is uniquely designed to offer speed, reliability and high availability, many of the concerns customers face when it comes to optimizing data workloads in data intensive verticals. By offloading the E8 software onto Mellanox BlueField we are able to leverage the benefits to provide a combined solution that meets the demands of high performance compute clusters – enabling fast access to data.”

For more information on purchasing E8 Storage appliances, please contact [email protected].

About E8 Storage

E8 Storage is a pioneer in shared NVMe storage for data-intensive, high-performance applications that drive business revenue. E8 Storage’s affordable, reliable and scalable solution is ideally suited for the most demanding low-latency workloads, including real-time analytics, financial and trading applications, genomics and large-scale file systems. Driven by the company’s patented architecture, E8 Storage’s high-performance NVMe over Fabrics certified storage delivers record breaking performance at half the cost of existing storage products. When performance matters, enterprise data centers turn to E8 Storage for unprecedented storage performance, density and scale, without compromising on reliability and availability. Privately held, E8 Storage is based in Santa Clara with R&D in Tel Aviv, and channel partners throughout the US and Europe. For more information, please visitwww.e8storage.com, and follow us on Twitter @E8Storage and LinkedIn.

November 13, 2018 0 comments

Datacenter News Storage

Spectra Logic Announces the World’s Largest Data Storage Machine

by Phil Marsden November 13, 2018

written by Phil Marsden

Spectra Logic Announces the World’s Largest Data Storage Machine Capable of Protecting More than 2,000 Petabytes (2EB) of Digital Assets.

Wow, That’s mind-bending ammounts of storage 20TB per tape at 400MB/s !

Congratulations to Spectra Logic on pushing the boundaries and giving us a glimpse of the future!

Read more at the official press release here, reposted below

Integrated with IBM® TS1160 tape technology, Spectra Expands Tape Offerings with Industry’s Most Advanced Tape Drive and Media

BOULDER, COLO. (PRWEB) NOVEMBER 07, 2018

Spectra Logic, a leader in delivering innovative and economical data storage solutions for the modern data center, announced the industry’s most advanced tape technology, the IBM® TS1160, for use in Spectra’s enterprise-class tape libraries. Spectra’s TFinity ExaScale, T950, T950v and T380 Tape Libraries will support the new tape drive and media. When fully populated with the TS1160 tape drives and media, a TFinity ExaScale will store and allow access to more than 2 exabytes (2,000 petabytes) of data, making Spectra’s TFinity ExaScale Tape Library the largest single data storage machine in the world.

Providing a number of technological breakthroughs over the TS1155 drive, the remarkable TS1160 tape drive and media expands storage capacity to an unprecedented 20TB per cartridge (uncompressed), increases data throughput to 400 MB per second (uncompressed), and delivers twice the Fibre Channel interface speed at 16 Gigabit per second or up to 25 Gigabit per second Ethernet.

“As we go into our 40th year, Spectra once again is delivering another breakthrough in tape innovation for users with immense data storage needs,” said Spectra CEO Nathan Thompson. “To achieve this level of technological advancement, a new custom ASIC chip was developed by IBM; one that will be used in future generations of TS and LTO enterprise tape technologies, including LTO-9. Ongoing engineering feats such as these underscore the value and longevity of tape technology and validate Spectra’s commitment to offering best-in-class tape solutions that align with tape’s extensive roadmap.”

Benefiting Specta’s customers in government, scientific research, high performance computing, media and entertainment, and public cloud environments, the upcoming JE tape cartridge, which works with the TS1160 tape drive, is engineered with aligned Barium Ferrite (BaFe) particles that provide better signal-to-noise ratios, enabling an unmatched native storage capacity of 20TB per cartridge. In addition, the same advanced TMR (Tunneling Magneto-Resistive) head used to achieve higher capacities and speeds in the TS1155 and LTO-8 drives are being used for the TS1160 drive. Spectra’s new tape offerings are designed to read and write media formatted by the TS1155 and TS1150 drives and read tapes formatted by drives older than two generations.

“Tape is the most reliable and economical storage technology available to protect massive amounts of data,” said Fred Moore, president of Horison Information Strategies. “The tape industry continues to keep pace with published roadmaps and surpass other media types by leveraging technology innovations from the prior generation and designing new components for each subsequent generation.”

Spectra’s offering of the TS1160 tape drive for its TFinity library will be available in December 2018. *

*Spectra’s T950, T950v and T380 Tape Libraries will support the TS1160 tape drives and media in 2019.

Resources:
TS1160 Data Sheet from Spectra Logic
White Paper on Tape Performance
Spectra HPT video
TFinity ExaScale brochure

About Spectra Logic Corporation
Spectra Logic develops data storage solutions that solve the problem of short- and long-term digital preservation for business and technology professionals dealing with exponential data growth. Dedicated solely to storage innovation for nearly 40 years, Spectra Logic’s uncompromising product and customer focus is proven by the adoption of its solutions by industry leaders in multiple vertical markets globally. Spectra enables affordable, multi-decade data storage and access by creating new methods of managing information in all forms of storage—including archive, backup, cold storage, private cloud and public cloud. To learn more, visit http://www.SpectraLogic.com.

Follow Spectra Logic on social media:
Twitter: @spectralogic
Facebook: https://www.facebook.com/spectralogic
LinkedIn: https://www.linkedin.com/company/spectra-logic
Instagram: @spectralogic

# # #

Media Contacts:
Matter Communications
Tim Hamilton
978-518-4503
spectralogic(at)matternow.com

Spectra Logic
Susan Merriman
303-449-6444,1378
susanm(at)spectralogic.com

Spectra and Spectra Logic are registered trademarks of Spectra Logic Corporation. All other trademarks and registered trademarks are the property of their respective owners.

November 13, 2018 0 comments

Datacenter News Storage

E8 STORAGE ACCELERATES PERFORMANCE FOR GENOMIC WORKLOADS AT QUEEN MARY UNIVERSITY OF LONDON

by Phil Marsden November 13, 2018

written by Phil Marsden

E8 STORAGE ACCELERATES PERFORMANCE FOR GENOMIC WORKLOADS AT QUEEN MARY UNIVERSITY OF LONDON

Having worked both as a researcher in computational chemistry and as IT support for the Department of Chemistry at a major University I understand the demards and inportance of storage in such an environment. I congratulate Queen Mary University of London for choosing E8, who are cementing there place as a top storage provider running some of the most important institutions.

Read more in E8`s official press release bellow :

Delivering record breaking performance speeds, E8 Storage underpins leading university scientific research department

SANTA CLARA, Calif., October 30, 2018 – E8 Storage today announced that Queen Mary University of London (QMUL) has deployed its E8-D24 array, to underpin its genomic workflow and significantly increase performance. With the newly updated high-performance cluster, supporting both researchers and students, the university will be able to leverage the benefits of the industry’s first shared NVMe storage solution to optimize its data intensive workloads.

QMUL is part of the Russell Group and is ranked fifth in the UK in terms of research quality. Integral to this ranking is its study of genomics, which requires high computing and processing power. IT services has a fundamental role to play in supporting this research, and with the need to cater to students and influential researchers, the institution decided to research the market and with the support of its channel partner, OCF, looked for the latest cutting-edge technology with performance and optimization.

With a shared NVMe platform in place, QMUL will be able to accelerate the performance of the fast-tier scratch space behind a distributed file system to support genomic sequencing. The high performance and capacity of E8 Storage NVMe-over-Fabrics system will be used for both data and metadata, enabling QMUL to push more jobs through the cluster at a faster rate. The previous high-performance system that the university used to support its genomics research was no longer fast enough to support its needs. Upon seeing the new performance benchmarks published by E8 Storage, the institution decided to upgrade, opting for the E8-D24 array to support genomics and moving the existing storage to a bulk storage tier. With E8 Storage, QMUL now has equipment that can scale for the future and deliver the fastest performance speed in the industry.

“Performance and data optimization are essential to the work behind genomic processing, and E8 Storage overcomes the traditional storage constraints with its first-to-market shared NVMe storage platform,” commented Tom King, Assistant Director for Research, IT Services at Queen Mary University of London. “The team have been extremely responsive right from the start, and we were able to discuss our main use case and pain points to see how the technology would enable us to overcome this. In addition, we were extremely pleased with the latest benchmark performance tests which showed E8 Storage as a leader in NVMe.”

Mahesh Pancholi, business development manager at OCF noted: “OCF are pleased to be working with E8 Storage to provide the industry leading NVMe-over-Fabric solution to QMUL. The combination of great performance figures and integration with IBM Spectrum Scale make the E8 Storage appliance a great way for QMUL to provide a high-performance scratch tier to their cutting-edge researchers. OCF are proud to have been working with QMUL to expand their cluster and provide cutting edge technologies and services to their world-class researchers.”

“The combination of Mellanox InfiniBand and E8 Storage provides the high computing and storage performance needed for QMUL’s research activities,” said Gilad Shainer, Vice President of Marketing at Mellanox. “Connecting the shared NVMe storage with high speed InfiniBand network is an ideal design for genomic applications.”

About E8 Storage

E8 Storage is a pioneer in shared NVMe storage for data-intensive, high-performance applications that drive business revenue. E8 Storage’s affordable, reliable and scalable solution is ideally suited for the most demanding low-latency workloads, including real-time analytics, financial and trading applications, genomics and large-scale file systems. Driven by the company’s patented architecture, E8 Storage’s high-performance NVMe over Fabrics certified storage delivers record breaking performance at half the cost of existing storage products. When performance matters, enterprise data centers turn to E8 Storage for unprecedented storage performance, density and scale, without compromising on reliability and availability. Privately held, E8 Storage is based in Santa Clara with R&D in Tel Aviv, and channel partners throughout the US and Europe. For more information, please visit www.e8storage.com, and follow us on Twitter @E8Storage and LinkedIn.

November 13, 2018 0 comments

Datacenter Storage

How NetApp MetroCluster Works

by Roger Lund October 31, 2018

written by Roger Lund

This post covers NetApp MetroCluster, a business continuity solution which provides synchronous data replication between two sites within 300km. Before getting into the MetroCluster details, let’s have a quick review of the NetApp High Availability feature first.

When a Controller Fails

The controller is the brains of the storage system and provides clients access to their data. If a controller went down then clients would lose access to data if redundancy was not in place. Controllers are arranged in pairs to provide this redundancy. Each controller in the pair controls its own data, and can additionally take over for its peer if that peer goes down. Multiple HA pairs can be deployed in the same cluster to provide additional capacity and performance.

Based on the diagram above, we have a standard high availability pair with “Controller 1” and “Controller 2”. The disk shelves are shown under the controllers. In NetApp ONTAP, disks are always owned by one and only one controller and are grouped into aggregates. Per the diagram above, Aggregate 1 is owned by Controller 1 while Aggregate 2 is owned by Controller 2.

Controller 1 and Controller 2 act as High Availability partners for each other. If Controller 1 fails, Controller 2 will detect it.

A controller can detect its peer has failed through the physical High Availability connection, or through hardware assisted failover with the help of Service Processors.

Controller 2 will take temporary ownership of the disks when Controller 1 fails.

Controller 2 now owns Aggregate 1 and Aggregate 2 and clients can still reach all of their data.

When a Whole Building Fails

High Availability helps when we have a controller failure, but what if it’s not just a controller but the whole building which goes down?

The above example shows Controller 1 and its disk shelves in one building while Controller 2 and its disk shelves are in a different building.

We have a power outage that causes us to lose the controller and the disk shelves. Through the High Availability connection, Controller 2 will detect the failure as it’s no longer receiving a keepalive signal.

However, Controller 2 won’t be able to take control of Aggregate 1 because the disk shelves are down. Aggregate 1 is no longer available due to the power failure.

So High Availability gives us redundancy for our controllers, but it doesn’t give us redundancy in case we lose the entire building. This is where NetApp MetroCluster comes in.

MetroCluster

In the event of a building failure, NetApp MetroCluster gives us redundancy by combining High Availability and SyncMirror. SyncMirror is used to mirror the aggregates to a disk shelf in each building. We can selectively choose which aggregates to mirror. We can choose to mirror all of them or we can cut down on disk hardware costs by only mirroring our mission critical aggregates.

When a Building Fails – with MetroCluster

Let’s see how NetApp MetroCluster works.

We have Controller 1 in one building and Controller 2 in another. Both have their own disk shelves. Aggregate 1 is still owned by Controller 1, but it is SyncMirrored across the two buildings. Plex 0 is in one building and Plex 1 is in the other building. Whenever data is written to the aggregate it is written to both plexes at the same time, so both buildings end up with the same copy of the data. The same applies to Aggregate 2 as well.

Building 1 fails so we lose controller 1 and its disk shelves, but because Aggregate 1 and Aggregate 2 are still available in the other building, clients won’t lose access to their data.

Switchover to the other building can either be done manually or automatically. This will be discussed in more detail later.

Recovery Point Objective (RPO)

SyncMirror and MetroCluster use synchronous replication. Data is written to both buildings before the acknowledgment is sent back to the client. NetApp MetroCluster has a Recovery Point Objective of zero due to synchronous replication. No data is lost in the case of an outage.

MetroCluster in 8.3.0

MetroCluster has gone through a few different implementations in NetApp ONTAP. It first became available in ONTAP version 8.3.0. MetroCluster had been available in 7-mode for a long time, and was practically the last major feature to be ported over to Clustered Data ONTAP.

NetApp MetroCluster only supported a four node setup in ONTAP 8.3.0. This is also still available in later versions of ONTAP. With the four node MetroCluster setup, both sites host an independent two node cluster. We can’t have three or more as MetroCluster only runs across two sites.

The two nodes in each site are a High Availability pair for each other. SyncMirror mirrors the aggregates across both sites – not across both nodes in the same site but across both actual sites. It’s an active-active configuration where each site serves read requests to local clients from the local Plex.

The remaining controller in the same site will take over its aggregates if a single controller fails as shown in the standard High Availability scenario. If both controllers in a site go down, meaning the entire site is down, you can failover to the other site. The sites can be up to 200 kilometres apart.

The Distance From Controller to Shelf

SAS only supports short cable lengths so if SAS is being used for the controller to disk shelf connections, there will be a controller to shelf distance problem. So how can we cable controllers to a disk shelf in another building?

Fabric-Attached MetroCluster

Fibre Channel supports long length cables. A pair of Fibre Channel switches can be installed in both sites. The next problem that we have is the current models of disk shelves have SAS, not Fibre Channel ports. How are we going to actually get the cable into the disk shelf, if we’re using these long distance Fibre Channel cables to go from the controller to the shelves in the other site?

ATTO FibreBridge

This is where the ATTO Fibre Bridge comes in, a Fibre Channel to SAS Gateway. It has both a Fibre Channel and a SAS port, and it can convert between the two. The controller connects to the Fibre Bridge with Fibre Channel cables via the Fibre Channel switches while the Fibre Bridge connects to the disk shelves with SAS cables.

Cabling in Fabric-Attached MetroCluster

Let’s take a look at the NetApp MetroCluster Cabling for Fabric-Attached MetroCluster. It is called Fabric-Attached MetroCluster since it uses Fibre Channel switches.

In the below example, we have an HA pair for four node MetroCluster in Building 1, Site 1 Controller 1 (S1C1) and Site 1 Controller 2 (S1C2).

We also have an HA pair in Building 2, Site 2 Controller 1 (S2C1) and Site 2 Controller 2 (S2C2).

Next, let’s take a look at the disk shelves. In Building 1 we have Aggregate 1 Plex 0 owned by Site 1 Controller 1.

Then we have a SyncMirror Plex (Plex 1) for Aggregate 1 located at the other site, owned by Site 1 Controller 1.

Then we have a second aggregate, Aggregate 2 Plex 0, owned by Site 1 Controller 2.

Also SyncMirror Plex (Plex 1) for that aggregate in Site 2.

Next is the third aggregate, Aggregate 3 Plex 0 in site 2 owned by Site 2 Controller 1.

We also have a Plex 1 for that in Site 1. Both Plexes are owned by Site 2 Controller 1.

Lastly, we have Aggregate 4 owned by Site 2 Controller 2.

Building 2 has Plex 0 while Plex 1 is in Building 1.

In the example shown, we use four aggregates and both sites have single stack of disk shelves to make the diagram clear and easy to understand. You can have as many aggregates as you want and you can also have multiple stacks in the different sites if needed.

Now let’s take a look at ATTO Fibre Bridges.

We cable these up to the disk shelves using SAS cables. In Site 1, Fibre Bridge 1 will be connected to the top shelf in the stack.

Then going down, it will be daisy chained from there.

Then the Fibre Bridge 2 will be connected to the bottom shelf in the stack.

And once we are done with Site 1, we will do the same on Site 2.

The next thing that we need is the Fibre Channel switches.

The switches are from either Cisco or from Brocade. Fibre Channel Switch 2 will be connected to Fibre Bridge 2 while Fibre Channel Switch 1 will be connected to Fibre Bridge 1. Both will be using Fibre Channel cables.

The Fibre Channel switches in Site 1 are shown above. The same setup will be implemented in Site 2.

We connect the controller to the Fibre Channel switches using Fibre Channel cables. Site 1 Controller 1 will be connected to the first Fibre Channel switch and to the second Fibre Channel switch as well.

We do the same for Site 1 Controller 2, it is also connected to both switches.

The above diagram only shows one connection to make it tidier but there’s actually two connections from each node to each switch.

The above diagram shows the Node Initiator connections, which give the controllers connectivity over Fibre Channel to the disk shelves on both sides.

Site 2 is also configured with similar connections.

The controllers have connections to the disk shelves in the same site but because we are using SyncMirror to write to both locations they need to have connections to the disk shelves in both sites. That’s the reason we connect both Fibre Channel switches together. Fibre Channel Switch 1 in both sites are connected to each other, and Fibre Channel Switch 2 in both sites are connected to each other.

The above diagram only shows one connection, but you can actually have up to four connections between each pair of switches bundled into a port channel. As you can see, there’s no single points of failure. There are two controllers in each site, which are configured as an HA pair for each other. We have two Fibre Channel switches in each site, two Fibre Bridges in both sites, and the aggregates are SyncMirrored across both sites as well.

FC Virtual Interfaces

In the previous diagram, cabling showed the Node Initiator connections from the controllers to the disk shelves in both sites for reading and writing to disk during Consistency Points. Writes still work in the same way as usual though, where they are written to NVRAM before being written to disk. NVRAM mirroring also takes place over the Fibre Channel network between both sites. It uses Separate 16-Gbps Fibre Channel Virtual Interface (FCVI) connections from the controllers.

We now have a Fibre Channel Virtual Interface (FCVI) connection from each controller going to both Fibre Channel switches in the same site. Those are the connections for Site 1 Controller 1.

Then we do the same for Site 1 Controller 2 as well as for the two controllers in Site 2.

For the Node Initiator connections, we’ve got two Node Initiator connections from each node to each switch. These are for our reads from disk and for our writes during Consistency Points. For the FCVI connections which are being used for the initial NVRAM mirroring between the two sites, we’ve got a single connection from each controller going to both switches in the same site. The FCVI connection has to use a dedicated 16 G connection while the Node Initiator connections can use a standard Fibre channel port on the controller.

Configuration Replication

SVM, LIF, volume, and LUN configuration information are replicated between the two sites using the Configuration Replication Service (CRS). Just like SnapMirror traffic, it replicates over a standard IP network, using cluster peering and inter-cluster Logical Interfaces (LIFs).

There are three different types of connections on our controllers:

Node Initiators – for connectivity to our disks going over Fibre Channel
FCVI – for NVRAM mirroring that also goes over the same Fibre Channel network.
CRS connectivity – uses Inter-cluster LIFs going over an IP network.

Connecting From Clients

Cluster identity is preserved during a switchover from one site to another. Clients connect to the same IP addresses or WWPN’s they were using before at the original site when it failed. The client data network must therefore span both sites. The same layer 3 subnet has to be available on both sites since the clients connect to the same IP address. You can use dark Fibre, an MPLS layer 2 VPN service, or a proprietary solution such as Cisco OTV (Overlay Transport Virtualization) for client protocols running over IP, like NAS or iSCSI, or a SAN fabric that spans both sites for Fibre Channel.

MetroCluster in 8.3.1

As previously mentioned, NetApp MetroCluster for Clustered Data ONTAP was released in version 8.3.0 which originally supported four node clusters. Support for two node MetroCluster was added when ONTAP 8.3.1 was released. Both sites host an independent single node cluster and the sites can switch over in case of a failure. There are three different supported two node configurations:

Stretch MetroCluster
Stretch MetroCluster with SAS Bridges
Fabric MetroCluster

Stretch MetroCluster

The controllers are cabled directly to the disk shelves with NetApp proprietary long reach SAS cables in two node Stretch MetroCluster. We are not required to use Fibre Channel switches and ATTO Fibre Bridges like the ones we used in Fabric MetroCluster. The maximum distance is only up to 500 metres, it is not as long as Fibre Channel since we are using SAS cables.

As you can see from the above diagram, setup is very similar to a standard High Availability.

Using SAS cables, the controllers are connected to the disk shelves. The Fibre Channel VI connection where is cabled directly between Fibre Channel ports on the controllers.

Stretch MetroCluster with SAS Bridges

The next type available is Stretch MetroCluster with SAS Bridges, where the controllers are not cabled directly to the dish shelves but via ATTO Fibre Bridges. It uses Fibre Channel but Fibre Channel switches are not being used here. The maximum distance is 500 metres.

The above diagram shows the controllers have a single Fibre Channel connection to the ATTO Fibre Bridge in both sites and the ATTO Fibre Bridge then has a SAS connection going to the disk shelves. Then SAS connections are daisy-chained down through the stack. Again, we have the FCVI connection for the NVRAM mirroring between the two controllers.

Two Node Fabric MetroCluster

The last type is two node Fabric MetroCluster. The system is cabled the same way as the four node Fabric MetroCluster which was supported on 8.3.0. But the maximum distance is increased from 200km to 300km.

The above diagram shows that it is almost the same as four node Fabric MetroCluster except that we only have one node in each site.

MetroCluster in Version 9

When ONTAP 9 was released, we had another improvement in that ONTAP 9 supports eight node Fabric MetroCluster. With this, we have two HA pairs located in both sites where each HA pair is replicated to its secondary HA pair at the other site. The maximum distance is 200 km over Fibre Channel, or you can use the new option of Fibre Channel over IP (FCIP) which can go up to 300 km.

Switchover

Let’s see what we would do if we lost a site.

An automatic switchover will occur once a node undergoes a panic in a two node NetApp MetroCluster.

In order to prevent a split brain scenario, where both sites lose connectivity with each other and assume the primary role for all aggregates, switchover occurs manually or through the use of MetroCluster Tiebreaker software.

A split brain scenario has to be avoided at all cost because it can lead to different data being written to the Plexes for the same aggregate in both sites. Clients in Site 1 would be writing to Aggregate 1 in their site while clients in Site 2 would be writing to their Plex for Aggregate 1 in Site 2. As a result, we would have two different, inconsistent copies of the data in the same aggregate. We need to ensure this doesn’t happen.

Split Brain

The situation that could lead to a split brain is when both sites are up but lose connectivity to each other.

This is the reason why switchover doesn’t happen automatically by default. Normally, each site won’t know if the loss of connectivity was due to the site going down or just the network connection going down.

Manual Switchover

The first way to initiate a switchover is by doing it manually. The administrator verifies if a site really has gone down and if it needs to switchover to the other site. The command to use is “metrocluster switchover”. It will perform a graceful, negotiated switchover if we do this while both sites are still available. We can do this if we want to take a site down for maintenance.

The other command that we can use is “metrocluster switchover -forced-on-disaster true”. This is for us to force a switchover when a site has actually failed. The only issue that we have on manual switchover is that the whole process takes time, from learning that the site has failed, up to manually entering this command. We may want to speed things up.

MetroCluster Tiebreaker

Using MetroCluster Tiebreaker (MTCB), we can automate the switchover. MTCB is a Red Hat Java application that runs in a third site which has connectivity to both clusters. It independently monitors both sites, and if issues are detected, it will send SNMP alerts. In the event of a site failure, it can also be configured to automatically switchover. When using MCTB, the Recovery Time Objective (RTO) is 120 seconds for automatic switchover.

It works by establishing SSH Secure Shell sessions with each node’s node management IP address to verify that they’re up. If the SSH session to a node goes down, MCTB will first check the HA status within the site to verify if it’s just that one node that has gone down and that it’s failed over to the HA peer in the same site. It will be declared unreachable if both nodes of an HA pair is unresponsive.

To ensure that it is not just the network from the third (MCTB) site to the first MetroCluster site that’s gone down, MCTB will ask the second MetroCluster site via SSH if it has connectivity over the FCIV connection or the inter-cluster IP network to the first site. This verifies a site failure because connectivity has been lost through two separate paths. At this point you can configure MCTB to only send you an alert, or you can also configure a rule which will cause an automatic switchover.

MetroCluster Interoperability

MetroCluster can be used in conjunction with SnapMirror and SnapVault. For instance, MetroCluster could be used to provide synchronous replication between two sites within 300 km, while SnapMirror could also be used to provide additional redundancy by asynchronously replicating the same data to a third site with no distance limitation.

For example, say you’ve got a site in New York and one in Philadelphia and they are within a couple of hundred kilometres of each other. You can use MetroCluster to get synchronous replication between those two sites with an RPO of 0. And to guard against a regional disaster, like flooding affecting the entire East Coast of the U.S, you can setup SnapMirror to replicate the data asynchronously to London.

The data could also be backed up off-site with no distance limitation using SnapVault Backup.

You can practice on NetApp storage by downloading the free NetApp simulator ‘How to Build a NetApp Lab for Free’ PDF.

October 31, 2018 0 comments

Datacenter News Storage

NETAPP INSIGHT 2018 – TFDX- Data Fabric 2.0

by Roger Lund October 24, 2018

written by Roger Lund

It’s Wednesday. We are here at the Netapp Insight event. And Netapp again is presenting on NetApp Data Fabric 2.0

@Arjantim @ChanEk81 @DeepStorageNet @settlersoman @darkkavenger @RayLucchesi @sfoskett @MBLeib @BenTGage https://tfd.bz/2pXpD82

also we have @davidchapa @ChrisMaki @mcbride_ruairi at the table and @PeytStefanova and @timwaldron in the room!

Yes it’s the hashtag #NetAppInsight and #TFDx !

Here we go!

Eiki Hrafnsson presenting.

Data Fabric 1.0

Colocation
Public Cloud
Data Center
Snap mirror connected.

NetApp delivers a data fabric built for the data-driven world.

Data Viability
Data Protection
Data Integration and Orchestration
Data & Cloud Optimization
Data Security & compliance

Data Fabric Services

Management & Monitoring
Orchestrations, integrations, controls
Data services – backup , protection, security
Data Storage

( Cloud Central Demo , will put video here )

The Data Fabric Services Catalog – Netapp & 3rd Party

Unified single endpoint Services API ( the data fabric API )

Data Fabric / Cloud Orchestration’s scope

Extensible data fabric api for multi cloud discovery, data movement, and services
extensible cloud orchestrate user interface
multiple cloud identity management and RBAC
Global Metadata APU labels and other metadata
Core Services
Cloud volumes and cloud backup
Netapp kubernetes Service and Fabric apps
Fabric Flows

Data Orchestration & Integration

Setup Data Sync Relations that sync data to and from your cloud volumes and any source / target auto optimized.

Build & run data fabric enabled applications on fully managed kubernetes cluster running within your own public cloud accounts or on-prem with Netapp HCI.

Visually design & run workflows that mashup your data fabric and data events with external services, events.

Data Sync

Sync/ Mirror your files between any source

Multi Cloud Workloads

Container based applications

netapp kubertnetes service

automated cluster config, deployment, autoscaling, ha

service mesh, ( istio) and cluster federation built-in

NKS – Product ready kubernetes Multi-cloud clusters.

Fabric Flows – Demo ( insert video here )

Cloud Insights

October 24, 2018 0 comments

Datacenter News Storage

NETAPP INSIGHT 2018 – TFDX- MAX Data

by Roger Lund October 24, 2018

written by Roger Lund

It’s Wednesday. We are here at the Netapp Insight event. And Netapp again is presenting on MAX Data.

A Few of the delegates.

@Arjantim @ChanEk81 @DeepStorageNet @settlersoman @darkkavenger @RayLucchesi @sfoskett @MBLeib @BenTGage https://tfd.bz/2pXpD82

also we have @davidchapa @ChrisMaki @mcbride_ruairi at the table and @PeytStefanova and @timwaldron in the room!

Yes it’s the hashtag #NetAppInsight and #TFDx !

Here we go!

Andrew Grimes is presenting

See his blog post on this technology prior to the release.

An Update on the Plexistor Acquisition: Introducing NetApp Memory Accelerated Data

Netapp Max Data : In-Memory Application Data Services

Product Page for Max Data a Acquisition of plexi store.

https://www.netapp.com/us/products/data-management-software/max-data.aspx

MAX Data Datasheet

Podcast: Tech ONTAP Podcast Episode 154: NetApp MAX Data

Agenda

Netapp Flash Memory Fabric
Max Data Overview
Use cases
Resources

Netapp Data Fabric, this product fits well.

Real-Time Applications : the next phase of digital transformation.

AI
Deep Learning
Real-Time Analytics

Started with Disk
Focus on the future of Solid-State Media.
Then NVMe and now persistent memory.

Layer 4-Layer 0

Lower latency, more IOPS as you move to layer 0 but costs rise as you move to Layer 0.
- ( note i’ll insert a picture here )
Persistent Memory Technology.
NVDIMM , based on 3D Xpoint.
Costly – intel optane dimm’s should lower costs.

Comprehensive NVMe : End-to-End & non-disruptive.

Add FAS, AFF, and AFF with NVMe non disruptively.
Storage-Class Memory SCM as cache.
Persistent Memory PMEM in Server.
NVMe Over fabrics.

What is Max Data Design Goals

Faster application response time – focus on low latency

leverage server-side memory data locality ( close to application )
software-only approach + PMEM
Acceleration for read and write workloads
- not a cache ( page caching not needed )
current software stack written to disk. no re architecture of the software stack for memory
Application plug and play
enterprise data protection services.
Put the LUN on the bus Enterprise Data Protection Tools
Linux IO/ Stack vs Linux Stack leveraging Netapp MAX Data File System

MAX Data on the memory bus PMEM Data Acceleration

Primary Tier

Memory. DRAM, NVDIMM, D3XP

Secondary Tier

Flash ( must be block)
Local flash SSD, or LUN.

Memory Optimized File System

Single Mount Point
Acceleration any workload
Focus on acceleration Writes.
Read after read acceleration.

Minimal Change to the application

Accelerate PM deployments.

Server and SAN can be supported.

1:25 ratio T1 should be backup-ed to T2

Sub 10 Micro Sec Latency.

T1 replication possible. adds 4 MS of latency.

Sync mirroring.

Max Data Data Services for PMEM

Protecting writes at memory speeds.
- Max Snapsync
- T1 to T2
- Integrated with Ontap snapshot
- Integrated with ontap Snapmiorror
Max Recovery
- T1 to T1 mirroring of writes
- Private RDMA Ethernet 100GB
- RPO 0 Data Protection for in memory.

Brings Enterprise Data Services to PMEM

MAX Data Cloud Integrations

Integrated Cloud Services

ontap fabric pool archive/offiste
ontap NDAS Backup
NPS
Cloud Volume Services ontap

Native Cloud Data Services

Future Releases
Resident Cloud instance
AWS, EC2, With DRAM and EBS
Azure, and GCP have been tested.

MAX Data Software Defined flash

Server
SAN
Cloud

Netapp MAX Data

use cases

MAX Data – Target Workloads

Real-time Analytics trading platforms
in-memory databases
AI
Data Wearhousing

Mongo DB

supports very large work sets
3x better OPS/SEC
4-5x lower latency

Planned on a PER Server license.

October 24, 2018 0 comments

Datacenter News Storage

Netapp Insight 2018 – Keynote Day 2

by Roger Lund October 24, 2018

written by Roger Lund

It’s day three here at Netapp Insight. And we are gearing up for our second keynote shortly.

Below I will do a live blog as I can of the keynote.

Today’s focus is netapp’s data fabric.

On the stage Henri Richard.

He talks about leveraging you data. He likes the concept of speed is the new scale. Stresses the importance of not just a business strategy but a data driven strategy.

France is not for sale… haha

Henri discussing customers and approach.

Dreamworks will come talk from a technical standpoint.

We will here from our business leaders.

Dave hits will give a small preso.

Scott dawkinds and Jeff wike take the stage.

Jeff talks about the need to burst out past the data center, into the public cloud.

Scott talks about the customer needs and trying to align the business offerings to meet the needs with data fabric.

Jeff talks about his public cloud journey. What he really wants is to start with a cloud native way, ( they use micro services ) he wants to be able to control where the process goes based on latency and cost. The artist shouldn’t care where the data is.

Scott talks about how netapp envisions one way to orchestrate the work, and set how scheduling is done with your own resources and public cloud. The next phase is around analytics.

Jeff stresses the importance of not having the budget to move the entire data center into the cloud, but just the working set. Cost is important, but so is performance.

Scott talks about the impressive workload is, Jeff jokes about him wanting his job. Then goes into flash and tiering.

Wrapping up this segment.

Anthony Lye.

He talks about how netapp has been a pioneer in the field.

INSPIRE

Innovation with cloud

Netapp Cloud Data services, deploy, run , monitor, maintain and manage services on the biggest public clouds so costumers don’t have to.

Cloud Analytics
Orchestrations , integrations, controls
Data services
Cloud Storage

A Year Of Innovation.

Tad Brockway, GM Azure storage.

Tad talks about the deep engineering project, and the two teams and how they have worked together. Leading to a real solution for real piratical workloads.

They discuss how they are delivering storage as a service, and on demand. Allowing a Application developer to have access to ontap on the public cloud.

Azure NetApp files can deliver.

100k IOPS @ sub-millisecound latency,

Unprecedented Cloud Performance.

Brad Clark. Director, McKesson.

Brad talks about the impressive latency, and predictable throughput.

He talks about how he was able to provision storage in under 60 minutes.

He talks about how he can shorten the time to delivery on the public cloud with SAP.

Netapp talks about the data management challenge, and new certifications.

next segment

Netapp Kubernetes, ISTIO, and Trident.

Kubernetes is the clear winner in container orchestration.

However, Containers need world class storage solutions.

Netapp Kubernetes Services ( NKS )

The Control Plane.

You can deploy clusters of containers on all clouds, and private cloud. ( HCI)

The Industry’s first complete multi-cloud kubernetes control plane.

James Holden

Cloud Insights

Giving a demo of the product. It allows you to monitoring both public and on-prem.

next segment

Brad Anderson.

He talks about the bar, and expectations, and how the bar is only going to get higher.

BUILD

Clouds to Accelerate New Services.

We promised and we’re delivering.

After last conference we launched HCI. He talks about the new services and business models and how that is his mission. to power a hybrid multi-cloud experience.

HCI – Hybrid Cloud Infrastructure.

Simplify , Automate, Deliver..

Benjamin Molloy,

Rachael Hedges.

Case Study : Consultel Cloud.

Adam Carter

Netapp delivers a data fabric built for the data driven world.

He is going to show us a example of a service. a Demo.

segment end

Joel Reich

Modernize.

Digital transformation requires IT Transformation.

He discuses the Value of flash. Cost savings, and how you can simplify your IT.

Thank you to our customers, and partners.

Kim

Talking about ontop, the journey, MQTT-Broker.

IBM Cloud

Discussing moving to the cloud. The journey.

Talking both the on-prem and cloud partnership. HCI / And Flexpod.

netapp ontap AI. with Nvidia and Netapp.

Accelerate your AI data pipeline for deep learning.

Introducing Netapp Max Data.

Max Data gives you SPEED!

Intel collaboration, with OPTANE DC Persistent memory.

11X in mongodb.

on-prem and in the future in the cloud.

Dave Hitz

Cloud first customers.

Cloud it? Should it?

He believes it’s the golden age of AI.

He shows how a actual cloud developer would run a .rg file.

He shows a example of the future of the data center, multi-cloud.

With the cloud , the goal is to inspire innovation.

In close, please us work with you to build your data fabric.

October 24, 2018 0 comments

Datacenter News Storage

Netapp Insight 2018 – TFDx- HCI Revealed! Hyper Converged or Hybrid Cloud?

by Roger Lund October 23, 2018

written by Roger Lund

I am sitting in Las Vegas , at a round table with some great colleagues today. No not at a poker table, but at yet another Tech Field Day.

A Few of the delegates.

@Arjantim @ChanEk81 @DeepStorageNet @settlersoman @darkkavenger @RayLucchesi @sfoskett @MBLeib @BenTGage https://tfd.bz/2pXpD82

also we have @davidchapa @ChrisMaki @mcbride_ruairi at the table and @PeytStefanova and @timwaldron in the room!

Yes it’s #NetAppInsight!

Time to get started.

Kicking it off is Gabriel Chapman @Bacon_Is_King

What is HCI.

Easy to buy

Simple to setup correctly

consolidates management & mait

Make private clouds easy

one throat to choke

easy to plan and scale

pay as you go expansion & economics

Rapidly respond to business demands

What is the differences between converged and HCI? Parts? self deployed vs built for the customer. Scale up vs Scale out.

Will we see a Mega Deployed in the future? Focus on a single platform and hardware Independence? Gabe thinks so.

Hypervisor and storage combined on Bare Metal. This gives us consolidated management. perhaps 1.0

Next we have Independent Hypervisor and Storage. Lets call this HCI 2.0

HCI Architectural options.

Deployed as a VM.

Time to market
Low entry point
Multiple hypervisors
Easy to buy
Simple to install
Consolidated management

Hypervisor based.

Feature Integration
Less overhead

Independent Compute and Storage

Flexibility & Scale
Enterprise Storage
Bare Metal workload is a option

Lets not get bogged down with definitions.

Netapp has the following focuses:

Inspire

Innovation in the cloud

Build

Clouds to accelerate new services.

Modernize

IT Architecture with Cloud-Connected flash.

Digital Transformation is driving new IT Imperatives.

Gabe claims it’s the battle of the harts and minds, as the public cloud comes into the data center. A example is snowball.

Netapp HCI focused on Customer Challenges.

Deliver Applications Faster
Operate Like Public Cloud
Provide Predictable Performance

Will the future of Netapp’s HCI be all cloud services on a local on-prep HCI offering?

Gabe and our delegates think it’s a future possibility.

Netapp HCI uses the solidfire technology, you can mix the storage nodes with a HCI storage cluster. It’s the same solution.

Netapp HCI Product Vision

Integrate Cloud Services.

Tackle new challanges

Enterprise scale HCI

Netapp Data Fabric

Integration at each step.

Extend Netapp Cloud Services

Expanded compute configurations.

New Trition nodes, GPU Accelerated Nodes.

Streamlined Infrastructure mgmt

Advanced Data Protection

Layered Security

Bundled Switch

Data Fabric

Netapp Open Shift Validated Design almost live

Looking to get to a 2 x 2 node configuration. 2x Computer, 2x Storage, and 2x witness nodes. Exploring options.

No Fiber Channel over token ring support. tehehe….

Thanks Gabe!

October 23, 2018 0 comments