How to Research and Buy Shared Storage for Video

When seeking storage for video production and post applications, do you know how to differentiate media optimized for video from general purpose media for general types of data? If not, ProMax a media solutions provider based in Santa Ana, California, has published some useful information on the topic in a new white paper.

Written by Nate Cooper, ProMax’s director of sales and marketing and a veteran in video workflow issues, the paper provides basic guidance and understanding of shared storage for video to help end users make an informed choice.

While there are thousands of IT vendors that can provide RAID arrays, network storage and networking services, most know very little about video production, post-production and streaming. Many an installation has failed by making the wrong choice.

The vast majority of servers are designed for a large number of users (from dozens to 100’s to 1000’s) with a high number of IOPS (Inputs/Outputs per Second) and generally low bandwidth to each connected user. Video production is nearly the opposite.

Video production requires fewer users (usually 100 or fewer) all with high bandwidth (often 50MB/s to 300MB/s+ to each user) connectivity to every creative user and a much lower number of IOPS.

Because of these unique requirements, which are opposed to the majority of what the rest of the market needs, there are niche products that serve the video production market.

Nate Cooper, author of the ProMax white paper

All vendors familiar with video will have a list of basic questions they will ask. To save time, and make sure the proper information is covered, it is best to prep this information prior to contacting a potential vendor.

Some of these questions include:

How much storage space will you need? (typically measured in Terabytes)
How many Macs will connect?
How many PCs will connect?
What applications will be used on the workstations?
What video formats and codecs will you be working with?
How many streams of video does each workstation require?
How will you ingest footage into the system and what format will it be?
How will you export footage out of the system and what format will it be?
Do you have backspace for the server?
Do you have cables in place?

There are several different approaches available in the market. One is SAN vs. NAS.

The history of SAN vs. NAS in the video work goes back to the early days of NLEs. In the early 2000’s, when Avid Unity and Apple Xsan were first becoming available, SAN was the go to topology for two reasons: it had faster protocol availability (1GbE vs. 2Gb Fibre) and block level connectivity had less issues with video streaming and dropping frames.

Since the mid to late 2000’s, many vendors have switched to a NAS architecture altogether or have added NAS options for their customers. This is due to the introduction and now reasonable pricing on 10GbE, as well as many vendors who have “tuned” NAS for video performance and achieve performance equal to a block level protocol.

In practical operating usage there are no major differences between SAN and NAS. Both must be properly tuned for video workflows, both can work equally well with proper tuning.

Volume locking vs. file locking is a valid topic in shared storage deployment, but is becoming less common as volume locking shared storage is becoming less common. The major difference is the ability to allow multiple users to write to the same volume at the same time (file locking) vs. having multiple users reading from the same volume, but only a single user writing (volume locking).

In its most basic definition, RAID is a group of drives working together. There are different types of RAIDs and in general, each type of RAID provides varying levels of three basic considerations: data protection (how many drives can fail without losing data), speed (how fast can the drives work together) and usable space (how much data storage can I actually use).

There are two predominant RAID types used in shared storage systems and two others that are good to be aware of. Start with the two most commonly used RAID types:

RAID5 was almost exclusively used prior to RAID6. Any RAID5 must contain at least three drives, however, RAID5 is usually set up with 4, 8, 12 or 16 drives. Many professionals recommend not using more than eight drives in a single RAID5 because it will increase the chances of losing data due to more drives in a RAID set that can fail.

Usable space in a RAID5 can be calculated by usable space = total space in RAID – total space of one drive. If one drive fails in a RAID5 set, the data is fine. If two drives fail in a RAID5 set, all data is lost. Setting up a RAID5 will take 30 minutes to 12 hours and rebuild times in the case of a drive failure are similar.

RAID6 has become more popular in the last few years. When first introduced, RAID6 was not commonly used because of performance issues and long rebuild times. Most companies have improved RAID6 technology to the point that this is a non-issue.

RAID6 provides similar read/write performance to RAID5 but with additional data protection because a RAID can lose two drives without any data loss. Usable space in a RAID5 can be calculated by usable space = total space in RAID – total space of two drives.

If one or two drives fail in a RAID6 set, data is fine. If three drives fail in a RAID6 set, all data is lost. Setting up a RAID6 will take 30 minutes to 12 hours and rebuild times in the case of a drive failure are similar.

RAID0 is almost never used for shared storage. It provides no data protection. RAID0 is only used in rare circumstances where performance is critical and data protection is not important. This was used more commonly in the early 2000’s when RAID5 and RAID6 performance was not as good as it is today. RAID0 is not recommended.

RAID1 is commonly used for small locally attached RAIDs or for system drives. Often servers will use RAID1 (also known as “mirroring”) on their system drives to ensure the OS is protected. RAID1 takes even numbers of disks and mirrors data across two drives at the same time. RAID1 provides fast reads, but writing performance is not as good.

Usable space in a RAID1 is equal to half of the total space of all drives in the RAID.

The number of drives that can fail without data loss is equal to half of the number of drives in the RAID, however (this is a big gotcha), although a RAID1 with more than two drives can allow for multiple drive failures without data loss. This is not guaranteed.

If a RAID1 has multiple drives fail you are subject to data loss depending on what drives fail. If two drives that are mirrored both fail you will lose all data. Generally speaking, RAID1 is only used two mirror two drives.

Disk drives are available in multiple protocols (sata vs. SAS) and at many different capacities. SAS drives typically spin at a faster speed than sata drives and are primarily designed for higher IOPS (input-output/second) requirements.

These are used in database servers and high-end transactional environments but generally are not necessary in video production. Fast bulk transfer of data is the requirement for video and sata drives are the best choice for this type of transfer with large capacities.

In sata configurations, enterprise drives provide little additional advantage for the additional cost (which can be significant). Enterprise drives have the exact same technical specifications as standard drives (RPM speed, system cache, transfer rate), however, they are selected off the production line because they have slightly less rotational vibration.

Drive companies offer them with longer warranties yet add a significant premium (sometimes two to three times the cost) of a standard drive.

There are a few more considerations to keep in mind. All shared storage servers should have power connected to a UPS (uninterrupted power supply). A UPS will ensure that if power is lost at your facility the server will not immediately lose power.

This gives the server enough time to flush data out of the controller cache and an admin or IT department time to login to the server and shut it down properly while power is restored.

This is critical because sudden loss of power can lead to data corruption on a file system or volume. Although a UPS does not guarantee protection from these issues, it will greatly reduce the chances of these issues.

If you are connecting to your workstations by switch or server with fiber cabling, chances are you will being using SFP’s, SFP+’s or XFP’s. These are all used for connecting cabling to HBA’s, switches and servers. Make sure your vendor is providing the correct number of each item to provide all connectivity needed.

These components range from $50-$200 (or more for long range needs) and if not spec’d properly, can be a “whoops” add-on cost of several hundred dollars.

Servers are large and deep. Be sure to measure the needed rack space to accommodate them. With very few exceptions, servers are also loud. Very loud. Don’t- edit-near- them-loud. When editing audio and video, it is critical to be able to clearly hear audio.

Most servers range from 75 to 85 decibels of output. Make sure that your server will be removed from the areas where people are working. This is often a server room or closet.

If you do not have a server room or closet there are some options for “quiet racks” which can dampen the sound enough to be in the same room. However plan on spending $2,000-$3,000 depending on your needs for a quiet rack.

There are two types of cables used in networking: copper and fiber. For all practical purposes copper cabling will use a connection known as RJ45 and fiber cabling will use a connection known as LC.

Most users are familiar with the copper RJ45 connection, it is the standard connector for Ethernet cabling. Most users are not familiar with the Fiber LC connector (unless you have networking experience).

In general, copper cables are more readily available, less expensive and more durable. Fiber cables are lighter and can go much longer distances. Within each type of cable there are different types or ratings of cable, each with different limitations for distance and speed. Do your research and choose the right cable for your installation.

Some final considerations that will assist you in your decisions:

How many concurrent users will be using the shared storage? More than just the number of connected users, concurrent users indicates the number of users that will be accessing the storage system at the same time. This is the true measurement of how capable your storage system must be.
How big are your storage needs now and in the future? Storage requirements for video can change rapidly and space is always used fast. Make sure you consider a system that can expand in the future without having to replace the base unit.
How will you backup and archive your data? Although shared storage systems can have built in redundancy and failure protection, you always need to back up your data to another device (tape, disk, etc.). Archiving allows you to move data to a secondary, typically less expensive storage device, so that your primary system remains freed up and fast enough for your primary tasks.
How much will it cost? Shared storage technology is rapidly changing. Make sure to do your technical and economic due diligence. It will pay off.

To download the entire white paper, click here.

You might also like...

IP Monitoring & Diagnostics With Command Line Tools: Part 6 - Advanced Command Line Tools

We continue our series with some small code examples that will make your monitoring and diagnostic scripts more robust and reliable

Building Software Defined Infrastructure: Monitoring Microservices

Breaking production systems into individual microservice based processors, requires monitoring over IP via RESTful APIs and a database system to capture the results.

Monitoring & Compliance In Broadcast: Monitoring QoS & QoE To Power Monetization

Measuring Quality of Experience (QoE) as perceived by viewers has become critical for monetization both from targeted advertising and direct content consumption.

IP Monitoring & Diagnostics With Command Line Tools: Part 5 - Using Shell Scripts

Shell scripts enable you to edit your diagnostic and monitoring commands into a script file so they can be repeated without needing to type them manually every time. Shell scripts also offer some unique and powerful features that help to…

Building Software Defined Infrastructure: Observability In Microservice Architecture

Building dynamic microservices based infrastructure introduces the potential for variable latency which brings new monitoring challenges that require an understanding of observability.