Managing data storage is ever more complex. IT teams have to wrestle with local, direct-attached storage, storage area networks, network attached storage and cloud storage volumes.
They might be deploying hyper-converged systems, or using on-premise implementations of cloud storage technology. And they are likely to have several, incompatible storage protocols at work, especially for unstructured data.
And all of this is happening as the business is demanding more from its data.
This is driving growing interest in global file systems, sometimes also known as distributed file systems.
Global file systems are not new. Back in the 1980s, Carnegie Mellon University developed its Andrew File System, or AFS, which is still in use today. But since then, commercial suppliers have taken the concept further and applied it across cloud and on-premise storage.
Not just object storage
Bridging the gap between on-premise and cloud storage promises to simplify IT management and cut costs.
Cloud storage is, by its nature, distributed. The end-user does not know, or need to know, where their data is stored (apart from any compliance-based limitations). Cloud suppliers use object storage technology to split data across multiple servers and even multiple datacentres, to achieve economies of scale.
But most operating systems and applications cannot read and write to object storage directly. They expect to see network protocols such as NFS or SMB, or access storage directly or via a SAN. Although suppliers have created storage gateways, and more applications are compatible with object storage (such as Oracle and Openstack), there are still plenty of applications that do not.
Global file systems could be the answer. They offer the flexibility, resilience and capacity of the cloud, but retain the simplicity – at least for applications and operating systems – of NAS.
Read more on file system storage
- Five on-premise and cloud options for network-attached storage. We look at five options for file access storage, from ‘traditional’ NAS in a standalone appliance to distributed hybrid cloud file storage and NAS cloud gateways.
- Five things you need to know about cloud file services. We look at cloud file services, which provide a file system-like architecture that gives users access to file data wherever they are, just as if it were a local NAS filer.
“Most organisations of any size will have data stored in a variety of places and file formats, which can make it very challenging to find and use,” says Bryan Betts, principal analyst at Freeform Dynamics. “Putting a global file system over the top means all your data is equally visible to everyone with access rights, in a single standard format – a virtualised ‘super-filesystem’, if you like.”
Benefits include the economies of scale of the cloud, better redundancy than can usually be achieved from on-premise systems, the ability to add (or remove) storage capacity quickly, and a move from capital spending to operating expenditure.
Other pluses of enterprise file sharing services include easier collaboration and, potentially, better security.
Further down the line, though, firms could even move to a single file system that spans on-premise hardware – where latency and performance are critical – and cloud-based applications.
Common features of global file systems
Global file systems work by combining a central file service – typically on public or private clouds – with local network hardware for caching and to ensure application compatibility. They do this by placing all the storage in a single namespace. This will be the single, “gold” copy of all data.
Caching and synching is needed to ensure performance. According to CTERA, one of the suppliers in the space, a large enterprise could be moving more than 30TB of data per site.
Secondly, the system needs broad compatibility. The global file system needs to support migration from legacy, on-premise, NAS hardware. Operating systems and applications need to be able to access the global file system as easily as they did previously with NFS or SMB.
The system also needs to be easy to use, ideally transparent to end-users, and able to scale. Few firms will be able to move everything to a new file system at once, so a global file system that can grow as applications move to it, is vital.
Global file systems or file and sync?
As a cloud-based service, global file systems appeal to organisations that need to share information between sites – or with users outside the business perimeter in use cases that were often bolstered during the pandemic.
This, however, leads to overlaps between the capabilities of the global file system, and conventional file-and-sync services. These include the more consumer-oriented services such as Dropbox and OneDrive, often pressed into service to support remote working during Covid-19, as well as SharePoint, Google Drive and enterprise-grade sharing services.
Some global file system suppliers stress that they, too, can provide these services. Certainly, being able to share files externally, or extend desktop search out to cloud-based files, is useful. For most enterprises, however, basic performance, compatibility and ease of migration are likely to rank higher.
“The challenges are, of course, that this can get very big, and if your data is globally distributed, you, or your global file system developer, need to decide how you will deal with things like file locking – to prevent two people or systems updating the same data at the same time – and replication,” says Freeform Dynamics’ Betts.
Vendor global file system offers
CTERA works through a combination of Edge, a caching filer, CTERA Drive, an agent for endpoint devices, and VDI for virtual workspaces. The global file system then links customer files to Amazon S3, Azure, or cloud provision from IBM, HPE, Dell EMC and Scality.
CTERA says its caching technology allows for infinite capacity. Also, it provides file sharing services and device-level access from Windows, Mac, Linux, Android and iOS.
Nasuni bills itself as a “cloud native” storage business, with its global file system UniFS as the building block. It supports NAS and file server consolidation, backup and recovery, DR and collaboration tools.
Nasuni supports global file locking, so two users cannot write to a file at the same time, and provides edge appliances for NFS and SMB services, as well as deduplication. Or users can install VMs on their virtual or hyperconverged infrastructure in place of NAS hardware. Nasuni claims that firms that deploy its global file system no longer need separate backup or DR.
Panzura positions itself as a data management supplier, and a single platform for unstructured data. Its core global file system is CloudFS, which creates a single, optimised dataset. The firm describes it as a global, hybrid cloud file system.
Panzura claims this not only replaces NAS hardware, but provides data management and resilient storage. The service also has direct support for mobile devices, which sync directly to the cloud. The company also offers cloud block storage.
Hammerspace focuses on providing customers with a single view of their metadata. This, it says, allows data to be stored and accessed efficiently. It even claims to be able to predict which data needs to move, ahead of time.
Hammerspace’s strapline is “Local access to global data”, through a global data environment. This brings together data from the cloud, datacentres and the edge into one dataset. The global data environment then provides access, along with ancillary services including data tiering, file versioning, deduplication, snapshots and anti-virus. It can access cloud, datacentre and legacy DAS, SAN and NAS filestores, with one licence covering all forms of storage.
Peer provides a global file service with hybrid and multicloud support, and support for the edge and datacentres. PeerGFS is software only, with active-active sync, a global namespace using Microsoft DFSN, and object storage integration. This supports S3 and Azure Blob. The Peer Agent runs on Windows File Servers or Windows VMs, but users retain direct access to their data. PeerGFS also supports cloud backup and replication.