sgursozlu - Fotolia
The project – which cost £1.5m overall – has massively reduced the management overhead incurred by supporting the NAS infrastructure and provided scalable storage for future growth, with built-in disaster recovery (DR) provision.
The Milton Keynes-based TSC holds numerous transport information datasets and also allows clients to use its processing resources to help develop innovative transport systems. Data is often unstructured and the volume of data it stores can increase and decrease according to the projects being run at any given time.
It had had NetApp NAS in place, but this was nearing end of life and presented challenges, said Alex Farr, IT director with TSC.
“The challenge was that the storage was beyond support from NetApp and was becoming expensive to add shelves,” he said. “It wasn’t scalable and although in theory it was scale-out, it didn’t lend itself well to this.”
TSC spoke to HPE via a recommendation and eventually deployed 10 HPE Apollo 4200 Gen 9 server nodes with three Scality object storage nodes on HPE hardware across two sites at a cost of about £800,000.
It uses these to run VMware virtual machines – about 46 at the time of writing – and a Hadoop analytics cluster. It is in the process of building an SQL database farm.
Read more on object storage
- We recap the key attributes of file and block storage access and the pros and cons of object storage, a method that offers key benefits but also drawbacks compared with SAN and NAS.
- NAS and object storage offer highly scalable file storage for large volumes of unstructured data, but which is right for your environment?
Key benefits are that TSC now has an infrastructure that can easily support its constantly-changing datasets and can scale up easily to multiple petabytes if needed. It has also saved a lot of time administering storage and has simplified backup and disaster recovery.
The latter is down to Scality’s uses of erasure coding and replication, which ensure constant data protection is in operation between nodes at different sites.
Farr said: “We can, in theory, lose the head office and part of our disaster recovery site and the service can still resume. The way the data moves around the boxes, we can lose 75% of it and keep going. It’s DR by design.”
Object storage does not provide the highest performance with regard to access times compared with high-end block and file storage methods, but is well suited to large volumes of unstructured data.
In place of the traditional tree-like file system of NAS, object storage uses a flat structure, with files given unique identifiers.
So why did TSC go for object storage? “We are very much in the dark about what types and formats of data we will have to deal with, but it is often unstructured and we knew we needed to pump it around our system and that it needed to be secure in disaster recovery,” said Farr.