2,655 Results for:hadoop

  • Sort by: 

DefinitionHadoop

Hadoop is an open source distributed processing framework that manages data processing and storage for ...Read More

Swim fast with a Hadoop data lake architecture -- or sink

By Craig Stedman 31 Aug 2015

The Hadoop data lake concept presents plenty of challenges for organizations. But the experiences of early adopters point the way toward successful data lake architecture deployments. Read More

Don't throw out design principles when jumping in Hadoop data lake

By Jack Vaughan 13 Aug 2015

In a Q&A, data warehousing expert Joe Caserta explains why a new generation of developers building Hadoop clusters and other big data systems may need an introduction to some fundamental rules of ETL. Read More

Take measured steps to build a Hadoop cluster architecture

By Jack Vaughan 07 Aug 2015

RelayHealth's Raheem Daya described the path he took to deploy and expand a Hadoop cluster for distributed data processing during a presentation at the 2015 TDWI conference in Boston. Read More

Panera meets big data challenge of lunchtime operations

By Jack Vaughan 28 Nov 2017

In the era of more and more digital orders, Panera Bread encountered big data challenges that led the restaurant chain to deploy a new cluster architecture with Hadoop, Spark and other technologies. Read More

What are ‘mature’ stateful applications?

By Adrian Bridgwater 18 Jul 2018

BlueK8s is a new open source Kubernetes initiative from ‘big data workloads’ company BlueData -- the project’s direction leads us to learn a little about which direction containerised cloud-centric ... Read More

Hadoop: the Third Wave Breaks

By Barry Devlin 26 Jun 2014

Although the yellow elephant continues to trample all over the world of Information Management, it is becoming increasingly difficult to say where more traditional technologies end and Hadoop ... Read More

Top 10 information management stories of 2017

By Brian McKenna 29 Dec 2017

In 2017, there has been some convergence of old and new schools of data management and BI/analytics – as with financial services firms’ data strategies that combine data stores and blend types of analytics Read More

OpenStack storage update 2018: After Rocky

By Chris Evans 26 Sep 2018

Snapshots and container storage support are among recently added functionality in the open source private cloud platform, but vendor driver support for OpenStack is not universal Read More

Hadoop 2 query: Less talk, more action with new Hadoop version?

By Craig Stedman 13 Mar 2014

For all the hype about Hadoop, adoption remains relatively low. But the Hadoop 2 release could give prospective users more reasons to move forward. Read More

Dell links with Syncsort to tune Cloudera Hadoop for offloaded ETL

By Jack Vaughan 27 Oct 2015

Dell and others have a new ETL reference architecture. Its purpose is to ease migrations to Cloudera Hadoop. Also: Dell buys EMC; Syncsort is acquired. Read More