2,656 Results for:hadoop

  • Sort by: 

DefinitionHadoop

Hadoop is an open source distributed processing framework that manages data processing and storage for ...Read More

Hadoop cluster capacity planning best practices

By Brien Posey 14 May 2018

Trying to calculate Hadoop cluster capacities isn't always straightforward. It's important for organizations to include IOPS and compression rates in their predictions. Read More

Cloudera and Hortonworks sink differences to merge

By Brian McKenna 04 Oct 2018

Hadoop distributors Hortonworks and Cloudera have sunk their differences and announced their marriage Read More

Apache Impala, a native analytics database for Hadoop

By Adrian Bridgwater 29 Nov 2017

The Apache Software Foundation (ASF) has graduated Apache Impala to become a Top-Level Project (TLP). Apache Impala itself is an analytic database for Apache Hadoop, the open source software ... Read More

Hadoop cluster configuration best practices streamline workflows

By Brien Posey 22 May 2018

Organizations that deal with a variety of Hadoop configurations can streamline workflows through baseline configuration, tests and site-specific configuration files. Read More

Hadoop data lake architecture tests IT on data integration

By Jack Vaughan 25 Jun 2018

Hortonworks users talk about building Hadoop data lakes to support new applications -- and the challenges their teams face on ingesting and refining data for end users. Read More

GlaxoSmithKline R&D creates data platform using Hadoop for the internal sharing of scientific data

By Karl Flinders 17 May 2018

GlaxoSmithKline is making better use of the data it has about the development and trials of medicines through a Hadoop-based platform Read More

Hortonworks Cloudera merger proposal stirs market pot

By Brian McKenna 05 Oct 2018

The proposed merger between Hadoop distributors Cloudera and Hortonworks stings MapR, and elicits analyst comment pointing up the threat from cloud players AWS, Google, and Microsoft Read More

Google TPUs open up on cloud; LinkedIn intros Hadoop Dynamometer

By Jack Vaughan 22 Feb 2018

In big data news, we find Google TPUs, or Tensor Processing Units, offered as a cloud service, while LinkedIn is open sourcing a Hadoop test simulator called Dynamometer. Read More

Big data tooling rolls with the changing seas of analytics

By Jack Vaughan 07 Sep 2018

Hadoop data tooling is expanding. A view holds that Hadoop is moving from alternate data warehousing to a full-fledged big data analytics offering. Read More

Cloud data warehouse makes inroads as users spurn admin tasks

By Jack Vaughan 01 Feb 2019

Overlooked in the run-up to Hadoop, data warehouses have found new life off premises. Cloud-based data warehouses find favor with teams that want to reduce warehouse administration. Read More