A group of IT suppliers in the US has launched an “open data platform” association to boost big data technology.
This comes at a time when big data analytics projects seem to be gaining ground in corporate IT. As an instance, big data analytics and management emerged as a stronger area of focus in the annual TechTarget/Computer Weekly IT Spending Priorities survey for 2015 than in 2014. 30% of respondents globally said they were undertaking big data initiatives in 2015, while the figure for Europe was 26%, and for the UK 21%. The global figure for the previous year was 17%.
Shaun Connolly, vice president of corporate strategy, at Hadoop distributor Hortonworks said: “The Open Data Platform initiative will rally both enterprise end users and vendors around a well-defined common core platform against which big data solutions can be qualified.”
Hortonworks joins GE Software, IBM, Infosys, Pivotal, SAS, AltiScale, Capgemini, CenturyLink, EMC, Splunk, Verizon Enterprise Solutions, Teradata, and VMware in declaring an intent to create an industry association, dubbed the Open Data Platform.
The ODP will promote the Apache Hadoop “ecosystem”. It will provide access to a tested reference core of Apache Hadoop, Apache Ambari and related Apache source “artefacts”.
Members will collaborate on Apache projects as well as other open source-licensed big data projects with “a goal toward meeting enterprise class requirements.”
Beth Smith, General Manager, Analytic Platform, IBM said: “As a founding member of the Open Data Platform Initiative, we're excited to be part of the expanding community collaborating to shape and promote open source standard technologies for Hadoop”.
Coming at big data from the analytics side, Craig Rubendall, vice president of platform R&D, SAS said that his company is committed to innovation in big data analytics and to "providing high-quality software that our customers can count on”. SAS founder and CEO Jim Goodnight said, in October last year, that the company will expand its portfolio that supports the Hadoop Distributed File System, which he characterised as “resilient” and well suited to analytic exploration by SAS technology.
“It’s a great combination. We are working with Hadoop distributors Cloudera, HortonWorks, and so on,” he said.
On the services side, Navin Budhiraja, head architecture and technology, Infosys said that the Open Data Platform would create “an ecosystem” that would preserve “the rapid innovation cycles of open source software, while still providing the benefits of broad vendor support and interoperability.”
And, from the world of data warehousing, sometimes seen as a technology to be superseded by Hadoop-based “data lakes”, Scott Gnau, president, Teradata Labs said: “The Open Data Platform movement resonates well at a time when businesses are challenged by the pace of change and the exploding number of options for data management. We are pleased to join the Open Data Platform with organizations that are engaged as Teradata partners – working with us to provide integrated, unified data ecosystems”.
Read more about Hadoop
- What is Hadoop?
- It's been nearly 10 years since Google published MapReduce: Simplified Data Processing on Large Clusters. The seminal paper, which describes first "mapping" or sorting the data and then "reducing" or summarising it, altered the data processing landscape, helping pave the way for Apache Hadoop, the popular open-source, distributed computing system.
- The changes in architecture brought on by Hadoop 2 can be expected to usher in a new round of innovative software. A prime contender appears to be Spark, which, in February 2014, went from ''Apache Incubator'' to become a top-level Apache Foundation project.