Pentaho Docker utilities: massage & pedicure for software footprints

Pentaho has tabled a set of Docker open source utilities intended to help simplify big data analytics. Emanating from its Pentaho Labs division, this containerised open source platform is available through the Pentaho Server. So what is actually happening here?

Cloud complexity loves automated & scriptable

As the complexity of application deployments (many via cloud, many being big data analytics focused) increase, developers must work rapidly to add new software components.

We could almost say that the ‘software footprint’ of the application itself has to spread and so imbibe, consume, configure, integrate, absorb and reflect all the new componentry it is being fed — and what this means is that many developers are shifting away from traditional installation methods for more automated and scriptable approaches.

Jules Winnfield: "Now don't you be tellin' me about no application massages, I'm the big data engineering master." Image Source: YouTube

Jules Winnfield: “Now don’t you be tellin’ me about no application massages, I’m the big data engineering master.” Image Source: YouTube

In software application architecture terms, what Pentaho wants to help achieve is the power of analytics within application containers.

“The open source foundation of Pentaho allows quick innovation with emerging technologies, so that we may remain autonomous and provide flexibility to developers when it comes to adapting Pentaho solutions into their development and test environments,” said Ken Wood, vice president of Pentaho Labs, Pentaho, a Hitachi Group Company. “ Developers can leverage a scalable testing and development environment to push boundaries in the areas such as IoT, machine learning and predictive analytics.”

Matt Aslett, research director for data platforms and analytics at 451 Research explains that while Docker containers by themselves are not designed to be persistent, the interest in containerisation is such that we are already seeing enterprises investigating the potential to run data-intensive workloads in container environments.

“Containers are perfect for enterprise applications that require flexibility and portability. As organizations continue to expand on their development and deployment environments, vendors that are quick to adapt and support data-intensive workloads on containers will get a head start on the competition,” concludes 451 Research’s Aslett.

Pentaho’s Docker utilities are currently available on GitHub.