Backstage Blog

RSS logo

You're browsing posts of the category Big Data

Data pipelines with Apache Crunch and Java 8

June 1st, 2016 by David Whiting

With Java 8 now in the mainstream, Scala and Clojure are no longer the only choices to develop readable, functional code for big data technology on the JVM. In this post we see how SoundCloud is leveraging Apache Crunch and the new Crunch Lambda module to do the high-volume data processing tasks which are essential at early stages in our batch data pipeline efficiently, robustly and simply in Java 8.