Podcast: Play in new window | Download
http://traffic.libsyn.com/sedaily/Apache_Beam__Edited.mp3Podcast: Play in new window | Download Unbounded data streams create difficult challenges for our application architectures. The data never stops coming, and we are forced to assume that we will never know if or when we have seen all of our data. Some streaming systems give us the tools to deal partially with unbounded data streams, but we have to complement those streaming systems with batch processing, in a
“Benchmarks are all crap, but there are some benchmarks that are better than others.”
“We still need to see in the long run how much of community and industry adoption is there. Because at the end of the day, these are the single two most important things which define and determine the success of any platform.”
“My bet is that there is going to be a big shift towards streaming technologies in the future.”
Apache Flink is an open-source framework for distributed stream and batch data processing.