
John Furrier27















Q5: What about the impact of #Spark on this mega trend of Systems of Intelligence? the pre chat poll was: a) it will turbo charge hadoop; 2) disrupt hadoop 3) neutral 4) no impact?

George Gilbert
#ApacheSpark makes a lot of analytics easier and faster by running different workloads on the same engine. Still needs more performance improvements. #IBM contributions could really change things

David Floyer
Horses for courses; Hadoop is batch and most efficient/greatest throughout. Spark is microbatch, gets an answer quicker, but less efficient/slower throughout.

Dave Vellante
@dfloyer Is Hadoop the new tape :-)

David Wild
Means you can develop for scale but still work on tab files

George Gilbert
@dfloyer Project Tungsten for #ApacheSpark should get them to pure streaming but that will never completely replace need for batch and #Hadoop

Rodrigo Gazzaneo
@dvellante HDFS is the new long term storage media Mapreduce and Spark can read from

Jen(Cohen)Cheplick
@ggilbert41 Completely agree. There is a role for both

David Floyer
Infostreams is real-time, can be supported in development by Hadoop and Spark

Rodrigo Gazzaneo
@KirkDBorne loved the Twin Towers analogy! #LOTR

George Gilbert
@KirkDBorne the debate about running all analytics on fast, in-memory (i.e. Spark) is likely misleading

George Gilbert
@KirkDBorne the analysis around throughput / volume is likely to have different logic than analysis around per event updates

Rodrigo Gazzaneo
@ggilbert41 Memory x Flash x Disk is a matter of cost per capacity and potential revenue from insight

Kevin Petrie
@ggilbert41 Great SoI deck. Can you elaborate on why Spark is at "slow" end of innovation axis on Slide 24? Would think it is high innovation level

Jeff Frick
@dvellante > Hadoop = Tape - Love the analogy. What is "spinning rust?" in this model?

George Gilbert
@KevinPetrieTech good question: it was a tough call - but having the wild west ecosystem of databases or even the Hadoop ecosystem means each component can evolve independently at own pace. Spark libraries must evolve to integrate with each other

Kevin Petrie
@ggilbert41 Got it. Spark has arisen quickly, but nature of Spark libraries throttles future innovation vs. other platforms