SparkSummitEast

Spark Summit East Conference
#theCUBE's live coverage of Spark Summit East 2017 from Boston 8-9 February 2017
Bert Latamore
#IBM #Spark Tech Center Principal Engineer Nick Pentreath live now on #theCUBE from Spark Summit East @DVellante @GGilbert41 http://bit.ly/2kOdAJ...
Bert Latamore
The #SPC formed by #IBM a yrs ago to focus on Open Source. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Overarching goal is to drive adoption in enterprise customers & make #Spark enterprise-ready. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
#IBM invests in Open Source technologies that it sees as transformational. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
#IBM backing #Spark as a next generation analaytics platform. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Machine learning is a key part of the mission. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Before we just dumped data into the data lakes & silos. Now what are we going to do with it. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
To unlock the dsaeta you need intelligenct systems, AI. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
We see machine learning as part of this. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
what are the early use cases? What needs to mature? @GGilbert41 #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Lots of use cases for machine learning inc. recommendation engines. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Making recommendations to online customers as to what they should buy is a classic. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Fraud detection in financial services & enterprises are another classic use case. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
No we have good models for these & other use cases are in the Spark library. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
What is missing? A huge complex workflow in the end-to-end story. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Feed multiple data streams into your database, then do the data science & then deploy the machine learning algorithm. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Then U have to model the algorithm in the real world. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
what is missing is that end-to-end model. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Fraud detection still has a lot of false positives. @DVellante #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
it is not magic that you throw machine learning at a database & everybody's happy. U have to fulfill the user's needs. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Better daetas, fulfilling customer expectations are vital. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
The models are getting better with more & better data. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
It's always difficult to make decisions about timeframes. There is a long way to go. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Time gap for predicdtions is down to real-time. Need better feedback, monitoring, end-user experience. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
There's a lot of work still to be done. Areas of active research in academic field on improving these systems. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Improving the quality of fraud detection -- we have a long way to go still. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
The enterprise-applied machine-learning problem has moved from the academic. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
#IBM has announced Watson Machine Learning to productionize the end-to-end machine learning platform. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
We're very focused on the open source side. My work is in the #Apache $Spark platform, not the #Watson side. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
When you operate at the scale of #IBM Ur customers will find all the bugs. We try to make it better for everyone. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
we take all the feedback and centralize the fixes in the #Apache #Spark platfoarm. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
We as a community need to decide whether we develop the functionality, adapt open standards or adopt other Open Source projectds. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
#SystemML started as an #IBM research project. As a #SQL optimizer it decides how to optimize queries. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Underlying execution engine is on #Spark. Can also run on #Hadoop. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Millions of lines of code, very powerful engine. Lot of work still to be done for it to be usable in production systems. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Most of the team is in San Francisco. I'm the only member in Cape Town, working remote. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Some of the key things I picked up are related to deep learning on #Spark. Interesting work coming from #Intel. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Every #SparkSummit there are now projecdts from the community. @MLNick #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
#Wikibon’s @DVellante & @GGilbert41 will do part 2 of their Big Data report coverage next on #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
#MapR Head of Partner Marketing Bill Peterson live now on #theCUBE from Spark Summit East @DVellante @GGilbert41 http://bit.ly/2kOdAJ...
Bert Latamore
The show's been great, we are getting a lot of deep technical questions. #MapR Head of Partner Marketing Bill Peterson live now on #theCUBE from Spark Summit East @DVellante @GGilbert41 http://bit.ly/2kOdAJ...
Bert Latamore
#MapR today is an enterprise software company that delivers a converged data platform.
Bert Latamore
A yeaer ago we got out of the business of leading 100% with #Hadoop & went to the platform play. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Then underneath we have been hardening all of it so it works out of the box. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
When I say that the lights go on in people's eyes. File system, NoSQL database, #Spark, streaming tool. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Open Source is still a big part of our stuff. We help our customers keep up to date on their open source products, which is a pain for customers. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
For new bsiness it gets us out of the #Hadoop only mode. We keep adding solutions to the unified daeta management platform. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
#Spark, #Hadoop, as a way to bring daeta into the Converged Data Platform. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
We talk now about converged applications. Putting historical & streaming data together with a converged application in the middle. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
The beauty of our file system along the bottom layer. Middle layer is Open Source tools. Above that is the data delivery system -- #Hadoop, #Spark, microservices. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
U can put 0-2 yrs data for instance in #SAP #HANA or #Spark, 2-5 yr data in Business Warehouse & 5+ yr data in Hadoop & query across all 3 levels. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
We're maturing in our messaging, in the level of people who we're joining, & in the number & volume of deals. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
Converged partner Program has 3 levels. Elites are by invitation only. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...
Bert Latamore
The middle layer includes a lot of medium sized vendors that we do a lot with. The Affiliates are partners we might pull into campaigns. Not a full go to market. @thebillp #MapR #theCUBE http://bit.ly/2kOdAJ...