StrataHadoop

StrataHadoop BigData Chat
Buncha Cool People Talking Big Data at Strata
   9 years ago
#StrataHadoopStrataHadoop BigData ChatBuncha Cool People Talking Big Data at Strata
   8 years ago
#StrataHadoop#BigDataNYC WeekBuncha Cool People Talking Big Data at Strata BigDataNYC
John Furrier
Q2: Why did Datawarehousing fail?
Dean of Big Data
great topic. Just getting ready to publish a blog on this very topic and the learnings for a Data Lake world
John Furrier
Data Warehouses that don't embrace the future will die
John Furrier
@schmarzo a warehouse can't float on a lake or ocean
Dean of Big Data
data warehouse concept will live, but the underlying infrastructure could change dramatically!
Robert Novak
I think data warehousing didn't fail, it just didn't scale. Lots of applications for DW and the ones that integrate with the modern data ecosystem can survive and transform.
John Furrier
adding new data sources is hard; lots of incomplete data makes for shit results
Robert Novak
And lost/missing/siloed data makes for incomplete results and lost efficiencies of scale. People are still learning how to find and use data.
John Furrier
data warehouses lacked the compute power both on prem and in the cloud to analyze petabytes of customer data
Robert Novak
I was talking to a @ciscodc customer who'd acquired a high-data-volume company and knew they wanted to use that data but had no idea how. Probably not uncommon among M&A situations.
John Furrier
@gallifreyan the elephant in the room you won't hear about at Orielly strata show is the nuances of Integration; Integration is the new "bar" that has been raised
Robert Novak
I wonder if the smaller conferences (from @USENIX and such) will pick up more of the hands-on integration technology discussions.
Dean of Big Data
I think data warehousing failed in at least one area - data proliferation with data silos.
Muddu Sudhakar
Datawarerhouse did not scale to large environments; hard to extract data to run algorithms on data.
Ash Parikh
great topic - it's not about data warehousing vs. big data - you need both - one answers what did I sell to my customers in the past - the other answers what I will be able to sell in the future - it's about making the best of both worlds IMHO
Ash Parikh
stats say that 70% of big data and data lake projects will remain at experimentation or fail - the issue is that data management for big data is an after thought or simply overlooked - data management is foundational whether data is big or small
Ash Parikh
here is an article I wrote recently that covers the big data journey - would love your thoughts - http://www.computerw...
Ash Parikh
it's about using the right tool for the right job - we see customers investing separately in data warehouses and in big data infrastructures at the same time - again, to answer different questions - successful ones think data management first
Muddu Sudhakar
Most of the Hadoop ecosystem products are features for Platform product or part of technology for Applications leveraging big data
Ash Parikh
@smuddu let me know if i interpreted your response correctly - stand-alone data preparation is not enough - it is a key capability in a comprehensive platform that provides end-to-end data management for big data and data lakes - thoughts?
Muddu Sudhakar
@parikhash @furrier data prep is feature in end-to-end platform product. Most hadoop vendors functionality seems like set of features and it will be customer responsibility to integrate these to solve their problems.
Ash Parikh
@smuddu thanks - i think you will find this article fun to read - http://www.computerw...
Rishi Yadav
what I see at clients is that they used datawarehosing because they did not have alrernative. Who wants to deal with mess of creating cubes when you can do everything in memory
Dave Vellante
EDW & BI = too rigid. they became insights for a few and those insights weren't operationalized at scale
Dave Vellante
certainly EDW failed to live up to it's vision and promise of a 360 degree view of customers in near real time
Muddu Sudhakar
@dvellante John Furrier Hadoop & BI = too rigid + too complex. Need packaged Solutions/Apps which can hide complexity of Hadoop/Spark and take away need consulting/PS services
Rishi Yadav
I am biased but in reality the most work is happening in integrating data sources. Hadoop/Spark etc just work. The challenge is in connectors. Lets take case of deployments on AWS. Consulting is not needed to get started but to reduce latency
Dmitry Golubev
EDW are failing due to complexity mainly. Too many interfaces between systems, and business logic is hidden in Apps. The result is difficult impact assessments and painful changes. Big Data is even worse in this sense.
Bharath Aleti
..we just have to look at the past .. databases were ubiquitous, bcz users had a whole slew of apps that could leverage the underlying data infra. Big Data requires a similar vertical ecosystem, to avoid the pain cited by @parikhash
Annika Jimenez
Data Warehousing is too BI-centric, too rigid for new data ingest, too ETL-dependent, too difficult to enable access, too expensive. The Big Data arena has shifted to agility in discovery, rapid access enablement, data/insights-enable apps.
Bert Latamore
@Furrier, @PLBurris & @GGilbert41 live now on #theCUBE http://bit.ly/1xw34F...
Bert Latamore
Join the conversation on CrowdChat @ http://bit.ly/1Rpmqp...
Bert Latamore
This is where the value is created. @Furrier #theCUBE
Bert Latamore
Big Data is where the value gets created - or not. That's the other side. @PLBurris #theCUBE
Bert Latamore
Companies that suffered the initial failures are now driving business change. @PLBurris #theCUBE
John Furrier
analyst segment on #theCUBE right now #bigdataweek #bigdataSV with Peter Burris & George Gillbert
Bert Latamore
As folks pivot from worrying about the technology to worrying about the problems the technology needs to solve. @PLBurris #theCUBE
Bert Latamore
Leaders are identifying patterns of usage enabling us to go after new classes of business problems. @PLBurris #theCUBE
Bert Latamore
Data warehousing was a relatively successful way to go after new classes of questions. @PLBurris #theCUBE
Bert Latamore
We found the limits of how that would work. @PLBurris #theCUBE
Bert Latamore
A new class of technology is created to take on questions that we might not have known in advance. That's what we have done for the last decade. @PLBurris #theCUBE
Bert Latamore
Hadoop is moving into its teen years. How do we allow them to go their own way without leaving them to go the wrong way? @PLBurris #theCUBE
Bert Latamore
The data is where the action is. This is where the battlegdround will be in software. @Furrier #theCUBE
Bert Latamore
For 50 yrs had systems of record automating internal processes. @GGilbert41 #theCUBE
Bert Latamore
The new class of apps relates to external-facing applciations. @GGilbert41 #theCUBE
Bert Latamore
We don't know how to codify exactly how that should work. We use daeta to anticipate what's likely to happen. @GGilbert41 #theCUBE
Bert Latamore
We have data that we accumulate in the enterprise that guides how it might engage & customize each customer interaction. @GGilbert41 #theCUBE
Bert Latamore
We don't know out unknowns yet, so we can't codify them. @GGilbert41 #theCUBE
Bert Latamore
if data is coming in so fast, you'll never be fully complete. @Furrier #theCUBE
Bert Latamore
essential to recognize the relationship between data coming in, analysis & learning, & then going out to seek more because we now have new questions. @PLBurris #theCUBE
Bert Latamore
As I find what I don't know we're constantly seeking new data sources. @PLBurris #theCUBE
Bert Latamore
hadoop took shape at Yahoo! when they ahd huge dadta warehouses & questions they couldn't answer. They had to unravel the pipelines and then rewrite the data warehouse. @GGilbert41 #theCUBE
Bert Latamore
That's what people are experimenting with in data lakes today. @GGilbert41 #theCUBE
Bert Latamore
Now I have real time data & impact on my business. Operationalizing it is the big challenge. @Furrier #theCUBE
Bert Latamore
If you cant operationalize it you can't use it in the business. @PLBurris #theCUBE
John Furrier
Data is Digital Capital says Peter Burris on #theCUBE
Bert Latamore
Data is your capital in the realm of digital business. @PLBurris #theCUBE
Bert Latamore
The tension will be how to bring in new disciplines we learned from data warehousing & Hadoop while facilitates more individuals creating business value from the results. @PLBurris #theCUBE
Bert Latamore
Tomorrow we will introduce a collection of reports on the forecast of the Big Data marketplace. @GGilbert41 authored most of those.
Bert Latamore
The two big questions is how does data and analytics change to take on new big questions. @PLBurris #theCUBE
Bert Latamore
An ad hoc process for capturing and cleaning the data, creating the models & embedding them in operational apps. @GGilbert41 #theCUBE
Bert Latamore
As we get further down the experience curve3 there's a wonderful study by McKinsey that we will be short 2 million data scientists in 5 yrs. @GGilbert41 #theCUBE
Bert Latamore
They are missing the point that the tools are evolving to run these data pipelines. @GGilbert41 #theCUBE
Bert Latamore
Once the tools improve we can worry about building better apps. @GGilbert41 #theCUBE
Bert Latamore
Go from managing processes to collaborating with people upstream & downstream. @GGilbert41 #theCUBE
Bert Latamore
What is the link in the past to data value? @Furrier #theCUBE
Bert Latamore
It's up in the air. We will spend time to answer that question with some clarity. @PLBurris #theCUBE
Bert Latamore
How will the vendors respond & what will the toolsets look like? @PLBurris #theCUBE
Bert Latamore
The role that developers play in creating new developments, methods, tooling to create new value will be a crucial issue. @PLBurris #theCUBE
Bert Latamore
Do the new insights & data sources change the value side? @Furrier #theCUBE
Bert Latamore
Ge, P&W no longer sell jet engines, they sell flying time on those engines. @PLBurris #theCUBE
Bert Latamore
If you have data, transparency, open source you change how people are judged. @Furrier #theCUBE
Bert Latamore
The stock market is one of the most mature Big Data systems out there. @PLBurris #theCUBE