SparkFree

ENTERPRISE DATA AND THE CLOUD
CrowdChat on bridging enterprise data to the cloud. Hosted by @IBMBigData
IBM Analytics
Question #7 : How can you measure the return on investment (ROI) from bridging your private data to public cloud data services?
IBM Analytics
Last 3 minutes left .... hurry up with your final thoughts on this question
Zeydy Ortiz, PhD
ROI: can we do more with less?
jameskobielus
Do you control IT spending by putting your more resource-intensive (storage, processing, etc) data apps in a public cloud vs. private cloud?
David Newberger
Data turn around time vs cost of crunching the data. Reliability of the data. Accuracy of analytics.
IBM Analytics
last minute.... thanking you all for your participation and comments .....
jameskobielus
Can you develop and deploy mission-critical data apps faster in a public cloud, leveraging pre-existing templates, than in a private cloud, and thereby deliver business results sooner?
IBM Analytics
What is a hybrid cloud data services architecture?
IBM Analytics
Please post your replies here
IBM Analytics
Waiting for your replies .... let us get this chat rolling!
Bob E. Hayes
hybrid cloud is when an entity has some data services hosted on premises and others hosted outside their company. The key is to have them work seamlessly together (e.g., one data view) in your analytics efforts.
jameskobielus
A hybrid cloud data services architecture involves deploying fit-to-purpose data repositories and analytics platforms in various roles in a unified architecture.
Zeydy Ortiz, PhD
Hybrid cloud data services leverage the resources of the private and public cloud to deliver services to the enterprise
David Newberger
A tiered data environment which bridges 2 services like #Hadoop and #Spark and can bridge public/private clouds too
jameskobielus
A hybrid cloud data services architecture may use a Hadoop platform for info refinery, a Spark platform for exploration, NoSQL for mobile/IoT data analytics, streaming analytics platform for low-latency, etc.
Tim Crawford
Hybrid #cloud data services architecture encompasses a number of delivery options that match the needs of many today.
jameskobielus
A hybrid cloud data services architecture may also combine public and private clouds into a unified environment.
Tim Crawford
Put another way: HCDSA is more inclusive vs exclusive.
Zeydy Ortiz, PhD
Nowadays, 'hybrid' is the way to go with technology - from delivery to software stack
jameskobielus
Essentially, a cloud data services architecture is hybrid when it has two or more zones, each with distinct roles within a common services model, but each zone is a different platform or is under different admin.
IBM Analytics
Time for question #2 - at the top of your screen!
Goran S. Milovanovic
Q1 Essentially, a mix of in-house, private cloud and a third party cloud provider...
IBM Analytics
Question #2 : What are the chief apps and benefits of a hybrid cloud data services architecture?
IBM Analytics
Please post your replies here
IBM Analytics
Looking forward to your views on question #2 please @bobehayes @jameskobielus @jf
Bob E. Hayes
Benefits are inclusiveness, openness, more data, perhaps even better models (because you're using more data/metrics).
David Newberger
Theability to mix and match per use case, persist data, and support scaling are what come to mind for me.
jameskobielus
The chief apps of a hybrid cloud data services architecture include processing/analyzing structured & unstructured data; supporting both batch and real-time data analytics in parallel; supporting both production and dev ops in parallel.
IBM Analytics
Great replies, requesting the participants to chime in with their views please
Zeydy Ortiz, PhD
Since performance is always on my mind, the big benefit is right-sizing your app: match the app resource needs to the right service that can provide it
Bob E. Hayes
an important application would be one that allows you to integrate disparate data sources.
jameskobielus
Benefits of hybrid cloud data services architecture include the flexibility to encompass diverse apps/data and leverage existing data/analytics investments while evolving to support new requirements.
David Newberger
Multi-tier data warehousing, Data Lakes and a Data hubs also come to mind as I think about it more.
IBM Analytics
@bobehayes Hey Bob, do you mind logging in from twitter? That way I can RT your replies through our handle?
Goran S. Milovanovic
The opportunity to provide data security and privacy by a delineation of what is stored and processed in the 3rd party cloud and what is kept in house
Bob E. Hayes
I did log in using Twitter. Let me try again.
Zeydy Ortiz, PhD
One of the use cases is IoT where you may need batch processing to build models on historical data, on-premise service to capture and process the streaming data, and cloud services to store the data for reporting
Bob E. Hayes
I hope it works this time.
IBM Analytics
Time for question #3 a the top of your screen
Marcus A. Noel
To me, it would be more flexibility while the kinks are being worked out
IBM Analytics
Question #4 : What are the best practices for bridging your advanced analytics and data lakes to the cloud?
IBM Analytics
Please post your replies here for question #4
IBM Analytics
We are half way through... please post your replies here :)
jameskobielus
Deploy your data lake and data-science exploration platform in a fully managed and secured cloud data services environment.
Tim Crawford
First: Don't inhibit your thinking w/ traditional architectures. New methods and architectures are needed.
Tim Crawford
Second: There is not a prescriptive, one-size-fits-all approach. Understand the requirements well.
David Newberger
Realize a lot of data when you start out will be full of junk.
John Furrier
the questions is how do you fill the data lake at scale
David Newberger
Realize that moving from traditional DW to a Data Lake will require a lot of rethinking and breaking of old habits.
jameskobielus
Use Spark for your adv analytics/data science dev tool and Hadoop/NoSQL as your data lakes platform(s). Make sure they share cloud-based bidirectional source connectors and data integration services to support flexible integration patterns.
David Newberger
@furrier but you don't want to fill the lake with bad data. You need to come up with a plan to ensure the right data is streaming into said data lake
Tim Crawford
@furrier Right! How you fill the lake, with what, how...and which lake.
Tim Crawford
@davidtc Old habits die hard. :)
John Furrier
@davidtc excellent point how to you scale meta data to manage the ability to pull data out of the lake to be used in value creation activities
David Newberger
@jameskobielus While I love #ApacheSpark it's not the right fit for some use cases and we all need to keep that in mind. They are making massive strides but one tool will not fit all
IBM Analytics
On that note - time for question #5 at the top of your screen
jameskobielus
Make sure that the advanced analytics development team--i..e, data scientists--and data lake team--i.e, data engineers--share a common development environment (e.g, DSX), collab/workflow, and learning/community resources.
Zeydy Ortiz, PhD
.@jameskobielus A common dev env for advanced analytics is very useful for enabling the collaboration needed