EntArch_VS_BigData

Architecture kills Big Data
If architecture is built properly, system behaves predictably. So it cuts noise by design.
Dmitry Golubev
Enterprises are looking for reducing costs, exploring new opportunities and getting stronger in competitive environment. Enterprise Architecture addresses all aspects through building capabilities, optimizing processes, using key company competencies.
Dmitry Golubev
Big Data promises to address the same areas: cost reduction, new opportunities, building unique capabilities which are difficult to repeat for competitors. So, there are should be some natural intersection of applying EA and Big Data
Dmitry Golubev
And from current interviews of #Hadoop and #Spark practitioners, we can clearly see that they talk about development to production time reduction, finding right messages to individuals (not market segments!), building processes involving #ML and #Streaming
Dmitry Golubev
Another popular item is simplifying and democratizing access to data, which is core element of Enterprise Architecture. Architecture Development Method (#ADM) is based on Business, Information and Technology architectural roadmaps.
Dmitry Golubev
#BigData as truly interruptive technology is promising to change all three architecture layers. Business is going to generate value new ways, available info will explode, technology will accommodate new platforms and ways to connect elements together.
Dmitry Golubev
So, why to put #EA vs #BigData instead #EA and #BigData? Problem is current vision of #BigData solutions and maturity of #EA in companies. Some way they live in parallel worlds and compete for the same resources.
Dmitry Golubev
Normal #EA development cycle takes around 5 years, which means that every company project follow roadmap leading to cost reduction, getting new opportunities or building unique capabilities.
Dmitry Golubev
#EA approach requires to follow company and industry standards, reuse architectural building blocks in Business, Information and Technology Architecture. And it requires to follow vendors reference architectures as well as common architectural patterns.
Dmitry Golubev
Now, we can clearly see that #BigData is not yet ready to provide value in this structured architectural approach. #Hadoop doers' presentations show one remarkable thing: success if happened is impossible to repeat without the same team and environment
Dmitry Golubev
It means that companies are just waiting for industry standard #BigData solutions from IT leaders. And of course, a lot of us are playing in #R&D trying to get new flavor, build custom solutions, overcoming limits of current technology stack.
Dmitry Golubev
Here I mean than different team will build totally different solution using different set of tools #Spark instead #MapReduce, #Kafka instead #Flink. And even integration with existing company tech is going to be different.
Dmitry Golubev
Is this flexibility helpful when we plan #TechArchitecture? Only after we have made extensive POC, have team in place, funds for internal development. In other cases, risk assessment is going to exclude #BigData from #EA building block for next iteration.
Dmitry Golubev
Let's consider a use case with credit card fraud. Is Big Data usage analysis a real solution or just right security architecture is an answer? Can using an advanced chip with pin instead magnetic strip kill most of fraud detection Big Data use case?
Dmitry Golubev
Specifically Big Data use case is based on user profile and transaction history. @ggilbert41 loves to use the example to talk about big data/smart data. In essence, it's calculation of one number (0-10) where 0 is clear, 10 is fraud, rest is in gray zone.
Dmitry Golubev
In accordance with @ggilbert41, time of calculation should be in sub-second to make sense for user experience. In general, it excludes using big datasets and limit calculation to rules' set, pre-calculated from Stat, ML, or some known cases.
Dmitry Golubev
This approach promises that we move from fraud detection after transaction done to prevention to not allow the transaction at all. For that point we are almost in field of authentication rather than authorization.
Dmitry Golubev
Authentication based on history and profile raises a lot of questions, tech and privacy based. Basically you are sort of punished for changing consumer behaviour or geography. It can be annoying SMS, phone calls and even blocked credit cards.
Dmitry Golubev
From the other side if we tighten the security, it can have some inconvenience to customer as well. It's much easier to tap card that use pin or sign the receipt. But security measured can be less intrusive and use more factors than credit card itself.
Dmitry Golubev
For example, face recognition module can add extra layer to authentication. Again, it only calculates number, but the same number can be stored in credit card for comparison. If it doesn't match, ask to enter pin. If pin doesn't match, block the card.
Dmitry Golubev
Image recognition can be done on client side; its cost including hardware goes down constantly (and economy on volume can be really huge). And prevention effect is going to be times more significant than Big Data fuzzy rules engine application.
Dmitry Golubev
This theoretical experiment is easy to estimate and it confirms that there are no substitutes for right Architecture. Right solution on design phase eliminates need of statistical analysis of deviations.
Dmitry Golubev
From that point of view, it means that Big Data Value should be discovered earlier during design phase. So, Big Data solutions should help with range uncertainty (rather than full uncertainty, "I don't know, what I don't know" @Crowdfather )
Dmitry Golubev
Building efficient Enterprise Architecture is based on companies know-hows, previous experience and statistics. It implies specific behaviour of elements including external ones. So, let's discuss why collect everything vs deviations from expected.
CrowdFather
If you don't know what you don't know then the architecture will be as good as your knowledge
Dmitry Golubev
@Crowdfather It is not only company knowledge, but industry knowledge and vendor's reference architectures and best practices. Not much degrees of freedom after that to discover with Big Data.
CrowdFather
this is a great conversation to have with the @wikibon community and analysts
John Furrier
getting the data gives you a baseline to work a new methodology like Agile vs waterfall development
John Furrier
Gartners BiModal IT is BS why would someone want to go slower
Dmitry Golubev
@furrier Can give you example from field: core telecom services should be reliable 99.999. So HA, DR, Fault Management should rely on architecture, not on "unpredicted" discovery in data
John Furrier
devops use cases use sandboxing to get agile into production its ops who need to enable speed
Dmitry Golubev
App development and Enterprise Architecture (it's company structure and processes, not only IT itself) have different life cycles, both can have benefits from Big Data. Healthcare provider with Data Lake is perfect example of changing EA with Hadoop.
Dmitry Golubev
Smart Algorithms (#DeepLearning for example) and #BigData are not the same. So including #deeplearning into #EA doesn't require having #BigData at all. But probably, some types of algorithm is impossible to get without #BigData.