Session: Decreasing Cost and Increasing Stability with 100% Streaming Data
With the proliferation of microservices and exploding usage of edge devices, data streams have quickly gone from an emerging technology to a staple in organizational data solutions. The big players (Google, Microsoft, Amazon and more) have developed out of the box platforms for comprehensive data streaming. Some are open source while other are proprietary, some managed and others self-hosted but all are ramping up technologies quickly to serve this growing need.
Most often, data streams are discussed as a piece of a larger data collection strategy. Data streams focus on high volume, event driven services that need to be available for analysis as close as possible to the moment the event happened. Modern data warehousing instead is used for slowly changing dimensions and less urgent events that can be loaded in batch on a schedule or from a trigger. For the past few years these two strategies have been used as a hybrid approach to a more traditional data warehouse in an attempt to serve both these use cases.
What if instead, the singular approach to a new way of working with data was 100% streaming? In lieu of strictly defined batch files which can result in concurrency issues, long run times and other problems resulting from the larger grouping of data, all information would be event-based and streaming.
During this session we look at the current state of streaming, it’s relationship with modern data warehousing, and whether a future of 100% streaming is the direction our industry should be moving.
- Understand the basics of streaming and a variety of tools used for streaming
- Discuss current widespread hybrid use cases (data warehousing with streaming infrastructures)
- Have a clear view of what 100% streaming could look like
- Discuss potential use cases for 100% streaming infrastructure
- Compare modern data warehousing to a 100% streaming infrastructure
Amanda Darcangelo is a Senior Data & Analytics Consultant for CTI, a boutique data firm focused on strategy, building roadmaps, and implementing technology to support organizations as they build out their data capabilities.
Amanda has been in the data industry for over 8 years and has worked across a number of sectors including eCommerce, public sector, finance, healthcare, and marketing analytics. She has experience in analytics, data science, data engineering, and data communication.
Amanda is an advisory board member for the Ithaca College Cybersecurity Certificate and a co-founder of Salt City Data Community. She also has a BA in Political Science from SUNY Buffalo State and uses that when developing data policies within both private and public organizations, including the City of Syracuse Policy in Regards to the Use of Surveillance Technology.