Episode
Armchair Architects: Is Big Data Turning into Dark Data?
with David Blank-Edelman, Eric Charran, Uli Homann
In this episode of the Azure Enablement Show, Uli and Eric, our Armchair Architects, talk with David about the concept of Dark Data: What is it? What patterns can lead to it? And how can you come back to good data? Uli shares a couple of examples of how business goals dictate how big data is used.
Chapters
- 00:00 - Introduction
- 01:07 - What do we mean by “dark data”?
- 02:31 - A data lake becomes a data swamp when the data has no value
- 03:35 - Uli uses Bing to illustrate a point about how the business objective drives how you use big data
- 05:23 - Uli cites another example (thyssenkrupp elevators) of the business objective driving the way that big data was used
- 07:52 - What are some of the patterns, like ETL or ELT, that can lead to dark data?
- 09:52 - Is schema needed if there is AI pattern detection, but no human interaction?
- 11:54 - ETL and ELT defined
- 14:50 - How do you go from good data to bad data and back to good data again?
Recommended resources
- What is Azure Data Lake Analytics?
- What is Extract, Transform, and Load (ETL)?
- What is Extract, Load, and Transform (ELT)?
- What does it mean to build a single source of truth?
- Microsoft Blog: thyssenkrupp embraces digital transformation for a better future
Related episodes
- Armchair Architects: How architecture is changing - Machine Learning
- To watch more episodes in the Well-Architected series, check out this playlist
- To watch more episodes in the Armchair Architects series, check out this playlist
Connect
- David Blank-Edelman | Twitter: @otterbook
- Eric Charran | Twitter: @mougue
In this episode of the Azure Enablement Show, Uli and Eric, our Armchair Architects, talk with David about the concept of Dark Data: What is it? What patterns can lead to it? And how can you come back to good data? Uli shares a couple of examples of how business goals dictate how big data is used.
Chapters
- 00:00 - Introduction
- 01:07 - What do we mean by “dark data”?
- 02:31 - A data lake becomes a data swamp when the data has no value
- 03:35 - Uli uses Bing to illustrate a point about how the business objective drives how you use big data
- 05:23 - Uli cites another example (thyssenkrupp elevators) of the business objective driving the way that big data was used
- 07:52 - What are some of the patterns, like ETL or ELT, that can lead to dark data?
- 09:52 - Is schema needed if there is AI pattern detection, but no human interaction?
- 11:54 - ETL and ELT defined
- 14:50 - How do you go from good data to bad data and back to good data again?
Recommended resources
- What is Azure Data Lake Analytics?
- What is Extract, Transform, and Load (ETL)?
- What is Extract, Load, and Transform (ELT)?
- What does it mean to build a single source of truth?
- Microsoft Blog: thyssenkrupp embraces digital transformation for a better future
Related episodes
- Armchair Architects: How architecture is changing - Machine Learning
- To watch more episodes in the Well-Architected series, check out this playlist
- To watch more episodes in the Armchair Architects series, check out this playlist
Connect
- David Blank-Edelman | Twitter: @otterbook
- Eric Charran | Twitter: @mougue
Have feedback? Submit an issue here.