Today's episode is all about Data Engineering — particularly the tools and techniques that Data Scientists should know. "Fundamentals of Data Engineering" book co-authors Matthew Housley and Joe Reis are guests!
Matt and Joe:
• Co-authored the brand-new "Fundamentals of Data Engineering" book that was published by O'Reilly Media and is already a bestseller.
• Co-founded the data architecture and data engineering consultancy Ternary Data. Joe is CEO of the firm while Matt is CTO.
In addition, Joe:
• Is an adjunct professor at the University of Utah.
• Previously founded several tech companies and has held both software engineering and data science roles.
• Holds a math degree from the University of Utah.
Matt:
• Holds a PhD in math from the University of Utah.
• Worked as a professor before becoming a data scientist in industry.
Today’s episode will appeal primarily to technical experts like data scientists and data engineers, but will also be of interest to anyone who manages technology projects that involve data flows.
In this episode, Matt and Joe detail:
• Why they identify as “recovering data scientists”.
• What kinds of people tend to become data scientists versus what kinds tend to become data engineers.
• Key components of their book such as latency trade-offs and the six data engineering undercurrents.
• Their favorite data engineering tools and techniques.
• What the Live Data Stack is and how it’s putting various data professional titles on a collision course.
• The biggest data engineering problems firms face and how to fix them.
The SuperDataScience show's available on all major podcasting platforms, YouTube, and at SuperDataScience.com.