Prepare to have your brain tickled by Dr. Julia Silge. In today's episode, Julia details the IDE she's been developing for data scientists, "Tidy" NLP, and open-source libraries that make MLOps a breeze.
More on Julia:
• Engineering Manager at Posit PBC (makers of RStudio... and the company formerly known as RStudio).
• Authored the bestselling O'Reilly books “Text Mining with R” and “Tidy Modeling with R".
• Previously worked as a Data Scientist at Stack Overflow and Datassist.
• Prior to joining industry, was an academic researcher and professor at Yale University.
• Holds a PhD in Astronomy from The University of Texas at Austin.
Today’s episode will probably appeal most to hands-on practitioners like data scientists, software developers and ML engineers. In it, Julia details:
• The brand-new IDE Positron (free to use and source-available) that she’s been developing.
• Her favorite LLMs for code generation.
• The open-source software libraries that make MLOps easy.
• Her top tips for effective Natural Language Processing, including when more traditional NLP techniques should be used instead of an LLM.
The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.