Today's superhuman guest is Dr. Sebastian Raschka,, author of the bestselling "ML with PyTorch and sklearn" book, iconic technical blogger (>350k followers) and Staff Research Engineer at Lightning AI. Hear him detail open-source libraries for LLMs.
More on Sebastian:
• Is Staff Research Engineer at Lightning AI, the company behind the popular PyTorch Lightning open-source library for training and deploying PyTorch models, including Large Language Models (LLMs), with ease.
• Iconic technical blogger (50k subscribers) and social-media contributor (>350k combined followers across LinkedIn and Twitter)
• Was previously Assistant Professor of Statistics at University of Wisconsin-Madison.
• Holds a PhD in statistical data mining from Michigan State University.
Today’s episode is technical and will primarily be of interest to hands-on practitioners like data scientists, software developers and machine learning engineers.
In it, Sebastian details:
• The many super-helpful open-source libraries that PyTorch Lightning leads development of.
• Dora parameter-efficient fine-tuning.
• Google’s “open-source” Gemma models.
• Multi-query attention.
• The leading alternatives to RLHF.
• Where he sees the next big opportunities in LLM development.
The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.