Dan started Practical AI with Chris Benson to create a podcast that focused on practical, hands-on AI applications, as opposed to overly hyped or theoretical discussions. They wanted to provide actionable insights that listeners could use in their daily work.
PredictionGuard addresses the challenges enterprises face when implementing generative AI technologies, such as data privacy, compliance, and the need for structured, consistent outputs. It provides tools for running AI models in a compliant manner and offers layers of control for structuring and validating model outputs.
Dan has observed the shift from traditional MLOps to LLMOps, the growing importance of multilingual and low-resource language models, and the increasing use of models to evaluate and generate data for training other models. He also notes the rise of AI engineering as a distinct skill set.
Dan's favorite episodes include those that focus on fully connected discussions between him and Chris Benson, such as episodes on ChatGPT, Stable Diffusion, and AlphaFold. He also highlights episodes on AI in Africa and the use of AI in low-resource scenarios.
The most popular episode is the one featuring Ville Tuulos discussing Metaflow, a Python package for full-stack data science developed at Netflix. The episode resonates with listeners because it addresses the challenges of moving from notebooks to production, which is a common struggle for data scientists.
Dan believes that 'prompt engineering' as a term is overhyped, but the engineering and operations around large language models are very real. He emphasizes the importance of understanding how to structure prompts, chain processes, and fine-tune models to achieve practical results.
Engineers transitioning into AI engineering face challenges with non-deterministic systems and the lack of control over model drift, as well as the need to explore the latent capabilities of models. They also need to adapt to the new workflows required for working with large language models.
Dan believes that AI UX is crucial and can make or break the adoption of AI technologies. He gives the example of ChatGPT, where the UX innovation played a significant role in its success. He also mentions GitHub Copilot as an example of how UX can enhance the integration of AI into software development.
Dan has observed trends towards using augmented tooling for fine-tuning models with human feedback and the increasing use of models to generate data for training other models. He also notes the challenges of data quality and the need to filter and curate datasets to improve model performance.
Dan is surprised by the generalizability of large language models beyond traditional NLP tasks. He found that these models could be applied to tasks like fraud detection without needing traditional statistical models, which he thought would take much longer to achieve.
Part 2 of our podcast feed swap weekend! Check out Cognitive Revolution) as well.
"Data" Dan Whitenack has been co-host of the Practical AI podcast for the past 5 years, covering full journey of the modern AI wave post Transformers.
He joined us in studio to talk about their origin story and highlight key learnings from past episodes, riff on the AI trends we are all seeing as AI practitioner-podcasters, and his passion for low-resource-everything!
Subscribe on the Changelog), RSS), Apple Podcasts), Twitter), Mastodon), and wherever fine podcasts are sold!
Show notes
Daniel Whitenack – Twitter), GitHub), Website)
Featured Latent Space episodes:
Benchmarks)
Reza Shabani)
MosaicML and MPT)
Segment Anything)
Mike Conover)
Featured Practical AI episodes:
From notebooks to Netflix scale with Metaflow)
Capabilities of LLMs 🤯)
ML at small organizations)
Prediction Guard)
Data Dan)
Timestamps
00:00) Welcome to Practical AI
01:16) Latent Space Podcast
04:00) Practical AI Podcast
06:20) Prediction Guard
08:05) Daniel's favorite episodes
10:21) Alessio's favorite episode
10:54) Swyx's favorite episode
12:44) Listener favorites
15:14) LLMOps
17:06) Reza Shabani
19:06) Benchmarks 101
20:06) Roboflow
21:38) Mode collapse
26:21) Rajiv Shah
28:01) Staying on top of things
33:11) Kirsten Lum
34:31) datadan.io
38:48) Prompt engineering
40:38) Unique challenges engineers face
42:51) AI-UX
45:31) NLP data sets
50:49) Unlabeled data sets
55:07) Lightning round!
55:20) What's already happened in AI?
56:27) Unsolved questions in AI
58:01) Get hands on
58:53) Outro
Transcript
Full transcript is over at the Changelog site)! Get full access to Latent Space at www.latent.space/subscribe)