We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

852: In Case You Missed It in December 2024

2025/1/10

Super Data Science: ML & AI Podcast with Jon Krohn

AI Deep Dive AI Insights AI Chapters Transcript

People

Andrew Ng

Ed Donner

Eiman Ebrahimi

Greg Epstein

Sadie St. Lawrence

Topics

Andrew Ng：在AI开发初期，不必过于担心LLM的成本，应优先构建有效的模型。可以使用最好的模型，构建有价值的产品，只有在成功之后，如果成本过高，再进行优化。可以尝试低成本方案，例如使用GPT-40 mini或改进Agent工作流程，找到两者之间的最佳平衡点。监督微调等技术可以帮助优化成本，但应在模型有效后才进行。 Ed Donner：选择合适的LLM需要综合考虑数据质量、评估标准和非功能性需求（如预算、时间等）。建议先建立基线模型（例如逻辑回归模型），再选择LLM。通常情况下，应优先考虑闭源模型，例如GPT-4，因为在原型开发阶段，成本通常较低。只有在需要处理大量私有数据或对API成本敏感的情况下，才考虑开源模型。 Eiman Ebrahimi：随着AI向Agent AI发展，数据安全需要考虑更复杂的因素，包括不同组件的运行位置、数据暴露参数以及程序化管理等。同态加密技术将发挥重要作用，可以帮助在完全加密的情况下进行数据处理。需要在不同组件之间找到平衡，并考虑如何以编程方式管理数据暴露。与其他技术相结合，才能实现更大的价值。 Sadie St Lawrence：OpenAI的Notebook LM令人印象深刻，其人性化的表达方式令人惊艳。自动驾驶技术（如Waymo和Tesla FSD）也展现了AI的巨大潜力，Waymo的单一最佳驾驶员理念体现了AI大规模应用的可能性。 Greg Epstein：对技术的盲目崇拜可能带来风险，提倡技术不可知论，即对技术发展持谨慎态度，不盲目乐观或悲观。 AGI的到来可能像一个宗教事件，难以预测其结果。我们应该关注人与人之间的关系，构建更具同情心的社会，而不是盲目追求技术进步。

Deep Dive

Key Insights

Why should AI developers not worry too much about the costs of using LLMs initially?

Andrew Ng advises that the primary focus should be on building a valuable and functional AI model first. Cost optimization should only be considered after the model is proven to work and is in use. Initial costs are often trivial, and tools like supervised fine-tuning can later reduce expenses if needed.

What are the key responsibilities of an AI or LLM engineer?

AI and LLM engineers must select the appropriate model for a task, understand business requirements, evaluate data quality and quantity, and consider non-functional factors like budget and time to market. They often start with a baseline model before moving to more complex LLMs, ensuring the model aligns with business outcomes.

What are the trade-offs between using closed-source and open-source LLMs?

Closed-source models like GPT-4 are often recommended for initial prototyping due to their advanced capabilities, while open-source models are preferred for proprietary or sensitive data, cost optimization, or on-device applications. The choice depends on data privacy, budget, and specific use-case requirements.

How does the rise of agentic AI impact data security?

Agentic AI systems interact with multiple data sources, some on-premise and others in the cloud, complicating data security. Innovations like homomorphic encryption are emerging to address these challenges, ensuring secure data processing across distributed systems.

What was the biggest wow moment in AI in 2024 according to the podcast?

The biggest wow moment was OpenAI's Notebook LM, which impressed users with its human-like conversational abilities. It was highlighted for its ability to make even mundane topics engaging, earning widespread praise and adoption.

What is the significance of Waymo's approach to autonomous driving?

Waymo treats its fleet of autonomous vehicles as a single, unified driver, emphasizing the scalability of machine intelligence. This approach demonstrates how focused, domain-specific AI models can revolutionize entire industries, such as transportation.

What is the concept of tech agnosticism as discussed by Greg Epstein?

Tech agnosticism advocates for a balanced, skeptical approach to emerging technologies, avoiding blind faith in their potential. It emphasizes the importance of uncertainty and critical thinking, especially in the face of grandiose claims about AI and the singularity.

Why does Greg Epstein caution against rapid AI development?

Greg Epstein warns that rushing AI development, driven by narratives of technological rapture, may lead to unintended consequences. He suggests that slowing down and focusing on human values and compassion could yield more sustainable and beneficial outcomes.

Chapters

This chapter explores the concerns of AI developers regarding the costs of using LLMs. Andrew Ng advises against prioritizing cost optimization before building a functional model, suggesting that focusing on building a valuable product is paramount. The discussion highlights the balance between cost-effective options and efficient workflows.

Prioritize building a working model over cost optimization.
Use the best model initially, then optimize costs if necessary.
Supervised fine-tuning can help lower costs after a successful model is built.

Shownotes Transcript

AI security, LLM engineering, how to choose the best LLM, and tech agnosticism: In our first “In Case You Missed It” of 2025, Jon Krohn starts the year with a round-up of our favorite recent interview moments. He selects from interviews with Andrew Ng, Ed Donner, Eiman Ebrahimi, Sadie St Lawrence, and Greg Epstein, covering the latest in AI development, touching on agentic workflows, promising new roles in AI, and what blew our minds last year.

Additional materials: www.superdatascience.com/852)

Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.

852: In Case You Missed It in December 2024 40:15 Share