We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

AI's Future is Here Now

2024/12/22

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

AI Deep Dive AI Insights AI Chapters Transcript

People

主

主持人

专注于电动车和能源领域的播客主持人和内容创作者。

Topics

主持人：即使当前生成式AI的性能有所瓶颈，现有AI工具和能力的冲击力依然会深刻地改变工作模式，使其面目全非。这并非对未来超级智能AI的预测，而是基于目前AI技术已具备的强大功能。即使AI研发就此停滞，现有技术也足以在未来数年内彻底改变我们的世界。目前的AI模型已经具备多模态能力，能够处理和生成多种媒体形式，虽然存在缺陷，但在许多领域已具备实用价值。例如，在无需完美精度、需要第二意见、缺乏人力或AI表现优于人类的领域，AI已经展现出其效用。 AI能够分析视频并提出改进建议，例如通过Claude分析建筑工地视频，识别安全隐患。AI还能生成总结报告，效率远超人工。然而，AI生成的报告可能存在错误，因此不应将其用于惩罚或奖励个人，但可在缺乏监管或指导的情况下提供帮助。 AI监控系统未来可能被广泛应用，但其应用方式需要谨慎考虑，避免滥用。AI还能自动化许多数字世界中的知识工作，例如使用电脑、浏览网站、填写表格和完成交易等任务。AI能够模拟用户行为，例如模拟用户在沃尔玛和亚马逊网站上购物，并生成报告。 AI已经能够胜任一些助理工作，未来可能会被广泛应用于分析和重复性任务。AI虚拟化身技术已经能够实现逼真的虚拟会议互动，未来可能广泛应用。即使现有AI系统并不完美，也已经开始重塑工作的基本方面。企业需要关注AI部署对人的影响，避免AI被滥用。AI对工作和社会的影响将是深远而广泛的。我们需要确保AI增强而非削弱人的潜力。大型企业明年将开始尝试使用AI代理。现有AI系统已经开始重塑工作的基本方面，企业需要关注AI部署对人的影响。即使AI发展停止，我们仍然需要应对长达数十年的转型工作。我们仍在探索AI工具的用例，会议记录工具的最佳用途并非个人使用，而是共享信息给未参会者。会议记录工具的真正价值在于提高团队协作效率。我们才刚刚开始探索AI工具最有价值的用例。用户体验正成为AI产品竞争的关键因素。AI工具的创新将不仅限于对现有活动的替代，还将带来新的可能性，例如营销人员可以使用AI工具创建软件应用程序。AI工具的创新将部分来自增强功能，部分来自人们更好地使用现有工具。AI代理将从未来能力的扩展中受益匪浅。在AI转型中成功的组织将具备持续学习、了解核心价值观和不等待的心态。即使AI发展停止，我们仍然处于一个长达数十年的转型和变革的开端。

Deep Dive

Key Insights

Why is the current impact of AI transformative even without superintelligence?

Even if AI development plateaued today, the current capabilities of Gen 2 and GPT-4 systems are already highly transformative. These systems can process and generate various media, write code, operate computers, and access the internet, leading to significant changes in work processes.

What are some examples of AI's current capabilities in monitoring and analysis?

AI can monitor construction sites for safety issues, analyze work patterns, and generate detailed reports on observations. For instance, Claude 3.5 can analyze a construction site video, identify safety concerns, and create a prioritized punch list of issues to address.

How can AI be used in digital knowledge work?

AI can automate tasks like navigating websites, filling forms, and completing transactions. For example, an AI can test e-commerce websites, simulate user behavior, and generate detailed reports on the user experience.

What are the potential risks of AI-powered monitoring?

AI monitoring could evolve from a mentorship and safety tool into a panopticon where everyone is constantly watched and judged by AI. The ethical implications and government regulations will determine whether AI is used to help or control individuals.

What role do virtual avatars play in AI's impact on work?

Virtual avatars, powered by AI, can conduct meetings and interact with users in a way that mirrors human behavior. While still imperfect, these avatars could soon fool many people, highlighting the need for policy and practice adjustments.

Why is user experience becoming a key focus in AI development?

User experience (UX) is emerging as a critical factor in AI product competition. Improved UX can unlock deeper and more frequent usage of AI tools, making them more accessible and familiar to users.

What innovative uses of AI are on the horizon?

Innovative uses of AI could extend beyond simple task replacements. For example, tools like Cursor and Devon could enable marketers to create custom software or games, transforming software into a new type of content.

What mindset should organizations adopt for AI transformation?

Organizations should adopt a mindset of continual learning, core value understanding, and proactive implementation. Waiting for others to figure out AI solutions will leave organizations far behind in the AI-driven world.

Chapters

Even without significant advancements in AI capabilities, current-generation tools like GPT-4 are poised to revolutionize the workplace. Their multimodal capabilities, while imperfect, are already proving useful in various fields, from construction site monitoring to digital task automation. The ethical implications of widespread AI deployment are also discussed.

Current AI models are multimodal and can process various media types.
AI is already useful in areas where perfect accuracy isn't required or human performance is limited.
AI-powered monitoring raises ethical concerns regarding surveillance and algorithmic control.

Shownotes Transcript

Translations:

中文

Today on the AI Daily Brief, why AI's future is right here in the present. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, follow the Discord link in our show notes. Hello, friends. Last long read episode here before the holiday. And today, once again, we are basing our discussion on a piece by Professor Ethan Malek. This piece is called The Present Future, AI's Impact Long Before Superintelligence.

The TLDR of this piece is something we've discussed quite a bit on this show, that even if we are facing currently a major plateau in the performance of generative AI, the impact of the current crop of tools and capabilities is still going to be absolutely transformative for work in ways that leave it nigh unrecognizable from where it is today. Let's read Ethan's take on this, or rather turn it over to an 11Labs version of me to read Ethan's take on this, and then I'll come back for a bit of a discussion.

The present future, AI's impact long before superintelligence. The AI labs are absolutely confident that larger, more powerful AI models are coming soon, ones that will enable autonomous agents and systems smarter than human PhDs. You can see this confidence in two separate essays by the CEOs of two of the leading AI labs, Sam Altman of OpenAI and Dario Amadei of Anthropic.

that discuss the coming age of superintelligent machines. But these are not uncontroversial assertions, and we do not know if they are right. Yet, in many ways, we do not need super-powerful AIs for the transformation of work. We already have more capabilities inherent in today's Gen 2-slash-GPT-4 class systems than we have fully absorbed. Even if AI development stopped today, we would have years of change ahead of us integrating these systems into our world.

Today's AI models are already multimodal, able to process and generate various types of media like text, images, and sound. They can write code, operate computers, access the internet, and more. The pieces are all there, and we are starting to see them come together. They do not do any of this flawlessly and remain inconsistent and prone to hallucination. But there are many fields where AI abilities, flawed as they are, are already useful.

areas where perfect accuracy is not expected, or where having a second opinion is helpful, or where there would otherwise be no one to help, or where the best available human performs worse than the best available AI. Consider, for example, the combination of the ability to AI to both process images and reason over them.

It means that you can add intelligence to any video feed by just giving it to an AI, doing what was previously impossible. For example, I gave Claude a YouTube video of a construction site and prompted, you can see a video of a construction site, please monitor the site and look for issues with safety, things that could be improved, and opportunities for coaching. There is no special training here, just the native ability of Claude 3.5 Sonnet with computer use, taking screenshots every few seconds and studying them. You can see the sped up video of the system at work below.

In the video, Claude analyzes various aspects of the construction site: workers' protective equipment usage, placement of materials, work patterns, and potential hazards. These observations are interesting, but the system can go further. I then asked, "What did you conclude? Write up your observations as a punch list." The AI created a spreadsheet summarizing what it observed in a few seconds. Something that would have taken humans far longer. Note how it took all of the many issues it spotted across the video and applied reasoning to them.

breaking them down by priority order, making logical inferences about how to address them, and more. Then Claude asked me a question: "Would you like to create a tracking system for completion verification?" That seemed like a good idea. So I agreed and it made one, purposefully including fake names as an example of the data I had to fill in. The results seemed good from reviewing the video, but I am not an expert and I would be surprised if there were not serious hallucinations mixed in.

For this and many other reasons, I would never want this system to be used to punish or reward people. Yet consider a case where there would otherwise be no one monitoring a potentially dangerous environment, or where mentorship or advice is lacking. Then an AI who could flag a human to dig into a potential issue or opportunity could be a useful asset. I improvised this system with a couple of prompts. With more work, the error rates and costs of AI monitoring will drop, even if no new models are released.

These systems will get better. Organizations will be tempted to deploy AI observers everywhere. Governments may follow suit. What could be a mentor and safety check could become a panopticon where everyone is watched and judged by AI. The choices companies make and the rules put in place by governments will determine whether AI is used to help or to monitor us. One of many complex adjustments we will need to make to an AI-filled world. But observation is only one area where AI is already showing high levels of capability.

The digital world in which most knowledge work is done involves using a computer, navigating websites, filling forms, and completing transactions. Modern AI systems can now perform these same tasks, effectively automating what was previously human-only work. This capability extends beyond simple automation to include qualitative assessment and problem identification.

Here I asked Claude, go to the Walmart webpage and test it like a naive user trying to buy something, then go to Amazon and do the same thing. Write up your findings in a report in a document. Again, you can see in the sped up video that the AI goes to each website and role plays a user searching for and buying products. It then wrote up two reports, a narrative and a testing report. There were no hallucinations I spotted, and while they are not the most insightful reports I have ever seen, they were quite solid.

The AI is already a reasonable intern that, when given an assignment, executes it quickly and well, using judgment to solve problems along the way. As models get better and these systems get less complicated to use, it is easy to imagine managers using teams of AI agents to do analysis and repetitive tasks in the near future. We saw how multimodal inputs and tool use transform how AIs interact with the world, but it gets stranger still when we add multimodal outputs. Here I invited an AI avatar made by Heijen into a Zoom call.

The avatar is completely AI-powered from the voice to the image to the behavior. In fact, I prompted the avatar to act in the most stereotypical and corporate possible way for a Zoom meeting. While the uncanny valley, that unsettling feeling we get from almost but not quite human representations, is obvious in the slightly unnatural voice and visual glitches like the changing shirt, the interaction fundamentally mirrors a typical Zoom call. This is a first-generation tool and it actually works.

I would not be surprised if many people are fooled by virtual avatars in the very near future. These capabilities demand immediate attention to both policy and practice. Even as imperfect as they are, current AI systems are already reshaping fundamental aspects of work, from how we monitor safety to how we conduct meetings.

The choices organizations make today about AI deployment will set precedents that could echo for a long time. Will AI-powered monitoring be used to mentor and protect workers or to impose algorithmic control? Will AI assistance augment human capability or gradually replace human judgment? Organizations need to move beyond viewing AI deployment as purely a technical challenge. Instead, they must consider the human impact of these technologies.

Long before AIs achieve human-level performance, their impact on work and society will be profound and far-reaching. The examples I showed, from construction site monitoring to virtual avatars, are just the beginning.

The urgent task before us is ensuring these transformations enhance rather than diminish human potential, creating workplaces where technology serves to elevate human capability rather than replace it. The decisions we make now in these early days of AI integration will shape not just the future of work, but the future of human agency in an AI-augmented world. Today's episode is brought to you by Vanta. Whether you're starting or scaling your company's security program, demonstrating top-notch security practices and establishing trust is more important than ever.

Vanta automates compliance for ISO 27001, SOC 2, GDPR, and leading AI frameworks like ISO 42001 and NIST AI risk management framework, saving you time and money while helping you build customer trust. Plus, you can streamline security reviews by automating questionnaires and demonstrating your security posture with a customer-facing trust center all powered by Vanta AI. Over 8,000 global companies like Langchain, Leela AI, and Factory AI use Vanta to demonstrate AI trust and prove security in real time.

Learn more at vanta.com slash nlw. That's vanta.com slash nlw. If there is one thing that's clear about AI in 2025, it's that the agents are coming. Vertical agents by industry, horizontal agent platforms, agents per function. If you are running a large enterprise, you will be experimenting with agents next year. And given how new this is, all of us are going to be back in pilot mode.

That's why Superintelligent is offering a new product for the beginning of this year. It's an agent readiness and opportunity audit. Over the course of a couple quick weeks, we dig in with your team to understand what type of agents make sense for you to test, what type of infrastructure support you need to be ready, and to ultimately come away with a set of actionable recommendations that get you prepared to figure out how agents can transform your business. If

If you are interested in the agent readiness and opportunity audit, reach out directly to me, nlw at bsuper.ai. Put the word agent in the subject line so I know what you're talking about. And let's have you be a leader in the most dynamic part of the AI market. All right, back to real NLW here. Ethan really sums it up strongly in the last couple of paragraphs. Even as imperfect as they are, he writes, current AI systems are already reshaping fundamental aspects of work. And...

organizations need to move beyond viewing AI deployment as purely a technical challenge. Instead, they must consider the human impact of these technologies. Five quick things I want to talk about following up from this. In terms of ways that I have conversations with big enterprises about this particular issue, this idea that even if things stop now, there would still be an enormous amount of catch-up and transformation work to be done.

The first thing I want to flag is use cases. We are still in the process of discovering use cases for these tools. Even things that seem like obvious one-to-one replacements for work that happens right now are in practice in many cases not that obvious.

A great example of this, I think, is with meeting note takers. There are, of course, infinite versions of these tools. If you are anything like me, probably a third to a half of the participants at any given Zoom or Google Meet are these note takers. But what are they actually useful for?

I'm sure that for some folks, they do go back and refer to their notes, using them as a personal summarizer, something akin effectively to Dumbledore's Pensieve. But I actually think it is the other use case of the Pensieve, which is sharing memories with people, where these tools really thrive. At Super Intelligent, what we do is we have certain key meetings set up to automatically push those notes into Slack channels for the people who weren't in the meeting.

The real value of the summarization becomes not for the participants, but in bringing people who weren't in that meeting up to speed without relying on those participants rehashing the whole thing. It increases collective knowledge, it increases our ability to be in sync, and it does so at a lower time cost.

The point is, we spent months using these tools before we really landed on that being the best use case for them for us. And we're a company who exclusively does AI and focuses on helping enterprises figure out how to use AI. Point being that I think we are still just barely scratching the surface of really figuring out the use cases that are most valuable, even with today's current tools.

Second theme that I want to discuss is user experience. It has really only been in the last half of this year that we've started to really see focus on user experience as a key vector of the AI product competition. Up to that point, it was just all about model innovation and who had greater capabilities. But then you started to see things like Claude's artifacts, and all of a sudden everyone is thinking about the UX of these tools.

As that continues, it is going to open up tons and tons of new usage as people unlock either more familiar or simply better user experiences for interacting with generative AI that get them using it more deeply. Next theme, innovation. In our discussion of use cases, I was just saying that we mostly think about one-to-one replacements for today's activity with an AI-enabled version of today's activity. In marketing, I'm producing more content and I'm producing it faster and cheaper, but I'm still producing content.

I think, however, that the really interesting stuff is going to happen when we're not just one-to-one replacing current activities, but actually fundamentally thinking differently about what we can do because of Gen AI tools. To use that marketing example, if tools like Cursor and Devon become usable enough so that non-coders can actually build software applications, all of a sudden, software becomes a new type of quote-unquote content that marketers could create.

When you think about what marketers care about, eyeballs and impressions, engagement and time spent creating custom software or games, especially things that can be spun up quickly, deployed via social media and other channels, perhaps paired with cultural moments, and you have this entirely new category of things that marketers can do. And this is, of course, just one example. Some of this innovation will come from enhanced capabilities, but a lot of it's going to come from just people figuring out how to use the current crop of tools better.

For theme agents, agents are one of the things that is most likely to benefit significantly from expanded future capabilities. However, what we're seeing with vertical agents right now is that there are some applications, some functions, that even right now agents are really good at. I think figuring out how to integrate agents alongside AI-enabled workers is going to be a totally new and difficult discipline, once again, even if nothing progressed from where we are right this moment.

Lastly, fifth theme following up from this, mindset. The organizations that are and will be most successful when it comes to AI transformation are those that are going to embrace a particular mindset. They're going to embrace a mindset of continual learning, where there is never a done point, where they are never quote unquote fully transitioned, but are always looking out for what's next.

they are going to have a mindset of understanding their core values. They're going to know what their business stands for and how they operate, regardless of what technology they're using. And lastly, the mindset of not waiting is going to be so essential. Organizations that think they can just wait till someone else figures it out are going to be so woefully behind that they just don't stand a chance.

And so again, ultimately my point here is that I agree wholeheartedly with Ethan. Even if things stopped now, we would still be at the beginning of a decades-long transition and transformation. There is no time to wait, and there is no substitute for experience. Dive in, and of course, if you need help, reach out. For now, that's going to do it for today's AI Daily Brief. Until next time, peace. ♪

AI's Future is Here Now 15:09 Share