Anthropic Has (Maybe) Solved a Holy Grail of Business AI

2024/11/28

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

播

播音员

主持著名true crime播客《Crime Junkie》的播音员和创始人。

播音员介绍了人工智能在感恩节火鸡烹饪和商业写作中的应用案例。Butterball公司利用AI分析客户数据，改进火鸡烹饪方案和客户服务；Perplexity公司考虑开发AI硬件；Anthropic公司推出Claude AI的自定义写作风格功能，可以模仿个人或品牌特有的语气，这对于大规模内容创作具有重要意义。 David Singleton介绍了AI代理操作系统的概念，认为AI代理需要一个通用的技术框架来连接服务并相互通信，就像应用程序需要操作系统一样。他还提到构建AI代理需要新的用户界面模式、重新设计的隐私模型和简化的开发平台。 Matt对Anthropic公司新推出的自定义写作风格功能的评价褒贬不一，认为该功能在模仿个人写作风格方面效果不尽如人意，但其真正价值在于创建和使用预设的写作风格，而不是完美地复制个人风格。 David Singleton认为AI代理需要一个通用的技术框架来连接服务并相互通信，就像应用程序需要操作系统一样。他还强调，构建AI代理需要新的用户界面模式、重新设计的隐私模型和简化的开发平台，以方便开发者构建有用的代理，并最终帮助人们节省时间，专注于重要的事情。 Matt对Anthropic公司新推出的自定义写作风格功能的评价褒贬不一，他认为该功能在模仿个人写作风格方面效果不尽如人意，上传了90分钟的访谈记录，但生成的文本并不像他自己的写作风格。但他同时也指出，该功能的真正价值在于创建和使用预设的写作风格，而不是完美地复制个人风格，这对于大规模内容创作具有重要意义。

Deep Dive

To down the a daily brief, a new anthropic feature that promises to be able to copy your own writing style before that in the headlines, how A I is helping fix one of the most painful parts of thanksgiving. The A I daily brief is a daily podcast in video about the most important news and discussions in A I. To join the conversation, follow the discard link in our shown notes.

Welcome back to the AI daily brief headline edition. All the daily AI news you need in around five minutes. We are coming right up on thanksgiving.

In fact, by the time you're listening to this and may be thanksgiving and IT, turns out that there is a very specific experience that americans really, really hate on this day. It's not getting stuck in conversation with a weird drunk relative, but has to do with the cooking process. For over fifty years, butterball has maintained a turkey hot line, think nine one one for turkey related emergencies.

This very famous ly featured in an episode of the west wing, where president jai, a bar let, mentioned off handling that there should be a special service for helping people with their thanksgiving turkeys, only to discover that one existed. He then, of course, tried to economists plain to them how to make a turkey, but that's neither here nor there. The point is that Better ball has decades of audio data from customers explaining the snack they run into on thanksgiving day by piping this data through an aim ization tool.

Butterball came up with a startling finding. People hate dealing with frozen turkey every year. Thankful ving chef's called in a state of panicking when they realized they ve forgotten to saw the bird. While this is in a surprise, the key insight was that complaints about buying have exploded in recent years, something the Better ball might not have noticed.

Without the help of A I to summarized millions of customer service interactions, the firm has now undertaken a three year process of revising their data structure and analyzing customer input, and is now quite possibly the most AI forward turkey processor in the world. This is all they do, and each thankful every day has eighteen months of preparation put into IT from procurement to logistics to customer service. AI is also helping with those tasks, allowing the firm to Better optimize their inventory and deliveries.

The crown duel of their AI over hall, however, is the thought turkey making its debut this thanksgiving. That product involved hours and hours of testing with turkey scientists collecting hundreds of data points with the help of machine learning. Butterball believes they have engineered the perfect though as turkey theyve paired IT with precise cooking instructions to ensure IT comes out right every time.

So if you are cooking a soleless turkey this year, you could be eating your first AI enhanced thanksgiving meal. Next up, perplexity is about to fall into one of the single greatest traps that any start up can fall in to, which is trying to make a cool next generation hardware device. Earlier this week, CEO are of a rino, vous wrote, considering making a simple under fifty dollar hardware device that will reliably answer your questions.

Voice to voice, just do this, but do IT very well if this post gets more than five thousand likes, will definitely make IT. When the post did indeed get more than five thousand likes, he followed up with the comment, alright, L F G A I hardware, though, has been a fairly fraught category over the past year. The arguable most successful product, the rabbit r one, launched to very low warm reviews.

The company claims to have shipped one hundred and thirty thousand units, but they are readily available at a steep discount in the secondary market. Others, like the humane A I pi n, had offer reviews, weak sales and a recall that is forced the community to look for an acquire. Still, if the last year has taught us anything, it's don't fate perplexity.

The company is reportedly in the middle, raising a half a billion dollars and has been shipping relentingly for months. AI hardware is also a theme that could rise in popularity next year. My journey recently formed a harvard team, open an eye looking to develop a device designed by john y.

eve. And rapid agented upgrade is showing some promise. Still, shipping and eye wear arable at a fifty dog Price point seems pretty tough, and some are trying to convince true of us to do literally anything else.

Sola premium ter levels wrote, in my opinion, don't do IT. It's already done and never works. We don't need another AI device.

We already have a smart phone, just double down on, making the mobile APP the best ever, add a great competitor to google images because I use that a lot. But IT should be a similar layout as A A M style layout, doesn't for IT. And I think this is the sentiment from lots of people taking on google.

And finally, being the first competitive actually have a chance to disrupt search in twenty years seems like a big enough task. Others, though, are hugging him on with boy on tongue saying, I already made one for two hundred dollars using off the shelf PC and ChatGPT advanced mode. I'm about to make one with rasberry pie for around one hundred dollars.

So will IT actually happen? We'll just have to wait and see. Over in funding news, a group of former google and stripe executives have raised fifty six million dollars to build an Operating system for A I agents, the start up called debt slash agencies LED by a group of founders who helped build the android platform.

They're now applying the same playbook to A I agents. The key inside is that agents will need a common technical framework to connect to services and communicate with each other, much like different apps than an Operating system, cofounder and CEO David singleton said. We need an android like moment for A I.

We can see the promise of A I agents, but as a developer is just too hard to build anything good. The college to creates a new user face that allows more natural interactions with agents across different one thing is for sure that the team is absolutely stacked with talent. You go bar to start chief product officer and formally VP of product management for android said, this is a team that built the last three generations of Operating systems.

Investors are certainly excited. Na oh, G, N, A partner at independence said, if you think about the people at this company and founder market fit, IT couldn't be more relevant for what theyve set out to go build. Jill chase, a partner of capital g, said, this is a once in a generation opportunity.

They are attacking, announcing the company and laying out his vision. Single twist, modern AI will fundamentally change how people use software in their daily lives. Agenda applications could, for the first time, enable computers to work with people in much the same way people work with people.

But IT won't happen without removing a tone of blockers. We need new U I patterns and reimagine privacy model and a developed platform that makes IT radically simpler to build useful agents. That's the chAllenge we're taking on.

We're building a cloud based to s for trusted agents to work with users across all of their devices. We want to help people spend time on what matters to them. In another bit of funding news, black forest labs is in talks to race two hundred million in their first major funding round.

The german started is only a few months old, but is garner to claim for their flux text to image model. The model is driving image generation for X A chatbot, and is generally considered to be right at the top of state of the r. The companies found team includes several computer scientists involved creating stable division.

The new funding round is rumor to value the company at a billion dollars will be LED by a sixteen z. Their seat round was raised in August, gathering thirty one million from and including oculus cofounder and in a rebate and why commons or head gary ten. In product news, google is connecting spotify to geri, according to court spotted in the german I extensions, the AI assistant will soon be able to take control of spotify.

Users will be able to use A I to search and play music using natural language requests. For the moment, gm I on't be able to make players to interact with radio stations on the platform. This Marks the second integration for geri outside of google's apps with what's up compatibility at IT last month.

And the question is whether this reveals something of google's upcoming strategy. Is their focus going to be on bringing an agent's type experience to a wide consolation of apps? IT would certainly be in line with how they done things in the past.

Last week, a new business line for uber, the company is launching a new AI data labelling service. The new division called scale solutions has begun hiring ing contract workers to complete data labelling tasks. The initial build on an internal team attackers large scale annotation task for the right sharing company, but the division will now offer their service to external clients.

Data labelling is an unglamorous but rapidly growing part of the A I industry scale. I A start of that offer similar services is currently valued at fourteen billion dollars among the top to venture back companies in the space. We're also seeing high quality data labeling emerge as a super powerful some model builders.

Last month, the new video model from chinese lab mini max blew the industry away with its unprecedented abilities, and some suspected the secret to building that performance video model was plentiful and accurate, labelled training data regarding plans for this new division, a neber spokesman said, having performed these tasks at scale over the past decade is part of our own growth. We deeply understand the needs of companies requiring these services. They added that hiring independent contractors alliance quote, with our expertise as one of the world's largest providers of flexible work opportunities, IT may not be big news from the frontier labs, but there are still plenty going on as we head into this holiday week.

For now, that is gonna IT for the AI daily brief headlines edition. Next up, the main episode, today's episode brought you by plum. Want to use A I to automate your work, but don't know to state AI worker ply describing no coding or A P I keys required.

Imagine typing out A I analyzed my zoo meetings and send me your insights in notion and watching IT come to life before your eyes, whether you an Operations leader, market or or even a non technical founder, plum gives you the power of A, I, without the technical hassle. Get in. Then access to top models.

G, P, O, assembly, A, I, more technology to you. Check out. Use that's plum with a bee for early access to the future of worker automation. Today's episodes brought you by vantage, whether you're starting or scaling your company is security program. Demonstrating top notch security practices and establishing trust is more important than ever.

Penta automate compliance for I S O twenty seven O O one soc two gdpr and leading A I frameworks like I S O forty two thousand one and N I S T A I rest management, saving you time and money while helping you build customer trust. Plus, you can streamline security reviews by automating questionnaire and demonstrates your security posture with a customer facing trust center. All power by vent to A I over eight thousand global companies like LangChain lea AI in factory A I use vantage to demonstrate A I trust, improve security in real time, learn more a vented 到 com flash N L W that's ventadour com slash N W。

Today's epo de is brought you by super intelligent. Every single business workflow and function is being remade and reimagine with artificial intelligence. There is a huge chAllenge, however, of going from the potential of AI to actually capturing the value.

And that gap is what R E intelligence is dedicated to. Filling S U P E R N T L ligon accelerates AI adoption and engagement to help teams actually use A I to increase productivity and drive business value. An interactive A I use case registry gives your company full visibility into how people are using artificial intelligence right now, pare that with capabilities, building content in the form of flutters, als learning path, ths and a use case library.

And super intelligent helps people inside your company show how they're getting value out of ai while providing resources for people to put that inspiration into action. The next three teams that sign up with one hundred or more seats are going to get free. Embedded consulting.

That's a process by which our super intelligence team sits with your organization, figures out the specific use cases that matter most to you and helps actually ensure support for adoption of those use cases to drive real value. Go to be super di. I learn more about this.

A I enable ment network. And now back to the show. Welcome back to the daily brief. There is a very common pattern when people start interacting with, generate A I and basically that pattern goes something like, at first, you are absolutely blown away by capabilities.

Whether it's an image generator like mid journey, a voice synthesizer like eleven labs, or of course, I have a lab like ChatGPT or d the things that A I can do make you feel like a wizard. However, inevitably, the deeper that you get, the more you find things that aren't quite right or are limiting in terms of just how far you can take a great example in the image generation space is consistent characters. It's wonderful if you're just trying to create one of images, you can get incredible fidelity and specificity in terms of what you want IT to look and feel like.

But if you're trying to do that across an entire range of images, if you're trying to create the basis for animation or a comic book, IT gets a lot harder. And lots and lots of effort has been spent, both from the big image generation companies as well as third parties on how to do that Better. And as that capability comes online, the key is that IT unlocks an entire set of use cases that were otherwise cut off in the alarm space.

One of those holy grail, in other words, one of the types of updates that could unlock a huge number of new use cases or just improve what the alam is already being used for in a fundamental and significant way, is the ability for an alm to imitate a particular writing style. What are the things that people figured out with ChatGPT n claud quite quickly was that there is a particular flavor in field to L. M.

Generated writing. There are certain words that get used way more by alms, then get used by real people, words like delve. And just in general, there's a particular style that bills recognizable.

Now there are always that you can use prompts to try to get around, that you can coach the aleem to write in a particular way, you can give a references. But one of the things that people have most wanted is the ability to just upload a set of their own documents and have the alem be able to natively copy them. In fact, I ve tried extensively to do this.

I built a custom GPT, which, as you can see, has like fifteen reference documents from short essays to long form writing. And the short of IT is that it's only okay. IT definitely makes chat B T not sound like ChatGPT.

And IT certainly has some patterns that mimic AI write, but IT also doesn't really sound like me, certainly not enough that i'm going to outsource important writing to you in any sort of short order. Other companies recently have tried to produce ze versions of this spiral. For example is a product from every which does something in this space.

Their approach is really interesting. Basically, they're trying to make IT easy for you to translate one type of content to another. So I can say i'm generating youtube videos, podcasts and linkedin posts, and then IT shows you how I can take that source work, can turn IT into lots of other posts.

Now part of the process is that IT tries to retain style as IT build out the set of additional assets for they let you play with historical examples as a way to demonstrate what I can do. And yet still what people have really wanted is this to be built natively into the alarms themselves and for IT to be good. And so when anthropic announced this new style feature yesterday, people in the know really got excited.

Matt, well, for example, route, I feel like this is a feature that everyone's been wanting and waiting for. Unfortunately, this news has been overshadow by the or league, which is already been shut down. This is bigger, in my opinion, anthropic for their part, announcers with styles.

You can now customize how claude response select from the new preset options. Concise, explanatory or formal. And so there are actually two parts of this feature.

The first is that there is just a built in style selector where any prompt that you have, you can say, if you want IT to be Normal. The default responses from cloud can size shorter responses and more messages, explanatory educational responses for learning or formal clearing well structure responses. That on the zone is a really great feature that allows you to not have to prompt to be clear about the style that you want.

It's instead now built into the U. I. However, the real juice and what's got people excited is the fact that you can create and edit styles. So what you do is you select the styles menu, press create edit styles, and then create custom style.

From here, you can add a writing example, which can be a document, or you can copy pace text, or you can simply describe this style you're going for. I added a recent post from linton that I had written that was a little bit longer and more substantial, more blog poste. And what I came back with was a style that I called tech translator.

The style summary was deliver analytical insights through conversational and authority of communication. So now when I go back, let's a right a short blog post about A I agents in the enterprise emphasize that while they might not be production ready yet, twenty twenty five is likely the year where people start integrating agents at their companies and forward thinking enterprises should get ahead. Copy this.

Let's do IT Normal first. And you get this piece AI agent in the enterprise preparing for the twenty twenty five wave. This because its claude sounds nothing like me.

IT has nothing to do with how I would write. Just by way of one example, the paragraphs are much longer that would make sense for the type of style that i'm going for. Now let's try IT with the tech translator style, the style that I created.

For me, this is certainly not perfect. It's a little more tweet that i'd like to think that I am. The first paragraph is let's get real about AI agent in the enterprise.

While everyone's watching demo videos improve of concepts, most companies are still sitting on the side lines. And honest, that's been the right move until now that is so much closer to how I write short residences, punchy and trying to get people to pay attention. Now it's very hard, of course, to describe in the context of a podcast whether this works Better.

But my point is that, at least in my initial trials, this gets a heck of a lot closer natively. Then the stuff i've done before and in the context, especially of imagine a use case where a company is trying to mass produce content, this is such a huge unlock in terms of the average quality of a piece of writing is going to come from cloud. Not everyone has been very impressed.

Matt wolf got a style summary that said, deliver technical AI news with enthusiastic conversation expertise that makes complex topics engaging and accessible. But he said, unfortunately, i'm kind of disappointed. I uploaded ninety minutes worth of transcripts, and I don't think this reads like me at all.

Luckily, I can get feedback and try to get IT even closer. One thing I will point out is that matt uploaded youtube transcripts, which are him talking that's different than writing, and I won't be surprised if that's a tougher translation for this technology. But regardless of what the sources, the point is that it's important to not overstate how close this huge to an individual style.

I think the real value and where a lot of people are going to get excited is a little bit copying their own style, but a lot being able to create a bank of existing style presets that make IT much faster to and get the exact type of out what you want for any particular given use case. This, to me, perfectly and capable. Tes, where we are with the development develops, IT is not just about pushing the state of the yard. IT is also about user experience and making these things actually work on a personal and in a business context. With that in mind, I think writing styles is a huge upgrade and one that i'm excited to play with more for now that's going to do for today's A A brief until next time peace.

Anthropic Has (Maybe) Solved a Holy Grail of Business AI 19:02 Share

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

Deep Dive

Shownotes Transcript

Anthropic Has (Maybe) Solved a Holy Grail of Business AI