We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

Build Your Own Models with ChatGPT on OpenAI

2024/4/6

No Priors AI

AI Deep Dive AI Chapters Transcript

People

主

主播

以丰富的内容和互动方式帮助学习者提高中文能力的播客主播。

Topics

主播：OpenAI 扩展了其自定义模型训练计划，允许公司创建针对特定用例的生成式 AI 模型。这对于医疗保健、法律等行业具有重要意义，可以提高模型的准确性和适用性。最初，该计划需要与 OpenAI 研究人员合作且成本高昂，但现在已经扩展到更多客户，并提供了自助微调选项，降低了门槛。OpenAI 的这一举措不仅可以增加收入，还可以缓解其模型服务基础设施的压力。许多公司已经开始使用微调的模型，例如 SK Telecom 利用微调的 GPT-4 处理与电信相关的韩国语对话。Harvey AI 则利用微调的 GPT-4 帮助律师处理法律案件，有效避免了 ChatGPT 出现幻觉和虚构案例的问题，显著提高了准确性和实用性。主播预测未来会有数千个针对不同行业和特定任务的 AI 模型出现，这将改变 AI 行业的格局。为了方便用户使用这些模型，主播正在开发 AI Box 平台，整合各种 AI 模型，并允许用户混合匹配和链接提示。

Deep Dive

Shownotes Transcript

Translations:

中文

OpenAI is expanding their custom model training program. So today on the podcast, I want to dive into what this means, why this is important, and talk about some of the interesting use cases that we're seeing from this fine-tuned model. There's a bunch of companies that are actually using these fine-tuned models that you probably didn't know about. I personally didn't know about. There's some very interesting use cases. I'll talk about the features, what's going on. Let's get into the podcast.

So the first thing that I want to talk about here is just the fact that obviously this is a use case that a lot of people have been using. This is essentially the custom model program, and it's set up essentially to help companies get

create generative AI models that are tailored for specific use cases. So you can imagine if a law firm or a healthcare provider wanted to use ChatGPT, but they're like, hey, it doesn't know all this really specific stuff about the healthcare industry, they would be able to provide extra data and they would fine tune a model. It's essentially ChatGPT or GPT-4, but with all this extra industry specific

knowledge. And this is what, you know, they're kind of helping people to do. So custom models and all of this was actually introduced at OpenAI's first developer conference. If you remember, they had their big dev day. And this essentially allowed businesses to work with OpenAI researchers. So this wasn't like you could do it yourself. You pretty much had to go collaborate. And I actually saw some like, I saw some reports saying that like you had to spend like a million dollars or something crazy in order to do this. So it definitely wasn't for everyone, but there was a bunch of people doing it. So

Since they launched this, they did a big blog and they said, quote, dozens. So I'm assuming, I don't know what, like dozens, like a handful of dozens. I don't know. There's probably like somewhere between 12 and 100 customers or companies that have specifically done this with them and worked with them on this custom model program. Yeah.

But OpenAI said that they needed to expand the program to quote unquote maximize performance. I'm not 100% sure what maximize performance means in this context. But what I can assume is that this is probably a good way for them to make more money. So if performance is financial returns, then this is a great thing to do. And of course, for companies, I'm not saying they're selling something bad here. I'm just saying that's probably their main motivation. So

One thing that is interesting, there's a new feature here and it's called assisted fine tuning. So this is added to the custom model program. But essentially what assisted fine tuning is, is of course you can go and

They kind of have two options. The old option, which was like OpenAI was going to help you do this, and they have a new option where you can actually do this yourself. So it's kind of like self-led. You can go bring in your own data. You can go train this yourself without having to use OpenAI, and that is going to be cheaper, and I think it's going to be a really interesting option for a lot of companies. So

There's a bunch of different custom trained models that are being developed right now on top of OpenAI's kind of base model of GPT-4. And I think this is a great example. So they had a couple examples in their blog post that they talked about when OpenAI kind of made this big announcement of these new features. I mean, the biggest feature here really is that like this program already existed, but now you can go and do it yourself without needing to spend a million dollars to get access to one of the researchers. So in my opinion, that's huge news.

They talked about a couple different companies that have been using them. So one is SK Telecom. This is a big telecom company in Korea that I like. Like I've been hearing about them do a lot of investments and a lot of moves in AI. So this isn't a shocker to me, but they fine tuned GPT-4 for help with a bunch of like specific telecom related Korean conversations and some other things that they were doing.

Another famous one that I didn't realize had a partnership with OpenAI on this is Harvey AI. This is a company that raised millions of dollars earlier last year. And I remember when there's all the hype around Harvey AI because it helps lawyers with legal cases and stuff. And I remember a lot of the hype was like, I'm sure there's way more now, but there was like 15,000 law firms that were on the wait list to use Harvey AI.

So I think it's very interesting. I watched a demo of this actually in action and essentially using Harvey AI, they were able to feed a bunch of specific data in relation to like the law field. And I think they actually gave it like a bunch of cases. So if you know lawyers, essentially what they do is when there's a lawsuit, they're going to go look at a bunch of old cases, how they're resolved and try to find what the precedent was on an issue.

And this is something that Chachapati notoriously struggled with. There was like, you know, a lawyer that asked it for, you know, some sort of like legal, you

some sort of legal precedent on an issue and it completely invented a case that never existed on like some airline lawsuit. He submitted the case. He got in big trouble because he completely submitted a fabricated case that never happened. It was a hallucination. And this was kind of like a big example of some of the issues that could arise with these AI tools. So that being said, lawyers had a lot of skepticism. And that's, I think, where Harvey AI was born and why it's important is because to avoid problems like that, um,

Sorry, I was just thinking, I'm like, that would have been like a genius marketing move for Harvey AI to essentially hire the lawyer to like just, you know, do that. And then whatever his consequences are, pay for them so that it can just prove how why you need them. And you can't just use chat out of the box. But anyways, it's probably not what happened. This is good conspiracy theory. Anyways, it came up the really great.

uh, if you watch it side by side with Chai Chippity and with what actually they were able to fine tune Harvey on, it comes up with really great results. So I saw a demo where they were like asking it for precedent on a specific issue. Chai Chippity gave kind of a three paragraph blurb about what you'd expect, right? Like it just kind of pulling from whatever it had. And then when Harvey was asked, it was able to use the natural language processing, like the LLM kind of

of GPT-4, but with all the data and context, it was able to actually outline four real cases. It can add links to those cases and then it's able to, yeah, essentially outline like actual useful data and a bunch of different questions and things that were being asked in relation to kind of like case law. So very, very useful. And of course, not possible just out of the box with GPT-4. I

I think what's interesting here is OpenAI said that they think most organizations are going to create their own customized models for their specific industries. This is actually the same thesis I have as I'm building out AI Box, which is a no-code AI app builder marketplace. We believe that there's not going to be just the handful of big players we see today, but that there will eventually be

thousands of AI models in every industry that are specifically good at doing different things. And so that's why we're building out AI Box, so that we will have a platform that people can access all of these different models on one account, right? You don't need to make 20 different accounts to go use all of the different best-in-case, best-in-class AI models. You can go get them all on one account, and you can also mix and match them together, link them together, chain your prompts together. So

That's what I'm building. And I'm excited to see that OpenAI sees the same future that essentially I'm building for. What is interesting here to me is that OpenAI right now, they're getting close, apparently, to reaching around $2 billion in annualized revenue.

And they're planning a hundred billion dollar data center with Microsoft. So right now, I think that these kind of custom model training that they're working on right now is seen as kind of a way to sustain some of their revenue growth while they're also trying to reduce, like there's a huge strain on their model serving infrastructure. Like this is,

It takes a, it takes a big toll. So these fine tuned and custom models, I think are a lot more efficient. And I think because of that, that could help alleviate opening eyes. Like, obviously this is historical there, you know, the, this compute capacity challenge that they're facing right now. And I think this could actually kind of help to alleviate that.

So what's interesting is OpenAI actually introduced some new model fine-tuning features for GPT 3.5, right? Their free version that anyone can access. And this includes a dashboard for model quality comparison, third-party integration support, and they're kind of starting with weights and biases and some tooling enhancements.

What I think is interesting, though, is that details about fine-tuning GPT-4, which was actually made available in early access during Dev Day, they didn't actually say anything about that, so I'm not sure when that's going to come out, but I'll definitely keep you up to date. I think this is a fascinating new development we're seeing out of OpenAI. It solves some problems for them. I think it helps generate extra revenue, but overall, I think this is amazing for the community as companies are going to be able to make some incredible progress.

fine-tuned models on top of what OpenAI has. So I'm excited for that and I'll keep you up to date on whatever amazing innovations we see there. What I would love for you, if you learned anything in this episode, if this was interesting in any way to you, it would really, really make my day if you could leave us a review on the podcast. I would be super thrilled to hear your feedback. You can, if you're on Spotify, drop us some stars for an Apple podcast. Go ahead and leave a comment. Really, really appreciate it. But I hope that you have an amazing rest of your day.

Build Your Own Models with ChatGPT on OpenAI 09:13 Share

No Priors AI

Deep Dive

Shownotes Transcript

Build Your Own Models with ChatGPT on OpenAI