We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode Beta Playground of AI Box Now Accepting Testers

Beta Playground of AI Box Now Accepting Testers

2025/6/3
logo of podcast Lex Fridman Podcast of AI

Lex Fridman Podcast of AI

AI Deep Dive AI Chapters Transcript
People
J
Jaeden Schafer
Topics
Jaeden Schafer: 我很高兴地宣布,经过两年半的努力,AI Box Playground 终于发布了 Beta 版。这是一个一站式平台,每月只需 20 美元,你就可以体验所有顶级的 AI 模型,无需订阅多个平台。最棒的是,你可以在同一个聊天窗口中与不同的模型互动,它们会记住你之前的对话内容。你还可以在对话中途切换模型,选择最适合当前任务的模型。此外,你还可以并排比较不同模型的效果,支持图像、文本和音频的混合使用。这绝对是一个非常棒的产品,我将向大家展示它的演示和实际操作。 Jaeden Schafer: AI Box Playground 提供了一个名为 AI Box Default 的模型,它可以根据你的提示,自动选择最佳的文本或图像模型来生成内容。你也可以创建自定义的默认模型,为不同类型的内容指定特定的模型。平台还允许你浏览各种 AI 模型,查看它们的功能和基准,比较它们的价格和质量。你可以使用不同的模型生成相同的提示,然后比较它们的结果。例如,你可以让不同的模型生成一张“日落时分的红猫”的图片,然后比较它们的效果。 Jaeden Schafer: AI Box Playground 还支持文本转语音和语音转文本功能。你可以上传音频文件进行转录,也可以使用 OpenAI 或 11 Labs 将文本转换为语音。更酷的是,你可以更改模型的声音设置,选择不同的声音进行语音合成。这对于测试不同的声音效果非常有用。此外,AI Box Playground 还具有媒体存储功能,可以保存你生成的所有媒体内容,包括图像和音频文件。你可以随时查看这些内容,并了解生成它们时使用的提示、模型和参数。你还可以删除这些内容,或者删除整个聊天线程。 Jaeden Schafer: 我们还为所有 AI 模型添加了功能说明和基准测试,方便你了解它们的能力。对于支持图像视觉功能的模型,你可以上传图像并提问。我们的目标是让你在一个平台上体验所有 AI 模型,节省订阅多个平台的费用。我们提供慷慨的 token 包,让你尽情使用各种 AI 工具。我们希望你能通过 AI Box Playground 探索更多你以前可能没有尝试过的 AI 模型和公司。目前 AI Box Playground 正在进行 Beta 测试,欢迎大家提供反馈。我们计划很快推出市场和构建器平台,敬请期待。

Deep Dive

Chapters
The AI Box Playground beta is launched, offering access to various top AI models for image, text, and audio processing within a single platform for $20 per month. Users can chat with different models simultaneously, switch models mid-conversation, compare outputs side-by-side, and create custom defaults. The platform includes models like Ideogram, Flux 1.1 Pro, and OpenAI's latest image model.
  • AI Box Playground beta launched after 2.5 years of development
  • Access to various top AI models for $20/month
  • Simultaneous chats with different AI models
  • Side-by-side model comparison
  • Image, text, and audio processing capabilities

Shownotes Transcript

Translations:
中文

Today on the podcast, I want to make an announcement that I have been waiting to make for over two and a half years. And that is that AI box, my software company I've been working on has officially launched its first product in beta. It's called the AI box playground and essentially allows you to try all of the top AI models in one place on one platform for $20 a month. So no longer do you have to have subscriptions to a ton of different platforms. And the cool thing is you can chat with different models in the same chat, they'll see the context of past

conversations you've had. You can switch the model mid conversation for different models that are better, different things. You can compare models side by side in the chats. You can do image, text and audio all in the same chat. It's a phenomenal product and I will be getting into it, showing you some demos and live viewing of the actual platform. So if you're listening to this, I'd recommend you check this out over on YouTube or on Spotify where we'll be

showing the actual video, but otherwise I'll explain what's going on for people that might just be listening to the podcast on Apple, which I know that's a big chunk of our

of our user base, of our listeners. Okay, so over on the platform, the AI Box Playground, what you'll notice is you have access to a ton of different models. We have one in particular that we've rolled ourself called AI Box Default. And this is a model that you essentially can give it a prompt, ask it a question, and it will pick the best model, whether that's text or image to kind of generate whatever you're going for. So

AI box default is a pretty cool concept. You can also go and create your own custom defaults. You know, if you're like, I just want this model for all of my images or this model for all my texts, you have the option to do that.

The other thing that you're able to do is scroll through all of the different models on the side. So if you go and check out an image model, for example, you can go look at Ideogram and you can go and see what the models are capable of and also kind of what the benchmarks are. So comparing this model to other models on their pricing, on their quality and everything else. So if you have a prompt that you are interested in, you can then go to the model and say, you know,

Let's just say a red cat at sunset for this ideogram one, getting it to create us an image here.

While it generates that, the thing that I love about this is that as soon as it's done its generation, whatever its image that it comes up with, you can actually go and even if you started by chatting with Ideogram, you can go and switch to a different AI model to talk to. So once it comes up, you're able to switch through different tabs and generate the same prompt

And by the way, it did generate the prompt and it's a pretty dang awesome. I don't know if you've ever tried an ideogram before, but it looks just like a photo. It doesn't look very AI generated. It's very, very impressive. So we have our red cat at sunset. But if you want to run that with a different model, you just click...

the little rerun button and, um, black forest labs, flux 1.1 pro, in my opinion, is also a really powerful tool. So we're going to run with that. Um, and you could also, you know, try the latest model out of chat GPT as well, because their new image model is pretty fantastic and it's a lot better than, um,

you know, the Dolly three or whatever. I think the new one is called, uh, open AI image one or GPT image one or something like that. So you can then go and actually see all of the different, um, outputs side by side. You do this with text as well. You could do it with audio. Um, we also have the option with, you know, some of our, some of our features with 11 labs and other tools like that, where you can do multiple voices. So you can do different voices and you can compare them all side by side. So we now have, um,

We have OpenAI, we have Flux, and we have Ideogram. And then we have a little button here where you can actually compare all of the different images side by side to get a better view of what all of these actually look like. And you can kind of tap through them to see what this actually looks like. So this is kind of one of our features we've been building. And then, of course, you have the ability to do

text and try some of the models you might not have tried before. Like let's say Quinn has an interesting model. We'll try Quinn 2.5. There's 72 billion parameter turbo model. We'll just tell it to tell us a joke.

And then you can do the same thing. You can get this to generate with not just like one or two, but you could get it to generate the same joke with like seven or eight different AI chats and just toggle between all of them. And then of course, the one that I love to do is we have text-to-speech and speech-to-text. So speech-to-text, if you upload an audio file or talk to it, then it's able to give you like a text transcript.

But if you want to do a text-to-speech, that's something that you're going to get from OpenAI or 11 Labs. I'm going to go ahead and do 11 Labs and just say, you are wonderful.

It's just going to use whatever its default voice is. Then you can actually go to the settings of the model. You are wonderful. You are wonderful. You can go to the settings of the model and go change the voice. So right now we have, you know, area voice selected, but we can see a ton of different voices in here. We can listen to examples. We got George. We got Charlie. We got Charlie.

So, you know, you can pick whatever one you want. You can change it. And then we're going to rerun this again. And we're going to do text to speech. And we're going to do 11 labs. And now it's going to do the new voice. So you actually have. You are wonderful.

I think that was George's voice. You are wonderful. And that was Aria's voice. So we have both the different voice options. So this is really cool. I love this because if you're doing a script or you want to test a voiceover and you want to test a bunch of different voices and listen to things side by side comparison, you can compare them side by side. You can get a ton of these things going. And for me, this is just...

So much fun. So I'm thrilled with this. Okay. The other thing that I'm really excited about with the playground, because of course the audio image text all in one place for $20 a month is great, but something that has plagued me honestly for ever really with these AI models is the

When I use something like OpenAI, for example, ChatGPT, and I get it to generate images for a lot of different projects, I use them for thumbnails on YouTube and for podcast covers and all sorts of stuff and projects I'm using ChatGPT for.

I forget where the image was. And so we've created something called media storage, which will show you all of the content, all of the media you've created, whether that's audio files, whether that's images. And if you go back and click on that image or file, it'll show you the exact prompt that was used to generate it. It will show you the model. It will show you how many tokens you use to generate that, what the date was. You could go download it or delete it or click view chat and it will take you back to the actual chat.

that was used to generate the image. And you can see, you know, what the context of your chat was before. So for me, this is pretty cool. And I am a huge fan. And again, if you want to delete it, if you go delete that image out of your media storage, it's going to disappear from this chat thread, but it won't necessarily delete the whole chat thread. So you can still keep some of your content by deleting that. However, if you do want to delete the whole chat thread, you can do that. And it will also delete all of the media files that are in the chat thread. So just a couple of cool things you can do there.

All right. One thing that we have added to all of the different AI models, if you're interested and you're kind of curious and you want to see like what they're all capable of, you can go to any company and click on any AI model and we have all of the models capabilities. So Lama 3.1 is only able to do text. We'll show you the benchmarks of how it compares to other models. If you're interested in Mistral AI and you want to try their CodeStroll developer, you can see what it's able to do. If you want to go and try something like

opening eyes 4.1 you can see that it also has image vision in addition so you know we'll show you all the different features and you can you can upload images by adding media here you can upload images and ask it questions about that so for any of the um for any of the models that have image vision it's able to it's able to you know understand what's going on there so some really cool stuff

that we've been packing into this. We're adding new features all the time, adding new AI models, really, really excited to see what people are able to do with this. And of course, my goal is just that you'll be able to save money, having all of your AI models in one place, not needing to have subscriptions on a dozen different platforms. $20 a month, you get access to everything. And if you need more usage, you can get more tokens, but we give you a pretty generous token package. So you get 20,000 tokens. And for a lot of people, this is going to be

This is going to be what you need to get you through the month using all of the AI tools you could ever imagine, which is a ton of fun. And we hope that all the models that we add will help you learn more and be able to explore a lot of AI models and companies that you probably didn't try before and that you maybe didn't want to pay for their subscription for their premium products. You get it all on here. So this is pretty exciting.

In any case, we are so excited. We finally launched. If you want to try this out, it's AIbox.ai. Two and a half years in, this is our first product. It's in beta. So if you're testing it and you think like, man, it would be really awesome if it could do X, Y, and Z, send me a message over on LinkedIn. Send me a DM on LinkedIn or on X, and we'll be adding new features very actively. Our roadmap is evolving. And in the meantime,

Play around with this. We'll be launching our marketplace and our builder platform very soon. And so if you have your account here, you'll be able to get access to that as soon as it rolls out as well. In any case, thank you so much for tuning into the podcast today for checking out what we're working on over here.

at AI Box. This is something I'm so excited about. I'm so excited to have you guys on the journey. It means the world to me. I appreciate all of the support we've received throughout the years from everybody. Hope you have a fantastic rest of your day and I will catch you next time.