Today, on the A, I daily brief, trump names the White house A, S, R. Before that, in headlines, OpenAI drops a new model and chat. B, T, pro on their first day of the daily brief is a daily podcast in video about the most important news and discussions in A I. To join the conversation, follow the discard ling at our shown outs.
I'll go back to the AI daily grief headlines edition, all the daily AI news you need in around five minutes today. One of those days where we kind of actually just have two main, the official main episode is about the appointment of a new White house ais r but most of these headlines are about the announcement from OpenAI first day of their twelve days of shipmen.
They kicked off their Christmas as event with the launch of the full of one model in a new protea ChatGPT subscription. So let's talk about the o one model first. The company tweet OpenAI o one is now out of preview in ChatGPT.
What's changed since the preview? A faster, more powerful reasoning model that Peter ate, coating, math and writing. A one now also supports image uploads, allowing IT to apply reasoning individuals for more detail and useful responses. When IT comes to this multi I modality opening eye demonstrated the future by showing the chat, by giving detailed instructions on how to build a birdhouse based on a single image.
In a more technically impressive example, the model analyzed the cooling requirements for a spacebar data center based on a cheap the updated version of one can deliver faster responses and has a thirty four percent reduction in major errors on difficult problems. An OpenAI spokesperson said that users can expect a quote faster, more powerful and accurate reasoning model that is even Better at coating in math, open a ee reasoning researcher and name Brown show that the model can not only pass the strawberry test, but can produce a three paragraph essay on strawberries without using the letter e. While performance is generally improved a great deal, the models performance is oddly reduced on some advanced chart.
Ks included mi bench, which measures how well A I agent perform machine learning engineering. There is some confusion right now around whether an earlier build was actually a benchmark, so should all become clear shortly. The updated model is now available for pleasant team subscribers and will roll out to enterprise and education users next week.
The second big announcement was the introduction of a ChatGPT protea subscription at a fairly pry, at least at first class, two hundred dollars per month. This teer offers unlimited usage of all of open A S models, including unlimited access to advanced ice mode. Jason way, a member of opening eyes technical staff, acknowledge that this tear is not for everyone, stating we think the audience for ChatGPT pro will be the power users of ChatGPT, those who are already pushing the models to the limit of their capabilities on tasks like math programing and writing.
To that end, the protection includes access to an even an beefier way of using the O N model referred to as promote. Open the eyes said that the promote quote uses more compute for the best answers to the hardest questions. Basically, the differences that promote allows a one to reason for a lot longer, with answers taking potentially minutes to return.
OpenAI said they intend to experiment with all one model that reason for hours, days or even weeks to further boost their reasoning capabilities. Evaluations from external expert testers. O one promote produces more reliably accurate and comprehensive responses, especially in areas like da science programing in case law analysis compared to both o one to know one preview.
O one promote performs Better on chAllenging machine learning benchMarks across math, science and coding. In particular, we saw seventy five percent reduction in errors for easier coding competition questions, more reflective of everyday programming queries. In their released notes for promote, open a eye highlighted that they had bumped up testing standards to verify the performance boost is reliable for other models.
The company gives a passing grade for each correct answer in a benchmark, but for promo, they required the model to get the answer right for at of four times. Now the discussion approach your subscriptions in promote is focused on a single question, who is this for? Alongside the release, OpenAI e announce the stead of grants to medical researchers at adding universities.
They said they planned to roll out grants to other disciplines in the future, and this seems to be part of the intended customer professionals and organizations that require research grade I tools. In other words, o one pro is probably not the right choice if you just want help with meal planning, but if you want to research gene therapy, IT might be the model for you. Professor y in molex spall day yesterday, experimenting with the new models and sharing what he had learned, he wrote in playing with one and no one pro for a bit, they'd very good.
And a little weird, they're not. For most people, most of the time, you really need to have particular hard problems to solve in order to get value out of. But if you have those problems, this is a very big deal.
The problems that can solve well tend to be very high value, think system design, complex problem solving, analysis for financer, other uses, the value will clearly be higher than the Price for the organizations in people who will need to use that. He found that oone pro performed well on a range of low value problems, like writing poetry or advising and investment strategy of buying etf. He also managed to get a to design a turn machine with logic gates made entirely of storms of crabs, inspired by a twenty twenty one scientific paper, more like some of his thoughts like this writing.
Here's my serious tweet on our one. IT can solve some P H D level problems in as clear applications in science, finance and high value fuel. Discovering uses will require real R N D efforts.
Few people have P H D level problems for most people, just use quarter chat, B, T R G, I IT beats on IT, but not everything. Instead, particular classes of hard problems that saw is still dominates in other areas. A one is not Better as a writer, but it's often capable of developing complex plott Better than saw IT, because I can plan a head Better.
I ve had access to o one for a bit, but I used on ted in GPT foro in geri a lot more. But when those fail on particularly chAllenging work, o one, and especially o one pro, can sometimes crack things that the other models cannot. I'm still figuring out a general pattern and use cases, and I think this is a key story for all of A I.
Even people who use A I all the time are still in a use case discovery modality right now, things that seem obvious or not an emergent use cases present themselves all the time. In this context, there is simply no substitute for getting in there and getting your hands dirty. Overall, I think there's a lot of enthusiasm.
Palm, rote o one proves incredible for research. Very, very good. Eric klin Cherry rights a one proves impressive. The responses don't feel like simple associations anymore. For the first time, I feel that really understands the nuances and think things through.
Read says, if no one pro can help quantity, M, L, engineers solve problems even five percent faster than it's a bargain at two hundred dollars per month. That's a miniscule fraction of what their salaries. Daniel, fun of lights of energy route, just hired a new internet.
Two hundred dollars ars a month. They're cracked and no doubt, but i'm suspiciously might be working many jobs and that kind of seems to be the point the protease seems seems squarely at people who have specific use cases. They want attack where the costs are justified.
And while some are worried that this is the start of much more expensive a eye products, adam silver erman thinks it's good for the industry making sure that the business model actually works for continued advancements, he writes. I hope OpenAI charging two hundred dollars a month for pro will be a catalyst for A I and agent companies to start charging more for their products overall. Pretty cool first day for shipments.
And like I said, well, that is mostly the main part of the headlines today. The one of the story I did quickly want to mention just for elon musk, X A I has closed their latest funding round, taking in six billion in fresh capital that brings their fun raising for the year to a very healthy eleven point four billion. That's a little over half the total raised by opening eyes since the launch of ChatGPT and judge I anthropic total on raising efforts.
According to A S C C filing, ninety seven investors took part in a series be round, with the lower stake being seventy seven thousand, five hundred and ninety three dollars. We don't have confirmation of the other details as there is no accompany in press release. But the point is X, A, I enters the year flush with cash and presumably ready to ship.
That's going to do a for today's headline es addition. Now next up, the main episode. Today's episode is brought to you by vantage, whether you're starting or scaling your company security program, demonstrating top noch security practices and establishing trust is more important than ever.
Penta automates compliance for I S O twenty seven, O O one soc two gdpr and leading A I frameworks like I S O forty two thousand one and N I S T A I risk management framework, saving you time and money while helping you build customer trust, plus you consume line security reviews by automating question and demonstrated your security posture with a customer facing trust center. All power by vent to A I over eight thousand global companies like LangChain lea A I in factory A I use vented to demonstrate A I trust, improve security in real time. Learn more eventide c com slash N L W that's ventadour com slash N L W.
Today's episode is brought to you, as always, by super intelligent. Have you ever wanted an A I daily brief, but totally focused on how A I relates to your company? Is your company struggling with A I adoption, either because you're getting installed, figuring out what use cases will drive value or because the AI transformation that is happening isolated individual teams, departments and employees and not able to change the company as a whole? Super intelligence has developed a new customer internal podcast product that inspires your teams by sharing the best A I use cases from inside and outside your company.
Think of IT is an A I daily brief. But just for your company's A I use cases, if you'd like to learn more, go to A B superdad I slash partner and fill out the information request form. I am really excited about this product, so I will personally get right back to you again.
That's be super da eyes slash partner. Welcome back to the A I daily brief, the rumors that Donald trump was planning to a point and A I zar to oversee A I policy have come to formation. They are true, and we now know who that person will be.
Now these reports started surfacing a couple of weeks ago. First, there was reports that there would be a crypto S. R. To oversee the crypto agenda but then he came out the trump was also thinking about a pointing an A I leader in A K White house position and that in fact, those two things might be bundled together. In the immediate aftermath of these reports, there was a lot of support for this idea.
The center for data innovation said appointing in A S R signals that the incoming administration is placing A I at the forefront of its agenda, and rightly so, as the lead for federal AI efforts, the zar should focus on two key priorities to help to fill the president alex economic goals, accelerating adoption and safeguarding U. S. competitiveness.
And indeed, this is what I felt like this role was going to be all about. In other words, regardless of who was appointed, IT seemed like the criteria was going to be someone who would be hard driving at U. S.
Leadership in these areas. Well, last night, president elect trump took to truth social to announce his selection. He wrote pleas to announce that David o.
Sax will be the White house. A, I, N, crp, dos are in this important role. David will guide policy for the administration and artificial intelligence script currency two areas critical to the future of american competitiveness.
David will focus on making amErica the clear global leader in both areas. He will safeguard free speech online and steers away from big tech bias and censorship. He will work on the legal framework so the crypto industry has the clarity and has been asking for and can try ve in the us.
Trump's announcement post also noted that sex is going to be the head of the president's council of advisors on science and technology, which serves a policy advisory group consisting of private sector or and academic experts across a wide range of to contact disciplines. Of course, when IT comes to the A I encysted s are role. Given that its brand knew, we don't know exactly what IT will enter or how much authority sex.
Actually, some reports have suggested that the main function would be to act as a liaison between congress, regulating agencies in the oval office, ensuring policies coordinated across government. But as of right now, that speculation, a lot of the discourse following the announcement is all about sax himself. And what we know about this takes on these particular issues, sax is actually quite familiar to lots and lots of people.
And that specifically because he hosts the all in podcast, all in A K A. One of the very few technology shows that is consistently ahead of the daily brief on the charts is a conversation between sax to moth, pola habita, Jason carcanet and David freedman. G on that show, because of the nature of the shows content, we've had a long term chance to see how sex and is cohoes think about a wide range of issues.
Philosophical speaking, sax is definitely aligned with the little tech that was articulated earlier this year by mark and dresses and bent horowitz, their original post they wrote, little tech is our term for tech startup s as contract to big tech combs. Little tech is run independent on politics for our entire careers. But as the old soviet joke goes, you may not be interested in politics, but politics is interested in you.
We believe bad government policies are now the number one threat to little tech. We believe american technological supremacy and the critical role that little tech starts playing, ensuring that the prema y is the first class political issue on par with any other sax, for his part, has long warned of the excessive power of big tech companies, is particularly in relation to free speech. And while his resume is likely already well known to this particular audience, to briefly recap, sax started his ilk on value journey as a founding COO of paypal.
You might remember this photo from a few years ago about the paypal m. ohia. And you can see sax over here just behind Peter till.
During the social media era, he created an enterprise communications networking tool called gamer, which result in microsoft over a billion dollars. He was extremely active as an investor during the sad wave through his firm craft ventures. And one of his big place for the A.
I era was a slight competitor called glue. Generally, sax strength has been as an Operator that is an adviser in the started world, rather that as a technologist, he was one of the first to suggest extreme belt tightening in only two. This ensured his portfolio companies could extend the runway to wait out the difficult funding environment that followed.
Essentially, he has a strong knowledge of the pitfalls and blockers that can limit the success of startups and that could make a little place to remove those types of blockers from his position in the White house. One day we don't really know a ton about is how to things about eye regulation. He's not a super prominent A I investor and hasn't expressed a lot of strong views as he has around other things.
Then again, that might not be an accident when he comes to disappointment. So far, the trumpet administration's major policy on A I has been a promise to repeal the by an executive order with no clear position on what will replace IT and IT may be that if saxes Mandate is to tear down the barriers to U. S.
Dominance of the A I sector, the role is less about A I, and more about cutting red tape. Sax was certainly one of the earliest allowed as trump orter since licking valley. He hosted three hundred thousand dollars.
I had fund raiser at his house over the summer, which broke the kind of silence around trump's ort among leaders and tech. Back in june, he wrote a long post on twitter, slash x called why on back in president trump, he went through his reasons, including the economy, foreign policy, the border and long fare. And ultimately, that fundraiser and the conversation that went with IT wasn't important moment in shifting the over to window when I came to tech.
And trump now in his role as White house adviser. Reports are that he won't be required to divest of his business interest as the role is only part time. However, under ethics rules, he will be required to practice himself from decisions.
That impact is holdings. Sax is not an uncontroversial person, and there are some who expect very negative things from this. Why commentor founder program over the summer said, I know he's an awful person from things he's done.
Crp, ta trader lot with weakened, seen, said the levels gripes in corruption were gonna see will stand even the most dirty of real business men. The response from many in silicon valley, however, was resultance ly positive. So coa partner Shawn require posted the announcement with the comment is time to build burdon Brooks of VC at ohio based overlooked ventures wrote, people told me I should hate sacks.
We should be diametrically opposed reality. David gave me the most respect out of many people in the world, adventure, engaged and intellectually onest congos and open my eyes. Congrats, David sacks earned A I entrepreneurs ready rights.
Congrats the David sacks. I hope he continues to be a huge fan of open source ai. I'm just massad the CEO rep. Lett said. This is fantastic. Sax in his firm craft have been one of the most useful and thoughtful investors we've had, so i'm excited to see him bring the same spirit to the rest of the country. Even say men squeeze out to make endorsement tweet congrats to ZARA David sax, with elon mok responding with the crying, laughing a moggy.
For what it's worth now, when IT comes to this question of whether IT really makes sense to have A I encrypt to bundle together criteria, jacor vin ski had, I think, the best take here, he wrote, for those questioning if crypto N A, I belong in the same policy portfolio, realized that the prime director for government to follow on both is this get out of the way? Yes, the time will come for new laws and regulations. But first, David's act can free builders to build.
Then again, maybe the most important point is the one for mister ino, who says David's x appointment proves you can do anything you want. If you podcast part enough, can think of a Better way to rap this episode. Appreciate you listening or watching, as always, till next time peace.