We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

The Agent Exchange: Practitioner Insights

2025/3/3

MLOps.community

AI Deep Dive AI Chapters Transcript

People

Chiara Caratelli

Dmitri Jarnikov

Steven Vester

Topics

Dmitri Jarnikov：我认为AI智能代理的构建需要在通用性和专业性之间取得平衡。专业化代理在执行特定任务时更精确，但缺乏可扩展性；而通用型代理虽然可扩展，但精确度不足。未来理想的代理应该在推理能力上具备一定的专业性，并在执行能力上结合通用性和专业性，能够在需要时回退到通用执行模式。这需要考虑代理的推理能力和执行能力两个维度上的专业化程度。例如，一个食品订购代理需要理解用户的偏好，生成订单，并在不同的食品服务平台上进行订购。在执行方面，从可以使用电脑或浏览器的通用型代理，到只能在特定网站上运行的专业化代理，都有其应用场景。OpenAI的Operator就是一个兼具通用推理和混合执行能力的例子。 Chiara Caratelli：我最初的工作是通用型Web代理，但很快转向了更专业的代理。最终，我发现两者之间的平衡才是最佳方案。代理的强大之处在于它们可以根据实际情况动态调整工作流程，就像人一样。代理可以调用其他代理作为工具，从而实现非常专业的代理仍然由通用代理管理，反之亦然。例如，一个专业的购物代理可以很好地浏览大型平台，但对于小型网站，它也可以作为一个通用的Web浏览器。与之前的模型相比，代理更容易采用这种混合方法。最近出现的工具，如OpenAI Operator，也体现了这种趋势，它们既可以利用API快速集成，也可以进行通用的Web浏览。从客户互动的角度来看，这种混合方法最大的优势在于速度和成功率的提升。虽然专业化代理在特定场景下的成功率更高，但对于需要长时间和多次调用的任务，通用型代理仍然可以取得不错的效果。 Steven Vester：我认为AI代理的采用将是一个迭代过程，类似于我们使用人类助理的方式。起初，我们会将简单的任务交给代理，例如购买咖啡。随着信任度的提高，我们会逐渐赋予代理更复杂的任务，例如撰写报告或进行市场调研。目前，通用型代理的性能不足以胜任复杂任务，因此专业化代理将率先获得成功。用户会先尝试并信任专业化代理，然后随着通用型代理能力的提升，用户会逐渐转向通用型代理。这与在线支付的普及过程类似，起初人们只在特定场景下使用在线支付，但现在已经成为一种普遍的支付方式。 Demetrios Brinkmann：我认为电商平台起初会采取防御措施抵制AI代理，但最终会与之合作。平台会采取反爬虫措施，例如IP地址限制和honeypots等，但这些措施最终难以奏效。随着AI代理的普及，平台会转变策略，与AI代理合作，例如开放API或提供向量数据库访问权限，从而改变其商业模式。这将需要电商平台从单纯的流量变现模式转向其他模式。同时，平台还可以利用自身的推荐系统，为AI代理提供已排序的结果，甚至可以将广告内容融入代理的响应中。最终，用户体验将决定平台和AI代理的合作方式。

Deep Dive

Shownotes Transcript

Translations:

中文

Buckle up and strap in. I grabbed three friends to talk about two topics related to AI agents in the e-commerce space for the great practitioners discussion. Number one, are we heading towards a world of generic agents or specialized agents? And number two, what is the future of companies making shopping agents?

Let me introduce the three awesome guests I brought in. Steven from OLX, who is the head of product there working on Gen AI efforts. He's focused on building AI-enabled solutions and doing a great job, some of which you may have heard of from our last AI Agents in Production virtual conference, like OLX magic. I've also brought in a friend of the pod, Kiara, who's back for a second appearance.

She's a data scientist on the Process AI team. And if you haven't listened to the episode where we dove into web agents, we talk about all the work she's been doing, testing out different frameworks and eventually building out their own web agent framework for the Process AI efforts. And last but not least, the most opinionated man in AI, Mr. Dimitri, the Senior Director of Process AI Research.

This man's a deep thinker. For the past few years, I've had the pleasure of being able to converse with him about all topics AI. And he's been working on Gen AI products, including them good old agents. It's always nice when I get to learn from him.

As a reminder, we are using a simple definition of AI agents, which is AI that can reason, execute tasks on a user's behalf, and use tools. So a very simple TLDR way of putting it is AI that can break out of the chat box. Let's get into our first topic of the day. I want to bring in our guests and hear their hot takes. So...

generic versus specialized agents? Dimitri, you were introduced as the most opinionated man in show business. I'll let you kick it off for what you think the future holds when it comes to generic or specialized. Well, Dimitri, thank you very much for introduction and also for giving me the pleasure to be the first person to provide the opinion.

Well, I think when we talk about agent and I like very much the way that you frame the discussion. First of all, we need to actually look at what type of agents are being built, right? Because in the past couple of months, we see a lot of companies, group research teams putting out either commercial products or even open source frameworks to provide functionality for people to build agents.

And roughly, at least in my opinion, we can group them into a number of categories. So the first category is what I call generic agents. So on extreme case, these are agents that are not specialized in terms of reasoning about the task and also not specialized in terms of execution of a task.

So think about computer use, for example, an agent that can take over your computer and just do whatever you ask it to do. Think about a web agent that can go online, use your browser to either shop

for a flight, shop for an item, or just do some kind of web research. So these are agents that most of the time are not specialized. And then on another side of the spectrum, you've got agents that can do only one task,

and only on one website or on one service. Sometimes they are so specialized that they require a specific API to be able to execute the task. So when I look at this, I think that, well,

Specialized agents in general are the most precise in doing what they require to do. But business-wise, this is not a scalable approach. Imagine that you are building a solution that needs to shop online. You cannot really build an agent or a family of agents that can go and shop

be integrated with every shopping site in the world, right? So it just doesn't scale.

On another side of the spectrum, if you've got very generic agent, so what we see, they are good at everything a little bit, which means that they are good in nothing in particular. So they can do a task which requires a couple of steps of execution. The most common that we've seen or we've heard of, they can do up to six steps of

in the sequence before they hit the wall. And if your task falls within the short execution pattern, you're fine. But if you've got a complex task that requires 10, 20 steps, these generic agents, they cannot just achieve a result.

So when I look at agent, I'm looking at two dimensions of implementation. I'm looking at reasoning capabilities and I'm looking at execution capabilities. And in each domain, I'm looking for specialization.

So in recent capabilities, is it something that is not specific at all? Is it something that's somewhere specific? So I'm thinking about food ordering agent, right? So it's an agent which understand that to order food, you need to understand preferences of people. You actually need to understand people. You need to build what is called food ticket. And then you need to go online, for example, and visit a couple of services providing food to make an order.

And then on execution part, I'm looking at, again, agents that can just use your computer or use your browsers and being absolutely not specialized to agents that can do well on specific websites.

So if you look at the latest product that we saw, we saw operator from OpenAI. So in my opinion, this is an example of agent that is fairly generic in reasoning part and has a combination of generic execution with a specialized execution. And my opinion that the future is...

for agents that are reasonably specialized in reasoning and have a combination of specialization and execution with a possibility to fall back on generic execution pattern. A bit of a hot take there at the end when it came to the specialization and reasoning, but got to be honest with you, man. I was hoping for some stronger opinions. That's okay, though. It's just a warm-up session. I get it. We've also got Chiara to take us through some of this because...

Chiara, you've been working with the generic form of these agents for a while. What are your thoughts? Yeah, so actually, I started working on generic form of web agent, but then quickly deviated towards some more specialized agents. And at the end, a good balance was something in between, like Dimitri just said, because the powerful thing of agents is that

There is not a fixed workflow they can adopt based on what's in front of them, right? Kind of like a person would. And agents can call tools that can be agents themselves. So you can have very specialized agents that are still managed by a generic agent and vice versa.

So one thing could be having a specialized agent for shopping that can browse platforms, knows how to browse bigger platforms very well, but still can be a generic web browser when it comes to smaller websites that it has never seen.

So with agents, it's much easier to have this kind of mixed approach compared to previously. I'm thinking this is a discussion, generic versus specialist that predates agents. It was there with smaller machine learning models versus bigger models. It's always been around, but I think agents broke this pattern because it's much, much easier to dynamically choose what's best.

And we see it also with recent tools that came out like OpenIO Operator. They made a call for action for websites to integrate with them. So they prefer to use APIs to integrate and do things fast while they also can browse the web in general. And I think from point of view of customer interaction, the biggest changes in speed

Of course there is a success rate. It's much better for specialized agents if they have their specialized scenario in front of them. That's a fact. But if you have a long enough time and a number of calls you can do, like with reasoning models, you could still get there. The problem is if you're interacting with a person in real time, this is not possible. So

This is why specialized agents are still very relevant. If you have something like deep research that, well, we saw it with Gemini, OpenAI also recently released this, then you can have a generalist web browsing tool that can be slow. They don't care how much it takes, but it gets to the answer and the user is happy at the end.

Speaking of users, Stephen, let's bring in your product mindset. I know you like to think about how users are going to react to these new tools they get. Where do you see the world of generalized versus specialized going? That's right. So, yeah, let me bring in the user angle because I think I wouldn't be doing my product background justice if I didn't.

Look, the bottom line is I don't believe in sort of zero to one full adoption straight away. I think it would be very iterative. And I think a good way to think through this, or at least the way I think through this, is with an analogy of a human assistant. So imagine that you get an assistant today and imagine they're, you know, just finished uni and they're going to help you with whatever.

Probably in the start, you will just ask them with, okay, maybe get me a coffee. Hey, maybe, you know, here's some raw notes I wrote, maybe prettify it, maybe use ChatGPT to do so. But essentially, you'll give very small chunks of tasks to them. And then I will trust without knowing their capabilities, I will trust them to do that small thing well, like getting me that coffee.

Then over time, as my trust builds, I might dare to give them a bit more complex tasks. So I might ask them to write a certain report or research a certain topic or create a plan for a certain area.

I will change very much how much context I give to that person. And I will also do it only when I have a proven track record of their capabilities and my trust in them grows. And I expect this analogy of how we would use a human assistant in our lives to actually work very well for how we're going to be adopting agents and assistants in our shopping and throughout our entire life anyway.

So that means, you know, we'll start off with a very small specific purchase and we will see how it does there. For example, buy me movie tickets to movie X on Saturday night, right? In this cinema. So there's not much to mess up there. Essentially, my trust grows and capabilities grow. I might do more complex stuff. Hey, buy me a new snowboard, right?

Now, that framework of actual abilities versus perceived abilities, I think those are the two dimensions that are going to be needed for people to trust more and more in agents, for agents to cover more and more parts of customer journeys, for users to accept that. So the truth today, and this is what Chiara and also Dimitri already say, today, generic agents are just not good enough to really do stuff. There's errors, there's...

it will make a bit dumb mistakes here and there. So today you need a specialized agent, one that is catered to one use case or one particular platform. And they're actually going to be able to do certain things. And so I suspect if we follow that framework of

you know, actual capability development and then my perception of that capability development, we're going to see the first real success cases with those specialized agents. So customers are going to be trying those and adopting those and building some confidence with those. And so you're going to start to build some trust with that agent in that domain. And just because the capabilities aren't there, I think specialized is the only way today. Now thinking long term, I don't think there's a moat here. I think that capabilities are a short term problem.

And our perception of capabilities are also a short-term problem. We have O3 now, some great stuff coming out with DeepSeq. There's obviously no sign of stopping here. And so where are we several years from now? Well, I think several years from now, generic agents like computer use, like operator, they're going to be fantastic. And in that world, I don't know if I still need this generic agent that I've come to appreciate and trust.

I could imagine, just like you use certain Google tools already now, it's just common, right? Of course you're going to use it. And I'm sure certainly Google is going to launch one and more labs will as well. So I would expect that over the longer term, customers are going to be moving to generic ones as their capabilities improve. And maybe one more analogy here.

I don't know if you remember how you started using online payments and mobile payments, but it feels very similar to me. You probably did like a very small payment somewhere and then you started doing it in more places and you did it first on those very specific checkout flows. There was no generic one.

But now you don't care, right? It's all just common and you have maybe your PayPal or you have your Google Pay or whatever you use. I suspect a similar type of curve there. So bring it back to the discussion, you know, generic versus specialized. I think today it needs to be specialized because generic is not good enough. And I think that is also the only way users are going to be adopting this technology is through these good experiences, small and then bigger, bigger, bigger with specialized ones. And as the generic ones get better,

there will be a move there and then it's just going to be as common as online payments is today with some big names that you use.

Hmm, okay. I like the idea of perceived capabilities and real capabilities and that delta between the two. I think that's one of the biggest disconnects that we have right now in the AI world. We see so many different demos around the internet and it makes us say, whoa, look at that. What can AI do? It's incredible. And then we go and try and play with it and it falls flat on its face.

I actually think that Steve made a very important point, right? And especially what I like about his argument is this historical perspective. I'm coming from slightly different technical background. Many, many years ago, I was working in IoT space, right? And we also had this discussion, okay, what the future will be in IoT in terms of network protocols. You remember those. Would it be IP like for the rest of internet or would it be specialized protocols?

And everyone in the field was saying, well, you cannot do full IP stack on those small devices. And you need to be really specialized. You need to, you know, be real time. You need to be light. You need to be this and this and this because otherwise it doesn't work.

What people forget about is what Stephen actually suggests to think about temporal dimension, right? So yes, we cannot do many things now, but probably the most widely adopted way

most generic solution will win because yes LLM will get better so it will get better in reasoning so yes it will be able to say okay I've got I can cover food ordering I can cover buying items I can cover buying trips or whatever

So it will be better in reasoning over time. And I think it will be better in instructing execution part of execution agent or execution part of an agent to go online and do something.

So there is, I agree, a very high likelihood that in the future we'll just go for generic. But for some time, it will be indeed very much specialized agents.

Another aspect that I want to probably mention is that part of this discussion is actually not technical. You know, we're technical people. Stephen, sorry, I bring you into this group of technical people as well. But as technical people, we have a bit of myopic view on technology because we say, okay, this is what is possible. This is what is reasonable. What I'm also looking at is

when talking about agents in e-commerce, what is the dynamic between companies making agents and companies making websites or platforms, right? So you can say, okay, generic agent will be able to do all the things on the platform, but most likely it will not be able to do it effectively.

So think about it's agent, which will mimic how we behave on the platform. I don't know what the last time you did shopping. I'm absolutely sure I did not use an optimal path on Amazon or on a booking or any other website. So I clicked around like a monkey. I look at things that probably were not relevant to the end result.

And now imagine agents doing exactly that. So specialization will be still valuable for platforms. Let's take something into account, though. Companies make a lot of money on those random clicks around the website because I myself have gone to a website countless times with the intention of buying only one thing.

and being enticed into buying a few extra items because of those damn recommender systems that know too much about me and my vices. So how do you cope with that in a world where agents just go and get you the thing, and they in theory are not susceptible to recommender systems? I did think about that. And the answer is that it's hard. Look, so...

The ideal agent can execute, let's say a buying agent, right? Shopping agent can help you outline the right criteria for what you're looking to buy and then find every possible match, right? So recommenders are relevant. Ideally, this agent can go through the entire inventory on a certain platform and just find that perfect one. And actually not just one platform, could be multiple, right? If we talk about generic ones. So...

This basically means that the better access that this agent has to the entire inventory, the better job they can do finding and retrieving those things. So it's probably not possible for that browsing agent to just go click page by page by page throughout all these millions of listings in some cases. So there will need to be some kind of...

other, like a middle layer here that could mean if it's an external or even a bit, let's say a bit of a competitor or a threatening way you could see it like maybe they would have to scrape all your inventory all the time, have that in a database and then make that database searchable. So basically scrape inventory and create some kind of quadrant, set some quadrant database and have the agent look through there.

But another way could be that a platform actually opens that. So you open your doors for agents and you make it easy for agents to find the right thing. So recommender systems, they become different. They're not showing you what is recommended, but rather gives you access to all the embeddings that have been calculated and stored in some vector database. And you just say, okay, here it is. Look what you need. So

Yeah, don't have a strict answer on where things will go, but just some thoughts. If I may add to that, I think platforms could even use their own recommender model for that. So they could decide to expose an API where the results that they propose are already ranked by their own engine. And I believe that could even be a solution for ads in that regard, because they could

add content that is boosted in that actual response that the agent could use and then present to the customer. So I think there are a lot of ways here and it will be really important to see how the user interacts with all these new modalities. And to add to the previous point, there is this

kind of conflict between generalist agents and specialized agents. But if specialized agents open new use cases and we see adoption through those use cases, this might be that these specialized agents become the generalists and on the other hand, generalist agents can open the door for specialized agents and adding new modalities, new tools and so on.

So yeah, I think customers are the center of all this and users would be really important. Yeah. Okay. I want to highlight two things you said, which I have been thinking about at length. One is the recommendations that come back to you and the answers that are fed to the user. I could see that being something that you get when your task is completed. And then there's another part that says, do you want me to do any of these other tasks?

That is becoming a fairly common UX design pattern we're seeing with agents these days. So it's not that big of a jump to think that that's going to be next. Then the second part that you said, Chiara, which I had not thought about, is the generalized agent being general at the beginning of the flow. And then it kicks it off onto a specialized agent to accomplish the task and

Doesn't feel like that far of a stretch because there are some design patterns with agents that that's happening in. So once maybe we build out a strong network of specialized agents, the generalized agent is just there to do air traffic control in a way. I think we'll enter a world where you want to build tools that agents can use, right? So you want to...

And in that way, you could think of a specialized agent internally built on a certain platform, just offering its capabilities in a sort of API to another agent, a generic one, like let's say Google's one or whoever will build it. And then you get a bit of best of both worlds thing. I think the problem there is not so much on capabilities, but it's rather a business strategy kind of thing. Do you want that? Right. And I think.

That's a good bridge to the next topic as well. To add to what Steven said, we've been experimenting a lot now with LLMs being used as recommender systems. So an LLM that knows the user, knows what they have done and can put rationale inside all this behavior, can actually provide input to another LLM during a recommendation. So...

There are already agents that work together with other agents to provide a better experience for both user and platform. Since we are already kind of drifting into this space, we might as well talk about the second part of this discussion, which is around the questions of what is the future of companies making shopping agents? And Stephen, you brought in a few strong points, so I'm going to let you kick it off because I

Really, there are so many different parts to this. Where do you want to start first? I'm probably not the best to understand or explain what those companies building those agents will do. But I think I am well positioned to answer what existing e-commerce platforms will do in response. And this will guide then, of course, what those first companies, what they'll be doing.

So I suspect that e-commerce platforms, and I come from the classifieds marketplace world, so that's used goods, cars, also real estate jobs. I believe that these platforms will start being very defensive in the beginning, and then we'll move to embrace it. So what do I mean?

The defensiveness would come from the fact that these agents, especially if it's a third party shopping agent browsing your platform, it removes your customer connection, right? So it damages your brand loyalty. And these brands have many years of investment behind them, hundreds of millions in some cases.

It also makes it hard for these marketplaces to get clear customer signals of what's happening on their platform. As agents start browsing, you need to find some way to identify those and kill that noise. Otherwise, it will be very hard to do any customer research or build for real customers.

And I think a small analogy here is that of price comparison sites. I think we've seen that typically classified websites don't like to cooperate with those. So with those aggregator, integrator type players, because again, you disintermediate, right? You lose the connection to that customer.

This is why I expect platforms to start being defensive. And I think they will take defensive actions to make the platform harder to use for agents, not easier. So what are those? Well, think of the type of things we're doing against scrapers, right? So you have bot detection, you will look, of course, at the IP addresses where things come from, only allowing, for example, traffic from residential IPs, not data centers.

In anti-scraping practices, you have things like honeypots, which are essentially, you know, once you identify a scraper, you don't block them, but you in fact send them to a piece of your database or to a piece of your site that a normal user would never see. You just start generating random data just to overload them and poison essentially the data that they're scraping. They wouldn't know anymore what's good and not. These kinds of tactics, and I'm sure there's many more,

will probably start to be deployed by platforms who don't want agents on their terrain. And I don't think that will work so well. So now my story changes. I think that they will try, and I think maybe for a while that's how it will be. But then at some point we will see that agents are here to stay. They will permeate society, lots and lots and lots of users. So then it becomes a kind of, you can't beat them, join them. And then I actually expect...

Yeah, we'll try to make it easier for other agents to be on our platform. So that are those kind of things I just mentioned earlier. So having some APIs open or allowing them to look through our vector store or catalog stuff or whatever, there's multiple ways we could help agents, give them tools to be more effective on our platform.

You can already see that this is what OpenAI intends to do if they're operator, right? They want to have partnerships where they can tune towards those sites. And I think trying to become a partner and trying to help a tool like operator be more efficient is

is something that platforms will start doing. And this will really mean a different monetization model for classifieds or marketplaces. It means that the traditional ways of just selling eyeballs and views, that's gone. So we need to have other ways. And there are ways, I'm certain, but it will really change quite fundamentally the business model over time. So yeah, I think we'll start off defensive and I think that we'll

have some success early but then fail and then I think we're going to change we and when I say we I mean every single Marketplace in the world is going to open up for this because if they don't they will lose to someone who will an end state you could think about you know a Marketplace brings together supply and demand so if you don't open up you know I could just imagine some third party even like even decentralized right some blockchain database where all

offer sits and where all demand can just look for that, that's possible. So you better open up and make it easy because that would give you a right to play. Well, Dimitri, I know that we have talked at length about whether or not you expose your data

to the internet or you keep it for yourself to create a very specialized agent for your platform. What are your opinions here? Indeed, I think it's a very interesting question and

Essentially, if you look at what is happening with agents nowadays, and this is also somewhat related to our discussion about generic versus specialized, right? Because a generic agent, in a way, is a stick, right?

in the hands of a company making them. And the carrot is we can make more specialized agent and we can have an agreement with a platform, right? So the platform has a choice at the moment, which is either collaborate with agents

Or acknowledge that there will be this generic agent that are indistinguishable from normal users coming to platform and shopping on your platform anyway. And I agree with Steven. In my experience as well, when I talk to different platforms, the immediate reaction is we're going to block it.

But this reaction lasts for like 30 seconds because it's a cat and mouse game. And people very quickly understand that, well, you start blocking it, you make life for actual users so much difficult that it becomes non-practical.

Now, one of the analogies I use when I think about this is you remember the story of Spotify and studios, music studios, right? So you basically have artists, right? And then you've got labels. I call them studio. I probably should have called them labels. And labels, usually their main focus is to find the talent and to basically subscribe them.

So they become part of a label. So if they produce anything of value, it's to be sold by labels. And now Spotify comes and Spotify says, look, we're going to let people listen to what they want. And we want to go on a completely ignore the way that you package stuff. So we're basically not going to,

kind of care about your interest. And to me, it sounds very similar to what is happening with shopping agent and the platforms at the moment, because the risks are all the same, right? It's what you mentioned. The view of users disappears. So labels do not know what people like to listen to. And as a result, they cannot do their job well.

And also the way that the risk profile presented is very similar because Spotify was, okay, we're gonna go around you labels and source musicians. And now with AI agent, shopping agent, it's very similar because the risk that we see is you've got this agent which goes around the platform

and just source goods, services from suppliers directly. But what happened eventually, labels and Spotify struck a deal where Spotify says, look, we're going to focus on serving user needs. And you're going to focus on making sure that the pipeline of artists does not dry out because this is what you're good at.

And to actually have mutually beneficial relationship, we're going to share with you usage data so you can make your decisions better. And because now we're going to use all your catalog, we'll also give you a share in our company. So if we grow, you grow. So they created this codependent relationship, which also allows them to focus each of their strengths.

So if I look at what is happening now, and especially with OpenAI operator, I see exactly the same dynamic. Because you've got exactly the same carrot and stick. OpenAI says, look, we can use generic web agent on your platform. So we're still going to play, well, we're going to allow user to buy something from you.

But we better focus on user experience. So let's work together. If you allow our agent on platform, if you do not block us, if you present an optimized version of your website or even give us API access, we give you user data so you do not lose it to your point, right? So you keep to know who your users are and then you can...

start focusing more of your energy on getting suppliers in, on building this domain intelligence in, and allowing us to actually just focus on customizing, personalizing experience of users.

So in a way for platforms, it could be a blessing, right? Because on the platform side, you actually now do two things. You basically need to deal with your sellers or iFood with a restaurant or take a lot with suppliers. And at the same time, you put a lot of effort in fostering this user relationship in the building user interfaces. So AI agent says, okay, we're going to take this app apart from you.

But in exchange, you're still going to get user data. So this was my analogy. I think I could have presented it somewhat better, to be honest. But this is the thing. And actually looking at our companies, and I think that we discussed it back and forth already a couple of times, I think that, yes, we're going to prevent agents from going to our platform. It's like the first reaction. You know the second most typical reaction?

They say, we're going to build our own shopping agent. And as a technical person, I'm like, yes, let's do process e-commerce agent, because this would be fantastic opportunity for us to apply our technical expertise, to have fun and so on. But from business perspective, you just need to ask, okay, so your platform, which game you want to play?

And if you build your own e-commerce agent, for example, how do you maintain one of the core value propositions to users, which is multi-platform?

right the idea is if i go to agent i say i want to buy this bicycle in poland i need to be able to go to oil x poland i need to be able to go to allegra who is competitor i need to be able to go to what amazon in case they've got like a well-priced bicycle and so on so if it's my agent i'm basically to have a value

have a value proposition and open it to competitor businesses, right? I like this idea of the distinction between sourcing and shopping agents in B2C versus B2B. One thing that you make very clear here is

is that you're basically creating a new platform that is almost an abstraction of many different shopping platforms. And if you're not careful, you can quickly lose scope of how difficult that is by just thinking, oh, here, we're going to create an agent that can go and do everything that 10 different platforms can do. But let's keep this rolling. Kira, I know you have some strong thoughts, so I want to hear what you have to say about this.

Yeah, so actually my opinions are very similar to Stephen and Dimitri on this, that now we're in a stage where platforms are trying to block them because it's a big change. Because I feel like platforms' interests and user interests don't always align. So as a user, I want to find the best deal possible. A platform may want to present certain items that maybe are not the best deal for the user, but

are the best deal for them. So, yeah, I think this is a bit scary in general for platforms. But as the user will expect more personalization, will expect an agent to help them discover products. And I think they will kind of expect that the agent will give them the best deal possible in order to trust this agent.

We're not yet there. Users don't trust agents. We're still in the first era of online payments, to give the analogy that Steven gave before. But as more trust will be put into the agent, more expectations will come from customers. And I think the platform will start to give in on this.

There are also more use cases that could be enabled. We've been thinking a lot of this at this, because there is a bit of doom scenario. Like it's very easy to say all the potential drawbacks that agents could have for platforms. They're clear. But there are also new opportunities. For instance, they could automatically integrate with smaller vendors and present the products directly and have an automated pipeline system

to integrate with their own payment system. So a user could buy and search for items on the platform itself using an agent and then purchase the product from another platform. So it will be much easier to work as aggregators here.

The other is sharing customer data. I think in future recommender systems it will be more important to give rationale to really understand the user, not just having a fine-tuned deep learning model on user and items embeddings, but actually having something that can give an explainable answer.

So we recommend you this product because you've been browsing this and that, because you like this type of items, because next month you're going to run a marathon and I need to give you the best shoes. So all these will enable a lot more interactions with customers and customers are going to expect them.

So there is one thing that you mentioned that I think we did not cover at all. And I also, to be honest, do not see being covered in general, which is selling agents, right? So we basically say, okay, shopping agents. So I'm as a user, I would like to go and I want to purchase something.

But there is so much need also for selling agent. It could be I'm as a user have something I don't need, I want to sell or I'm as a manufacturer would like to offer this product or I'm as a service provider have this to offer.

And my guess, and it's pure speculation, and we're not talking about selling agent at the moment because there is no infrastructure, if you will, to connect selling agent with a shopping agent, right? So this is a part which is missing. But...

Yeah, at the moment, somebody comes up with this infrastructure, then there will be a gate opening to building selling agent. And then the relationship between selling, shopping and the platform in the middle will change once again and we'll have another discussion about

trying to predict the future. Yes. Trust is so important. And I want to second that. I have always thought about trust from the perspective of if I'm using an agent and I give it my credit card number, am I going to have money in my bank account tomorrow? Or is it just going to drain it because it doesn't understand the requirements well enough. But here you're talking about how quickly we can lose faith in agents if they're not giving us the best deals. Yeah.

You can see a world where if a buyer is being shown answers that someone paid to get in front of them, the buyer will quickly revert to doing its own research because it doesn't get the best deal. So it's a very fragile situation. And how you play that out in the future is...

going to determine a lot of things. So in our industry, a big part of our customers and where we earn most money are with professional car dealers, also professional real estate agents. And it's the same with what you're saying, Dimitri, but some thoughts on that. Yes, you could have an agent that would help them

you know, put all their inventory everywhere. But those tools already exist today. They have tools already and just standard API connections. You know, there's software that allows you to, I think they're called multiple listing systems in our industry. So they have an inventory management system and they can just select like, oh, I want this car on these five platforms, right? And then the API connection just handles it.

Sure, you could automate more pieces, right? For example, there would be customers calling or sending a message and you could automatically reply to them to make your selling process easier. But in the end, they typically want to stay very close to that potential customer because that's how they can sell, that's how they can use their skill.

It's a very interesting domain. I've been thinking about it, whether seller agents in our context, so for example, car dealers and real estate agents, if it makes sense. And I'm not certain because I'm not certain what use they will have beyond tools that already exist today.

Yeah, maybe it could scale more. You could be on more platforms, but typically there aren't that many classified type platforms in a market, right? You will maybe have like three, max four or so relevant ones. So it's a very interesting one. It might be different indeed for, let's say, a producer of a good who just wants to sell to every market in the world. They don't care. But when it comes to these kinds of selling agents, I'm not certain how the future will look.

Definitely interesting. Maybe it's a topic for another debate. Well, folks, I think we are coming to the end of the great practitioners discussion. I want to thank Dimitri, Stephen and Chiara for being here with us today to give their opinions on what the future may look like around these two topics and really being intellectually honest with what is real right now versus what is a little bit more hypey.

This was a very special production of the process and MLOps community AI agents in production collaboration. It is the fifth episode of the mini series that we kicked off. I was able to spend a week at the process AI offices and embed myself in their teams, learn about all the agent use cases that they're working on, and then bring the different experts into the studio with me to explain what they've learned while building out these agents.

Since it is something new, I would love if you give me your feedback by either leaving a comment on Spotify or YouTube or dropping in a review on Apple Podcasts or wherever you listen to your podcasts because it helps me so much to hear the feedback from you all. If it's too much to leave a review, just DM me on Twitter or LinkedIn or Blue Sky even, hearing your feedback.

is a great way for me to convince people like the Process AI team that it is worth their time to take the day off and come and chat with me in the studio. Lastly, these awesome people that I've been hanging out with at Process are hiring, so we'll leave a link to all their job openings in the show notes. Check it out, and peace out, everybody.

The Agent Exchange: Practitioner Insights 48:17 Share

MLOps.community

Deep Dive

Shownotes Transcript

The Agent Exchange: Practitioner Insights