Marketplace lessons from Uber, Airbnb, Bumble, and more | Ramesh Johari (Stanford professor)
Ramesh Johari is a professor at Stanford University focusing on data science methods and practice, as well as the design and operation of online markets and platforms. Beyond academia, Ramesh has advised some incredible startups, including Airbnb, Uber, Bumble, and Stitch Fix. Today we discuss:
- Published
- Published Jun 14, 2024
- Uploaded
- Uploaded Jun 14, 2026
- File type
- YouTube
- Queried
- 00
- Source
- youtube.com
Full transcript
Showing the full transcript for this video.
AI-generated transcript with timestamped sections.
[00:00] Marketplaces are a little bit like a game of whack-a-mole. One example that I came across with one of the companies I worked with that I love is our new supply side was having a pretty bad experience. So what we decided to do is build some custom bespoke features that were really going to direct them to more experienced folks on the other side of the market. [00:19] Good. [00:19] And then, yeah, lo and behold, you know, pretty soon those metrics start to look better. But then we're looking at we're like, wait a second. Now, you know, the existing folks on the other side are having a worse experience. So you kind of whiplash around. You're like, oh, wait a second. We better do something about that. So we take them. We try to match them up with the more experienced folks. And now suddenly, a month after that, you're like, you know, wait a second. And your metrics just keep moving around. And that's because the whack-a-mole game here is ultimately a lot of marketplace management is moving attention and inventory around. [00:49] And rolling with those changes is about recognizing whether the winners you've created are more important to your business than the losers you've created in the process. [01:00] Today, my guest is Ramesh Johari. Ramesh is a professor at Stanford University. [01:05] where he does research on and teaches data science methods and practices [01:09] with a specific focus on the design and operation of online marketplaces. [01:14] He's advised and worked with some of the biggest marketplaces in the world, including Airbnb, [01:19] uber stripe bumble stitch fix upwork and many others and in our conversation we get super nerdy on how to build a thriving marketplace including where to focus your resources to fuel the marketplace flywheel of growth why data and data science is so central to building a successful marketplace how to design a better review system why as a founder you shouldn't think of yourself as a marketplace founder but instead simply as a founder also how ai is going to impact
[01:49] marketplaces and experimentation and so much more. If you're building a marketplace business or thinking about building a marketplace or just curious, this episode is for you. With that, I bring you Ramesh Johari after a short word from our sponsors. [02:03] This episode is brought to you by Sanity. Your website is the heart of your growth engine. For that engine to drive big results, you need to be able to move super fast, ship new content, experiment, learn, and iterate. [02:33] constraints of outdated tech. Forward-thinking companies like Figma, Amplitude, Loom, Riot Games, Linear, and more. Use Sanity to build content growth engines that scale, drive innovation, and accelerate customer acquisition. With Sanity, your team can dream bigger and move faster. As the most powerful headless CMS on the market, you can tailor editorial workflows to match your business. Reuse content seamlessly across any page or channel and bring your ideas to market without developer friction. Sanity makes life better for your whole [03:03] team. It's fast for developers to build with, intuitive for content managers, and it integrates seamlessly with the rest of your tech stack. Get started with Sanity's generous free plan, and as a Lenny's podcast listener, you can get a boosted plan with double the monthly usage. Head over to sanity.io slash Lenny to get started for free. That's sanity.io slash Lenny.
[03:26] This episode is brought to you by Hex. If you're a data person, you probably have to jump between [03:33] visualizations, write Python, and send around a lot of screenshots and CSV files. Hex brings everything together. Its powerful notebook UI lets you analyze data in SQL, Python, or no code, in any combination, and work together with live multiplayer and version control. And now, Hex's AI tools can generate queries and code, create visualizations, and even kickstart a whole analysis for you, all from natural language prompts. It's like having an analytics copilot [04:03] Then, when you're ready to share, you can use Hex's drag-and-drop app builder to configure beautiful reports or dashboards that anyone can use. Join the hundreds of data teams like Notion, AllTrails, Loom, Mixpanel, and Algolia using Hex every day to make their work more impactful. Sign up today at hex.tech.lenny to get a 60-day free trial of the Hex team plan. That's hex.tech.lenny. [04:28] Ramesh, thank you so much for being here. Welcome to the podcast. [04:35] Thanks so much for having me, Lenny. It's great to be on. [04:38] It's great to have you on. [04:39] A big thank you to Riley Newman for connecting us. Riley was the first data scientist at Airbnb and head of data science at Airbnb. [04:47] And that role is actually a really good microcosm of what we're going to be focusing on in our chat today. [04:52] we're going to get super nerdy about marketplaces and experimentation and data.
[04:57] I know that's your jam. [04:59] Are you ready to dive in? [05:00] I really am. Yeah. And I actually want to thank Riley, too. You know, I got to know Riley when I was at Odesk, you know, first as a research scientist and then I directed their data science team. This is like way back in 2012. And, you know, I was looking around for people who are experts on how we think about data and marketplaces. And Riley Newman came up. And so I invited him to come talk to us at Odesk. And we've stayed in touch since then. [05:30] had a kind of lengthy career now thinking about those kinds of problems. So I'm pretty excited to talk about it with you. [05:37] Let's start broad and set a little foundation. [05:40] You have a really interesting way to describe what a marketplace business even is. [05:45] So, [05:45] Ramesh, what is a marketplace business and also why is data so important and such an integral part of building a successful marketplace business? [05:53] It's interesting when people sit down and think about, say, Airbnb. What does Airbnb sell? An average person is like, oh, that's pretty obvious. Airbnb sells rooms. I go there to book a room I want to stay at. Other people say, oh, what does Uber sell? Uber sells me rides. I use Uber when I need to get a ride from somewhere to somewhere else. [06:23] selling you listings. The drivers on Uber are selling you rides.
[06:28] Bye. [06:29] Uber and Airbnb are selling you something, the taking away of something, which is a weird thing to think about. What they're taking away is the friction. [06:37] of finding a place to stay. They're taking away the friction of finding a driver. In economics, we call those things transaction costs. When you take econ one, you learn about markets and how supply meets demand and we get prices out of that. [06:51] What you don't learn until Econ 201 is that markets don't always work. One of the reasons markets don't always work is because we have what are called market failures due to the presence of these kinds of frictions. What's a market failure? It's that Lenny wants to get from Palo Alto to Burlingame and he can't do it. Why can't he do it? Because he doesn't have anyone to drive in. [07:12] Why doesn't he just call someone to drive him? Well, who's Lenny supposed to call? Who are those people? Are they out there? Are they willing to drive him right now, right at like 10 a.m. on a Friday? Are they willing to take him somewhere? [07:22] When I want to stay somewhere when I'm traveling, a friction is who's willing to give me their room? I mean, in principle, there's people who are willing to let me stay in their living room, but I don't know who they are. Those are frictions. [07:33] What the marketplaces are selling you is taking the friction away. That's what you're paying them for. And it's an important observation because what that means is the marketplace is customers. [07:43] aren't just the people buying the rides or buying the listings. Actually, the hosts are Airbnb's customers. And the drivers are also Uber's customers, right? So both sides of the marketplace are the customers of the platform. Both sides depend on the platform to help the platform take that friction away. Because just like you want a place to stay or you want to ride, the driver is at Uber because he wants to earn money by taking people places. And the host is on Airbnb because they want to earn money by selling their listing. I think this concept that we're making
[08:13] transaction costs away is such a fundamental idea that's misunderstood around marketplaces, that when you're an entrepreneur starting a marketplace or [08:22] Thinking about your business model, I think you can be wildly off if you forget that that's the thing that's fundamentally your value proposition. Then you asked about the role of data and more broadly data science in marketplaces. [08:37] It's an interesting thing, right? [08:39] The example I always love to give are the ancient agoras in Greece or Trajan's Market in Rome. [08:46] You know, when you look at pictures of these things, what really stands out to me is the rock. I mean, these things are made of stone. It's not like you were going to move a booth from one place to another place without moving a lot of rock from one place to another place. [09:00] So you flash forward to 2023, and here we are with technology undergirding pretty much every kind of commerce now. And it means we can architect and re-architect the marketplace kind of on the fly, and we really are doing it all the time. And these frictions that are getting taken away, they're getting taken away because of data and data science. So I really want to like highlight kind of three pieces of this for people. [09:23] I want you to think of them as a cycle, but to start with, let's just lay them out one at a time. [09:29] One of them is finding people to match with. [09:33] So that's the problem of I want to stay somewhere. [09:36] who is out there who's willing to let me stay with them on a given time frame. And then if I'm a host, I have a listing who is out there who's willing to stay at my place when I have it available. So that's finding matches. Then there's making the match. And so here, going back to my time at Odesk, a big problem that we dealt with there was if I've got multiple applicants to my job,
[10:00] Who should I hire? Who should I interview? It's a common problem we face in the real world, but now it's all remote. I don't meet these people in person. All I've got is this application they submitted to me. [10:09] I need help triaging that. So that's helping make a match out of possible partners you can match with. And then finally, we make matches. [10:18] Well, what did Matt just tell us, right? I mean, if you stay somewhere in Airbnb, you learn something about the host. You learn something about the listing. The host learns about you. [10:26] too. That's all information that the marketplace should feed back into. This is where we get to rating systems and feedback systems, even passive data collection. Did you leave your booking before you were supposed to leave? Well, maybe that's a sign that something can quite work out the way you want it to work out. That's passive data collection. Did you leave five stars? That's active data collection. Get all this back in and what does that do? Well, that lets us do a better job finding potential matches and make potential matches in the future. Every single thing I just said [10:56] Finding potential matches, making matches, and then learning about those matches, and then cycling back again. That is the data science of marketplaces. [11:04] And I feel like every marketplace that you could think of [11:08] in any vertical, [11:10] has those three problems to deal with and relies on algorithms and data science to help them solve it. And in turn, that is the, I think, really the underpinning of taking those frictions away. Many founders try to start a marketplace business, think about marketplace opportunities where they don't exist. [11:28] And there's often these recurring failures of types of marketplaces that just don't work in an area. I was just writing a couple ideas down while you're chatting. Like cleaners, getting cleaners as a marketplace doesn't seem to work ever.
[11:39] Car wash, there's a classic failure to. [11:42] like getting tasks. [11:43] Done for you on demand as a marketplace seems to not often work. [11:46] So this might be too big of a question, but I'm just curious if anything comes up of [11:51] When someone is starting a marketplace or thinking about starting a marketplace business, [11:55] What do you find are the most common flaws in like, this is probably not... [11:58] going to work as a marketplace. [12:01] That is such a fantastic question. I want to preface what I say with a couple of comments. One of them is that I've worked with a lot of different marketplace companies, but anything I say is pertaining [12:12] to something more sensitive. I may not name the company just over the course of the podcast. But the other more important thing I want to say is that I'm a professor at Stanford, and there's a reason I'm not a successful scaled entrepreneur of marketplaces, and that's because I probably haven't unlocked the key to exactly the question you asked, but nevertheless, I have some thoughts on it. The most important one is this. What I found talking to people who want to start [12:37] what they think is a marketplace. [12:39] is that they think too much about a marketplace before they're a marketplace. [12:44] That, in my view, is the biggest failure mode. You mentioned specific things, cleaners. I wonder about that. Is it something about the cleaning industry? It possibly is. I don't claim to be an expert on the microeconomics of the cleaning industry. But often it's not that. It's that I thought I was building a marketplace from the beginning. And that's not the way the world works. So I'm going to give you one vignette of this that I really like. And that's Urban Sitter. [13:08] So an urban sitter first, urban sitter is a babysitting marketplace. Okay, we can talk about kind of their whole life story. But I think what's most interesting is really the early days. And in the early days, what I found interesting, the way I found out about them actually is that we were stuck.
[13:21] looking for some help and I found out about this new platform and [13:24] where the clever thing was... [13:28] When you used to hire a babysitter, this is like pre-Venmo days, you need a cash on hand. [13:33] Because when the babysitter is done at the end of the day, they're usually like high school students or something. They want to get paid. They're not going to take your IOU that you'll send them some check in the mail the next day. [13:43] Unfortunately, you often don't have cash. They don't take credit card. They're high school students. That was an incredible friction to address, which is literally just we accept credit card payments for babysitting. [13:53] That's it, right? Now, from there, what happened is they took advantage of Facebook networks between parents and babysitters to build trusted introductions. So, like, let's say my sitter wasn't available. I get to know sitters in the Facebook network of that sitter, right? [14:08] Once they overcame that first thing to get some liquidity onto their platform, they could move towards asking, [14:16] How do I solve for these frictions that I talked about earlier? How do I solve for helping people find potential matches? How do I solve for people making those matches? You can't do that when you don't have liquidity on your platform. It's silly to tell someone, hey, I'm really going to help you find all those drivers out there, even though I only have three drivers on my platform. That's not a friction you're solving for.
[14:46] credit cards instead to now billing for how you were interviewing and contacting sitters. They had kind of a two-part plan for that, one with a pay-as-you-go menu, one with a more of a subscription option. But the key thing was either way, what you were paying for now was finding potential babysitters, not paying them with a credit card. That wasn't the key thing anymore. So what's the [15:08] A marketplace business never starts as a marketplace business. [15:12] Because what we think of as a marketplace business is something which at scale is removing the friction of the two sides finding each other. [15:19] But when you start, you don't have that scale. [15:22] So when you start, you had better be thinking, what's my value proposition in a world in which I don't have that scale liquidity on both sides? And, you know, that's bespoke. It means different things. And in the case of Odesk, where I started, [15:36] That initial thing was that remote work is a weird thing because basically you've somehow got to know that this person who you're not next to is doing what you're asking them to do. And so the initial value proposition of Odesk was to provide tools for workers to verify they were working the hours and doing the things that they said they were doing, you know, screenshots and various kinds of tracking. [16:01] And then, [16:02] in return for that to be able to provide guarantees on both sides. Now the workers could say, "Hey, I worked what I said, so I should get paid." The employers could say, "Oh, I actually see that you worked what you said, and so I feel comfortable that I got what I paid for." [16:15] That was the initial value proposition, it was resolving a trust issue at a remote scale.
[16:21] At that point, liquidity isn't the game. It's asking what's a problem that people in this space are facing that I can deal with. [16:30] When... [16:31] I'm not a scale marketplace. Again, with the cleaning industry, I can comment on that from personal experience, but otherwise, I think that's the way I would think about it. It's almost never about building a marketplace when you're building a marketplace. [16:43] That's very similar to the advice I always give marketplace founders is, [16:47] 90% of your problems are going to be non-marketplace specific problems. They're going to be the same problems. [16:52] any startup is going to have. Like, how do I grow? It's going to be like the same things you need to do. [16:58] So, you know, one thing you said was that's what you tell marketplace founders. I mean, something I've actually pressed hard on in my own way of thinking about this is that maybe we shouldn't talk about the concept of a marketplace founder. [17:10] Really, there's founders. And I think every entrepreneur, I mean, one way to think about it, right, it's very hard to think about a human business endeavor that has not been disrupted. [17:19] by the potential for transactions to take place online. [17:24] And if that's the case, it means literally any founder is a marketplace founder. It'll be a choice they make after they grow as to whether they want to build a platform. I mean, to take a very, you know, a very hot recent example. [17:36] No one in their right mind would have thought of open AI as a marketplace. [17:41] Right. But open as a marketplace now, they may not want to call themselves a marketplace, but they have plugins. Plugins are flooding that that platform. I don't know, you know, if people have played with it. I mean, it's not it's not an easy thing to find the plugin you need for what you want to do. And that really is a two sided thing. Now there's the plugin creators and there's the users and they may believe it. They may not believe it, but they are a marketplace.
[18:11] That's, I think, one. And two is, [18:14] Because that's the case, I think one of the other challenges I find founders struggle with [18:18] is you don't want to overcommit your future. [18:22] And what I mean by that is, [18:24] is that you're building up trust and you're building up a sense of what kind of business you are in your early days. [18:31] If you believe that this kind of platform future awaits you or market platform future awaits you, [18:38] there may be choices you're making early on that are tying your hands later. A great example of this is [18:43] When ODAS started, it was because the tools they were providing were for ongoing monitoring of work. It's a very natural thing to say... [18:52] We'll just take a constant fraction of the dollars that cross the platform. That all works well and good until after you become mature, some of these relationships between worker and employer last a long time. [19:04] Most of the value is generated now not so much because they're able to track each other because the trust is now there, [19:10] but because they found each other, because they're able to build that relationship through ODesk. And [19:16] That meant that the longer that goes on, the less value the platform is adding into that relationship. But you're still pulling 10% of all the dollars. What does that lead to? A word that most marketplace CEOs know well is disintermediation, which is where you were intermediating between the two parties and now they do. [19:34] Disintermediation means that essentially they're like, "Hey, we don't need you anymore." My favorite example is we had some stuff delivered from IKEA by a Thumbtack worker once and my wife is like, "Thanks a lot. You're so reliable." He's like, "Hey, great. Here's my business card. Ever need me again, just call the number on the back." That was it. Thumbtack got their one lead gen and then we didn't need the platform anymore.
[19:59] This issue for Odesk meant that after they merged with Elance and became Upwork, they had to think a little bit about, okay, what's the monetization strategy we want to use? How do we address... [20:09] this issue that longer term relationships may disintermediate. Does that mean you need a pricing plan that actually takes that into account? Right. So, [20:18] Early commitments, in this case, to a particular pricing scheme, particular monetization, can really tie your hands as you then realize later you actually are a platform. [20:26] I really like this message. It makes me think about Substack, actually. [20:29] which started as a [20:31] It's just a platform for newsletter writers. [20:33] And then they're like, how do we make this more valuable? Because they take a cut of everyone's revenue. [20:38] And they've actually invested heavily on [20:40] helping drive demand to writers, for example me, [20:43] And at this point, over 80% of my subscribers come from Substack's network. [20:47] And so they've built this marketplace element. [20:50] Exactly as you're describing what they just found. Here's a pain point. [20:52] Writers need more subscribers. How do we help them drive subscribers? [20:55] So they figured out all these ways to create demand. [20:58] That's like a really positive story, right? Where they managed to actually expand the frontier of their business by enabling that network. For every one of those, there's unfortunately a lot of negative stories. [21:28] treatises at this point written on eBay and their history and how they got to the point that they're at.
[21:33] But I think one kind of simple thing I do want people to think about there is, [21:38] that the sellers on eBay who had matured with the platform, who had grown with it, had come to develop certain expectations about what their lives on that platform would look like. It's understandable because a lot of these businesses, they had built their livelihood on that platform. That was their entire business. When you now reach in and you say, "I'm going to completely change the business." [22:00] the rules of the game in which your business model operates, from the perspective of those sellers, that's a breaking of a social contract that's been developed over a very long time. And so, you know, I love the Substack example because that's like, hey, let me amplify our social contract. Right. But I think for every one of those, there's an eBay warning sign that you can also you can trap yourself a little bit. [22:24] Just to close the loop on this really, I think, [22:26] Important point. A lot of people listening to this are probably, I'm a marketplace founder. I'm building a marketplace. [22:32] are going to hear this and be like, oh, shit. [22:34] Maybe I need to rethink how I think about what I'm doing. [22:36] What would be your piece of advice to people like that? Is it [22:41] focus on the friction point and it may be a marketplace solution, it may be a managed marketplace, it may be you own the supply. [22:47] Is that the advice or how would you what would your advice be to someone that's like, I'm building a marketplace? How should they reframe their thinking? [22:54] Let's go back to kind of thinking about this concept of a marketplace as reducing friction, right? [23:00] So, [23:01] The litmus test I like to give to someone who claims to me that they're building a marketplace business or their marketplace founder is, you know, do you have what I would call scale liquidity on both sides of your platform? What does scale liquidity mean?
[23:13] What it means in lay terms, and that's, by the way, I am a data scientist, and I love to think about these quantitatively, but fundamentally, like, if it doesn't pass the smell test, then you don't have to keep going with the data science. The smell test is... [23:24] Scale Liquidity asks: [23:27] Do I have a lot of buyers and a lot of sellers on my platform? [23:31] Or do I only have one of these two or do I have neither? Okay, if you don't have both, [23:36] You can call yourself whatever you want to call yourself, but at this moment in time, you're not a marketplace. [23:40] If you have won, congratulations, you've won the game on one side of the market. Now, if you won, you have a choice point. You can lean into growth on the side that you're doing well with. You've got a ton of users, a ton of buyers, great. Lean into it, get more buyers. That's one option. There's no shame in not being a marketplace. [24:01] Scaling a business is scaling a business. If that's the way to do it, do it. If you decide you want to be a marketplace, then at that moment when you've got a lot of buyers but not a lot of sellers or a lot of sellers but not a lot of buyers, the choice you're facing is how do I take advantage of having that one side scale to attract the other side? We can talk more about that, but there's a lot of ways to kind of hack that, to think about how.
[24:31] when really Uber Black was the only service is, [24:34] They just hand out coupons for free rides at kind of, you know, events, parties, things like that to take people home. [24:40] That was a way of saying, "Hey, we're subsidizing the drivers in the city. That's our scaled side. Now we're going to use that subsidized driver base to attract riders." That's how you get that flywheel going. Again, many people have written about how to take scale liquidity on one side and use it to attract the other side. [24:59] if you don't have either side [25:01] Don't worry about it. [25:03] Don't worry about being a marketplace. Worry about scaling one side. And in that world, it's opened up, it opens your visibility up completely into the advice of many, many, you know, startup advisors, right? People who have advice, not so much about scaling a marketplace, but about scaling a startup. And I really, I want to say you got to let the ego go at that point, right? Like it's fine to, [25:28] articulate to people that your vision of the future is to be a platform or marketplace. As I said, virtually every business is going to have that option at some point in the modern tech-enabled economy anyway. So you're not saying something people don't already know when you tell an advisor or an investor that. But I do think you need to be humble enough at the starting point to recognize that there's no sense in talking about a marketplace if you don't have scaling on either side yet. [25:52] And then it becomes a question of a business model unit economics of can I build, say, a DoorDash not as a marketplace? Can I just hire a bunch of people delivering? Right. It's like, is this even possible in a different route?
[26:06] Yeah, that's a great point. One of the things I think that's useful for people to think about here that you're raising is, [26:12] At some level, it's kind of tied up, I think, with that question of whether I should have employees or kind of contract or freelance work on one side of the marketplace. And that's actually a pretty old question in economics. The way we talk about it often is a distinction between a market or a firm. [26:31] One of the interesting puzzles in economics, Ronald Coase is a famous economist who thought about this, is well, if markets are so efficient, why do we need firms? [26:40] Right. Because if markets are efficient and matching labor up with things that need to get done, why would you ever need a firm? And that's like one of the earliest recognitions of transaction costs are real thing. And that's kind of one of the things that firms are solving for. And I love what you're saying, because what it's recognizing is, hey, for your frictions. [26:58] the best resolution to that might not be to have a marketplace. It might actually be to have very tightly controlled labor. A good example of this actually, Stitch Fix, I think one of the things that's cool about Stitch Fix is the experience that people had early on with stylists at Stitch Fix. I'm a happy customer, by the way. I think I fixed stuff on. Yeah, I think one of the great things about that experience is it felt magical to have someone who
[27:28] like a freelance relationship every single time you're going back. Another example that I would pull out is pretty much any healthcare platform. So, you know, for example, for physical therapy, you know, it'd be weird if every time you went to a physical therapy platform, you just got randomly matched to whoever happened to be available then. So I think there's some curation that needs to happen of that relationship. Does that mean full employee? Maybe not, but it does mean you have to think a little bit about exactly as you brought up, you know, what's the [27:57] What's the nature of curation of your labor pool? [28:00] Awesome. Okay, so let's come back to a point you made early on. [28:05] around the importance of data and the power of data in actually making your marketplace a lot more efficient and work more effectively. [28:12] So say that [28:13] you have a data scientist or data analyst or someone that is helping you optimize your marketplace. [28:18] Where do you often find the biggest leverage and opportunity for [28:22] a data person to help you make your marketplace more effective. [28:25] This is an incredible question, right? Because I think I could answer it a number of different ways. One question I think there that's kind of basic is just what should this person be doing? And I'm going to actually kind of evade that question a little bit. I'm going to give some examples of what they could do, but I feel like that's one where context matters a lot. So as an example, at the end of the day, I'm going to give you a little bit of a question. [28:47] ride sharing or grocery delivery marketplaces, pricing means actually what do you pay for that ride or what do you pay for that delivery? Right. So that's that's actually the price that's set at the moment you actually place there. Just to be clear, by the way, if you order from DoorDash, I don't mean the price of the restaurant. I mean, what do you pay to DoorDash? Right. What's that? What's that fee? Is there a surcharge because it's, you know, surge or whatever. Right. So.
[29:10] Okay, so that's a thing, right? But that's not really a thing in a marketplace where the platform's not setting the prices. So at Airbnb, really, hosts are the ones who are charged of setting prices for their listings. [29:21] So one answer to your question is, [29:23] If I'm in a place like, you know, Uber, Lyft, DoorDash, I want to have good data scientists thinking about pricing because that seems like something which should be heavily dependent on the instantaneous state of supply and demand in my marketplace. Right. So that's. [29:36] One type of answer is, well, do I need people, data scientists working on pricing? Do I need data scientists working on search? Why search? Because maybe in my marketplace, finding the needle in the haystack is really the biggest, highest friction problem. So maybe I need a lot more data scientists thinking about search. That's what I'm going to evade. [29:52] Okay. [29:53] I'm going to focus more on something completely different. [29:55] which is just a more philosophical point about what a data scientist does. [30:00] So in a lot of companies today, especially, a main thing that you ask data scientists to do is build what's called a machine learning model. You know, machine learning model even already can mean a lot of things to a lot of different people. I'm going to focus on something very concrete. You're asking them to predict something. [30:15] When I started at Odesk, this is in 2012, one of the funny things about me is I started at Odesk because... [30:23] I had an academic career up to that point, about 10 years. [30:27] you know, building mathematical models of things. I was not really very much of a data scientist at that point. What I expected would happen is I go to industry and I'd be told, hey, like, look how important data is, you know, and definitely my eyes were opened. And one of the first things I was asked to think about is, well, okay, someone comes to ODAS, posts a job.
[30:46] workers apply to that job. [30:48] Predict. [30:49] Which of these workers is most likely to be hired on that job? That was the narrow question. [30:54] Okay, and so why is that a good question? Because we have a whole awesome set of tools now. [31:00] to solve that kind of a problem exactly. How do we do it? Take a lot of past data, of past jobs, past applicants, past hires that were made, and then we ask, [31:09] you know, these crazy big black box algorithms, all right, do the best job you can predicting who's going to get hired on this job with these applicants. And we use that data to test how well these algorithms are doing. That's like machine learning in 30 seconds, basically. [31:21] Okay. [31:22] So, you know, we're working on this problem, right? [31:25] And then I kind of poked my head up a little bit. I go, why are we working? What is this going to do? [31:29] Well, it turns out the reason these kinds of things are important is they get used to make decisions. So what kind of decision do you make with that? Well, one thing you do is you say, oh, well, if I could predict who's most likely to be hired, [31:42] then I should just rank people. [31:44] based on that, and that would be a good matching algorithm. That would be a good way to sort and triage people, applicants for employers when they're screening, trying to figure out who to interview, who to hire. [31:55] Great. [31:55] Thank you. [31:56] Sounds pretty natural, right? And then, you know, you think about it a little bit. And this to me is really, it's such a... [32:04] such a passion project to get people to understand that this is why [32:07] The humans in the loo. [32:09] that help us in businesses and making sense of data are so critical is the following problem. [32:15] If you think about it a little bit, you realize, you know, what that algorithm is doing, it's really just picking up on patterns and past data, right? So, yeah.
[32:24] That's great, this person is likely to be hired. But what we really want is something different. We're trying to add value by ranking people. So, you know, to give another example that's similar to this, when you're a marketing manager, [32:37] and you've got a crack data science team that's built a long-term value, lifetime value model for you, you're not going to get in trouble with anyone if you send your highest value promotions to the highest LTV customers. Who's going to blame you for that? Because you're like, oh, yeah, this person's worth a lot. [32:53] and I sent them this promotion, [32:55] You say that in your monthly report, nobody's going to give you a hard time. [32:59] But the problem with that way of thinking is actually predicting what their lifetime value is, isn't really the question. The question is, how much more are they going to spend on my platform because I sent them that promotion? That's a very different thing. It's a differential rather than an absolute. I'm not interested in their absolute LTV. I'm interested in the difference in their LTV because I sent them this promotion. [33:21] And when you look at it that way, what you realize can happen is picking up on patterns because of good predictions, right? Finding the people that have high LTV because you predicted that is very different than making good decisions, which is about saying, [33:35] the difference in their LTV is going to be higher because I sent them this promotion. [33:39] I love this example because I taught a class here at Stanford. It was like an executive education class. We had all the executives for a company in the room and one of the people in the room was the chief marketing officer. I just asked this question like, "Hey, okay, let's see you got this great LTV model. Who would you send the promotions to?" It's like, "Definitely the highest LTV people." There's a CMO in the room and so it's a little bit of a delicate situation like pushing back a little bit.
[34:03] I do want to be clear, there's reputational reasons you might do that anyway. I mean, I'm not trying to get away from that, but just to make the narrow point that, [34:10] predicting [34:11] is about picking up patterns, but making decisions is about thinking about these differences. Now, why is that important? Because we learn in high school. [34:18] correlation is not causation. That's a phrase everybody has heard all over the place. What does that have to do with this? Well, when we teach people to build machine learning models, we're asking them to make predictions, we're asking them to find correlations. Prediction is inherently about correlation. [34:32] but when we ask people to make decisions we're asking him to think about causation if I make this decision then [34:40] Will I actually increase the net value of my business, right? Will I have, by sending the promotion, increase the likelihood that this person is going to spend more on my platform? [34:50] And so the first and most important thing that I feel very strongly about and what would I get a data scientist to do is no matter who they are. [34:58] Even if it was that person in the weeds thinking about building this prediction model for hiring, [35:04] get them to be thinking in the back of their mind always that their goal is to help the business make decisions. And that the distinction between causation and correlation matters a lot. You know, we could talk a lot more about how does that play out in terms of their day to day work. But at least at a starting point, you have to recognize that the first step is always recognition. That prediction isn't the same thing as making decisions. [35:27] So the takeaway here is help. [35:30] as a data team and as a data scientist on the team has helped the business make predictions.
[35:34] Are there a couple more examples you could share of just like what is an example of a decision that you think they often should be making and using data to help them with? [35:41] Maybe the right way, the right frame of reference for this and the word that an academic would use is, you know, causal inference. Right. So what we're changing from is machine learning to causal inference. So let's think that through in a couple of different ways. [35:55] use cases that are related to that marketplace data science flywheel I talked about earlier. Finding matches, making matches, and then learning about matches. So finding matches, like you said, a core part of that is search and recommendation, and each of those relies on rankings. So I want to be able to rank order, let's say I go do a search on Airbnb, I want to rank order the different listings in the marketplace, right? [36:16] At some level, it's true that what I'm trying to do there is I'm trying to just predict what are you going to like the most. [36:23] Right. But I think there's an important piece of that also, which is that I want to think a little bit about the distinction between two different ranking algorithms. That's the real decision that's being made. And when I think about the distinction between two different ranking algorithms, I don't want to be only comparing them. [36:42] in terms of how well they recreate the choices people made in the past. The way I'm really going to evaluate those is in my market, [36:49] Does one of those lead to better matches or more matches than the other one? Right. So Airbnb as a business, like what are the most obvious core metrics? It's bookings and revenues. [36:59] So you're going to want to ask a very basic question. If I use the ranking algorithm Lenny just developed last night versus the ranking algorithm Ramesh developed last week, does Lenny's ranking algorithm lead to more bookings than Ramesh's ranking algorithm? And it's so important to put it that way starkly because that's so different a question than does Lenny's ranking algorithm do a better job of predicting over the last two years what bookings people made than Ramesh's ranking algorithm.
[37:29] I think that's where this kind of hiring issue popped up, right? Because in the end, while we might have these predictive algorithms to rank who you're going to hire, that's not the important question. Interestingly, the important question is actually, [37:40] to evaluate the quality of the match that's made. And we would do that through the next step of that flywheel. We'd ask ourselves, you know, [37:48] What ratings did they give back to that freelancer? Do they hire that freelancer again? So you're comparing two different algorithms, not through their ability to recreate the past, but their ability to make matches in the future that can be objectively evaluated to say, hey, yeah, I increased the value of the business. I actually made better matches this way. [38:05] You know, and then rating systems, I think we could talk quite a bit about kind of a similar, a similar phenomenon there too. This episode is brought to you by Epo. Epo is a next generation A/B testing and feature management platform built by alums of Airbnb and Snowflake for modern growth teams. Companies like Twitch, Miro, ClickUp and DraftKings rely on Epo to power their experiments. Experimentation is increasingly essential for driving growth and [38:31] and for understanding the performance of new features. And EPPO helps you increase experimentation velocity while unlocking rigorous, deep analysis in a way that no other commercial tool does. When I was at Airbnb, one of the things that I loved most was our experimentation platform, where I could set up experiments easily, troubleshoot issues, and analyze performance all on my own. EPPO does all that and more with advanced statistical methods that can help you shave weeks off experiment time, an accessible UI for diving deeper into performance,
[39:01] that helps you avoid annoying, prolonged analytic cycles. [39:05] EPPO also makes it easy for you to share experiment insights with your team, sparking new ideas for the A/B testing flywheel. EPPO powers experimentation across every use case. [39:14] including product growth machine learning monetization and email marketing check out epo at getepo.com slash lenny and 10x your experiment velocity that's get eppo [39:26] dot com slash lenny yeah i'd actually love to talk about rating systems but [39:31] There's kind of an implication in everything you're describing of running an experiment. [39:34] Versus looking at what [39:36] would have happened in the previous world. It's [39:39] You made a change, run experiments, see if it actually makes an impact on bookings and revenue. [39:43] And that leads me to a question I wanted to ask, which is, [39:47] With experiments, there's kind of this classic challenge. And always elephant in the room of, if you just run a bunch of experiments, you're kind of going to, [39:55] micro-optimize, lead to these local maxima, [39:59] And you may miss big opportunities and big unlocks if you're just like, [40:02] extremely experiment driven. [40:04] You spent a lot of time thinking about experimentation. [40:07] what have you learned or what advice do you have for people to either be less worried about micro-optimizing and missing something big or just [40:14] finding a balance with [40:16] running experiments, but also creating opportunity to find a huge new opportunity. [40:20] Yeah, first of all, I'm really glad you broached the E-word. I was dancing around it and I'm really glad that we talked about experiments. Because yeah, one of the big lessons of this recent conversation we've been having is just how could you possibly know that difference without doing something like experimenting? Okay, so yeah, I am a big believer in experiments. I mean, I'll just lay those cards on the table. I don't know.
[40:43] I love working with businesses that think experiments are important to helping make good decisions. [40:48] Now, all that said, I am also someone who feels pretty strongly about this exact issue that you're raising, which is you can't experiment your way out of everything. And one way, you know, that one frame I like to give people is that. [41:03] although you might say you're an experiment driven business I you know some businesses will proclaim we literally test everything but that kind of leaves aside a little bit is there's a lot of degrees of freedom and what it means to test everything because ultimately [41:17] What's getting built and tested our choices that are made through the organizational structure the data scientists the PMs the engineers? Everybody's on the you know before we're running experiments were actually thinking about even what's worth experimenting right? Like what designs are we coming? So that's one and the other big one is how long do we run these experiments? Okay? That's a big choice and and [41:38] What I genuinely believe, and I think there's a paper we can link to later that I'll point your readers to as well, not my paper, from some folks at Microsoft, [41:49] What I generally believe is... [41:51] that [41:51] we're risk averse on both these two dimensions. [41:54] that what people decide to test in a world that has promoted experimentation for everything [42:00] tends to be more incremental by design. Okay, because, and we'll come back to by actually answer the because in a second. So that's one. And two is people tend to run experiments for a long time. [42:11] and probably longer than they should.
[42:14] Now what do I mean by these two things? So what's interesting to me about this dynamic is experiments don't live in a vacuum. [42:22] Companies have incentives. [42:24] In companies that really go all in on experimentation, one of the things that gets wrapped up in that is the incentives around experiments. Because if you go all in on experiments, the common thing you'll see is data scientists get judged based on how many wins they had that quarter. How do you get more wins? [42:44] It's easier to get wins when you're being incremental. And because it's important to have wins, you have to run them long enough to demonstrate that they're really wins. You're less willing to cut something off in exchange for trying something riskier. So the big lesson is Microsoft paper. It's called AB testing with what's called fat tails, which in lay terms just means you're running a business where there's potentially big opportunities out there if you look at kind of the effects of the experiments that you run. There's a couple of lessons there about both. [43:12] trying a lot more stuff that's not all risk averse, and not necessarily running everything for so long. So really getting velocity up. So you can see that there's a big incentive problem there, right? Because the culture that says it's okay to fail big, [43:27] actually requires changing the terminology of wins. This is one of the things I hate most in A/B testing, I have to say. I get where it comes from. [43:34] You know, experimentation was never historically in science about winners and losers. It'd be weird if, you know, Ronald Fisher, who's kind of the father of experimentation with his agriculture experiments, talked about winners. I don't think that's necessarily how he talked about things. Experimentation is always very hypothesis driven. It's about what are you learning?
[43:52] And that's really an important distinction because what it means is if I go with something big, risky, [43:59] and it "fails", meaning that doesn't win. [44:04] I nevertheless, if I was being rigorous about what hypotheses that's testing about my business, I'm potentially learning a lot. Right. So a great example of this kind of thing is, you know, that there's. [44:16] An important feature of Marketplaces is badging. Sometimes it's really important to have badges on your top-rated profiles when people are searching. [44:26] Without going too far into the details, a common finding with badges is that badges you think are going to be great actually turn out to be terrible. One reason they're terrible is they focus too much attention on the badged folks and pull too much attention away from the unbadged folks. [44:44] And if we judge that only in terms of winners and losers, you throw the baby out with the bathwater. You're like, oh, well, that badging idea was terrible. So ditch that. No badges. But that's not what it's telling you. It's teaching you something about how inventory is being reallocated, how attention is being redirected through the badges. And you really want to think not in terms of winning and losing, but learning. So learning is a win. [45:06] And I feel that that's a cultural thing. Fundamentally, it's very hard to somehow attach dollars and cents at the top to data scientists running experiments that that fail, but learn. And ultimately, I think getting into that space where you experiment more, meaning you don't run all your experiments for quite as long and you accept the willingness to try experiments that are into the tails where you might fail bigger.
[45:30] is a cultural thing. It's about saying that, you know, we're allowing that to be part of our social contract with our data scientists or, you know, actually our employee contract with our data scientists, that not everything is just about how many launches you had and, you know, how many wins there were. It's okay to say that's how I want to use experimentation, but if you're going to use it that way, then I would say don't be a we experiment everything business, because then I think you need some other way to deal with these big, [45:55] changes that teach the whole company a lot, but maybe can't. [46:00] fall into the incentives you've created for your data scientists. [46:04] This badging example is, I don't know if you're referring to the Airbnb example, but it actually led the launch of Superhost at Airbnb, which is like the ultimate... [46:12] badge on Airbnb. And there was a lot of concern from the data team that it would [46:17] destroy the marketplace because they built [46:19] as you described this very [46:21] well-crafted ranking algorithm. [46:23] with just like a prediction of, you know, exactly as you described, which listings that guest is most likely to book and be successful booking. [46:32] And then we're about to throw a badge on random listings in the, [46:35] results. And so this one data scientist in our team is like, "No, we can't do this. This is insane. We're going to destroy it all." [46:42] And we still went ahead with it. We ran an experiment showing the badge to some people and some not. Actually, it was no... [46:49] no impact at all. [46:50] which is like Superhost itself had no impact at all on the business as far as we could tell initially, which is also bittersweet because it felt like, why did we even work on this thing?
[47:01] There's like a slight benefit where hosts felt better. They felt more satisfied. [47:05] with being a host, but [47:07] I went exactly through what you described, so that's pretty funny. [47:10] Without necessarily going into the weeds on the data science of Superhost, I think there's a lot wrapped up in what you said. I guess another thing I'll say is that I'm a big believer that you don't throw your understanding of the business out the window when you process experiment results. And it's partly, I guess, what I mean by this is... [47:33] Data science is really about accumulation of evidence. It's never about one finding in isolation. And so another kind of trap, I think, is to sometimes say, well, I hit stat sig on my AB test. [47:45] you know, green light, it's all go. And, you know, I think, you know, you had Ronnie Cahavi on your show, and he made a similar point that, that, [47:53] There are different levels of evidence and just having an outlier A/B test that goes against everything you believe about your business doesn't mean that you somehow have controverted all your knowledge. [48:04] I think that's one side, but the other thing is you can't always measure. [48:07] everything that's important that's needed to really [48:10] develop like a full sense. So with Superhost, right, one of the things that's hard to measure is the long-term impact of Superhost. [48:17] Because, [48:18] In the short run, Superhost causes a rebalancing of inventory. There's going to be winners and losers. Part of Superhost is actually about retaining hosts that get the badge over a longer period of time. [48:30] recognizing that hypothesis actually says something about maybe how long the experiment needs to be run or what kinds of data analyses need to be done.
[48:38] And in the end, if you can't do that, you can't run it long enough, or you can't do that data analysis due to sparsity of data or lack of data to address the question, [48:47] it matters what you bring to the table, right? What are your beliefs about that? So what I like to tell people to do there is I like to push people to be what's called quantified rather than data driven, which is, okay, fine. Some things we can't measure, right? But [49:01] Maybe you've got a leadership team with different beliefs about what they think the retention value of Superhost is going to be. They might be all over the place. You can process your experiment results in the context of these competing beliefs. It's almost like a prediction market kind of a thing. And start asking, well, okay, if this is what we believe about our business, this is what the data is telling us out of the experiment, let's put those two together and ask, is this... [49:25] Is this enough for us to make the bet that we're still going to go with it, even though maybe that short-term test you ran was flat? [49:30] That's actually exactly how we think of Superhost looking back. [49:33] It was a great idea. I'm really happy. I can't even imagine Airbnb without that. [49:38] Even though... [49:39] There's no evidence, at least initially, that it made any impact. I'm guessing they looked at it again and maybe there's something... [49:45] that came out of it. [49:47] But even if it had no impact, [49:49] It just feels like it made the marketplace better. And that was a big learning for me. It doesn't need to always drive a metric that you can measure. [49:56] There's just like, this is the way it should work. So one of the reasons the thing you said happens is because marketplaces are a little bit like a game of whack-a-mole. [50:05] Okay, and what I mean by that is, like, so narrowly in the context of Superhost, it's
[50:10] Because you're redirecting attention to some hosts at the expense of others, it's not even obvious if bookings can really go up. Maybe you get lucky, maybe you get a bunch more bookings. One reason you probably wouldn't expect that in the first place is there's only a limited number of super hosts. How many more bookings are they going to be absorbing because of all this extra attention? And you're taking attention away from other people. Without doing any data analysis, my prior would have been that bookings should probably go down, right? [50:32] And one example that I came across with one of the companies I worked with that I love is we were... [50:40] working together over a period of time and [50:43] And in a month, we looked at some of the data and it suggested that our new [50:49] you know, supply side was having a pretty bad experience. So, oh, we got to do something about this. So what we decided to do is build some custom bespoke features that were really going to direct them to more experienced folks on the other side of the market. [51:02] Good. [51:03] And then, yeah, lo and behold, you know, pretty soon those metrics start to look better. But then we're looking at it. We're like, wait a second. Now, you know, the existing folks on the other side are having a worse experience. So you kind of whiplash around. You're like, oh, wait a second. We better do something about that. So we take them. We try to match them up with the more experienced folks. And now suddenly. [51:20] A month after that, you're like, you know, wait a second. And your metrics just keep moving around. And that's because the whack-a-mole game here is ultimately... [51:27] A lot of marketplace management is moving attention and inventory around. [51:33] Sometimes you get lucky and you really expand the pie for everybody. But I think Sir Bas Salen, who was CFO at Upwork that I got to know there and then went to Thumbtack later, he had this line when he came to visit our class that I love.
[51:46] which is [51:48] You have to recognize when you run marketplaces that many of the changes that are most consequential create winners and losers. [51:54] And rolling with those changes is about recognizing whether the winners you've created are more important to your business view than the losers you've created in the process. And it's a hard reality because nobody likes to articulate the idea that a feature change is hurting some of the people in your marketplace. [52:10] But because of this... [52:12] fundamental constraint baked into how marketplaces work, many of the things that we would choose to do and the reallocation they create can't necessarily create observed pie expanding wins in the short run. You're often making bets that that's where you're headed partly through the reallocation that you're doing right now. And so I think that's what's interesting about Superhost to me is it partly points to thinking about what's [52:35] the objective you would have defined, the metric you would have defined in the short run, that captures this idea of a tradeoff. [52:41] That's a great way to [52:43] Think about it. [52:44] I wanted to come back to... [52:46] This idea you're sharing of [52:48] maybe you should run experiments more quickly, not wait for stats sig. [52:53] have a culture of learning versus impact. [52:56] In practice, it's very difficult because [52:58] People are measured by impact. There's performance reviews, there's promotions, there's [53:02] How much impact did this team drive? We're going to look at their experiment results. [53:05] You've worked a lot of marketplace companies, a lot of different companies. Is there anything you've seen about just like how to. [53:10] something you could do to help the [53:12] company. [53:13] Shift and actually [53:15] It worked this way.
[53:16] while also recognizing success and who's doing great, who's not, which team's driving impact, who's not. [53:22] Interestingly, it's actually sort of an active area of research for me now. What I mean by active area of research is I care a lot about the incentives that we create for data science through how we set up reward mechanisms. So there's a couple things I think that could be helpful that are maybe, you know, there may be a little bit less about, like, maybe I'm not going to directly answer the question you asked, because I think that's a hard one, right? I think I recognize that measurement on impact is critical. [53:52] way first. I think there's a cultural issue here that's really critical. You know, one of the things I often find is that my PhD students, our PhD students here often go off and get, you know, great data scientist jobs. And in one sense, they're doing amazing stuff. They apply really technically sophisticated methods. But when I look at kind of the problems they're working on, they're often more at the margins of the business than they should be. And it's, it's, [54:14] a cultural thing. It's basically because if you're measured narrowly on impact and that's all anyone sees around you, then it's very hard to engage with the creative aspect of business change and the strategic aspects of business change. So the cultural aspect there is, I think it's partly incumbent on the leaders. [54:31] to expect something more of their data scientists. And what I mean by expect more is that you expect them to do more than deliver [54:40] narrowly defined statistically rigorous results to you in their reports. You're actually expecting them to talk also [54:46] about what they're learning about the business in the process so where that's headed is this this concept of being hypothesis driven which is like the technical phrase what does that mean again in a
[54:57] more lay sense. What it means is tests aren't [55:00] going to be defined only in terms of winners and losers, but each test should also say something about what will we learn about [55:07] a business flow, a funnel, preferences of the guests, preferences of the hosts, right? What will we learn about their demand elasticity if we're changing prices around, you know, these kinds of things. So it's possible to articulate in an experiment doc, a launch doc, you know, what are the hypotheses that are being tested? So that's one thing I would say, it's like just culturally setting the norms that learning is part of the discourse and it's expected actually, I think is important. But the other thing I would say that's, you know, maybe a little [55:37] programmatically, like what could a team, a data science platform team do? [55:43] You know, a funny thing about experiments is that we throw past learning away effectively. And this is just an artifact of how we analyze experiments that the methods use, the statistical methods use typically, you know, p-values, confidence intervals. These fall in a branch of statistics known as frequentist statistics. And the idea behind frequentist statistics, you know, without being overly technical, is just I let the data speak for itself. [56:05] There's no beliefs brought to the table about where that data came from. But if you think about this in a company, in A/B testing, it's a weird thing, right? Because I might have run a thousand A/B tests in the past on this exact same button or call to action or color, and now I'm going to completely ignore that and focus only on this. So there's ways. [56:25] to [56:26] take the past into account to build what's called a prior belief before I run an experiment, and now take the data from the experiment, connect it with the prior to come up with a conclusion of like, okay, in light of the past plus this experiment, what's it telling me about the future? And that falls broadly under the category of what's called Bayesian.
[56:44] A-B testing. [56:46] So that's one of the things I think can help culturally, weirdly, it's like a super technical thing. But I think it can help culturally because what it's doing is it's now rewarding people for contributing information to that prior. And I think it's then becomes possible to say, oh, like, [57:01] your experiment that failed [57:04] actually moved our prior. And that's an important thing because by doing so, you're now altering how we're going to think about this flow or this pricing plan. [57:12] in all future experiments, right? So there's like an information positive externality, positive network effect that's generated for the rest of your business. If I can somehow encode what you learned into the analysis of future experiments. So this is one thing. There's a strong connection between the culture and incentives of A-B testing and the ability to actually incorporate past learning into these prior beliefs. [57:35] I love that you're doing research in this area. We should bring you back when you've completed it and have the ultimate answer for everyone to change how they operate. Yeah, one of the great things about professors is we never complete anything and never have ultimate answers. Oh, boy. Yeah, I'll do my best, though. [57:50] This touches on a really interesting concept that you shared with me. [57:53] around how learning, just learning isn't free. People think that [57:56] they could just learn a bunch of stuff and there's not a cost to it. I'd love for you to just chat a bit about that. [58:01] Let me start with an anecdote that I just absolutely love. I use it every year in class. I was talking to a real estate platform and they [58:11] They had a marketing data science manager who's basically responsible, as many marketing managers are, for allocation of ad spend across different channels.
[58:22] and [58:23] What they discovered had happened at the end of the year is – [58:27] In one hand, the team had done great. [58:30] But the manager had held out. [58:33] some subset of arriving visitors, [58:36] not shown them any of the innovations they were making. Like a holdout group. [58:40] Yeah, exactly, what's called a holdout group in experimentation. The important thing about this holdout is it wasn't authorized. That's not the way things are supposed to work. They've got their ad spend, allocate out your ad spend, great. [58:52] So at the end of the year, you know, they looked at the whole lot. They're like, wow, that cost us like a couple of million dollars, something in that range. And. [58:58] It's like not a trivial amount of money. What's the deal? What were you thinking?" Basically, and of course, the answer was, "Well, I'm not going to do that." [59:05] I get that I cost you that much, but number one, [59:08] Now you know what my team's worth. [59:11] And number two, you would never have had that answer unless I'd done that on my own, right? Now, I'm not going to be able to do that. [59:16] Why is that so powerful? I think what I find so interesting about experiments [59:21] is that when you don't know something, [59:24] It seems... [59:26] It's not even a question that you would allocate some of your samples to all options, right? Treatment and control. I got two different ways of doing something. I don't know which one's better, so of course I'll give some samples to each of them. After the fact, you're like, oh, the treatment was better. What the heck were we thinking? Why did we give all those samples to control, right? That doesn't make any sense now. There's this great Seinfeld clip where, you know, they get the bill at the... He mentions getting a bill at the end of a, like, large, luxurious meal, and people stare at the bill. We're not hungry now. Why did we order all this food, right? So it's the same thing.
[59:55] I mean you know treatments better now why do you waste all those samples on control and I think that is such a powerful observation that [1:00:03] You have to put yourself in the frame of reference of when you didn't have the answer. [1:00:07] And at that moment, [1:00:09] What you're essentially saying to yourself is that it's worth paying to learn the answer. [1:00:14] I think it sounds obvious the way we're saying it now or this anecdote of the marketing manager in the holdout sounds obvious, right? What's culturally not baked in, I think, is [1:00:25] That idea, and the reason I say it's not culturally baked in, by the way, is because of the language of winners and losers. Because if we use that language, what we're implicitly saying is that we wasted time. [1:00:36] when we ran an A-B test on Loser. If I reward you for shipping winners, then what I'm really telling you is all the time that you spent testing out failures was wasted time. [1:00:46] And I think, you know, of course, yeah, like you don't want to keep data scientists around who regularly are just generating failures. That's not my point. But my point is there's a disconnect there. [1:00:56] On one hand, we can all look at the story of this marketing manager and chuckle at it, right? And yet, every day, we're instantiating language and processes that are reinforcing that same theme, which is essentially trying to say to you, if you're wasting samples, [1:01:10] on things that don't ultimately end up being a winner, then the act of doing so is a failure. So I really feel that that... [1:01:18] That idea that you have to pay to learn is, again, it's a cultural thing, but it's also an education issue for, you know, businesses are populated by people of all stripes. Not everybody comes from a data science or experimentation background. And this idea that that learning is costly is not natural, actually. It's not natural as a matter of human nature. It's certainly not natural as a matter of running a business.
[1:01:41] I love that example of the real estate. [1:01:43] platform where it's like very [1:01:45] viscerally clearly cost. They lost because they didn't roll out experiments to this group for a long time. [1:01:52] Such a neat example of this idea in action. [1:01:55] You mentioned... [1:01:56] Star ratings. I know you spend a lot of time on designing rating systems. Sorry, I didn't mean to imply star ratings. That's just one implementation. [1:02:04] rating systems in general. [1:02:05] So maybe just to keep it [1:02:06] focused, say a marketplace founder, [1:02:10] is trying to decide and design how they do ratings. [1:02:13] and reviews and things like that. [1:02:15] What's a couple pieces of advice you'd give them for how to... [1:02:18] do this correctly [1:02:20] And is there a model marketplace you'd point them to, like, these guys really do it? [1:02:24] really well and I know it's like super specific based on the marketplace [1:02:27] But is there one just like, oh, they really nailed it? [1:02:30] Oh, man, that's a tough one. I think I'll answer the second part first. I don't feel like anyone's really nailed this. I think there's a lot of innovation that's happened, but I think fundamentally we're still playing with the same kind of tools that we had when [1:02:43] eBay and Amazon first started thinking about how to do rating systems ages ago. And part of the reason we haven't nailed it is because there's a lot of dynamics in play that lead to what's called rating inflation, where if you look at ratings over time in a marketplace, one of my colleagues, John Horton, who was a professor at MIT and has worked very closely with Upwork.
[1:03:13] let's say on marketplaces like Odesk, like Uber, like any of these, right? [1:03:17] And there's a lot of reasons for this, but one of them is just that there's a reciprocity issue. [1:03:22] Right. Which is it's effectively, you know, from your perspective, it's kind of costless if someone says to you, hey, like, please leave me a nice rating. Right. And, you know, if you're seeing them or you're interacting with them, you know, most people don't want to be mean. So that happens. No. But there's another aspect of it, which is norming. [1:03:37] As the ratings in the marketplace go up, they get normed, right? So that now you're in a condition, you're like, oh, a four-star rating, I'm really screwing this person over. Whereas maybe when the marketplace started, you didn't think that. So definitely one thing that we worked on in our research was to think about re-norming. [1:03:53] the meaning of some of these labels. And renorming could mean something like, rather than the star ratings just being poor to excellent, [1:04:02] the top rating is actually exceeded expectations. You could go one step further and you could say, "How did this compare to this experience you had in the past that you rated really highly?" Airbnb had something like this in place where they would actually ask you to compare or ask you questions about expectations. I find that that's really valuable because [1:04:22] It's easier for people to say that was good but didn't exceed my expectations. [1:04:27] That was good, but definitely not better than this amazing stay I had like two months ago than it is to say, well, you know, I'm going to ding this person and give him four stars. So that's one issue. And I think another thing I want to point out for any marketplace founder is that something you want to be really careful about is the concept of averaging and what are the implications of averaging. And that's because a default for many marketplaces is to just average the ratings that people get.
[1:04:52] It feels very natural, right? Like, Lenny's got five ratings, let me average them. And that actually has some pretty important distributional consequences for the marketplace, distributional in the sense of who wins, who loses. [1:05:04] And that's because [1:05:05] If you're averaging and you're really established on a platform, think of a restaurant on Yelp with 10,000 reviews, [1:05:11] It's irrelevant what the next review is. It doesn't matter. Nothing's moving at that point. [1:05:16] If you're new, [1:05:18] and you break into that market and your first review is negative, you might be completely screwed. In fact, there are some early work on eBay that show that if your first rating is negative, that could actually immediately cause like an 8% hit on your immediate expected revenue, say nothing of long-term consequences. Subsequent work has found that that's a significant indicator of potential exit from the platform just because now it's very hard to find work. [1:05:42] And, you know, some platforms do things like maybe they won't show your ratings until you've accumulated a few. But in the end, this kind of. [1:05:49] distributional fairness aspect of averaging is pretty significant. And one of the recent papers that we've written is trying to get platforms to think a little bit about that. There's ways to address that, interestingly, through the same concept of a prior. And the prior basically says, hey, if someone comes into the marketplace, and instead of averaging them, I average them together with a prior belief. [1:06:10] Then maybe what that prior belief does, it says, yeah, you got one negative rating, but maybe you got a little bit unlucky. And maybe my prior belief is something which actually pulls your rating up a little bit and allows me to still have you alongside others in the marketplace and give you a chance at getting work, getting rides, etc.
[1:06:27] So I believe pretty strongly in this kind of like distributional fairness thing. [1:06:32] element of designing rating systems, I think it's been understudied. And I'll say in general, actually, I think rating systems are understudied, which to me is astonishing, because the biggest change from those agora's and Trajan's market elements of those kinds of markets, to me, the biggest change is that we get to see what happened with our matches. [1:06:51] So, you know, as a data scientist working on marketplaces, I feel like it's it's [1:06:56] incredible that more of us don't spend our time thinking about what we're learning from the matches and what these rating systems are telling us and what the impact of that is on who wins and who loses in these markets on you know kind of thinking about like the social implications of these things so that's something i'm pretty passionate about [1:07:14] I also led the review system flows for a while at Airbnb. [1:07:18] One of the things I'm most proud of is [1:07:20] launching what we call double-blind, [1:07:22] reviews where you don't see the other person's review until... [1:07:26] You will leave your review. [1:07:27] And yet we... [1:07:29] The intention was to create more honesty and more accurate reviews. It turned out the biggest impact was review rate went up. [1:07:35] because people get this email. Ramesh left your review. If you want to see it, you should leave a review. [1:07:39] And that really increased review rate, which gave us more data. [1:07:42] And it was a really fun experiment to work on. [1:07:44] There's a great concept in the literature on rating systems called the sound of silence, which is this idea that there's a lot of information in ratings that are not left. So Steve Tadellis, who's a professor at Berkeley, he had a really nice paper with some folks at eBay talking about what they called effective percent positive, where rather than normalizing just by the ratings, they normalized by including ratings that weren't left.
[1:08:14] downstream kind of performance of a seller. So there's a lot of information in that lack of a response. So it's cool that you're able to get more of that out. So much easier just to not leave a review than leave a bad review, right? Like the downside to you is just... [1:08:28] much better. [1:08:30] Oh, man. Marketplaces are so fascinating. I could see why a founder would want to be a marketplace founder because it's just like such an interesting space and... [1:08:38] hearing your feedback of like, "No, you're not a marketplace founder. Let's think about the problem you're solving." And it might be a marketplace. [1:08:44] might change people's minds. [1:08:46] Also, I feel like there's like a podcast episode in every topic we touched on. I know we just kind of scratched the surface a lot of things. [1:08:52] I know you got to run before we get to our lightning round. Is there anything else you wanted to highlight, touch on, leave people with? [1:08:58] that are maybe working on marketplaces, thinking about a marketplace. I think one of the high level points I would make, and like you said, there's an entire podcast in this topic, is that, [1:09:08] I think people... [1:09:09] want to imagine LLMs and AI-driven data science automating out large parts of what it means to do data science in industry. [1:09:19] And I think that's probably the wrong perspective. In some mundane sense, that's true. It's easier for me to code than it used to be before. Easier for me to develop visualizations than it used to be. I can make dashboards faster. So programmatically, I think it's true in some basic sense. [1:09:36] What I believe pretty strongly, and I teach data science here, and my students are asked to use LLMs and generative AI on a weekly basis on all their assignments. So I've got an up-close and personal read on this.
[1:09:47] But I believe very strongly actually is what AI has done for us is it's massively expanded the frontier of things we could think about our problem, hypotheses we could have, maybe things we could test. It's just an astronomical explosion of explanations and ideas and principle. [1:10:06] And... [1:10:06] I really think actually what that does is puts more pressure on the human, not less. I think it becomes more important for humans to be in the loop in interacting with these tools to drive the funneling down process of identifying what matters at all levels. That ranges from you're carrying out a data scientific analysis. And now because you've got these tools, you can hypothesize 10 explanations. [1:10:30] maybe 100 explanations. [1:10:32] Which of those are you going to focus attention on? What are you going to tell other people to focus their attention on? To your running experiments, you used to have 10 creatives you're testing for a marketing campaign. Now you've got 1,000 creatives you're testing for that marketing campaign. [1:10:43] Maybe that completely changes the game of what it means to run an experiment. What are you actually looking for now? How do you evaluate that you found something that was good enough? And I think these questions are not getting enough attention. I think people are looking for the automated tool that really... [1:10:59] cuts the human out but what i've seen so far and again you know who knows by 2024 i might have a totally different answer for you i don't think so [1:11:07] But at the moment, what I see is that humans have actually become far more important. [1:11:13] to the productive data science loop, not far less. Such an important point. I feel like we need to add AI corner to this podcast where we always think about how does AI impact what we're talking about on this? Yeah, I can see that. I totally see that. Okay, we might start doing that.
[1:11:27] Ramesh, with that, we've reached our very exciting lightning round. I've got six questions for you. Let's try to knock through them so you can go teach your class. [1:11:34] Are you ready? [1:11:35] I am ready. All right. What are two or three books you've recommended most to other people? When it comes to books, I have one I love that I start with always, which is How to Lie with Statistics. It's a tiny book by Daryl Huff from 1954, which is just for anyone... [1:11:50] that likes data at any level, it's such a fun read. It's a great book. The second thing I recommend to people, and actually this is true even for people who are not expert, is David Friedman was a statistician at Berkeley who passed away in the early 2000s. [1:12:08] And his writing was fantastic in getting us to think hard about process. He was especially fond of what he called shoe leather statistics, where you rolled your sleeves up, you got on the ground, boots on the ground, really getting in there, really trying to understand your data. [1:12:27] His writing is fantastic. His explanations are fantastic. He has a few different books at different levels. I think people love reading. Most importantly, what I like about it is he puts such emphasis on driving evidence and understanding of your processes that generate data. And, you know, I find often data scientists don't even look at examples. Right. So like at Odesk, it meant, are you looking at actual jobs and what's actually going on in your product before you're trying to do data science on it?
[1:12:57] writing is great. The last one I was going to mention has nothing to do with data science or anything. It's called 4,000 Weeks. [1:13:03] by Oliver Berkman. [1:13:05] I'm not like a huge like self-help type person, but I really like this book a lot. It's a little bit I think it's a little bit stoic in its approach, like so philosophy. But it's the basic point is you're only on Earth for somewhere in the neighborhood of 4000 weeks. And, you know, my wife and I have this term we call the infinite cue, which is like no matter what you think you get done on a given day, more stuff's going to just keep coming in. And he basically says that recognizing that is liberating, because once you recognize it doesn't matter what you do, you're always going to have too much to do. [1:13:33] There's no point in stressing out about having too much to do. And just that like small shift of mindset then puts a lot more attention on, you know, the usual thing people worry about, which is where do I want to prioritize my time? So he has a great way of writing about it, some concrete rules of thumb to help manage, you know, that way of thinking. And yeah, I think it's a great book. What is a favorite recent movie or TV show? I am a climber. And one movie that I really liked was The Alpinist. I know a lot of people have
[1:14:03] I recommend they watch the Alpinists. I think climbing is an interesting sport because... [1:14:09] It has a very much like a psychological aspect of it. And I think that movie is pretty good at this sort of meta level where you kind of reflect a little bit on what does it mean to make a movie about people who are obviously putting themselves into such risky situations. So I really enjoyed that. On TV, we've been watching Only Murders in the Building, but I'm like enough episodes behind right now that I probably won't say anything more because I'm trying to avoid any spoilers and I'm sure there's people out there trying to do the same. So great show though on Hulu. [1:14:39] What's a favorite interview question that you like to ask? [1:14:41] candidates that you're hiring. I interview people probably that are a little bit different than most of your podcast listeners. [1:14:47] But that said, there's one question I like to ask a lot, and that's, [1:14:52] If you imagine, so, you know, often in our interviews in academia, whether it's grad students or faculty, we'll ask people about their plans. [1:15:00] And what I like to ask people is, okay, now imagine... [1:15:03] Everything works out. [1:15:04] All the challenges you're facing work out. All your plans work out. Everything hits the top end of your vision for what this could be. [1:15:12] What do you imagine is the impact of having done that? [1:15:15] Like, who's being impacted by that? Why is that a big deal that that happened? And I find that's a really valuable question to ask, because, first of all, many people haven't thought about that. We're so short term folks, we don't even think like, boy, if everything worked out, what would be the big deal because of what I did? [1:15:32] Startup founders tend to be better at this than most people obviously but another reason I like it is because you are
[1:15:38] You'll find in that conversation that their vision expands a little bit of, [1:15:43] additional spheres that are touched or impacted by what they're thinking about doing so on both sides it's kind of a revealing question i think so i find it important for my line of work but you know my hunch is that might be useful for some of your listeners too [1:15:57] Yeah, such a unique perspective on interviewing. [1:16:00] versus most of the guests that I interview in tech company. Yeah, like normally there's like a coding question, right? I should say I would never ask a coding question like, you know, post November 2022 after after after we got. [1:16:11] AI to help us code. I think it's a superpower. [1:16:15] AI Corner. [1:16:16] Ha ha ha ha. [1:16:18] What is a favorite product you recently discovered that you really like? [1:16:22] I also really like cycling and I'm not ashamed to admit that I think that e-bikes are like the greatest thing for cycling. Admittedly, I'm like, you know, late 40s. So maybe I'm the right target demographic too. But yeah, I love my e-road bike. It's like great because it's not like one of those with a throttle, you have to work, but it kicks in just when you're on like your, you know, sixth hill and you don't want to go up the last hill anymore on the way home. So yeah, that's amazing. I think that just like, [1:16:47] transformative for people that like cycling but have busy lives and i think another one that my son who's 10 roped me into actually is we were like in santa cruz browsing at a [1:16:57] at a kitchenware shop of all places and he saw an outdoor pizza oven like a tiny portable one and he insisted he just did research for like two weeks and insisted we get one [1:17:07] So we got in over the summer and after we got it, he refused to eat pizza out anymore as a 10 year old. So, you know, that's like maybe that's the best best thing I could say about like the quality of pizza you can get from a home outdoor portable pizza oven.
[1:17:20] Oh my god, I'm hungry. I am going to go here. I have to get some pizza now. [1:17:24] What is a favorite life motto that you like to repeat to yourself, share with folks, find useful in your day-to-day? [1:17:31] A lot of my work involves talking to students of all stripes. And I guess these students go on to be data scientists, go on to be founders, and a lot of them go in the tech industry. So maybe in that sense, the advice is relevant. My main thing I tell people is slow down. [1:17:46] Because I think... [1:17:48] What I've found has been happening is just... [1:17:52] we're so convinced that speed is the way you're going to find the right answer that I just don't think we slow down to develop meaningful mental models of the things we're doing. That's certainly true in the research projects I work on. It's consistently true when I talk to people in business. I ask them about [1:18:10] By mental model, I just mean if you're running a marketplace, right? What is your model of what people care about? What makes people stay versus leave? What makes matches work versus not work? All those things shape a roadmap in your mind. And I think a lot of roadmapping, a lot of execution, paper writing in academia has all just become far more fast-paced at the expense of kind of deeper thinking about these kind of structural issues. [1:18:39] features of the thing you're building. And so, yeah, with my students, but also I think with people I interact with in industry. [1:18:45] I think slowing down is... [1:18:48] Actually, more of a virtue that it's given credit for. Very similar to a motto that a recent guest shared.
[1:18:53] Which I think was go slow to go fast or stay smooth to go fast. Yeah, I like that. Maybe I'll pilfer that when I go talk to my grad students. [1:19:04] Final question. You're a professor at Stanford University, which sounds incredibly cool. [1:19:08] What's something about being a professor that, [1:19:11] at Stanford. In particular or in general, that would surprise people. Either [1:19:15] Good or bad? [1:19:16] Yeah, I mean, we've had a rough ride, as everybody probably knows. Stanford's been in the news for [1:19:22] a lot of not so great reasons, I think, over the last five years especially. So I don't know if this is the right kind of surprise, but I think one thing that I find really interesting, [1:19:37] energizing at Stanford is people have never asked me for [1:19:43] credentialing here. And what I mean by that is that I came from kind of a bunch of other good schools and obviously I've spent time in industry with a lot of great companies. And [1:19:54] A kind of cultural dynamic that can often develop is, well, before I'm going to talk to you, [1:19:59] I want to know something about why you're worth talking to. Like, give me your credentials, right? Like, oh, you know, where are you a grad student or like, where are you a professor? Like, tell me about yourself first. [1:20:09] One of the things that I found very surprising when I came here is just how that never happened at any level. Grad students tell me this all the time. Go talk to someone across campus and just launch right into a conversation about how your X meets my Y and we have something we can do together. You know, as a faculty member, it happens all the time.
[1:20:39] things I do and yet seamless. Our conversation was about [1:20:43] the substance rather than the credentialing. [1:20:46] I really think part of the reason for that is that Stanford is sort of unique in that it doesn't have a weakness across the board. We have strong professional schools, law, business, medicine, strong engineering schools, strong humanities and social sciences. [1:20:58] And then that and the weather is what I usually tell people, honestly, which matters a lot. People are willing to walk anywhere. I think... [1:21:05] those things combine to create a culture and an environment where you don't credential everybody. And [1:21:10] I think that means a lot. I think that's something that I... [1:21:14] I haven't found elsewhere. And, you know, if people want to know something about what Stanford's like on the inside, I think that's one. [1:21:21] one aspect of it that probably isn't discussed very much. And I think that's part of what makes it really fun to be here. [1:21:27] It's also an incredibly dreamy campus that is in... [1:21:30] Very joyful to walk around. [1:21:32] That helps, I'm sure. [1:21:34] Ramesh, I feel like we got people's brains tingling. I think we've created new marketplace founders and also convinced people maybe they aren't marketplace founders. So maybe we netted out it. [1:21:42] Zero new marketplace founders. [1:21:44] Two final questions. Where can folks find you online if they want to reach out? And how can listeners be useful to you? [1:21:49] I think the easiest way, if someone's interested more on the industrial side, is probably LinkedIn. You send me a message or connect there. I'm also, because I'm an academic, I have my own Stanford webpage and it's pretty easy to figure out how to find me there as well. And how can listeners help me? [1:22:04] You know, I kind of feel the most important thing that someone listening to this could do is take forward some of the messages that came out in terms of what it means to be data literate.
[1:22:15] I think there's a lot you can do to educate yourself there. Maybe one final thought I'll share is that [1:22:21] In the same way that AI generates a lot of ideas, AI also generates a lot of pros. And in data science, that can actually be deadly because you're getting more explanations that sometimes maybe are extraneous. So taking that as a little vignette, I think what the world needs is data literacy on the part of people interacting with these tools and with each other. So that's the thing I care most about. The things I teach, the things I do research on, they're all connected to that thing. [1:22:51] I do work with companies regularly. And so if there's, you know, interesting opportunities that kind of fall in the sphere of stuff we've discussed on the podcast, always, always happy to listen. [1:23:00] Awesome. I think we've made a dent in helping people become a little more data literate. [1:23:05] Ramesh, thank you so much for being here. [1:23:07] All right. Thank you so much, Lenny. Bye, everyone.
Want to learn more?