COVID-19 Modeling and the Path to Herd Immunity

COVID-19 Modeling and the Path to Herd Immunity

; Youyang Gu, MA

Disclosures

February 08, 2021

21

This transcript has been edited for clarity.

Eric J. Topol, MD: Hello. This is Eric Topol, Medscape editor-in-chief. I'm delighted to have a chance to have a conversation, one on one, with Youyang Gu, previously from MIT and now doing incredible work on COVID-19. Welcome, Youyang.

Youyang Gu, MA: Thank you, Eric. It's an honor to be on Medscape today and to chat with you.

Topol: You are the first data scientist we've ever interviewed, which is amazing. I know you're only 27, which is also amazing, and your contributions have been vast. Before your work on COVID-19, you went to MIT and did a double major in electrical engineering, computer science, and math and then got a master's degree at MIT. You've also done work with natural language processing and at CSAIL. But let's go back with your upbringing and some of your prior work in data science. Where did this all get started for you?

Gu: I studied computer science at MIT, and it wasn't until probably my master's that I started getting into machine learning and data science. I had more of a systems background in my undergrad. For my master's, I worked on natural processing and using deep neural networks to do a lot of these difficult natural language processing tasks. That's how I got my first exposure to big data and building statistical models to make predictions on data.

After I graduated, I spent a couple of years working in finance. That's where I further honed my modeling background because, as you can probably imagine, it can get very quantitative, and the goal there is to just be as accurate as possible.

Topol: A lot of the great algorithmic talent goes to the finance industry, right?

Gu: Yes. I met some really brilliant and talented people during my time there. I feel like those couple of years helped me develop the skills that I've been using now for the past year while making sure that my COVID modeling is up to par and can be as accurate as possible.

Topol: But you also did work in sports analytics too, right?

Gu: Yes, I did some work in that, and I was working on that right before the pandemic hit. Actually, that's one of the reasons I got started in modeling COVID, because everything was shut down in March. Like many Americans, it impacted me. I was just curious to see where COVID was going, when it could potentially be improved.

The existing models at the time, back in March 2020, weren't doing as great of a job as I thought they could do. So I took a shot at building my own model to see what I could do. I guess it took off from there.

Topol: Well, you're not kidding that it took off. Before we get into covid19-projections.com, which you singlehandedly run — I mean, you don't have a team, right?

Gu: Yeah, it's just me.

Topol: It's Youyang Gu's COVID-19 projection.

Before we get into that, did you know when you were in the crib that you were going to become a computer data scientist guy? When did you figure this out?

Gu: I actually didn't, really... Growing up, I liked math and science, but I didn't really have much exposure to computer science until toward the end of high school. I had a really good computer science teacher. My dad was the one who was pushing me to study computer science because that's his expertise and that's what he is doing. I was always kind of like, "Oh, I don't want to do what you're doing. I want to take my own path."

But then once I got into MIT, I saw how much this technology and computer science is changing our world, and in many instances, for the better. That's what inspired me to dive into it and learn more about it.

Topol: That's really fascinating. Maybe there is a little genetic talent component to this, but you have definitely found a sweet spot of unmet need.

Projected Deaths: 60,000 or 2 Million?

Topol: Here it is, March 2020. The pandemic is starting to go into full force. You're looking at all of these models out there like the University of Washington IHME and others, and you see that they're really not performing very well. Basically, you said, "I'm going to set one up." When you made that decision, it was a big deal because in the vanguard, you became the number-one go-to modeling force out there. What was it that made you think I can do this really well?

Gu: I didn't expect going into it that I was going to have this model that everyone in the world was going to look at. I started because I was just curious to see where the trajectory was going because, at that time in March, things were getting worse and worse by the day.

The question that a lot of people had was, when would things peak? And so you had two ends of the spectrum where one model, say, like the IHME, was forecasting 60,000 deaths by the summer. And then on the other end of the spectrum, you had the Imperial College model that was estimating 2 million deaths by the summer. That's a pretty wide range, so I didn't really know what to make of it. That's why I thought it was better to write my own model and see who was right.

Topol: What were your inputs that made your model perform so well?

Gu: I think the answer to that would be better phrased as what inputs did I not use, because the only input that I did use was previous deaths. I feel like the other models out there were using too many inputs, too many data sources.

From my experience, when the signal in the data is so low, the data quality is low, so the more data you give it, the worse your outputs tend to be. That's why I didn't want to make it too complicated and I just decided that I was going to use deaths, and I wanted to make it as simple as possible. I just built the whole model in, I think, less than a week. From the beginning to when I had the website live, it was around a week.

Topol: And it became the go-to model for The Wall Street Journal, The Economist, The New York Times, The Washington Post, CNN, NPR, Nate Silver — I mean, everybody.

Gu: I'm curious — how did you first hear about it?

Topol: I learned about you through Twitter. Some people were writing to me early on in the pandemic saying, "Don't even look at those other models. There's only one to look at and it's Youyang's covid19-projections.com."

I started looking at yours every day. And it was so different from the others and so much more accurate. I just kept testing it and you were phenomenal. You were like the oracle or the soothsayer. It's amazing, actually.

No wonder when you delivered on accuracy, unlike the other models of these very prestigious academic centers with teams of people. Here you are, one person, and you're out with better forecasts. It's incredible. It really was amazing to see this, and it went on, obviously, many, many months.

You would interact with people on Twitter because some of them would challenge you or bring up controversies. How would that go?

Gu: That was actually the most surprising aspect of this whole thing. I felt like, at least to me, the building-the-model part was fairly straightforward in the sense that I've had prior experience at building statistical models. The hard part was starting on Twitter and trying to advertise my model, and getting people to first look at it.

I found that to be the most challenging part because, in the initial stages, I would just be tweeting at reporters, people in academia, and epidemiologists, saying, "Hey, can you look at my model?" Almost all of them went unanswered. Then it all happened in the span of like a week or two, where everything just blew up and suddenly I had all these people who previously weren't responding to me suddenly writing to me, saying, "Yeah, let's talk."

Since then, I've been pretty active on Twitter and I've seen a lot of different things happen on Twitter. It's been an interesting ride. I think you're right: There have been many of those people who are very supportive, but there are also people who are very critical.

I think I try to take in the criticism as much as I can. I think that's actually one of the reasons that my model has been so accurate, because I've been able to get that feedback from hundreds of people at once. Of course, some people put their criticisms more nicely than others.

Topol: Twitter brings out some of the worst, raw kind of communication. You always are very courteous and very respectful. That's another part about you that I think is very distinctive. I've never seen any of your responses that were at all disparaging. You come across through social media as a gentle, respectful, highly obviously intelligent person. You're a model. Not just a model, but a model for communicating.

Separating From the Pack

Topol: Now, one of the things that was curious to me is that you then separated from the pack. You're the most accurate model for COVID-19, and then you decided in the late fall, October or November, that you're going to just stop the projections. What made you do that?

  • 21
Recommendations

Comments

3090D553-9492-4563-8681-AD288FA52ACE
Comments on Medscape are moderated and should be professional in tone and on topic. You must declare any conflicts of interest related to your comments and responses. Please see our Commenting Guide for further information. We reserve the right to remove posts at our sole discretion.

processing....