1 hours 25 minutes 6 seconds
🇬🇧 English
Speaker 1
00:00
The following is a conversation with Greg Brockman. He's the co-founder and CTO of OpenAI, a world-class research organization, developing ideas in AI with the goal of eventually creating a safe and friendly artificial general intelligence, 1 that benefits and empowers humanity. OpenAI is not only a source of publications, algorithms, tools, and data sets. Their mission is a catalyst for an important public discourse about our future with both narrow and general intelligence systems.
Speaker 1
00:33
This conversation is part of the Artificial Intelligence Podcast at MIT and beyond. If you enjoy it, subscribe on YouTube, iTunes, or simply connect with me on Twitter at Lex Friedman, spelled F-R-I-D. And now, here's my conversation with Greg Brockman. So in high school, and right after you wrote a draft of a chemistry textbook, I saw that.
Speaker 1
00:57
That covers everything from basic structure of the atom to quantum mechanics. So it's clear you have an intuition and a passion for both the physical world with chemistry and now robotics to the digital world with AI, deep learning, reinforcement learning, and so on. Do you see the physical world in the digital world is different and what do you think is the gap a
Speaker 2
01:20
lot of it actually? Boils down to iteration speed right that I think that a lot of what really motivates me is is building things right is the You know think about mathematics for example where you think really hard about a problem you understand it You're right down to this very obscure form. They call it proof, but then this is in humanities Library right?
Speaker 2
01:37
It's there forever. This is some truth that we've discovered. Maybe only 5 people in your field will ever read it, but somehow you've kind of moved humanity forward. And so I actually used to really think that I was going to be a mathematician and then I actually started writing this chemistry textbook.
Speaker 2
01:51
1 of my friends told me, you'll never publish it because you don't have a PhD. So instead I decided to build a website and try to promote my ideas that way. And then I discovered programming and I, you know, that in programming you think hard about a problem, you understand it, you write it down in a very obscure form that we call a program. But then once again, it's in humanity's library, right?
Speaker 2
02:12
And anyone can get the benefit from it. And the scalability is massive. And so I think that the thing that really appeals to me about the digital world is that you can have this insane leverage, right? A single individual with an idea is able to affect the entire planet.
Speaker 2
02:26
And that's something I think is really hard to do if you're moving around physical atoms.
Speaker 1
02:30
But you said mathematics, so if you look at the wet thing over here, our mind, do you ultimately see it as just math, as just information processing? Or is there some other magic as you've seen, if you've seen through biology and chemistry and so on?
Speaker 2
02:46
I think it's really interesting to think about humans as just information processing systems. And that it seems like it's actually a pretty good way of describing a lot of kind of how the world works or a lot of what we're capable of to think that, you know, Again, if you just look at technological innovations over time, that in some ways the most transformative innovation that we've had has been the computer, right? In some ways the internet, you know, what has the internet done, right?
Speaker 2
03:10
The internet is not about these physical cables. It's about the fact that I am suddenly able to instantly communicate with any other human on the planet. I'm able to retrieve any piece of knowledge that in some ways the human race has ever had. And that those are these insane transformations.
Speaker 1
03:26
Do you see our society as a whole, the collective, as another extension of the intelligence of the human being. So if you look at the human being as an information processing system, you mentioned the internet, the networking, do you see us all together as a civilization, as a kind of intelligence system?
Speaker 2
03:41
Yeah, I think this is actually a really interesting perspective to take and to think about that you sort of have this collective intelligence of all of society. The economy itself is this superhuman machine that is optimizing something, right? And it's almost, in some ways, a company has a will of its own, right?
Speaker 2
03:57
That you have all these individuals who are all pursuing their own individual goals and thinking really hard and thinking about the right things to do, but somehow the company does something that is this emergent thing and that is a really useful abstraction. I think that in some ways we think of ourselves as the most intelligent things on the planet and the most powerful things on the planet, but there are things that are bigger than us. There are these systems that we all contribute to. And so I think actually it's interesting to think about if you've read Isaac Asimov's foundation, right, that there's this concept of psychohistory in there, which is effectively this, that if you have trillions or quadrillions of beings, then maybe you could actually predict what that being, that huge macro being will do, and almost independent of what the individuals want.
Speaker 2
04:42
I actually have a second angle on this that I think is interesting, which is thinking about technological determinism. 1 thing that I actually think a lot about with open AI is that we're kind of coming on to this insanely transformational technology of general intelligence that will happen at some point. And there's a question of How can you take actions that will actually steer it to go better rather than worse? And that I think 1 question you need to ask is as a scientist, as an inventor, as a creator, what impact can you have in general?
Speaker 2
05:11
You look at things like the telephone invented by 2 people on the same day. What does that mean? What does that mean about the shape of innovation? And I think that what's going on is everyone's building on the shoulders of the same giants.
Speaker 2
05:21
And so you can kind of, you can't really hope to create something no 1 else ever would. You know, if Einstein wasn't born, someone else would have come up with relativity. You know, he changed the timeline a bit, right? That maybe it would have taken another 20 years, but it wouldn't be that fundamentally humanity would never discover these fundamental truths.
Speaker 1
05:37
So there's some kind of invisible momentum that some people like Einstein or open AI is plugging into that anybody else can also plug into and ultimately that wave takes us into a certain direction. That's what you mean by digitalism.
Speaker 2
05:51
That's right. That's right. And you know, this kind of seems to play out in a bunch of different ways that there's some exponential that is being ridden and that the exponential itself, which 1 it is changes.
Speaker 2
06:00
Think about Moore's law, an entire industry set its clock to it for 50 years. How can that be? How is that possible? And yet, somehow, it happened.
Speaker 2
06:09
And so I think you can't hope to ever invent something that no 1 else will. Maybe you can change the timeline a little bit. But if you really want to make a difference, I think that the thing that you really have to do, the only real degree of freedom you have is to set the initial conditions under which a technology is born. And so you think about the internet, right?
Speaker 2
06:26
That there are lots of other competitors trying to build similar things, and the internet won. And that the initial conditions were that it was created by this group that really valued people being able to be, anyone being able to plug in this very academic mindset of being open and connected. And I think that the internet for the next 40 years really played out that way. Maybe today things are starting to shift in a different direction, but I think that those initial conditions were really important to determine the next 40 years worth of progress.
Speaker 1
06:54
That's really beautifully put. So another example that I think about, I recently looked at it. I looked at Wikipedia, the formation of Wikipedia.
Speaker 1
07:03
And I wonder what the internet would be like if Wikipedia had ads. You know, there's a interesting argument that why they chose not to make it, put advertisement on Wikipedia. I think it's, I think Wikipedia is 1 of the greatest resources we have on the internet. It's extremely surprising how well it works and how well it was able to aggregate all this kind of good information.
Speaker 1
07:25
And essentially the creator of Wikipedia, I don't know, there's probably some debates there, but set the initial conditions and how it carried itself forward. That's really interesting. So the way you're thinking about AGI or artificial intelligence is you're focused on setting the initial conditions for the progress. That's right.
Speaker 1
07:42
That's powerful. Okay, so look into the future. If you create an AGI system, like 1 that can ace the Turing test, natural language, what do you think would be the interactions you would have with it? What do you think are the questions you would ask?
Speaker 1
07:57
Like, what would be the first question you would ask it,
Speaker 2
08:00
her, him? That's right. I think that at that point, if you've really built a powerful system that is capable of shaping the future of humanity The first question that you really should ask is how do we make sure that this plays out?
Speaker 2
08:11
Well, and so that's actually the first question that I would ask A powerful AGI system is
Speaker 1
08:17
so you wouldn't ask your colleague. You wouldn't ask like Ilya you would ask the AGI system
Speaker 2
08:22
Oh, we've already had the conversation with Ilya right and everyone here And so you want as many perspectives and a piece of wisdom as you can for answering this question So I don't think you necessarily defer to whatever your powerful system tells you, but you use it as 1 input to try to figure out what to do. I guess fundamentally what it really comes down to is if you've built something really powerful and you think about, for example, the creation of—shortly after the creation of nuclear weapons, the most important question in the world was, what's the world order going to be like? How do we set ourselves up in a place where we're going to be able to survive as a species?
Speaker 2
08:58
With AGI, I think the question is slightly different. There is a question of, how do we make sure that we don't get the negative effects. But there's also the positive side, right? You imagine that, you know, like what won't AGI be like?
Speaker 2
09:09
Like what will it be capable of? And I think that 1 of the core reasons that an AGI can be powerful and transformative is actually due to technological development. If you have something that's capable as a human and that it's much more scalable, that you absolutely want that thing to go read the whole scientific literature and think about how to create cures for all the diseases. You want it to think about how to go and build technologies to help us create material abundance and to figure out societal problems that we have trouble with, like how are we supposed to clean up the environment?
Speaker 2
09:40
Maybe you want this to go and invent a bunch of little robots that will go out and be biodegradable and turn ocean debris into harmless molecules. And I think that that positive side is something that I think people miss sometimes when thinking about what an AGI will be like. And so I think that if you have a system that's capable of all of that, you absolutely want its advice about how do I make sure that we're using your capabilities in a positive way for humanity.
Speaker 1
10:09
So what do you think about that psychology that looks at all the different possible trajectories of an AGI system, many of which, perhaps the majority of which are positive, and nevertheless focuses on the negative trajectories. I mean, you get to interact with folks, you get to think about this, maybe within yourself as well. You look at Sam Harris and so on.
Speaker 1
10:30
It seems to be, sorry to put it this way, but almost more fun to think about the negative possibilities. Whatever that's deep in our psychology, what do you think about that? And how do we deal with it? Because we want AI to help us.
Speaker 2
10:44
So I think there's kind of 2 problems entailed in that question. The first is more of the question of how can you even picture what a world with a new technology will be like? Now imagine we're in 1950 and I'm trying to describe Uber to someone.
Speaker 1
11:01
Ha ha ha. Apps and the internet. Yeah, I mean, that's going to be extremely complicated, but it's imaginable.
Speaker 2
11:10
It's imaginable, right? But, and now imagine being in 1950 and predicting Uber, right, and you need to describe the internet, you need to describe GPS, you need to describe the fact that everyone's going to have this phone in their pocket. And so I think that just the first truth is that it is hard to picture how a transformative technology will play out in the world.
Speaker 2
11:31
We've seen that before with technologies that are far less transformative than AGI will be. And so I think that 1 piece is that it's just even hard to imagine and to really put yourself in a world where you can predict what that positive vision would be like. And I think the second thing is that it is, I think it is always easier to support the negative side than the positive side. It's always easier to destroy than create and you know, less in a, in a, in a, in a physical sense and more just in an intellectual sense, right?
Speaker 2
12:03
Because I think that with creating something, you need to just get a bunch of things right. And to destroy, you just need to get 1 thing wrong. And so I think that what that means is that I think a lot of people's thinking dead ends as soon as they see the negative story. But that being said, I actually have some hope.
Speaker 2
12:20
I think that the positive vision is something that I think can be, is something that we can talk about. I think that just simply saying this fact of, yeah, there's positives, there's negatives, Everyone likes to dwell on the negative. People actually respond well to that message and say, huh, you're right, there's a part of this that we're not talking about, not thinking about. And that's actually something that's, I think, really been a key part of how we think about AGI at OpenAI.
Speaker 2
12:46
You can kind of look at it as like, okay, like OpenAI talks about the fact that there are risks, and yet they're trying to build this system. Like, how do
Speaker 1
12:53
you square those 2 facts? So do you share the intuition that some people have, I mean, from Sam Harris to even Elon Musk himself, that it's tricky as you develop AGI to keep it from slipping into the existential threats, into the negative. What's your intuition about how hard is it to keep AI development on the positive track?
Speaker 1
13:19
What's your intuition there?
Speaker 2
13:20
To answer that question, you can really look at how we structure open AI. So we really have 3 main arms. We have capabilities, which is actually doing the technical work and pushing forward what these systems can do.
Speaker 2
13:31
There's safety, which is working on technical mechanisms to ensure that the systems we build are aligned with human values. And then there's policy, which is making sure that we have governance mechanisms, answering that question of, well, whose values? And so I think that the technical safety 1 is the 1 that people kind of talk about the most, right? You talk about like think about you know All of the dystopic AI movies a lot of that is about not having good technical safety in place And what we've been finding is that you know I think that actually a lot of people look at the technical safety problem and think it's just intractable.
Speaker 2
14:05
Right, this question of what do humans want, how am I supposed to write that down, can I even write down what I want? No way. And then they stop there. But The thing is we've already built systems that are able to learn things that humans can't specify.
Speaker 2
14:20
You know, even the rules for how to recognize if there's a cat or a dog in an image. Turns out it's intractable to write that down, and yet we're able to learn it. And that what we're seeing with systems we build at OpenAI, and they're still in early proof of concept stage, is that you are able to learn human preferences. You're able to learn what humans want from data.
Speaker 2
14:38
And so that's kind of the core focus for our technical safety team. And I think that there actually we've had some pretty encouraging updates in terms of what we've been able to make work. So you have an intuition and a hope that from data, you know, looking at the value alignment problem, from data we can build systems that align with the collective better angels of our nature. So align with the ethics and
Speaker 1
15:03
the morals of human beings.
Speaker 2
15:04
To even say this in a different way, I mean think about how do we align humans, right? Think about like a human baby can grow up to be an evil person or a great person. And a lot of that is from learning from data, right?
Speaker 2
15:15
That you have some feedback as a child is growing up, they get to see positive examples. And so I think that just like the only example we have of a general intelligence that is able to learn from data to align with human values and to learn values, I think we shouldn't be surprised that we can do the same sorts of techniques or whether the same sort of techniques end up being how we solve value alignment for AGIs.
Speaker 1
15:40
So let's go even higher. I don't know if you've read the book Sapiens, but there's an idea that as a collective, as us human beings, we kind of develop together ideas that we hold. There's no, in that context, objective truth.
Speaker 1
15:57
We just kind of all agree to certain ideas and hold them as a collective. Did you have a sense that there is, in the world of good and evil, do
Speaker 2
16:05
you have a sense that to the first approximation there are some things that are good and that you could teach systems to behave to be good? So I think that this actually blends into our third team, which is the policy team. And this is the 1, the aspect I think people really talk about way less than they should.
Speaker 2
16:25
Because imagine that we build super powerful systems that we've managed to figure out all the mechanisms for these things to do whatever the operator wants. The most important question becomes, who's the operator, what do they want, and how is that going to affect everyone else? And I think that this question of what is good, what are those values, I mean, I think you don't even have to go to those very grand existential places to realize how hard this problem is. You just look at different countries and cultures across the world and that there's a very different conception of how the world works and what kinds of ways that society wants to operate.
Speaker 2
17:03
And so I think that the really core question is actually very concrete. And I think it's not a question that we have ready answers to, right? It's how do you have a world where all the different countries that we have, United States, China, Russia, and you know, the hundreds of other countries out there are able to continue to not just operate in the way that they see fit, but in the world that emerges in these where you have these very powerful systems operating alongside humans ends up being something that empowers humans more, that makes human existence be a more meaningful thing, and that people are happier and wealthier and able to live more fulfilling lives. It's not an obvious thing for how to design that world once you have that very powerful system.
Speaker 2
17:53
So if we take a little step back, and we're having a fascinating conversation, and OpenAI is in many ways a tech leader in the world, and yet we're thinking about these big existential questions, which is fascinating and really important. I think you're a leader in that space, and that's a really important space, of just thinking how AI affects society in a big picture view. So Oscar Wilde said, we're all
Speaker 1
18:16
in the gutter, but some of us are looking at the stars, and I think OpenAI has a charter that looks to the stars, I would say, to create intelligence, to create general intelligence, make it beneficial, safe, and collaborative. Can you Tell me how that came about, how a mission like that and the path to creating a mission like that at OpenAI was founded.
Speaker 2
18:39
Yeah, so I think that in some ways it really boils down to taking a look at the landscape. All right, So if you think about the history of AI, that basically for the past 60 or 70 years, people have thought about this goal of what could happen if you could automate human intellectual labor. Imagine you could build a computer system that could do that.
Speaker 2
18:59
What becomes possible? We have a lot of sci-fi that tells stories of various dystopias and, you know, increasingly you have movies like Herd that tell you a little bit about maybe more of a little bit utopic vision. You think about the impacts that we've seen from being able to have bicycles for our minds and computers, and that I think that the impact of computers and the internet has just far outstripped what anyone really could have predicted. And so I think that it's very clear that if you can build an AGI, it will be the most transformative technology that humans will ever create.
Speaker 2
19:34
And so what it boils down to then is a question of, well, is there a path? Is there hope? Is there a way to build such a system? And I think that for 60 or 70 years that people got excited and that ended up not being able to deliver on the hopes that people had pinned on them.
Speaker 2
19:51
And I think that then, after 2 winters of AI development, that people, I think, kind of almost stopped daring to dream that really talking about AGI or thinking about AGI became almost this taboo in the community. But I actually think that people took the wrong lesson from AI history. And if you look back, starting in 1959 is when the Perceptron was released. And this is basically 1 of the earliest neural networks.
Speaker 2
20:17
It was released to what was perceived as this massive overhype. So in the New York Times in 1959, you have this article saying that, you know, the Perceptron will 1 day recognize people, call out their names, instantly translate speech between languages. And people at the time looked at this and said, your system can't do any of that, and basically spent 10 years trying to discredit the whole perceptron direction and succeeded. And all the funding dried up and people kind of went in other directions.
Speaker 2
20:45
And in the 80s there was a resurgence. And I'd always heard that the resurgence in the 80s was due to the invention of back propagation and these algorithms that got people excited. But actually the causality was due to people building larger computers. That you can find these articles from the 80s saying that the democratization of computing power suddenly meant that you could run these larger neural networks.
Speaker 2
21:04
And then people started to do all these amazing things. Back propagation algorithm was invented. And you know, the neural nets people were running were these tiny little like 20 neuron neural nets, right? Like what are you supposed to learn with 20 neurons?
Speaker 2
21:15
And so of course they weren't able to get great results. And it really wasn't until 2012 that this approach that's almost the most simple natural approach that people had come up with in the 50s, right? In some ways even in the 40s before there were computers with the Pitts-McCulloh neuron, suddenly this became the best way of solving problems. I think there are 3 core properties that deep learning has that I think are very worth paying attention to.
Speaker 2
21:44
The first is generality. We have a very small number of deep learning tools, SGD, deep neural net, maybe some, you know, RL. And it solves this huge variety of problems, speech recognition, machine translation, game playing, all of these problems, small set of tools. So there's the generality.
Speaker 2
22:02
There's a second piece, which is the competence. You want to solve any of those problems, throw out 40 years worth of normal computer vision research, replace it with a deep neural net, it's going to work better. And there's a third piece, which is the scalability, right? That 1 thing that has been shown time and time again is that you, if you have a larger neural network, throw more compute, more data at it, it will work better.
Speaker 2
22:25
Those 3 properties together feel like essential parts of building a general intelligence. Now it doesn't just mean that if we scale up what we have, that we will have an AGI, right? There are clearly missing pieces, there are missing ideas. We need to have answers for reasoning.
Speaker 2
22:40
But I think that the core here is that for the first time, it feels that we have a paradigm that gives us hope that general intelligence can be achievable. And so as soon as you believe that, everything else comes into focus. If you imagine that you may be able to, and that the timeline I think remains uncertain, but I think that certainly within our lifetimes and possibly within a much shorter period of time than people would expect, if you can really build the most transformative technology that will ever exist, you stop thinking about yourself so much. You start thinking about just like, how do you have a world where this goes well?
Speaker 2
23:16
And that you need to think about the practicalities of how do you build an organization and get together a bunch of people and resources, and to make sure that people feel motivated and ready to do it. But I think that then you start thinking about, well, what if we succeed? And how do we make sure that when we succeed, that the world is actually the place that we want ourselves to exist in, and almost in the Rawlsian Vale sense of the word. And so that's kind of the broader landscape.
Speaker 2
23:43
And OpenAI was really formed in 2015 with that high-level picture of AGI might be possible sooner than people think and that we need to try to do our best to make sure it's going to go well. And then we spent the next couple years really trying to figure out what does that mean, how do we do it. And you know I think that typically with a company you start out very small, so you and a co-founder and you build a product, you get some users, you get a product market fit, you know then at some point you raise some money, you hire people, you scale and then you know down the road then the big companies realize you exist and try to kill you. And for OpenAI, it was basically everything in exactly the opposite order.
Speaker 1
24:25
Let me just pause for a second. You said a lot of things, and let me just admire the jarring aspect of what OpenAI stands for, which is daring to dream. I mean, you said it's pretty powerful.
Speaker 1
24:36
It caught me off guard because I think that's very true. The step of just daring to dream about the possibilities of creating intelligence in a positive and a safe way. But just even creating intelligence is a much needed refreshing catalyst for the AI community. So that's the starting point.
Speaker 1
24:58
Okay, so then formation of open AI.
Speaker 2
25:02
I would just say that when we were starting OpenAI, that kind of the first question that we had is, is it too late to start a lab with a bunch of the best people? Right, is that even possible?
Speaker 1
25:13
That was an actual question.
Speaker 2
25:15
That was the core question of, we had this dinner in July of 20 2015 and there's that was that was really what we spent the whole time talking about and you know because it's You think about kind of where AI was is that it transitioned from being an academic pursuit to an industrial pursuit? And so a lot of the best people were in these big research labs and that we wanted to start our own 1 that no matter how much resources we could accumulate would be pale in comparison to the big tech companies. We knew that and there was a question of are we going to be actually able to get this thing off the ground?
Speaker 2
25:48
You need a critical mass. You can't just do you and a co-founder, build a product. You really need to have a group of 5 to 10 people. And we kind of concluded it wasn't obviously impossible.
Speaker 2
25:59
So it seemed worth trying.
Speaker 1
26:02
Well, you're also a dreamer, so who knows, right? That's right. Okay, so speaking of that, competing with the big players, let's talk about some of the some of the tricky things as you think through this process of growing of seeing how you can develop these systems at a scale that competes.
Speaker 1
26:22
So you recently formed OpenAI LP, a new cap profit company that now carries the name OpenAI. So OpenAI is now this official company. The original non-profit company still exists and carries the OpenAI non-profit name. So can you explain what this company is, what the purpose of its creation is, and how did you arrive at the decision to create it.
Speaker 2
26:48
OpenAI, the whole entity, and OpenAI LP as a vehicle, is trying to accomplish the mission of ensuring that artificial general intelligence benefits everyone. And the main way that we're trying to do that is by actually trying to build general intelligence to ourselves and make sure the benefits are distributed to the world. That's the primary way.
Speaker 2
27:07
We're also fine if someone else does this, right? It doesn't have to be us. If someone else is going to build an AGI and make sure that the benefits don't get locked up in 1 company or, you know, 111, with 1 set of people, like we're actually fine with that. And so those ideas are baked into our charter which is kind of the the foundational document that are that describes kind of our values and how we operate It's also really baked into the structure of OpenAILP.
Speaker 2
27:36
The way that we've set up OpenAILP is that in the case where we succeed, if we actually build what we're trying to build, Then investors are able to get a return And but that return is something that is capped And so if you think of a GI in terms of the value that you could really create You're talking about the most transformative technology ever created it's going to create orders of magnitude more value than any existing company and that all of that value will be owned by the world, like legally titled to the nonprofit to fulfill that mission. And so that's the structure.
Speaker 1
28:12
So the mission is a powerful 1, and it's 1 that I think most people would agree with. It's how we would hope AI progresses. And so how do you tie yourself to that mission?
Speaker 1
28:25
How do you make sure you do not deviate from that mission? That other incentives that are profit-driven don't interfere with the mission?
Speaker 2
28:36
So this was actually a really core question for us for the past couple of years, because I'd say that the way that our history went was that for the first year, we were getting off the ground. We had this high-level picture, but we didn't know exactly how we wanted to accomplish it. And really 2 years ago is when we first started realizing in order to build AGI, we're just going to need to raise way more money than we can as a nonprofit.
Speaker 2
29:00
And you know, we're talking many billions of dollars. And so the first question is, how are you supposed to do that and stay true to this mission? And we looked at every legal structure out there and concluded none of them are quite right for what we wanted to do. And I guess it shouldn't be too surprising if you're going to do some like crazy unprecedented technology that you're going to have to come with some crazy unprecedented structure to do it in.
Speaker 2
29:20
And a lot of our conversation was with people at OpenAI, the people who really joined because they believe so much in this mission, and thinking about how do we actually raise the resources to do it and also stay true to what we stand for. And the place you've got to start is to really align on what is it that we stand for, right? What are those values? What's really important to us?
Speaker 2
29:41
And so I'd say that we spent about a year really compiling the opening charter and that determines and if you even look at the first line item in there, it says that, look, we expect we're gonna have to marshal huge amounts of resources, but we're going to make sure that we minimize conflict of interest with the mission, and that kind of aligning on all of those pieces was the most important step towards figuring out how do we structure a company that can actually raise the resources to do what we need to do. I imagine OpenAI, the decision to create OpenAILP was a really difficult 1 and there was
Speaker 1
30:16
a lot of discussions as you mentioned for a year and there was different ideas perhaps detractors within OpenAI, sort of different paths that you could have taken. What were those concerns? What were the different paths considered?
Speaker 1
30:31
What was that process of making that decision like?
Speaker 2
30:34
Yep, so if you look actually at the OPAI charter, that there's almost 2 paths embedded within it. There is, we are primarily trying to build AGI ourselves, but we're also okay if someone else does it. And this is a weird thing for a company.
Speaker 1
30:48
It's really interesting actually. Yeah. There is an element of competition that you do wanna be the 1 that does it, but at the same time you're okay if somebody else doesn't.
Speaker 1
30:58
We'll talk about that a little bit, that trade off, that dance, that's really interesting.
Speaker 2
31:02
And I think this was the core tension as we were designing OpenAI LP and really the OpenAI strategy, is how do you make sure that both you have a shot at being a primary actor, which really requires building an organization, raising massive resources, and really having the will to go and execute on some really, really hard vision. You need to really sign up for a long period to go and take on a lot of pain and a lot of risk. To do that, normally you just import the startup mindset.
Speaker 2
31:31
You think about, okay, how do we out-execute everyone? You have this very competitive angle. But you also have the second angle of saying that, well, the true mission isn't for OpenAI to build AGI. The true mission is for AGI to go well for humanity.
Speaker 2
31:45
And so how do you take all of those first actions and make sure you don't close the door on outcomes that would actually be positive in fulfilled mission? And so I think it's a very delicate balance, right? I think that going 100% 1 direction or the other is clearly not the correct answer. And so I think that even in terms of just how we talk about open AI and think about it, there's just like 1 thing that's always in the back of my mind is to make sure that we're not just saying open AI's goal is to build AGI, right?
Speaker 2
32:13
That it's actually much broader than that, right? That first of all, you know, it's not just AGI, it's safe AGI that's very important. But secondly, our goal isn't to be the ones to build it, our goal is to make sure it goes well for the world. And so I think that figuring out how do you balance all of those and to get people to really come to the table and compile the like a single document that that encompasses all that wasn't trivial.
Speaker 1
32:37
So part of the challenge here is your mission is, I would say, beautiful, empowering, and a beacon of hope for people in the research community and just people thinking about AI. So your decisions are scrutinized more than, I think, a regular profit-driven company. Do you feel the burden of this in the creation of the charter and just in the way you operate?
Speaker 1
33:00
Yes. So why do you lean into the burden by creating such a charter? Why not to keep it quiet?
Speaker 2
33:10
I mean, it just boils down to the, to the mission, right? Like, like I'm here and everyone else is here because we think this is the most important mission. Dare to dream.
Speaker 1
33:18
All right, so do you think you can be good for the world or create an AGI system that's good when you're a for-profit company? From my perspective, I don't understand why profit interferes with positive impact on society. I don't understand why Google, that makes most of its money from ads, can't also do good for the world, or other companies, Facebook, anything.
Speaker 1
33:47
I don't understand why those have to interfere. Profit isn't the thing, in my view, that affects the impact of a company. What affects the impact of the company is the charter, is the culture, is the people inside, and profit is the thing that just fuels those people. So what are your views there?
Speaker 2
34:08
Yeah, so I think that's a really good question, and there's some real longstanding debates in human society that are wrapped up in it. The way that I think about it is just think about what are the most impactful non-profits in the world? What are the most impactful for-profits in the world?
Speaker 1
34:26
Right. It's much easier to list the for-profits.
Speaker 2
34:29
That's right. And I think that there's some real truth here that the system that we set up the system for kind of how you know today's world is organized is 1 that that really allows for huge impact and that that you know kind of part of that is that you need to be the the you know for for profits are are self-sustaining and able to to kind of, you know, build on their own momentum. I think that's a really powerful thing.
Speaker 2
34:52
It's something that when it turns out that we haven't set the guardrails correctly causes problems right think about logging companies that go into forest, you know, the rainforest that's really bad. We don't want that. And it's actually really interesting to me that kind of this, this question of how do you get positive benefits out of a for-profit company? It's actually very similar to how do you get positive benefits out of an AGI, right?
Speaker 2
35:15
That you have this like very powerful system It's more powerful than any human and is kind of autonomous in some ways You know It's superhuman in a lot of axes and somehow you have to set the guardrails to get good things to happen But when you do the benefits are massive And so I think that that when when I think about nonprofit versus for-profit, I think just not enough happens in non-profits. They're very pure, but it's just hard to do things there. In for-profits in some ways, too much happens. But If kind of shaped in the right way, it can actually be very positive.
Speaker 2
35:47
And so with open ILP, we're picking a road in between. Now the thing that I think is really important to recognize is that the way that we think about open ILP is that in the world where AGI actually happens, right? In a world where we are successful, we build the most transformative technology ever, the amount of value we're going to create will be astronomical. Then in that case, the cap that we have will be a small fraction of the value we create.
Speaker 2
36:15
The amount of value that goes back to investors and employees looks pretty similar to what would happen in a pretty successful startup. And that's really the case that we're optimizing for, right? That we're thinking about in the success case, making sure that the value we create doesn't get locked up. And I expect that in other, you know, for-profit companies that it's possible to do something like that.
Speaker 2
36:37
I think it's not obvious how to do it, right? And I think that as a for-profit company, you have a lot of fiduciary duty to your shareholders and that there are certain decisions that you just cannot make. In our structure, we've set it up so that we have a fiduciary duty to the charter. That we always get to make the decision that is right for the charter, rather than, even if it comes at the expense of our own stakeholders.
Speaker 2
37:01
And so I think that when I think about what's really important, it's not really about non-profit versus for-profit, it's really a question of if you build a GI and you kind of, humanity's now in this new age, who benefits, whose lives are better? And I think that what's really important is to have an answer that is everyone. Lex Bowens
Speaker 1
37:20
Yeah, which is 1 of the core aspects of the charter. So 1 concern people have, not just with OpenAI, but with Google, Facebook, Amazon, anybody really that's creating impact at scale, is how do we avoid, as your charter says, avoid enabling the use of AI or AGI to unduly concentrate power? Why would not a company like OpenAI keep all the power of an AGI system to itself?
Speaker 1
37:48
The charter? The charter. So, how does the charter actionalize itself in day to day?
Speaker 2
37:57
So, I think that first to zoom out, the way that we structure the company is so that the power for sort of, you know, dictating the actions that OpenAI takes ultimately rests with the board, right? The board of the nonprofit. And the board is set up in certain ways, with certain restrictions that you can read about in the OpenAI LP blog post.
Speaker 2
38:16
But effectively the board is the governing body for OpenAI LP. The board has a duty to fulfill the mission of the non-profit. That's kind of how we tie, how we thread all these things together. Now there's a question of day-to-day, how do people, the individuals, who in some ways are the most empowered ones, right?
Speaker 2
38:36
Now, the board sort of gets to call the shots at the high level, but the people who are actually executing are the employees, right? The people here on a day-to-day basis who have the, you know, the keys to the technical kingdom. And there I think that the answer looks a lot like, well, how does any company's values get actualized, right? I think that a lot of that comes down to that you need people who are here because they really believe in that mission and they believe in the charter and that they are willing to take actions that maybe are worse for them but are better for the charter.
Speaker 2
39:08
And that's something that's really baked into the culture. And honestly, I think that's 1 of the things that we really have to work to preserve as time goes on. And that's a really important part of how we think about hiring people and bringing people into OpenAI.
Speaker 1
39:22
So there's people here, there's people here who could speak up and say, like, hold on a second, this is totally against what we stand for, culture-wise.
Speaker 2
39:34
Yeah, yeah, for sure. I mean, I think that we actually have, I think that's like a pretty important part of how we operate and how we have, even again with designing the charter and designing OpenAILP in the first place, that there has been a lot of conversation with employees here and a lot of times where employees said, wait a second, this seems like it's going in the wrong direction and let's talk about it. And so I think 1 thing that's, I think a really, and you know, here's, here's actually 1 thing that I think is very unique about us as a small company is that if you're at a massive tech giant, it's a little bit hard for someone who's a line employee to go and talk to the CEO and say, I think that we're doing this wrong.
Speaker 2
40:10
And you look at companies like Google that have had some collective action from employees to make ethical change around things like Maven. And so maybe there are mechanisms that other companies that work. But here, super easy for anyone to pull me aside, to pull Sam aside, to pull Ilya aside. And people do it all the time.
Speaker 1
40:27
1 of the interesting things in the charter is this idea that it'd be great if you could try to describe or untangle, switching from competition to collaboration and late stage AGI development. It's really interesting, this dance between competition and collaboration. How do you think about that?
Speaker 2
40:43
Yeah. Assuming that you can actually do the technical side of AGI development, I think there's going to be 2 key problems with figuring out how do you actually deploy it and make it go well. The first 1 of these is the run-up to building the first AGI. You look at how self-driving cars are being developed and it's a competitive race.
Speaker 2
41:00
And the thing that always happens in competitive race is that you have huge amounts of pressure to get rid of safety. And so that's 1 thing we're very concerned about, right, is that people, multiple teams figuring out we can actually get there, but you know, if we took the slower path that is more guaranteed to be safe, we will lose. And so we're going to take the fast path. And so the more that we can both ourselves be in a position where we don't generate that competitive race, where we say, if the race is being run and that, you know, someone else is further ahead than we are, we're not going to try to leapfrog.
Speaker 2
41:35
We're going to actually work with them. We will help them succeed. As long as what they're trying to do is to fulfill our mission, then we're good. We don't have to build AGI ourselves.
Speaker 2
41:44
I think that's a really important commitment from us, but it can't just be unilateral. I think that it's really important that other players who are serious about building AGI make similar commitments. I think that, again, to the extent that everyone believes that AGI should be something to benefit everyone, then it actually really shouldn't matter which company builds it. And we should all be concerned about the case where we just race so hard to get there that something goes wrong.
Speaker 1
42:07
So what role do you think government, our favorite entity, has in setting policy and rules about this domain? From research to the development to early stage to late stage AI and AGI development?
Speaker 2
42:22
So I think that first of all is really important that government's in there, right? In some way, shape or form, you know, at the end of the day we're talking about building technology that will shape how the world operates and that there needs to be government as part of that answer. And so that's why we've done a number of different congressional testimonies, we interact with a number of different lawmakers, and that right now a lot of our message to them is that it's not the time for regulation it is the time for measurement right that our main policy recommendation is that people and you know the government does this all the time with bodies like NIST, spend time trying to figure out just where the technology is, how fast it's moving, and can really become literate and up to speed with respect to what to expect.
Speaker 2
43:13
So I think that today the answer really is about measurement, and I think that there will be a time and place where that will change. And I think it's a little bit hard to predict exactly what exactly that trajectory should look like.
Speaker 1
43:27
LRWL So there will be a point at which regulation, federal in the United States, the government steps in and helps be the, I don't want to say, the adult in the room to make sure that there is strict rules, maybe conservative rules that nobody can cross.
Speaker 2
43:45
Well, I think there's kind of maybe 2 angles to it. So today, with narrow AI applications, that I think there are already existing bodies that are responsible and should be responsible for regulation. You think about, for example, with self-driving cars, that you want the National Highway… It's a… Yeah, exactly, to be regulated.
Speaker 2
44:02
That makes sense, right? That basically what we're saying is that we're going to have these technological systems that are going to be performing applications that humans already do. Great, we already have ways of thinking about standards and safety for those. So I think actually empowering those regulators today is also pretty important.
Speaker 2
44:19
And then I think for AGI, you know, that there's going to be a point where we'll have better answers. And I think that maybe a similar approach of first measurement and, you know, start thinking about what the rules should be. I think it's really important that we don't prematurely squash progress. I think it's very easy to kind of smother a budding field.
Speaker 2
44:39
And I think that's something to really avoid. But I don't think it's the right way of doing it is to say, let's just try to blaze ahead and not involve all these other stakeholders. So
Speaker 1
44:53
you've recently released a paper on GPT-2 language modeling, but did not release the full model because you had concerns about the possible negative effects of the availability of such model. It's outside of just that decision, it's super interesting because of the discussion as at a societal level, the discourse it creates. So it's fascinating in that aspect.
Speaker 1
45:19
But if you think, that's the specifics here at first, what are some negative effects that you envisioned? And of course, what are some of the positive effects?
Speaker 2
45:28
Yeah, so again, I think to zoom out, like the way that we thought about GPT-2 is that with language modeling we are clearly on a trajectory right now where we scale up our models and we get qualitatively better performance. Right GPT-2 itself was actually just a scale up of a model that we've released in the previous June, right? We just ran it at a much larger scale and we got these results where suddenly starting to write coherent pros, which was not something we'd seen previously.
Speaker 2
46:00
And what are we doing now? Well, we're going to scale up GPT-2 by 10x, by 100x, by 1,000x, and we don't know what we're going to get. And so it's very clear that the model that we released last June, I think it's kind of like it's a good academic toy. It's not something that we think is something that can really have negative applications or to the extent that it can, the positive of people being able to play with it is far outweighs the possible harms.
Speaker 2
46:28
You fast forward to not GPT-2, but GPT-20, and you think about what that's gonna be like, and I think that the capabilities are going to be substantive. And so there needs to be a point in between the 2 where you say, this is something where we are drawing the line, and that we need to start thinking about the safety aspects. I think for GPT-2 we could have gone either way. In fact, when we had conversations internally that we had a bunch of pros and cons and it wasn't clear which 1 outweighed the other.
Speaker 2
46:58
I think that when we announced that, hey, we decide not to release this model, then there was a bunch of conversation where various people said it's so obvious that you should have just released it. There are other people said it's so obvious you should not have released it. And I think that that almost definitionally means that holding it back was the correct decision. If it's not obvious whether something is beneficial or not, you should probably default to caution.
Speaker 2
47:19
And so I think that the overall landscape for how we think about it is that this decision could have gone either way. There are great arguments in both directions. But for future models down the road, and possibly sooner than you'd expect, because scaling these things up doesn't actually take that long. Those ones, you're definitely not going to want to release into the wild.
Speaker 2
47:39
And so I think that we almost view this as a test case and to see, can we even design, how do you have a society, how do you have a system that goes from having no concept of responsible disclosure, where the mere idea of not releasing something for safety reasons is unfamiliar, to a world where you say, okay, we have a powerful model, let's at least think about it, let's go through some process. And you think about the security community, it took them a long time to design responsible disclosure, right? You know, you think about this question of, well, I have a security exploit. I send it to the company.
Speaker 2
48:09
The company is like, tries to prosecute me or just sit, just ignores it. What do I do? Right? And so, you know, the alternatives of, oh, I just always publish my exploits, that doesn't seem good either, right?
Speaker 2
48:19
And so it really took a long time and took this—it was bigger than any individual, right? It's really about building a whole community that believe that, okay, we'll have this process where you send it to the company, you know, If they don't act in a certain time, then you can go public and you're not a bad person. You've done the right thing. And I think that in AI, part of the response to GPT-2 just proves that we don't have any concept of this.
Speaker 2
48:44
So That's the high-level picture. And so I think that this was a really important move to make. And we could have maybe delayed it for GPT-3, but I'm really glad we did it for GPT-2. And so now you look at GPT-2 itself and you think about the substance of, okay, what are potential negative applications?
Speaker 2
49:01
So you have this model that's been trained on the Internet, which is also going to be a bunch of very biased data, a bunch of very offensive content in there, and you can ask it to generate content for you on basically any topic. You just give it a prompt and it'll just start writing. And it writes content like you see on the internet, you know even down to like saying advertisement in the middle of some of its generations and You think about the possibilities for generating fake news or abusive content? And you know, it's interesting seeing what people have done with you know, we released a smaller version of GPT-2 and the people have done things like try to generate, you know, take my own Facebook message history and generate more Facebook messages like me and people generating fake politician content or, you know, there's a bunch of things there where you at least have to think is this going to be good for the world.
Speaker 2
49:54
There's the flip side which is I think that there's a lot of awesome applications that we really want to see like creative applications in terms of if you have sci-fi authors that can work with this tool and come up with cool ideas, like that seems awesome if we can write better sci-fi through the use of these tools. And we've actually had a bunch of people write into us asking, hey, can we use it for a variety of different creative applications? So the positive are actually pretty easy to imagine.
Speaker 1
50:24
The usual NLP applications are really interesting, but Let's go there. It's kind of interesting to think about a world where, look at Twitter, where, not just fake news, but smarter and smarter bots being able to spread in an interesting, complex, networking way information that just floods out us regular human beings with our original thoughts. So what are your views of this world with GPT-20?
Speaker 1
50:58
Right?
Speaker 2
50:59
What do
Speaker 1
50:59
you, How do we think about it? Again, it's like 1 of those things about in the 50s trying to describe the internet or the smartphone. What do you think about that world, the nature of information?
Speaker 1
51:12
1 possibility is that we'll always try to design systems that identify robot versus human and will do so successfully and so we'll authenticate that we're still human. And the other world is that we just accept the fact that we're swimming in a sea of fake news and just learn to swim there.
Speaker 2
51:32
Well, have you ever seen the, there's a popular meme of a robot with a physical arm and pen clicking the I'm not a robot button? Yeah. I think that the truth is that really trying to distinguish between robot and human is a losing battle.
Speaker 2
51:52
Ultimately, you think it's a losing battle. I think it's a losing battle ultimately. I think that that is in terms of the content, in terms of the actions that you can take. Think about how captures have gone.
Speaker 2
52:00
The captures used to be a very nice, simple, you just have this image. All of our OCR is terrible. You put a couple of artifacts in it, humans are going to be able to tell what it is. An AI system wouldn't be able to.
Speaker 2
52:13
Today, I can barely do CAPTCHAs. I think that this is just kind of where we're going. I think CAPTCHA's where we're a moment in time thing, and as AI systems become more powerful, that there being human capabilities that can be measured in a very easy automated way that AIs will not be capable of, I think that's just like, it's just an increasingly hard technical battle. But it's not that all hope is lost, right?
Speaker 2
52:36
You think about how do we already authenticate ourselves, right? We have systems, we have social security numbers if you're in the US or You have ways of identifying individual people, and having real-world identity tied to digital identity seems like a step towards authenticating the source of content rather than the content itself. Now, there are problems with that. How can you have privacy and anonymity in a world where the only content you can really trust is, or the only way you can trust content is by looking at where it comes from.
Speaker 2
53:08
And so I think that building out good reputation networks may be 1 possible solution. But yeah, I think that This question is not an obvious 1. And I think that we, maybe sooner than we think, we'll be in a world where today I often will read a tweet and be like, do I feel like a real human wrote this? Or do I feel like this is genuine?
Speaker 2
53:27
I feel like I can kind of judge the content a little bit. And I think in the future, it just won't be the case. You look at, for example, the FCC comments on net neutrality. It came out later that millions of those were auto-generated and that the researchers were able to do various statistical techniques to do that.
Speaker 2
53:43
What do you do in a world where those statistical techniques don't exist? It's just impossible to tell the difference between humans and AIs, and in fact, the most persuasive arguments are written by AI. All that stuff, it's not sci-fi anymore. You look at GPT-2 making a great argument for why recycling is bad for the world.
Speaker 2
54:02
You gotta read that and be like, huh, you're right, we are addressing different symptoms.
Speaker 1
54:06
Yeah, that's quite interesting. I mean, ultimately it boils down to the physical world being the last frontier of proving, so you said like basically networks of people, humans vouching for humans in the physical world. And somehow the authentication ends there.
Speaker 1
54:22
I mean, if I had to ask you, I mean, you're way too eloquent for a human. So if I had to ask you to authenticate, like prove how do I know you're not a robot and how do you know I'm not a robot? I think that's, so far, we're in this space, this conversation we just had, the physical movements we did, is the biggest gap between us and AI systems, is the physical manipulation. So maybe that's the last frontier.
Speaker 2
54:51
Well, here's another question is, why is solving this problem important? What aspects are really important to us? I think that probably where we'll end up is we'll hone in on what do we really want out of knowing if we're talking to a human.
Speaker 2
55:07
And I think that again, this comes down to identity. And so I think that the internet of the future, I expect to be 1 that will have lots of agents out there that will interact with you. But I think that the question of, is this a real flesh and blood human, or is this an automated system, may actually just be less important.
Speaker 1
55:25
Let's actually go there. It's GPT-2 is impressive, and let's look at GPT-20. Why is it so bad that all my friends are GPT-20?
Speaker 1
55:40
Why is it so important on the internet, do you think, to interact with only human beings? Why can't we live in a world where ideas can come from models trained on human data?
Speaker 2
55:52
Yeah, I think this is actually a really interesting question. This comes back to the how do you even picture a world with some new technology? And I think that 1 thing that I think is important is, is, you know, let's say honesty.
Speaker 2
56:04
And I think that if you have, you know, almost in the Turing test style sense of technology, you have AIs that are pretending to be humans and deceiving you. I think that is, you know, that feels like a bad thing. I think that it's really important that we feel like we're in control of our environment, that we understand who we're interacting with. And if it's an AI or a human, that's not something that we're being deceived about.
Speaker 2
56:28
But I think that the flip side of, can I have as meaningful of an interaction with an AI as I can with a human? Well, I actually think here you can turn to Sci-Fi and her I think is a great example of asking this very question. 1 thing I really love about her is it really starts out almost by asking how meaningful are human virtual relationships. And then you have a human who has a relationship with an AI and that you really start to be drawn into that, right?
Speaker 2
56:54
That all of your emotional buttons get triggered in the same way as if there was a real human that was on the other side of that phone. And so I think that this is 1 way of thinking about it is that I think that we can have meaningful interactions and that if there's a funny joke, some sense it doesn't really matter if it was written by a human or an AI. But what you don't want and where I think we should really draw hard lines is deception. And I think that as long as we're in a world where, you know, why do we build AI systems at all, right?
Speaker 2
57:22
The reason we want to build them is to enhance human lives, to make humans be able to do more things, to have humans feel more fulfilled. And if we can build AI systems that do that, you know, sign me up.
Speaker 1
57:33
So the process of language modeling, how far do you think it takes us? Let's look at movie Her. Do you think a dialogue, natural language conversation as formulated by the Turing test, for example, do you think that process could be achieved through this kind of unsupervised language modeling?
Speaker 2
57:52
So I think the Turing test in its real form isn't just about language, right? It's really about reasoning too, right? To really pass the Turing test, I should be able to teach calculus to whoever's on the other side and have it really understand calculus and be able to, you know, go and solve new calculus problems.
Speaker 2
58:11
And so I think that to really solve the Turing test we need more than what we're seeing with language models. We need some way of plugging in reasoning. Now, how different will that be from what we already do? That's an open question, right?
Speaker 2
58:23
Might be that we need some sequence of totally radical new ideas, or it might be that we just need to kind of shape our existing systems in a slightly different way. But I think that in terms of how far language modeling will go, it's already gone way further than many people would have expected. I think that things like, and I think there's a lot of really interesting angles to poke in terms of how much does GPT-2 understand physical world? Like you read a little bit about fire underwater in GPT-2.
Speaker 2
58:52
So it's like, okay, maybe it doesn't quite understand what these things are. But at the same time, I think that you also see various things like smoke coming from flame and, you know, a bunch of these things that GPT-2 has no body, it has no physical experience, it's just statically read data. And I think that the answer is like, we don't know yet. These questions though, we're starting to be able to actually ask them to physical systems, to real systems that exist, and that's very exciting.
Speaker 1
59:20
What's your intuition? Do you think if you just scale language modeling, significantly scale, that reasoning can emerge from the same exact mechanisms?
Speaker 2
59:31
I think it's unlikely that if we just scale GPT-2 that we'll have reasoning in the full-fledged way. And I think that there's like, you know, the type signature is a little bit wrong, right? That like there's something we do with that we call thinking, right?
Speaker 2
59:45
Where we spend a lot of compute, like a variable amount of compute, to get to better answers, right? I think a little bit harder, I get a better answer. And that that kind of type signature isn't quite encoded in a GPT, right? GPT will come in.
Omnivision Solutions Ltd