Insightcast AI
Home
© 2025 All rights reserved
Impressum
#371 – Max Tegmark: The Case for Halting AI Development

#371 – Max Tegmark: The Case for Halting AI Development

Lex Fridman Podcast XX

--:--
--:--

Full Transcription:

[0] The following is a conversation with Max Tegmark, his third time in the podcast.

[1] In fact, his first appearance was episode number one of this very podcast.

[2] He is a physicist and artificial intelligence researcher at MIT, co -founder of Future Life Institute, and author of Life 3 .0, being human in the age of artificial intelligence.

[3] Most recently, he's a key figure in spearheading, the open letter calling for a six -month pause on giant AI experience.

[4] experiments like training GPT4.

[5] The letter reads, we're calling for a pause on training of models larger than GPT4 for six months.

[6] This does not imply a pause or ban on all AI research and development, or the use of systems that have already been placed on the market.

[7] Our call is specific and addresses a very small pool of actors who possesses this capability.

[8] The letter has been signed by over 50 ,000 individuals, including 1 ,800 CEOs and over 1 ,500 professors.

[9] Signatories include Joshua Benjillo, Stuart Russell, Elon Musk, Steve Wozniak, Yval Noah Harari, Andrew Yang, and many others.

[10] This is a defining moment in the history of human civilization, where the balance of power between human and AI begins to shift.

[11] And Max's mind and his voice is one of the most valuable and powerful in a time like this.

[12] His support, his wisdom, his friendship has been a gift I'm forever, deeply grateful for.

[13] And now a quick few second mention of each sponsor.

[14] Check them out in the description.

[15] It's the best way to support this podcast.

[16] We've got Notion for Project and Team Collaboration, Insight Tracker for Biological Data, and indeed for hiring.

[17] choose wise than my friends also speaking of hiring if you want to work with our amazing team we're always hiring whether it's through indeed or otherwise could alexfredman dot com slash hiring and now on to the full adderies as always no ads in the middle i try to make this interesting but if you must skip them please still check out our sponsors i enjoy this stuff maybe you will too this show is brought to you by notion i've spoken endlessly about how amazing notion is how everybody, all the cool kids are recommending it for just basic note taking, but there's so, so much more.

[18] It's the collaborative aspect of it, the project management aspect of it, the wikis, the document sharing, all of that, all in a simple, powerful, beautifully designed solution.

[19] What can I say?

[20] On top of this, there's the notion AI tool.

[21] This is the best integration of large language models into a productivity note taking tool.

[22] There are so many amazing features.

[23] I mean, it's just endless.

[24] Go to the website.

[25] You can generate entire presentations and reports based on a to -do list.

[26] You can summarize stuff.

[27] You can short stuff.

[28] You can generate tables based on the description.

[29] You can write a summary.

[30] You can expand the text.

[31] You can change the style of the text.

[32] You can fix spelling and grammar.

[33] You can translate.

[34] You can use simpler language, more complicated language.

[35] Change the tone of the voice.

[36] Make it shorter, longer.

[37] Like I said, everything.

[38] It's just so easy to play around with and all of it.

[39] No matter what you're doing, will, challenge you to think how you write.

[40] It will challenge you to expand the style of your writing.

[41] It will save you a lot of time, of course.

[42] But I just think it makes you a better thinker and productive being in this world.

[43] And I think that's such a great integration of AI into the productivity workflow.

[44] To me, it's not enough for a large language model to be effective at answering questions and having good dialogue.

[45] You have to really integrate.

[46] graded into the workflow.

[47] And Notion, better than anybody else I've seen, has done that.

[48] So if that's interesting to you, Notion AI helps you work faster, write better, and think bigger, doing tasks that normally take you hours and just minutes.

[49] Try Notion AI for free when you go to notion .com slash Lex.

[50] That's all lowercase, notion .com slash Lex, to try the power of Notion AI today.

[51] This show is also brought to you by Inside Tracker, a service I use to track biological data.

[52] It's really good.

[53] to do that kind of thing regularly, to look at all the different markets in your body and to understand what could be made better through lifestyle, through diet changes.

[54] It's kind of obvious that decisions about your life should be made based on the data that comes from your own body.

[55] Not some kind of population study, although those are good.

[56] Not some spiritual guru, although those are good.

[57] Not some novel, whether it's Harry Potter or Dostoevsky, although those are sometimes good.

[58] not your relative who says, I heard a guy say that a guy does this thing that is very bro -souncy sounding, although sometimes it turns out to be pretty effective.

[59] Overall, the best decisions about your life should be based on the things that come from your own body.

[60] Inside Tracker uses algorithms to analyze your blood data, DNA data, fitness tracker, all that kind of stuff to give you recommendations.

[61] You should be doing it.

[62] You should be doing it regularly.

[63] So it's not just a one -time thing, but regularly over time you see what changes led to improvements in the various markers that come from your body.

[64] Get special savings for a limited time when you go to insidetrakker .com slash flex.

[65] This show is also brought to you by Indeed, a hiring website.

[66] I think the most important thing in life, not to quote, quote on the barbarian, because that would be very inappropriate to quote at this moment.

[67] And it's not actually accurate at all as a reflection of what's important in life.

[68] It's only has comedic value.

[69] What I really want to say about what's important in life is the people you surround yourself with.

[70] And we spent so much of our time in the workplace, seeking solutions to very difficult problems together, passionately pursuing ambitious goals, sometimes impossible goals.

[71] That is the source of meaning, a sort of happiness for people.

[72] And I think part of that happiness comes from the collaboration with other human beings.

[73] the sort of professional depth of connection that you have with other human beings, being together through the grind and surviving and accomplishing the goal or failing in a big, epic way, knowing that you have tried together.

[74] And so doing that with the right team, I think, is one of the most important things in life, so you should surround yourself with the right team.

[75] If you're looking to join a team, you should be very selective about that, or if you're looking to hire a team, you should be very selective about that and use the best tools of the job.

[76] I've used Indeed many, many times throughout my life for the teams I've led.

[77] Don't overspend on hiring.

[78] Visit Indeed .com slash Lex to start hiring now.

[79] That's Indeed .com slash Lex.

[80] Terms and conditions apply.

[81] This is Alex Friedman podcast.

[82] To support it, please check out our sponsors in the description.

[83] And now, dear friends, here's Max, tag mark.

[84] You were the first ever guest on this podcast, episode number one.

[85] So first of all, Max, I just have to say, thank you for giving me a chance.

[86] Thank you for starting this journey.

[87] It's been an incredible journey.

[88] Just thank you for sitting down with me and just acting like I'm somebody who matters, that I'm somebody who's interesting to talk to.

[89] And thank you for doing it.

[90] That meant a lot.

[91] Thanks to you for putting your heart and soul into this.

[92] I know when you delve into controversial topics, it's inevitable to get hit by what Hamlet talks about the slings and arrows and stuff.

[93] And I really admire this.

[94] It's in an era, you know, where YouTube videos are too long and now it has to be like a 20 -minute TikTok, 20 -second TikTok clip.

[95] It's just so refreshing to see you going exactly against all of the advice and doing these really long -form things and that people appreciate it.

[96] You know, reality is nuanced.

[97] And thanks for Sharing it that way.

[98] So let me ask you again, the first question I've ever asked on this podcast.

[99] Episode number one, talking to you, do you think there's intelligent life out there in the universe?

[100] Let's revisit that question.

[101] Do you have any updates?

[102] What's your view when you look out to the stars?

[103] So when we look out to the stars, if you define our universe the way most astrophysicists do, not as all of space, but the spherical region of space that we can see with our telescope, from which light has a time to reach us since our Big Bang.

[104] I'm in the minority.

[105] I estimate that we are the only life in this spherical volume that has invented internet, radio has gotten our level of tech.

[106] And if that's true, then it puts a lot of responsibility on us to not mess this one up.

[107] Because if it's true, it means that life is quite rare.

[108] and we are stewards of this one spark of advanced consciousness, which if we nurture it and help it grow, eventually life can spread from here out into much of our universe, and we can have this just amazing future.

[109] Whereas if we instead are reckless with the technology we build and just snuff it out due to stupidity or infighting, then maybe the rest of cosmic history in our universe was just going to be a play for empty benches.

[110] But I do think that we are actually very likely to get visited by aliens, alien intelligence, quite soon.

[111] But I think we are going to be building that alien intelligence.

[112] So we're going to give birth to an intelligent alien civilization.

[113] Unlike anything that human, the evolution here on Earth was able to create in terms of the path, the biological path it took.

[114] Yeah, and it's going to be much more alien than a cat or even the most exotic animal on the planet right now, because it will not have been created through the usual Darwinian competition where it necessarily cares about self -preservation, afraid of death, any of those things.

[115] the space of alien minds that you can build is just so much vaster than what evolution will give you and with that also comes to great responsibility for us to make sure that the kind of minds we create are the kind of minds that it's good to create minds that will share our values and be good for humanity in life and also mind don't create minds that don't suffer Do you try to visualize the full space of alien minds that AI could be?

[116] Do you try to consider all the different kinds of intelligences?

[117] So generalizing what humans are able to do to the full spectrum of what intelligent creatures entities could do?

[118] I try, but I would say I fail.

[119] I mean, it's very difficult for a human mind to really grapple with something so completely alien.

[120] Even for us, right?

[121] If we just try to imagine, how would it feel if we were completely indifferent towards death or individuality?

[122] Even if you just imagine that, for example, you could just copy my knowledge of how to speak Swedish.

[123] Boom, now you can speak Swedish.

[124] And you could copy any of my cool experiences and you could delete the ones you didn't like in your own life, just like that.

[125] it would already change quite a lot about how you feel as a human being, right?

[126] You probably spend less effort studying things if you just copy them and you might be less afraid of death because if the plane you're on starts to crash, you'd just be like, oh, shucks, I haven't backed my brain up for four hours.

[127] So I'm going to lose all these wonderful experiences of this flight.

[128] we might also start feeling more like compassionate maybe with other people if we can so readily share each other's experiences in our knowledge and feel more like a hive mind it's very hard though i i really feel very humble about this to grapple with it that the how it might actually feel the the one thing which is so obvious though it's i think it's just really worth reflecting on is because the mind space of possible intelligence is so different from ours, it's very dangerous if we assume they're going to be like us or anything like us.

[129] Well, there's the entirety of human written history has been through poetry, through novels, been trying to describe through philosophy, trying to describe the human condition and what's entailed in it.

[130] Like, just like you said, fear of death and all those kinds of things, what is love, and all of that.

[131] changes if you have a different kind of intelligence all of it the entirety all those poems they're trying to sneak up to what the hell it means to be human all of that changes how AI concerns and existential crises that AI experiences how that clashes with the human existential crisis the human condition yeah that's hard to hard to fathom how to predict it's hard but it's fascinating to think about also.

[132] Even in the best case scenario where we don't lose control over the ever more powerful AI that we're building to other humans whose goals we think are horrible and where we don't lose control to the machines and AI provides the things we want, even then you get into the questions you touched here.

[133] Maybe it's the struggle that it's actually hard to do things is part of the things that gives us meaning as well, right?

[134] So, for example, I found it so shocking that this new Microsoft GPT4 commercial that they put together has this woman talking about showing this demo how she's going to give a graduation speech to her beloved daughter, and she asks GPT4 to write it.

[135] It was freaking 200 words or so.

[136] If I realized that my parents couldn't be bothered, struggling a little bit to write 200 words and outsource that to their computer, I would feel really offended, actually.

[137] And so I wonder if eliminating too much of this struggle from our existence, do you think that would also take away a little bit of what...

[138] It means to be human, yeah.

[139] We can't even predict.

[140] I had somebody mentioned to me that they used, they started using Chad GPT with a 3 .5 and that 4 .0 to write what they really feel to a person and they have a temper issue and they're basically trying to get Chad GPT to rewrite it in a nicer way to get the point across but rewrite it a nice way.

[141] So we're even removing the inner asshole from our communication.

[142] So I don't, you know, there's some positive aspects of that, but mostly it's just the transformation of how humans communicate.

[143] And it's scary because so much of our society is based on this glue of communication.

[144] And if we're now using AI as the medium of communication, that does the language for us, so much of the emotion that's laden in human communication, and so much of the intent that's going to be handled by outsourced the AI.

[145] How does that change everything?

[146] How does it change the internal state of how we feel about other human beings?

[147] What makes us lonely?

[148] What makes us excited?

[149] What makes us afraid?

[150] How we fall in love?

[151] All that kind of stuff.

[152] Yeah.

[153] For me personally, I have to confess, the challenge is one of the things really makes my life feel meaningful.

[154] You know, if I go hike a mountain with my wife, Maya, I don't want to just press a button and be at the top.

[155] I want the struggle and come up there sweaty and feel, wow, we did this in the same way.

[156] I want to constantly work on myself to become a better person.

[157] If I say something in anger that I regret, I want to go back and really work on myself rather than just tell an AI from now on always filter what I write so I don't have to work on myself, because then I'm not growing.

[158] Yeah, but then again, it could be like with chess.

[159] And AI, once it significantly obviously supersedes the performance of humans, it will live in its own world and provide maybe a flourishing civilizations for humans, but we humans will continue hiking mountains and playing our games, even though AI is so much smarter, so much stronger, so much superior in every single way, just like with chess.

[160] Yeah.

[161] I mean, that's one possible, hopeful trajectory here, is that humans will continue to human, and AI will just be a kind of a medium that enables the human experience to flourish.

[162] Yeah.

[163] I would phrase that as rebranding ourselves from homo sapiens to homo sentience.

[164] You know, right now it's sapiens, the ability to be in terms.

[165] intelligence, even put it in our species name, so we're branding ourselves as the smartest information processing entity on the planet.

[166] That's clearly gonna change if AI continues ahead.

[167] So maybe we should focus on the experience instead, the subjective experience that we have with homosentience and say that's what's really valuable, the love, the connection, the other things, and get off our high horses and get rid of this hubris that only we can do we do integrals so consciousness the subjective experience is a fundamental value to what it means to be human make that make that the priority that feels like a hopeful direction to me but that also requires more compassion not just towards other humans because they happen to be the smartest on the planet but also towards all our other fellow creatures on this planet.

[168] And I personally feel right now we're treating a lot of farm animals horribly, for example, and the excuse we're using is, oh, they're not as smart as us.

[169] But if we, I could admit that we're not that smart in the grand scheme of things either in the post -AI epoch, you know, then surely we should value the subjective experience of a cow also.

[170] Well, allow me to briefly look at the book, which at this point is becoming more and more visionary that you've written, I guess, over five years ago, Life 3 .0.

[171] So, first of all, 3 .0, what's 1 .0, what's 2 .0, what's 3 .0?

[172] And how's that vision sort of evolve?

[173] The vision in the book evolved to today.

[174] Life 1 .0 is really dumb, like bacteria, and that it can't actually learn anything at all during the lifetime.

[175] The learning just comes from this genetic process from one generation to the next.

[176] Life 2 .0 is us and other animals which have brains, which can learn during their lifetime a great deal.

[177] And you were born without being able to speak English, and at some point you decided, hey, I want to upgrade my software.

[178] Let's install an English -speaking module.

[179] So you did?

[180] And Life 3 .0 does not.

[181] exist yet can replace not only its software the way we can but also it's hardware and um that's where we're heading towards at high speed we're already maybe 2 .1 because we can you know put in an an artificial knee pacemaker etc etc and if neuralink you know their companies succeed will be life 2 .2 etc but uh well the company's trying to build aGI are trying to make is of course full 3 .0 And you can put that intelligence in something that also has no biological basis whatsoever.

[182] So less constraints and more capabilities, just like the leap from 1 .0 to 2 .0.

[183] There is nevertheless you speaking so harshly about bacteria, so disrespectfully about bacteria.

[184] There is still the same kind of magic there that permeates life 2 .0 and 3 .0.

[185] It seems like maybe the thing that's truly powerful about life, intelligence, and consciousness was already there in 1 .0.

[186] Is it possible?

[187] I think we should be humble and not be so quick to make everything binary and say either it's there or it's not.

[188] Clearly, there's a great spectrum.

[189] And there is even a controversy by whether some unicellular organisms like amoebas can maybe learn a little bit.

[190] after all.

[191] So apologies if I offended any bacteria.

[192] It wasn't my intent.

[193] It was more that I wanted to talk up how cool it is to actually have a brain.

[194] Yeah.

[195] Where you can learn dramatically within your lifetime.

[196] Typical human.

[197] And the higher up you get from 1 .0 to 2 .0 to 3 .0, the more you become the captain of your own ship, the master of your own destiny, and the less you become a slave to whatever evolution gave you, right?

[198] By upgrading our software, we can be so different from previous generations and even from our parents, much more so than even the bacterium, you know, no offense to them.

[199] And if you can also swap out your hardware and take any physical form you want, of course, really the sky is the limit.

[200] Yeah, so it accelerates the rate at which you can perform the computation that determines your destiny.

[201] Yeah, and I think it's worth commenting a bit of what you means in this context also.

[202] If you swap things out, a lot, right?

[203] This is controversial, but my current understanding is that life is best thought of not as a bag of meat or even a bag of elementary particles, but rather it's in as a system which can process information and retain its own complexity, even though nature is always trying to mess it up.

[204] So it's all about information processing and that makes it a lot like something like a wave in the ocean which is not its it's it's water molecules right the water molecules bob up and down but the wave moves forward it's an information pattern in the same way you lex you're not the same atoms as during the first time you did with me you've swapped out most of them but still you yeah and the information pattern is still there, and if you could swap out your arms and whatever, you can still have this kind of continuity.

[205] It becomes a much more sophisticated sort of way forward in time where the information lives on.

[206] I lost both of my parents since our last podcast, and it actually gives me a lot of solace that this way of thinking about them, they haven't entirely died because a lot mommy and daddy's um sorry i'm getting a little emotional here but a lot of their values and ideas and even jokes and so on they haven't gone away right some of them live on i can carry on some of them and they also live on a lot of other and a lot of other people so in this sense even with life 2 .0 we can to some extent already transcend our physical bodies and our death and particularly if you can share your own information, your own ideas with many others like you do in your podcast, then that's the closest immortality we can get with our biobodies.

[207] You carry a little bit of them and you in some sense.

[208] Do you miss them?

[209] You miss your mom and dad?

[210] Of course.

[211] What did you learn about life from them if it can take a bit of a tangent?

[212] I know so many things, for starters, my fascination for math and the physical mysteries of our universe, you got a lot of that for my dad, but I think my obsession for fairly big questions and consciousness and so on, that actually came mostly from my mom.

[213] And what I got from both of them, which is very core part of really who I am, I think, is this, just feeling comfortable with not buying into what everybody else is saying.

[214] Doing what I think is right.

[215] They both very much just, you know, did their own thing, and sometimes they got flack for it, and they did it anyway.

[216] That's why you've always been an inspiration to me, that you're at the top of your field, and you're still willing to, to tackle the big questions in your own way.

[217] You're one of the people that represents MIT best to me. You've always been an inspiration in that.

[218] So it's good to hear that you got that from your mom and dad.

[219] Yeah, you're too kind.

[220] But yeah, I mean, the good reason to do science is because you're really curious, you want to figure out the truth.

[221] If you think this is how it is and everyone else says, no, no, that's bullshit and it's that way, you know, you stick with what you think is true.

[222] And even if everybody else keeps thinking it's bullshit, there's a certain, I always root for the underdog when I watch movies.

[223] And my dad once, one time, for example, when I wrote one of my craziest papers ever, talking about our universe ultimately being mathematical, which we're not going to get into today, I got this email from a quite famous professor saying this is not only bullshit, it, but it's going to ruin your career.

[224] You should stop doing this kind of stuff.

[225] I sent it to my dad.

[226] Do you know what he said?

[227] What he said?

[228] He replied with a quote from Dante.

[229] Segi el tu corso and la sedir la gente.

[230] Follow your own path and let the people talk.

[231] Go, dad.

[232] This is the kind of thing.

[233] He's dead, but that attitude is not.

[234] How did losing them as a man, as a human being, change you?

[235] how did it expand your thinking about the world?

[236] How did it expand?

[237] You're thinking about, you know, this thing we're talking about, which is humans creating another living, sentient, perhaps, being.

[238] I think it mainly did two things.

[239] One of them just going through all their stuff after they had passed away and so on just drove home to me how important it is to ask ourselves, why are we doing these things we do?

[240] Because it's inevitable that you look at some things they spent an enormous time on and you ask, in hindsight, would they really have spent so much time on this?

[241] Would they have done something that was more meaningful?

[242] So I've been looking more in my life now and asking, you know, why am I doing what I'm doing?

[243] And I feel it should either be something I really enjoy doing or it should be something that I find really, really meaningful because it helps humanity.

[244] And if it's in none of those two categories, maybe I should spend less time on it.

[245] The other thing is dealing with death up and personal like this.

[246] It's actually made me less afraid of, even less afraid of other people telling me that I'm an idiot, which happens regularly.

[247] And just, I don't live my life.

[248] life do my thing you know um um and um it's made a little bit easier for me to focus on what i what i feel is really important what about fear of your own death has it made it more real that this is that this is something that happens yeah it's made it extremely real and i'm next in line in our family now right it's me and my brother younger brother but um they both handle handled it with such dignity.

[249] That was a true inspiration also.

[250] They never complained about things.

[251] And, you know, when you're old and your body starts falling apart, it's more and more to complain about it.

[252] They looked at what could they still do that was meaningful.

[253] And they focused on that rather than wasting time talking about or even thinking much about things they were disappointed in.

[254] I think anyone can make themselves depressed if they start their morning by making a list of grievances.

[255] Whereas if you start your day when the little meditation and just the things you're grateful for, you basically choose to be a happy person.

[256] Because you only have a finite number of days should spend them.

[257] Make it count.

[258] Being grateful.

[259] Well, you do happen to be working on a thing which seems to have potentially some of the greatest impact on human civilization of anything humans have ever created, which is artificial intelligence.

[260] This is on the both detailed technical level and in a high philosophical level you work on.

[261] So you've mentioned to me that there's an open letter that you're working on.

[262] It's actually going live in a few hours.

[263] I've been having late nights and early mornings.

[264] It's been very exciting, actually.

[265] In short, have you seen Don't Look, the film yes yes I don't want to be the movie spoiler for anyone watching this who hasn't seen it but if you're watching this you haven't seen it watch it because we are actually acting out it's it's life imitating art humanity is doing exactly that right now except it's an asteroid that we are building ourselves almost nobody is talking about it people are squabbling across the planet about all sorts of things, we've seen very minor compared to the asteroid that's about to hit us, right?

[266] Most politicians don't even have their radar, this on the radar, they think maybe in 100 years or whatever.

[267] Right now, we're at a fork on the road.

[268] This is the most important fork.

[269] The humanity has reached in its over 100 ,000 years on this planet.

[270] We're building effectively a new species.

[271] It's smarter than us.

[272] It doesn't look so much like a species yet because it's mostly not embodied in robots.

[273] But that's a technicality, which will soon be changed.

[274] And this arrival of artificial general intelligence that can do all our jobs as well as us and probably shortly thereafter superintelligence, which greatly exceeds our cognitive abilities, it's going to either be the best thing ever to happen, humanity or the worst.

[275] I'm really quite confident that there is not that much middle ground there.

[276] But it would be fundamentally transformative to human civilizations.

[277] Of course, utterly and totally.

[278] Again, we branded ourselves as Homo sapiens because it seemed like the basic thing.

[279] We're the king of the castle on this planet.

[280] We're the smart ones.

[281] If we can control everything else, this could very easily change.

[282] We're certainly not going to be the smartest on the planet for very long if AI, unless AI progress just halts.

[283] And we can talk more about why I, I think that's true because it's controversial.

[284] And then we can also talk about reasons you might think it's going to be the best thing ever and the reason you think it's going to be the end of humanity, which is, of course, super controversial.

[285] But what I think we can, anyone who's working on advanced AI can agree on is it's much like the film, don't look up in that it's just really comical how little.

[286] serious public debate there is about it, given how huge it is.

[287] So what we're talking about is a development of currently things like GPT4 and the signs it's showing of rapid improvement that may in the near term lead to development of super intelligent AGI, A .I. A .I. General AI systems and what kind of impact that has on society.

[288] Exactly.

[289] When that thing achieves general human level intelligence, and then beyond that, general, superhuman level intelligence.

[290] There's a lot of questions to explore here.

[291] So one, you mentioned halt.

[292] Is that the content of the letter?

[293] It's to suggest that maybe we should pause the development of these systems.

[294] Exactly.

[295] So this is very controversial.

[296] So when we talked the first time, we talked about how I was involved in starting the Future Life Institute.

[297] We worked very hard on 2014 -2015 was the mainstream AI safety, the idea that there even could be risks and that you could do things about them.

[298] Before then, a lot of people thought it was just really kooky to even talk about it.

[299] And a lot of AI researchers felt worried that this was too flaky and could be bad for funding and that the people who talked about it were just not didn't understand AI I'm very very happy with how that's gone in that now you know completely mainstream you go in any AI conference and people talk about AI safety and it's a nerdy technical field full of equations and similar and blah blah yes um as it should be uh but there's this other thing which has been quite taboo up until now calling for slowdown so what we've constantly been saying, including myself, I've been biting my tongue a lot, is that we don't need to slow down AI development.

[300] We just need to win this race, the wisdom race, between the growing power of the AI and the growing wisdom with which we manage it.

[301] And rather than trying to slow down AI, let's just try to accelerate the wisdom.

[302] Do all this technical work to figure out how you can actually ensure that your powerful AI is going to do what you wanted to do and have society adapt also with incentives and regulations so that these things get put to good use.

[303] Sadly, Matt didn't pan out.

[304] The progress on technical AI on capabilities has gone a lot faster than many people thought back when we started this in 2014.

[305] It turned out to be easier to build really advanced AI than we thought.

[306] And on the other side, it's gone much slower than we hoped with getting policy makers and others to actually put the incentives in place to steer this in the good directions.

[307] Maybe we should unpack it and talk a little bit about each.

[308] So why did it go faster than a lot of people thought them?

[309] In hindsight, it's exactly.

[310] like building flying machines.

[311] People spent a lot of time wondering about how do birds fly, you know, and that turned out to be really hard.

[312] Have you seen the TED Talk with a flying bird?

[313] Like a flying robotic bird?

[314] Yeah, it flies around the audience.

[315] But it took a hundred years longer to figure out how to do that than for the Wright brothers to build the first airplane because it turned out there was a much easier way to fly.

[316] And evolution picked the more complicated one because it had its hands tied.

[317] It could only build a machine that could assemble itself, which the Wright brothers didn't care about.

[318] It can only build a machine that use only the most common atoms in the periodic table.

[319] Right brothers didn't care about that.

[320] They could use steel iron atoms and it had to be able to repair itself and it also had to be incredibly fuel efficient.

[321] A lot of birds use less than half the fuel of a remote control plane flying the same distance.

[322] For humans, throw a little more, put a little more fuel in a roof, there you go, 100 years earlier.

[323] That's exactly what's happening now with these large language models.

[324] The brain is incredibly complicated.

[325] Many people made the mistake.

[326] You're thinking we had to figure out of the brain.

[327] Does human level AI first before we could build in the machine?

[328] That was completely wrong.

[329] You can take an incredibly simple, computational system called a transformer network and just train it to do something incredibly dumb, just read a gigantic amount of text and try to predict the next word.

[330] And it turns out, if you just throw a ton of compute at that and a ton of data, it gets to be frighteningly good, like GPD4, which I've been playing with so much since it came out, right?

[331] And there's still some debate about whether that can get you all the way to full human level or not but yeah we can come back to the details of that and how you might get the human level AI even if large language models don't can you briefly if it's just as a small tangent comment on your feelings about GPT4 suggest that you're impressed by this rate of progress but where is it can GPT4 reason what are like the intuitions what are human interpretable words you can assign to the capabilities of GPT4 that makes you so damn impressed with it.

[332] I'm both very excited about it and terrified.

[333] It's an interesting mixture of emotions.

[334] All the best things in life include those two somehow.

[335] Yeah, I can absolutely reason.

[336] Anyone who hasn't played with it, I highly recommend doing that before dissing it.

[337] It can do quite remarkable reasoning.

[338] I've had to do a lot of things, which I realized I couldn't do that myself.

[339] that well even and and obviously does it dramatically faster than we do too when you watch a type and it's doing that while servicing a massive number of other humans at the same time at the same time it cannot reason as well as a human can on some tasks just because it's obviously a limitation from its architecture you know we have in our heads what in geek speak is called a recurrent neural network there are loops information you can go from this neuron to this neuron and then back to this one you can like ruminate on something for a while you can self -reflect a lot these large language models that are they cannot like gpt4 it's it's a so -called transformer where it's just like a one -way street of information basically in geek speak it's called a feed -forward neural network and it's only so deep so it can only do logic that's that many steps and that deep and it's not and you can create the problems which will fail to solve for that reason.

[340] But the fact that it can do so amazing things with this incredibly simple architecture already is quite stunning.

[341] And what we see in my lab at MIT, when we look inside large language models to try to figure out how they're doing it, that's the key core focus of our research.

[342] It's called mechanistic interpretability in geese.

[343] speak you know you have this machine that does something smart you try to reverse reverse engineer see how does it do it I think of it also as artificial neuroscience that's exactly what neuroscientists do with actual brains but here you have the advantage that you can you don't have to worry about measurement errors you can see what every neuron is doing all the time and and a recurrent thing we see again and again there's been a number of beautiful papers quite recently by by a lot of researchers, some of them here, I mean in this area, is where when they figure out how something is done, you can say, oh man, that's such a dumb way of doing it.

[344] And you immediately see how it can be improved.

[345] Like, for example, there was a beautiful paper recently where they figured out how a large language model stores certain facts, like Eiffel Tower is in Paris, and they figured out exactly how it's stored.

[346] The proof that they understood it was they could edit it.

[347] They changed some of the synaps, says in it, and then they asked it, where is the Eiffel Tower, and it said, it's in Rome.

[348] And then they asked you, how do you get there?

[349] Oh, how do you get there from Germany?

[350] Oh, you take this train to Roma Termini train station and this and that, and what might you see if you're in front of it?

[351] Oh, you might see the Coliseum.

[352] So they had edited it.

[353] So they literally moved it to Rome.

[354] But the way it's storing this information, it's incredibly dumb for any fellow nerds, listening to this, there was a big matrix, and roughly speaking, there are certain row and column vectors which encode these things, and they correspond very high and waverly to principal components, and it would be much more efficient for a sparse matrix, just store in the database, you know, and everything, so far we've figured out how these things do are ways you can see that can easily be improved.

[355] And the fact that this particular architecture has some roadblocks built into it is in no way going to prevent crafty researchers from quickly finding workarounds and making other kinds of architectures sort of go all the way so so it's um in short it's turned out to be a lot easier to build human close to human intelligence than we thought and that means our runway as a species to get our shit together has shortened and it seems like the scary thing about the effectiveness of large language models, so Sam Altman I recently had a conversation with, and he really showed that the leap from GPT3 to GPT4 has to do with just a bunch of hacks, a bunch of little explorations, but with smart researchers doing a few little fixes here and there.

[356] It's not some fundamental leap and transformation in the architecture.

[357] And more data and more compute.

[358] And more data and compute, but he said the big leaps has to do with not the data and the compute, but just learning this new discipline, just like you said.

[359] So researchers are going to look at these architectures, and there might be big leaps where you realize, wait, why are we doing this in this dumb way?

[360] And all of a sudden, this model is 10x smarter.

[361] And that can happen on any one day, on any one Tuesday or Wednesday afternoon.

[362] and then all of a sudden you have a system that's 10x smarter.

[363] It seems like it's such a new discipline.

[364] It's such a new...

[365] Like we understand so little about why this thing works so damn well that the linear improvement of compute or exponential, but the steady improvement of compute, steady improvement of the data may not be the thing that even leads to the next leap.

[366] It could be a surprise little hack that improves everything.

[367] Or a lot of little leaps here and there because so much of this is out in the open also, so many smart people are looking at this and trying to figure out little leaps here and there and it becomes this sort of collective race where a lot of people feel if I don't take the leap someone else with and it's actually very crucial for the other part of it why do we want to slow this down so again what this open letter is calling for is just pausing all training of systems that are more powerful than GPT for for six months give a chance for the labs to coordinate a bit on safety and for society to adapt, give the right incentives to the labs.

[368] Because, you know, you've interviewed a lot of these people who lead these labs.

[369] And you know, just as well as I do, that they're good people.

[370] They're idealistic people.

[371] They're doing this first and foremost because they believe that AI has a huge potential to help humanity.

[372] But at the same time, they are trapped.

[373] in this horrible race to the bottom.

[374] Have you read Meditations on Moloch by Scott Alexander?

[375] Yes.

[376] Yeah, it's a beautiful essay on this poem by Ginsberg, where he interprets it as being about this monster.

[377] It's this game theory monster that pits people against each other in this race to the bottom where everybody ultimately loses.

[378] Yes.

[379] And the evil thing about this monster is, even though everybody sees it and understands, they still can't get out of the race, right?

[380] A good fraction of all the bad things that we humans do are caused by Moloch.

[381] And I like Scott Alexander's naming of the monster so we can, we humans can think of it as a thing.

[382] If you look at why do we have overfishing, why do we have more generally the tragedy of the comments.

[383] Why is it that to live Bore, I don't know if you had her on your podcast.

[384] Yeah, she's become a friend, yeah.

[385] Great.

[386] She made this awesome point recently that beauty filters that a lot of female influencers feel pressure to use are exactly Mollock in action again.

[387] First, nobody was using them, and people saw them just the way they were, and then some of them started using it and becoming ever more plastic fantastic and then the other ones that weren't using it started to realize that if they want to just keep their their market share they have to start using it too and that and then you're in the situation where they're all using it and and none of them has any more market share or less than before so nobody gained anything everybody lost and they have to keep becoming ever more plastic fantastic also right and uh but nobody can go back to the old way because it's just too costly, right?

[388] Moloch is everywhere, and Moloch is not a new arrival on the scene either.

[389] We humans have developed a lot of collaboration mechanisms to help us fight back against Mollock through various kinds of constructive collaboration.

[390] The Soviet Union and the United States did sign the number of arms control treaties against Moloch who is trying to stoke them into unnecessarily risky nuclear arms races, et cetera, et cetera.

[391] And this is exactly what's happening on the AI front.

[392] This time, it's a little bit geopolitics, but it's mostly money where there's just so much commercial pressure.

[393] You know, if you take any of these leaders of the top tech companies, if they just say, you know, this is too risky, I want to pause for six months, they're going to get a lot of pressure.

[394] from shareholders and others.

[395] We're like, well, you know, if you pause, but those guys don't pause, we're, if you don't want to get our lunch eaten.

[396] And shareholders even have the power to replace the executives in the worst case, right?

[397] So we did this open letter because we want to help these idealistic tech executives to do what their heart tells them by providing enough public pressure on the whole sector just pause so that they can all pause in a coordinated fashion and I think without the public pressure none of them can do it alone push back against their shareholders no matter how good -hearted they are because Molok is a really powerful foe so the idea is to for the major developers of AI systems like this so we're talking about Microsoft Google meta and anyone else Open AI is very close with Microsoft now of course and there are plenty of smaller players for example Anthropic is very impressive there's conjecture there's many many many players I don't want to make a long list to leave anyone out and for that reason it's so important that some coordination happens that there's external pressure on all of them saying you all need the pawns because then the people that the researchers in these organizations who the leaders who want to slow down a bit they can say they're shareholders you know everybody's slowing down because of this pressure and it's the right thing to do have you seen in history their examples where it's possible to pause the molecule absolutely and even like human cloning for example you could make so much money on human cloning why aren't we doing it because Because biologists thought hard about this and felt like this is way too risky.

[398] They got together in the 70s and the Cilomar and decided even to stop a lot more stuff also, just editing the human germ line, gene editing that goes into our offspring and decided, let's not do this because it's too unpredictable what it's going to lead to.

[399] we could lose control over what happens to our species.

[400] So they paused.

[401] There was a ton of money to be made there.

[402] So it's very doable.

[403] But you just need a public awareness of what the risks are.

[404] And the broader community coming in and saying, hey, let's slow down.

[405] And, you know, another common pushback I get today is we can't stop in the West because China.

[406] And in China, undoubtedly, they all.

[407] also get told we can't slow down because the West, because both sides think they're the good guy.

[408] But look at human cloning, you know.

[409] Did China forge ahead with human cloning?

[410] There's been exactly one human cloning that's actually been done that I know of.

[411] It was done by a Chinese guy.

[412] Do you know where he is now?

[413] Right.

[414] In jail.

[415] And you know who put him there?

[416] Who?

[417] Chinese government.

[418] Not because Westerners said, China, look, this is the Chinese government.

[419] No, the Chinese government put him there, because they also felt they like control the Chinese government.

[420] If anything, maybe they are even more concerned about having control than Western governments have no incentive of just losing control over where everything is going.

[421] And you can also see the Ernie Bot that was released by, I believe, by do recently.

[422] They got a lot of pushback from the government and had to rein in in a big way.

[423] I think once this basic message comes out that this isn't an arms race, It's a suicide race, where everybody loses, if anybody's AI, goes out of control.

[424] It really changes the whole dynamic.

[425] It's not, it's, and I'll say this again, because this is this very basic point.

[426] I think a lot of people get wrong.

[427] Because a lot of people dismiss the whole idea that AI can really get, very superhuman, because they think there's something really magical about intelligence such that it can only exist, human minds, you know, because they believe that, I think it's kind of kind of get to just more or less GPT4 plus plus and then that's it.

[428] They don't see it as a suicide race.

[429] They think whoever gets that first, they're going to control the world, they're going to win.

[430] That's not how it's going to be.

[431] And we can talk again about the scientific arguments from why it's not going to stop there.

[432] But the way it's going to be is if anybody completely loses control and, you know, you don't care if if some someone manages this take over the world who really doesn't share your goals you probably don't really even care very much about what nationality they have you're not going to like it much worse than today if you live in Orwellian dystopia who what do you care who's created it right and if someone if it goes farther and we just lose control even to the machines so that it's not us versus them it's us versus it?

[433] What do you care who created this, this unaligned entity, which has goals different from humans ultimately, and we get marginalized, we get made obsolete, we get replaced.

[434] That's why what I mean when I say it's a suicide race.

[435] It's kind of like we're rushing towards this cliff, but the closer to the cliff we get, the more scenic the views are, and the more money there is there.

[436] So we keep going.

[437] But we have to also, stop at some point, right?

[438] Quit while we're ahead.

[439] And it's a suicide race, which cannot be won.

[440] But the way to really benefit from it is to continue developing awesome AI a little bit slower.

[441] So we make it safe, make sure it does the things that humans want, and create a condition where everybody wins.

[442] Technology has shown us that, you know, geopolitics and, and, you know, politics and general, it's not a zero -sum game at all.

[443] So there is some rate of development that will lead us as a human species to lose control of this thing.

[444] And the hope you have is that there's some lower level of development, which will not allow us to lose control.

[445] This is an interesting thought you have about losing control.

[446] So if you have somebody, if you are somebody like Sanda Prachai or Sam Altman at the head of a company like this, you're saying if they develop an AGI, they too will lose control of it.

[447] So no one person can maintain control, no group of individuals can maintain control.

[448] If it's created very, very soon, and is a big black box that we don't understand, like the large language models, yeah, then I'm very confident they're going to lose control.

[449] But this isn't just me saying, you know, Sam Altman and Demisaziz have both said, themselves acknowledge that, you know, there's really great risks with this, and they want to slow down once they feel it gets scary.

[450] but it's clear that they're stuck in this again malloc is forcing them to go a little faster than they're comfortable with because of pressure from just commercial pressures right to get a bit optimistic here of course this is a problem that can be ultimately solved it's just to win this wisdom race it's clear that what we hope that it was going to happen hasn't happened the the capability progress has gone faster than a lot of people thought and and the progress in the public sphere of policymaking and so on has gone slower than we thought.

[451] Even the technical AI safety has gone slower.

[452] A lot of the technical safety research was kind of banking on that large language models and other poorly understood systems couldn't get us all the way, that you had to build more of a kind of intelligence that you could understand.

[453] Maybe it could prove itself safe, things like this.

[454] And I'm quite confident that this can be done, so we can reap all.

[455] the benefits but we cannot do it as quickly as this out -of -control express train we are on now is going to get the aGI that's why we need a little more time i feel is there something to be said what like sam oman talked about which is while we're in the pre -aGI stage to release often and as transparently as possible to learn a lot so as opposed to being extremely cautious release a lot Don't invest in a closed development where you focus on AI safety.

[456] While it's somewhat dumb, quote unquote, release as often as possible.

[457] And as you start to see signs of human -level intelligence or superhuman -level intelligence, then you put a halt on it.

[458] Well, what a lot of safety researchers have been saying for many years is that the most dangerous things you can do with an AI is, first of all, teach it to write, code, because that's the first step towards recursive self -improvement, which can take it from AGI to much higher levels.

[459] Okay, oops, we've done that.

[460] And another thing, high risk is connected to the internet, let it go to websites, download stuff on its own, talk to people.

[461] Oops, we've done that already.

[462] You know, Elias Yutkowski, you said you interviewed him recently, right?

[463] Yes, yes.

[464] So he had this tweet recently, which I gave me one of the best laughs in a while where he's like, Hey, people used to make fun of me and say, you're so stupid, Eliezer, because you're saying, you're saying, you have to worry.

[465] Obviously, developers, once they get to, like, really strong AI, the first thing you're going to do is, like, never connected to the internet, keep it in the box, where you can really study it.

[466] So he had written it in the, like, in the meme form.

[467] So it's like, then.

[468] Yeah.

[469] And then that.

[470] Now, let's, L -O -L, let's make a chatbot.

[471] And the third thing, Stuart Russell, you know, amazing AI researcher, he has argued for a while that we should never teach AI anything about humans.

[472] Above all, we should never let it learn about human psychology and how you manipulate humans.

[473] That's the most dangerous kind of knowledge you can give it.

[474] Yeah, you can teach it all that needs to know about how to cure cancer and stuff like that, but don't let it read Daniel Kahneman's book about cognitive biases and all that.

[475] And then, oops, LOL, you know, let's invent social media recommend our algorithms, which do exactly that.

[476] They get so good at knowing us and pressing our buttons that we're starting to create a world now where we just have ever more hatred because they figured out that these algorithms, Not for out of evil, but just to make money on advertising, that the best way to get more engagement, the euphemism, get people glued to their little rectangles, is just to make them pissed off.

[477] That's really interesting that a large AI system that's doing the recommender system kind of task on social media that is basically just studying human beings because it's a bunch of us rats giving it signal, nonstop signal.

[478] It'll show a thing and then we give signal and whether we spread that thing, we like that thing, that thing increases our engagement, gets us to return to the platform, and it has that on the scale of hundreds of millions of people constantly.

[479] So it's just learning and learning and learning.

[480] And presumably if the number of parameters in neural network that's doing the learning, and more end -to -end the learning is, the more it's able to just basically encode how to manipulate human behavior, how to control humans at scale.

[481] Exactly.

[482] And that is not something you think is in humanity's interest.

[483] Right now, it's mainly letting some humans manipulate other humans for profit and power, which already caused a lot of damage, and eventually that's a sort of skill that can make AI persuade humans to let them escape, whatever safety precautions we had.

[484] But, you know, there was a really nice article in the New York Times recently by Yuval Noah Harari and two co -authors, including Tristan Harris from the Social Dilemma.

[485] And they have this phrase in there, I love, humanity's first contact with advanced AI was social media.

[486] And we lost that one.

[487] We now live in a country where there's much more hate in the world where there's much more hate, in fact.

[488] And in our democracy, there we're having this conversation, and people can't even agree on who won the last election.

[489] And we humans often point fingers at other humans and say it's their fault.

[490] But it's really Moloch and these AI algorithms.

[491] We got the algorithms and then Moloch pitted the social media companies against each other.

[492] So nobody could have a less creepy algorithm because then they would lose out on revenue to the other company.

[493] Is there any way to win that battle back, just if we just linger on this one battle that we've lost in terms of social media?

[494] Is it possible to redesign social media, this very medium in which we use as a civilization to communicate with each other, to have these kinds of conversation, to have discourse to try to figure out how to solve the biggest problems in the world, whether that's nuclear war or the development of AGI?

[495] Is it possible to do social media correctly?

[496] I think it's not only possible, but it's necessary.

[497] Who are we kidding that we're going to be able to solve all these other challenges if we can't even have a conversation with it?

[498] each other.

[499] It's constructive.

[500] The whole idea, the key idea of democracy is that you get a bunch of people together and they have a real conversation, the ones you try to foster on this podcast or you respectfully listen to people you disagree with.

[501] And you realize, actually, you know, there are some things actually, some common ground we have and we both agree, let's not have a nuclear war, let's not do that, et cetera, et cetera.

[502] We're kidding ourselves thinking we can faced off the second contact with ever more powerful AI that's happening now with these large language models if we can't even have a functional conversation in the public space.

[503] That's why I started the Improve the News Project, improve the news .org.

[504] But I'm an optimist fundamentally in that there is a lot of intrinsic goodness in people.

[505] and that what makes the difference between someone doing good things for humanity and bad things is not some sort of fairy tale thing that this person was born with the evil gene and this one was not born with a good gene, no, I think it's whether we put, whether people find themselves in situations that bring out the best in them or that bring out the worst in them.

[506] and I feel we're building an internet and a society that brings out the worst in us.

[507] But it doesn't have to be that way.

[508] No, it does not.

[509] It's possible to create incentives and also create incentives that make money.

[510] They both make money and bring out the best in people.

[511] I mean, in the long term, it's not a good investment for anyone, you know, to have a nuclear war, for example.

[512] And, you know, is it a good investment for humanity if we just ultimately replace all humans by machines and then are so obsolete that eventually there are no humans left.

[513] Well, it depends against how you do the math.

[514] But I would see by any reasonable economics started, if you look at the future income of humans and there aren't any, that's not a good investment.

[515] Moreover, like, why can't we have a little bit of pride in our species?

[516] Damn it.

[517] You know, why should we just build another species that gets rid of us?

[518] If we were Neanderthals, Would we really consider it a smart move if we had really advanced biotech to build Homo sapiens?

[519] You might say, hey, Max, you know, yeah, let's build these Homo sapiens.

[520] They're going to be smarter than us.

[521] Maybe they can help us defend us better against predators and help fix up our caves, make them nicer.

[522] We'll control them undoubtedly, you know.

[523] So then they build a couple, a little baby girl, little baby boy.

[524] And then you have some wise old, the Neanderthal elder is like, hmm, I'm scared that we're opening a Pandora's box here and that we're going to get outsmarted by these super Neanderthal intelligences and there won't be any Neanderthals left.

[525] But then you have a bunch of others in the cave, right?

[526] Are you such a Luddite scaremonger?

[527] Of course, they're going to want to keep us around because we are their creators and, and we're, And the smarter, I think, the smarter they get, the nicer they're going to get, they're going to leave us, they're going to want us around, and it's going to be fine.

[528] And besides, look at these babies.

[529] They're so cute.

[530] Clearly, they're totally harmless.

[531] Those babies are exactly GPT4.

[532] It's not, I want to be clear, it's not GPT4 that's terrifying.

[533] It's the GPT4 is a baby technology.

[534] You know, and Microsoft even had a paper recently out.

[535] with the title something like Sparkles of AGI.

[536] Well, they were basically saying this is baby AI, like these little Neanderthal babies.

[537] And it's going to grow up.

[538] There's going to be other systems from the same company, from other companies.

[539] They'll be way more powerful, but they're going to take all the ideas from these babies.

[540] And before we know it, we're going to be like those last Neanderthals who are pretty disappointed.

[541] when they realized that they were getting replaced.

[542] Well, this interesting point you make, which is the programming, it's entirely possible that GPT4 is already the kind of system that can change everything by writing programs.

[543] Yeah, it's because it's Life 2 .0, the systems I'm afraid of are going to look nothing like a large language model, and they're not going to.

[544] But once it or other people, people figure out a way of using this tech to make much better tech, right?

[545] It's just constantly replacing its software.

[546] And from everything we've seen about how these work under the hood, they're like the minimum viable intelligence.

[547] They do everything in the dumbest way that still works, sort of.

[548] Yeah.

[549] And so they are life 3 .0, except when they replace their software, it's a lot faster than when you decide to learn Swedish.

[550] and moreover they think a lot faster than us too so when you know we don't think on have one logical step every nanosecond or a few or so the way they do and we can't also just suddenly scale up our hardware massively in the cloud so limited right so they are they are also life can soon become a little bit more like life 3 .0 in that if they need more hardware, hey, just rent it in the cloud, you know.

[551] How do you pay for it?

[552] Well, with all the services you provide.

[553] And what we haven't seen yet, which could change a lot, is an entire software system.

[554] So right now, programming is done sort of in bits and pieces as an assistant tool to humans, but I do a lot of programming and with the kind of stuff that GPT4 is able to do, I mean, is replacing a lot what I'm able to do, but you still need a human in the loop to kind of manage the design of things, manage like what are the prompts that generate the kind of stuff, to do some basic adjustment of the code, let's do some debugging, but if it's possible to add on top of GPT4 kind of feedback loop of self -reveillance.

[555] debugging, improving the code, and then you launch that system onto the wild on the internet because everything is connected and have it do things, have it interact with humans, and then get that feedback.

[556] Now you have this giant ecosystem of humans.

[557] That's one of the things that Elon Musk recently sort of tweeted as a case why everyone needs to pay $7 or whatever for Twitter.

[558] To make sure they're real.

[559] Make sure they're real.

[560] We're now going to be living in a world where the bots are getting smarter and smarter and smarter to a degree where you can't tell the difference between a human and a bot.

[561] That's right.

[562] And now you can have bots outnumber humans by one million to one, which is why he's making the case why you have to pay to prove your human, which is one of the only mechanisms to prove, which is depressing.

[563] And I feel we have to remember, as individuals, we should.

[564] should from time to time ask ourselves, why are we doing what we're doing?

[565] And as a species, we need to do that too.

[566] So if we're building, as you say, machines that are outnumbering us and more and more outsmarting us and replacing us on the job market, not just for the dangerous and boring tasks, but also for writing poems and doing art and things that a lot of people find really meaningful, got to ask ourselves, why?

[567] Why are we doing this?

[568] We are The answer is Moloch is tricking us into doing it.

[569] And it's such a clever trick that even though we see the trick, we still have no choice but to fall for it, right?

[570] Also, the thing you said about you using co -pilot AI tools to program faster.

[571] How many times, what factor faster would you say you code now?

[572] Does it go twice as fast?

[573] I don't really, because it's such a new tool.

[574] Yeah.

[575] I don't know if speed is significantly improved, but it feels like I'm a year away from being five to ten times faster.

[576] So if that's typical for programmers, then you're already seeing another kind of recursive self -improvement, right?

[577] Because previously, one, like a major generation of improvement of the codes would happen on the human R &D timescale.

[578] And now if that's five times shorter, then it's going to take five times less time than otherwise would to develop the next level of these tools and so on.

[579] So this is exactly the sort of beginning of an intelligence explosion.

[580] There can be humans in the loop a lot in the early stages, and then eventually humans are needed less and less, and the machines can more kind of go alone.

[581] But what you said there is just an exact example of these sort of things.

[582] Another thing which I was kind of lying on my psychiatrist, imagining I'm on a psychiatrist couch here saying what are my fears that people would do with AI systems.

[583] So I mentioned three that I had fears about many years ago that they would do, namely teach it the code, connected to the internet and teach it to manipulate humans.

[584] A fourth one is building an API where code can control this super powerful thing, right?

[585] That's very unfortunate.

[586] Because one thing that systems, like GPT4 have going for them is that they are an Oracle in the sense that they just answer questions there is no robot connected to GPT4 GPT4 can't go and do stock trading based on its thinking.

[587] It is not an agent.

[588] An intelligent agent is something that takes in information from the world, processes it to figure I what action to take based on its goals that it has and then does something back on the world.

[589] But But once you have an API, for example, GPD4, nothing stops Joe Schmo and a lot of other people from building real agents, which just keep making calls somewhere and some inner loop somewhere to these powerful Oracle systems, which makes them themselves much more powerful.

[590] That's another kind of unfortunate development, which I think we would have been better off delaying.

[591] I don't want to pick on any particular companies.

[592] I think they're all under a lot of pressure.

[593] sure to make money.

[594] And again, the reason we're calling for this pause is to give them all cover to do what they know is the right thing, slow down a little bit at this point.

[595] But everything we've talked about, I hope we'll make it clear to people watching this, why these sort of human level tools can cause gradual acceleration.

[596] You keep using yesterday's technology to build tomorrow's technology.

[597] and when you do that over and over again, you naturally get an explosion.

[598] That's the definition of an explosion in science, right?

[599] Like if you have two people and they fall in love, now you have four people and then they can make more babies and now you have eight people and then you have 16, 32, 64, etc. We call that a population explosion where it's just that each, if it's instead free new, neutrons in a nuclear reaction that if each one can make more than one, then you get an exponential growth in that.

[600] We call it a nuclear explosion.

[601] All explosions are like that.

[602] And an intelligence explosion, it's just exactly the same principle, that some quantity, some amount of intelligence can make more intelligence than that, and then repeat.

[603] You always get exponentials.

[604] What's your intuition why it does?

[605] You mentioned there's some technical reasons why it doesn't stop at a certain point.

[606] What's your intuition?

[607] And do you have any intuition, why it might stop.

[608] It's obviously going to stop when it bumps up against the laws of physics.

[609] There are some things you just can't do no matter how smart you are, right?

[610] Allegedly.

[611] Because we don't know the full laws of physics yet, right?

[612] Seth Lloyd wrote a really cool paper on the physical limits on computation, for example.

[613] If you make it, put too much energy into it and the finite space, it'll turn into a black hole.

[614] You can't move information around fast in the speed of light, stuff like that.

[615] but it's hard to store way more than a modest number of bits per atom, et cetera.

[616] But, you know, those limits are just astronomically above, like 30 orders of magnitude above where we are now.

[617] So bigger different, bigger jump in intelligence than if you go from an ant to a human.

[618] I think, of course, what we want to do is have a control.

[619] controlled thing.

[620] A nuclear reactor, you put moderators in to make sure exactly it doesn't blow up out of control, right?

[621] When we do experiments with biology and cells and so on, you know, we also try to make sure it doesn't get out of control.

[622] We can do this with AI too.

[623] The thing is, we haven't succeeded yet.

[624] And Mollock is exactly doing the opposite, just fueling, just egging everybody on, faster, faster, faster, faster, or the other company is going to catch up with you, or the other country is going to catch up with you.

[625] We have to want this stuff.

[626] We have to, and I don't believe in this, just asking people to look into their hearts and do the right thing.

[627] It's easier for others to say that, but like if you're in the situation where your company is going to get screwed, if you, by other companies, they're not stopping, you're putting people in a very hard situation, the right thing to do is change the whole incentive structure instead.

[628] And this is not an old...

[629] Maybe I should say one more thing about this, because Mollock has been around as humanity's number one or number two enemy since the beginning of civilization.

[630] And we came up with some really cool countermeasures.

[631] First of all, already over 100 ,000 years ago, evolution realized that it was very unhelpful that people kept killing each other all the time.

[632] So it genetically gave us compassion and made it so that if you get two drunk dudes getting into a pointless bar fight, they might give each other black eyes, but they have a lot of inhibition towards just killing each other.

[633] And similarly, if you find a baby lying on the street when you go out for your morning jog tomorrow, you're going to stop and pick it up, right?

[634] even though it may be make you late for your next podcast.

[635] So evolution gave us these genes that make our own egoistic incentives more aligned with what's good for the greater group or part of, right?

[636] And then as we got a bit more sophisticated and developed language, we invented gossip, which is also a fantastic anti -moloch, right?

[637] Because now it really discourages liars, moochers, cheaters.

[638] Because their own incentive now is not to do this because word quickly gets around and then suddenly people aren't going to invite them to their dinners anymore or trust them.

[639] And then when we got still more sophisticated in bigger societies, you know, invented the legal system where even strangers who couldn't rely on gossip and things like this would treat each other, would have an incentive.

[640] Now those guys in the bar fights, even if someone is so drunk that he actually wants to kill the other guy, he also has a little thought in the back of his head that, you know, do I really want to spend the next 10 years eating like really crappy food in a small room?

[641] I'm just going to chill out, you know.

[642] And we similarly have tried to give these incentives to our corporations by having regulation and all sorts of oversight.

[643] so that their incentives are aligned with the greater good.

[644] We tried really hard.

[645] And the big problem that we're failing now is not that we haven't tried before, but it's just that the tech is growing much, is developing much faster than the regulators have been able to keep up, right?

[646] So regulators, it's kind of comical, the European Union right now is doing this AI act, right?

[647] And in the beginning, they had a little opt -out exception that GPT4 would be completely excluded from regulation.

[648] Brilliant idea.

[649] What's the logic behind that?

[650] Some lobbyists pushed successfully for this.

[651] So we were actually quite involved with the Future Life Institute, Mark Brackel, Risto Ouk, Anthony Aguirre and others.

[652] We're quite involved with educating various people involved in this process about these general purpose AI models coming and pointing out that they would become the laughing stock if they didn't put it in.

[653] So the French started pushing for it.

[654] It got put in to the draft and it looked like all was good.

[655] Then there was a huge counterpush from lobbyists.

[656] There were more lobbyists in Brussels from tech companies than from oil companies, for example.

[657] And it looked like it might, is we can maybe get taken out again.

[658] And now GPT4 happened.

[659] and I think it's going to stay in.

[660] But this just shows, you know, Moloch can be defeated, but the challenge you're facing is that the tech is generally much faster than what the policy makers are.

[661] And a lot of the policymakers also don't have a tech background.

[662] So, you know, we really need to work hard to educate them on what's taking place here.

[663] So we're getting the situation where the first, kind of non so you know i define artificial intelligence just as non -biological intelligence right and by that definition a company a corporation is also an artificial intelligence because the corporation isn't it's humans it's the system if its CEO decides the CEO of a tobacco company decides one morning the CEO he doesn't want to sell cigarettes anymore they'll just put another CEO in there it's not enough to align the incentives of individual people or align individual computers incentives to their owners, which is what technically AI safety research is about.

[664] You also have to align the incentives of corporations with the greater good.

[665] And some corporations have gotten so big and so powerful very quickly that in many cases their lobbyists instead align the regulators to what they want rather than the other way around.

[666] It's a classic regulatory capture.

[667] Right.

[668] Is the thing that the slowdown hopes to achieve is give enough time to the regulars to catch out or enough time to the companies themselves to breathe and understand how to do AI safety correctly?

[669] I think both.

[670] But I think that the vision, the path to success, I see, is first you give a breather actually to the people in these companies.

[671] Their leadership, who wants to do the right thing and they all have safety teams and so on their companies, give them a chance to get the, together with the other companies and the outside pressure can also help catalyze that and work out what is it that's what are the reasonable safety requirements one should put on future systems before they get rolled out there are a lot of people also in academia and elsewhere outside of these companies who can be brought into this and have a lot of very good ideas and then I think it's very realistic that within six months you can get these people coming up so here's a white paper here's where we all think it's reasonable just because cars killed a lot of people he didn't ban cars but they got together a bunch of people and decided you know in order to be allowed to sell a car it has to have a seatbelt in it they're the analogous things that you can start requiring a future AI systems so that they are set safe.

[672] And once this heavy lifting, this intellectual work has been done by experts in the field, which can be done quickly, I think it's going to be quite easy to get policymakers to see, yeah, this is a good idea.

[673] And it's, you know, for the companies to fight malloc, they want, and I believe Sam Altman has explicitly called for this, they want the regulators to actually adopt it so that their competition is going to abide by two, right?

[674] You don't want, you don't want to be enacting all these principles, then you abide by them, and then there's this one little company that doesn't sign on to it, and then now they can gradually overtake you.

[675] Then the companies will get, be able to sleep secure, knowing that everybody's playing by the same rules.

[676] So do you think it's possible to develop guardrails?

[677] that keep the systems from basically damaging irreparably humanity while still enabling sort of the capitalist -fueled competition between companies as they develop how to best make money with this AI.

[678] You think there's a balancing...

[679] Totally.

[680] That's possible.

[681] Absolutely.

[682] We've seen that in many other sectors where you've had the free market produce quite good things without causing particular harm.

[683] when the guardrails are there and they work you know capitalism is a very good way of optimizing for just getting the same things on more efficiently but it was good you know and like in hindsight and I never met anyone even even on parties way over on the right in any country who think it was a bad thinks it was a terrible idea to ban child labor for example yeah but it seems like this particular technology has gotten so good, so fast, to become powerful to a degree where you could see in the near term the ability to make a lot of money.

[684] And to put guard rails, to develop guard rails quickly in that kind of contact seems to be tricky.

[685] It's not similar to cars or child labor.

[686] It seems like the opportunity to make a lot of money here very quickly is right here before us.

[687] Again, there's this cliff.

[688] Yeah.

[689] This gets quite scenic.

[690] the closer, the cliff there you go.

[691] There are more money there is, more gold, ingots there are on the ground, you can pick up or whatever, so you want to drive there very fast.

[692] But it's not in anyone's incentive that we go over the cliff.

[693] And it's not like everybody's in their own car.

[694] All the cars are connected together with a chain.

[695] So if anyone goes over, they'll start dragging others down, the others down too.

[696] And so ultimately, it's in the selfish interests, also of the people in the companies to slow down when you can start seeing the contours of the cliff there in front of you, right?

[697] And the problem is that even though the people who are building the technology and the CEOs, they really get it, the shareholders and these other market forces, they are people who don't, honestly, understand that the cliff is there.

[698] They usually don't, you have to get quite into the weeds to really appreciate how powerful this is and how fast.

[699] And a lot of people are even still stuck again in this idea that, intelligent in this carbon chauvinism as i like to call that that you can only have our level of intelligence in humans that there's something magical about it whereas the people in the tech companies who build this stuff they all realize that intelligence is information processing of a certain kind and it really doesn't matter at all whether the information is processed by carbon atoms in neurons in brains, or by silicon atoms and some technology we build.

[700] So you brought up capitalism earlier, and there are a lot of people who love capitalism and a lot of people who really, really don't.

[701] And it struck me recently that what's happening with capitalism here is exactly analogous to the way in which super intelligence might wipe us out.

[702] so you know i studied economics for my undergrad stockham school of economics yay well no no i tell me so i was very interested in how how you could use market forces to just get stuff done more efficiently but give the right incentives to the market so that it wouldn't do really bad things so dylan had phil manel who's a professor and colleague of mine at MIT wrote this really interesting paper with some collaborators recently, where they proved mathematically that if you just take one goal that you just optimize for on and on and on indefinitely that you think is going to bring you in the right direction.

[703] What basically always happens is in the beginning, it will make things better for you.

[704] But if you keep going at some point that it's going to start making things worse for you again, and then gradually it's going to make it really, really terrible.

[705] So just as a simple, the way I think of the proof is, suppose you want to go from here back to Austin, for example, and you're like, okay, yeah, let's just, let's go south, but you put in exactly the right, sort of the right direction.

[706] Just optimize that.

[707] South is possible.

[708] You get closer and closer to Austin, but there's always some little error, so you're not going exactly towards Austin, but you get pretty close.

[709] But eventually you start going away again, and eventually you're going to be leaving the solar system.

[710] Yeah.

[711] And they proved, it's a beautiful mathematical proof.

[712] This happens generally, and this is very important for AI, because even though Stuart Russell has written a book and given a lot of talks on why it's a bad idea to have AI just blindly optimize something, that's what pretty much all our systems do.

[713] Yeah.

[714] We have something called a loss function that we're just minimizing or reward function.

[715] We're just minimized.

[716] maximizing and capitalism is exactly like that too we wanted to get stuff done more efficiently that people wanted so introduced the free market things got done much more efficiently than they did in say communism right and it got better but then it just kept optimizing it and kept optimizing and you got ever bigger companies and ever more efficient information processing and I also very much powered by IT and eventually a lot of people are beginning to feel weight we're kind of optimizing a bit too much like why did we just chop down half the rainforest and why did suddenly these regulators get captured by lobbyists and so on?

[717] It's just the same optimization that's been running for too long if you have an AI that actually has power over the world and you just give it one goal and just keep optimizing that.

[718] Most likely, everybody's going to be like, yay, this is great in the beginning.

[719] Things are getting better.

[720] But it's almost impossible to give it exactly the right direction to optimize in.

[721] And then eventually, all hay breaks loose, right?

[722] Nick Bostrom and others are given examples of sound quite silly.

[723] Like, what if you just want to like tell it to it?

[724] cure cancer or something and that's all you tell it maybe it's going to decide to take over entire continents just so we can get more supercomputer facilities in there and figure out a cure cancer backwards and then you're like wait that's not what i wanted right and um the the the issue with capitalism and the issue with runaway i have kind of merged now because the moloch i talked about is exactly the capitalist moloch that we have built an economy that is optimizing for only one thing, profit.

[725] And that worked great back when things were very inefficient, and then now it's getting done better.

[726] And it worked great as long as the companies were small enough that they couldn't capture the regulators.

[727] But that's not true anymore, but they keep optimizing.

[728] And now they realize that these companies can make even more profit by building ever more powerful AI, even if it's reckless.

[729] but optimize more, more, more, more, more.

[730] So this is Mollock again showing up.

[731] And I just want to, anyone here who has any concerns about late -stage capitalism having gone a little too far, you should worry about superintelligence because it's the same villain in both cases.

[732] It's Mollock.

[733] And optimizing one objective function aggressively, blindly, blindly.

[734] is going to take us there.

[735] Yeah, we have this pause from time to time and look into our hearts and ask, why are we doing this?

[736] Is this, am I still going towards Austin or have I gone too far?

[737] Maybe we should change direction.

[738] And that is the idea behind a halt for six months.

[739] Why six months?

[740] It seems like a very short period.

[741] Can we just linger and explore different ideas here because this feels like a really important moment in human history, would actually have a significant positive effect.

[742] We said six months because we figured the number one pushback we were going to get in the West.

[743] It's like, but China.

[744] And everybody knows there's no way that China is going to catch up the West on this in six months.

[745] So that argument goes off the table.

[746] And you can forget about geopolitical competition and just focus on the real issue.

[747] that's why we put this.

[748] That's really interesting.

[749] But you've already made the case that even for China, if you actually want to take on that argument, China too would not be bothered by a longer halt because they don't want to lose control, even more than the West doesn't.

[750] That's what I think.

[751] That's a really interesting argument.

[752] I have to actually really think about that, which the kind of thing people assume is if you develop an AGI, that Open AI, if they're the ones, that do it, for example, they're going to win.

[753] But you're saying, no, everybody loses.

[754] Yeah, it's going to get better and better and better, and then, boom, we all lose.

[755] That's what's going to happen.

[756] When lose and win it, define a metric of basically quality of life for human civilization and for Sam Altman.

[757] Both.

[758] To be blunt, my personal guess, you know, and people can quibble with this, is that we're just going to, there won't be any humans.

[759] That's it.

[760] That's what I mean by lose.

[761] You know, if you, we can see in history, once you have some species or some group of people who aren't needed anymore, doesn't usually work out so well for them, right?

[762] Yeah.

[763] There were a lot of horses that were used for traffic in Boston, and then the car got invented, and most of them got, you know, well, we don't need to go there.

[764] And if you look at humans, you know, right now, we, why did this?

[765] the labor movement succeed after the industrial revolution because it was needed.

[766] Even though we had a lot of mollocks and there was child labor and so on, the company still needed to have workers and that's why strikes had power and so on.

[767] If we get to the point where most humans aren't needed anymore, I think it's quite naive to think that they're going to still be treated well.

[768] You know, we say that.

[769] Yeah, yeah.

[770] everywhere is equal, and the government will always protect them.

[771] But if you look in practice, groups that are very disenfranchised and don't have any actual power usually get screwed.

[772] And now in the beginning, so industrial revolution, we automated away muscle work.

[773] But that got, worked out pretty well eventually because we educated ourselves and started working with our brains instead and got usually more interesting.

[774] better paid jobs.

[775] But now we're beginning to replace brain work.

[776] So we replaced a lot of boring stuff.

[777] Like we got the pocket calculator so you don't have people adding multiplying numbers anymore at work.

[778] Fine.

[779] There were better jobs they could get.

[780] But now, GPT4 and the stable diffusion and techniques like this, they're really beginning to blow away some jobs that people really love having.

[781] It was a heartbreaking article just post just yesterday.

[782] social media I saw about this guy who was doing 3D modeling for gaming and he and all of a sudden now they got this new software he just gives says prompts and he feels his whole job that he loved lost its meaning you know and uh i asked uh gpt4 to rewrite twinkle twinkle little star in the style of shakespeare i couldn't have done such a good job it was just really impressive you've seen a lot of art coming out here, right?

[783] So I'm all for automating away the dangerous jobs and the boring jobs, but I think you hear some arguments which are too glibs.

[784] Sometimes people say, well, that's all that's going to happen.

[785] We're getting rid of the boring, tedious, dangerous jobs.

[786] It's just not true.

[787] There are a lot of really interesting jobs that are being taken away now.

[788] Journalism is going to get crushed.

[789] Coding is going to get crushed.

[790] I predict the job market for programmers, salaries are going to start dropping.

[791] You know, if you said you can code five times faster, you know, then you need five times fewer programmers.

[792] Maybe there will be more output also, but you'll still end up using fewer programmers, needing fewer programmers than today.

[793] And I love coding.

[794] You know, I think it's super cool.

[795] So we need to stop and ask ourselves, why again, are we doing this as humans?

[796] I feel that AI should be built by humanity for humanity.

[797] And let's not forget that.

[798] It shouldn't be by Mollock for Moloch.

[799] What it really is now is kind of by humanity for Moloch, which doesn't make any sense.

[800] It's for us that we're doing it.

[801] And it would make a lot more sense if we build, figure out gradually, safely how to make all this tech, and then we think about what are the kind of job that people really don't want to have, you know, automate them all the way?

[802] And then we ask, what are the jobs that people really find meaning in, like maybe taking care of children in the daycare center, maybe doing art, et cetera, et cetera.

[803] And even if it were possible to automate that way, we don't need to do that, right?

[804] We built these machines.

[805] Well, it's possible that we redefine or rediscover what are the jobs that give us meaning.

[806] So for me, the thing, it is really sad.

[807] Like I, half the time I'm excited, half the time I'm crying as I'm generating code because I kind of love programming.

[808] It's an act of creation.

[809] You have an idea, you design it, and then you bring it to life, and it does something, especially if there's some intelligence, it does something.

[810] It doesn't even have to have intelligence.

[811] It's printing Hello World on screen.

[812] You made a little machine, and it comes to life.

[813] Yeah.

[814] And there's a bunch of tricks you learn along the way because you've been doing it for many, many years.

[815] And then to see AI, be able to generate all the tricks you thought were special.

[816] Yeah.

[817] I don't know, it's very, it's scary, it's almost painful.

[818] Like a loss of innocence, Maybe when I was younger, I remember before I learned that sugar is bad for you, you should be on a diet.

[819] I remember I enjoyed candy deeply in a way I just can't anymore that I know it's bad for me. I enjoyed it unapologetically, fully, just intensely.

[820] And I just, I lost that.

[821] Now I feel like a little bit of that is lost for me with programming, or being lost with programming, similar as it is for, the 3D modeler no longer being able to really enjoy the art of modeling 3D things for gaming I don't know I don't know what to make sense of that maybe I would rediscover that the true magic of what it means to be human is connecting with other humans to have conversations like this I don't know to have sex to eat food to really intensify the value from conscious experiences versus like creating other stuff you're pitching the rebranding again from Homo's AP the homo santhians the meaningful experiences and just to inject some optimism in this year so we don't sound like a bunch of gloomers you know we can totally have our cake and eat it you hear a lot of totally bullshit claims that we can't afford having more teachers yeah have to cup the number of nurses you know that's just nonsense obviously with anything even quite far short of aGI we can dramatically improve, grow the GDP and produce this wealth of goods and services, it's very easy to create a world where everybody is better off than today, including the richest people, can be better off as well.

[822] It's not a zero -sum game technology.

[823] Again, you can have two countries like Sweden and Denmark had all these ridiculous wars century after century.

[824] And sometimes Sweden got a little better because it got a little bit bigger and then Denmark got a little better off because we can get a little bit smaller and and then we then technology came along and we both got just dramatically wealthier without taking away from anyone else it was just a total win for everyone and uh AI can do that on steroids if you can build safe AGI if you can build super intelligence you know basically all the limitations that cause harm today can be completely eliminated right so wonderful you talk possibility and this is not sci -fi this is something which is clearly possible according to the laws of physics and we can talk about ways of making it safe also um but unfortunately that'll only happen if we steer in that direction that's absolutely not the default outcome that's why income inequality keeps going up that's why the life expectancy in the u .s has been going down now i think it's four years in a row i just read a heart heartbreaking study from the CDC about how something like one -third of all teenage girls in the U .S. been thinking about suicide, you know, like those are steps in totally the wrong direction.

[825] And it's important to keep our eyes on the prize here that we can, we have the power now for the first time in the history of our species to harness artificial intelligence, to help us really flourish and help bring out the best in our humanity rather than the worst of it, to help us have really fulfilling experiences that feel truly meaningful.

[826] And you and I shouldn't sit here and dictate the future generations what they will be.

[827] Let them figure it out, but let's give them a chance to live and not foreclose all these possibilities for them by just messing things up, right?

[828] For that, we have to solve the AI safety problem.

[829] It would be nice if we can link on exploring that a little bit.

[830] So one interesting way to enter that discussion is you tweeted and Elon replied, you tweeted, let's not just focus on whether GPT4 will do more harm or good on the job market, but also whether it's coding skills will hasten the arrival of superintelligence.

[831] That's something we've been talking about, right?

[832] So Elon proposed one thing in the reply saying maximum truth -seeking is my best guess for AI safety.

[833] Can you maybe steal man the case for this subjective function of truth and maybe make an argument against it?

[834] And in general, what are your different ideas to start approaching the solution to AI safety?

[835] I didn't see that reply, actually.

[836] Oh, interesting.

[837] But I really resonate with it because AI is not evil.

[838] It caused people around the world to hate each other much more.

[839] but that's because we made it in a certain way.

[840] It's a tool.

[841] We can use it for great things and bad things.

[842] And we could just as well have AI systems.

[843] And this is part of my vision for success here, truth -seeking AI that really brings us together again.

[844] Why do people hate each other so much between countries and within countries?

[845] It's because they each have totally different versions of the truth, right?

[846] if they all had the same truth that they trusted for good reason because they could check it and verify it and not have to believe in some self -proclaimed authority, right?

[847] There wouldn't be nearly as much hate.

[848] There'd be a lot more understanding instead.

[849] And this is, I think, something AI can help enormously with.

[850] For example, a little baby step in this direction is this website called Metaculous, where people bet and make predictions, not for money, but just for their own reputation.

[851] And it's kind of funny, actually.

[852] You treat the humans like you treat AI, as you have a loss function where they get penalized if they're super confident on something and then the opposite happens.

[853] Whereas if you're kind of humble and then you're like, I think it's 51 % chance this is going to happen and then the other happens, you don't get penalized much.

[854] And what you can see is that some people are much better at predicting than others.

[855] They've earned your trusts, right?

[856] One project that I'm working on right now is the outgrowth and improve the News Foundation together with the Metaculous folks is seeing if we can really scale this up a lot of the more powerful AI.

[857] I would love it, I would love further to be a really powerful truth -seeking system where that is trustworthy because it keeps being right about stuff and people who come to it and maybe look at its latest trust ranking of different pundits and newspapers, et cetera.

[858] If they want to know why someone got a low score, they can click on it and see all the predictions that they actually made and how they turned out.

[859] This is how we do it in science.

[860] You trust scientists like Einstein who said something everybody thought it was bullshit and turned out to be right.

[861] Get a trust point, and he did it multiple times even.

[862] I think AI has the power to really heal a lot of the rifts we're seeing by creating trust system.

[863] It has to get away from this idea today with some fact -checking sites, which might themselves have an agenda and you just trust it because of its reputation.

[864] You want to have it, so these sort of systems, they earn their trust, and they're completely transparent.

[865] This, I think, would actually help a lot.

[866] That can, I think, help heal the very dysfunctional conversation that humanity has about how it's going to deal with all its biggest challenges in the world today.

[867] And then on the technical side, another common sort of gloom comment I get from people saying, we're just screwed, there's no hope, is, well, things like GPT4 are way too complicated for, a human to ever understand and prove that they can be trustworthy.

[868] They're forgetting that AI can help us prove that things work, right?

[869] And there's this very fundamental fact that in math, it's much harder to come up with a proof that it is to verify that the proof is correct.

[870] You can actually write a little proof checking code.

[871] It's quite short that you can, as human, understand.

[872] And then it can check the most monstrously long proof ever generated even by a computer and say, yeah, this is valid.

[873] So right now, we have this approach with virus checking software that it looks to see if there's something you should not trust it.

[874] And if it can prove to itself that you should not trust that code, it warns you, right?

[875] What if you flip this around?

[876] And this is an idea I should give credit to Steve, I'm 104, for so that it will only run the code if it can prove instead of not running it if it can prove that it's not trustworthy if it will only run it if it can prove that it's trustworthy so it asks the code prove to me that you're going to do what you say you're going to do and it gives you this proof and you a little proof tricker can check it now you can actually trust an AI that's much more intelligent than you are right because you it's it's problem to come up with this proof that you could never have found, but you should trust it.

[877] So this is the interesting point.

[878] I agree with you, but this is where Eliezer Yikovsky might disagree with you.

[879] His claim, not with you, but with this idea, his claim is super intelligent AI would be able to know how to lie to you with such a proof.

[880] How to lie to you and give me a proof that I'm going to think is correct?

[881] Yeah.

[882] But it's not me as lying to.

[883] that's to trick my proof checker.

[884] So, yes.

[885] So his general idea is that super intelligent system can lie to a dumber proof checker.

[886] So you're going to have, as a system becomes more and more intelligent, there's going to be a threshold where a super intelligent system would be able to effectively lie to a slightly dumber AGI system.

[887] Like there's a threat, like he really focuses on this weak AGI to strong AGI jump.

[888] where the strong AGI can make all the weak AGIs think that it's just one of them, but it's no longer that.

[889] And that leap is when it runs away.

[890] I don't buy that argument.

[891] I think no matter how super -intelligent in AI is, it's never going to be able to prove to me that there are only finitely many primes, for example.

[892] It just can't.

[893] And it can try to snow me by making up all sorts of new weird rules of of deduction and say, trust me, you know, the way your proof checker works is too limited and we have this new hypermath and it's true.

[894] But then I would just take the attitude, okay, I'm going to forfeit some of these supposedly super cool technologies.

[895] I'm only going to go with the ones that I can prove in my own trusted proof checker.

[896] Then I don't, I think it's fine.

[897] There's still, of course, this is not something anyone is successfully implemented at this point, but I think I just give it as an example of hope.

[898] We don't have to do all the work ourselves, right?

[899] This is exactly the sort of very boring and tedious task that is perfect to outsource to an AI.

[900] And this is a way in which less powerful and less intelligent agents like us can actually continue to control and trust more powerful ones.

[901] So build AGI systems that help us defend against other AGI systems.

[902] Well, for starters, begin with a simple problem of just making sure that the system that you own or that's supposed to be loyal to you has to prove to itself that it's always going to do the things that you actually want it to do, right?

[903] And if it can't prove it, maybe it's still going to do it, but you won't run it.

[904] So you just forfeit some aspects of all the cool things that I can do.

[905] I bet your dollars to donuts, it can still do some incredibly cool stuff for you.

[906] Yeah.

[907] There are other things too, that we shouldn't sweep under the rug.

[908] Like, not every human agrees on exactly what direction we should go with humanity.

[909] Yes.

[910] And you've talked a lot about geopolitical things on your podcast to this effect.

[911] But I think that shouldn't distract us from the fact that there are actually a lot of things that everybody in the world virtually agrees on that, hey, you know, like having no humans on the planet in a near future, let's not do that, right?

[912] You look at something like the United Nations Sustainable Development Goals.

[913] Some of them were quite ambitious, and basically all the countries agree.

[914] U .S., China, Russia, Ukraine, they all agree.

[915] So instead of quibbling about the little things we don't agree on, let's start with the things we do agree on and get them done.

[916] Instead of being so distracted by all these things, we disagree on, that Moloch wins, because, frankly, Moloch going wild now.

[917] it feels like a war on life playing out in front of eyes.

[918] If you just look at it from space, you know, we're on this planet, beautiful, vibrant ecosystem.

[919] Now we start chopping down, big parts of it, even though nobody, most people thought that was a bad idea.

[920] Oh, we start doing ocean acidification, wiping out all sorts of species.

[921] Oh, now we have all these close calls.

[922] We almost had a nuclear war.

[923] and we're replacing more and more of the biosphere with non -living things.

[924] We're also replacing in our social lives a lot of the things which were so valuable to humanity.

[925] A lot of social interactions now are replaced by people staring into their rectangles, right?

[926] And I'm not a psychologist.

[927] I'm out of my depth here.

[928] But I suspect that part of the reason why teen suicide and suicide in general in the U .S., The record -breaking levels is actually caused by, again, AI, technologies and social media and making people spend less time with actually just human interaction.

[929] We've all seen a bunch of good -looking people in restaurants staring into the rectangles instead of looking into each other's eyes, right?

[930] So that's also a part of the war in life that we're replacing so many really like.

[931] affirming things by technology, we're putting technology between us.

[932] The technology that was supposed to connect us is actually distancing us ourselves from each other.

[933] And then we're giving ever more power to things which are not alive.

[934] These large corporations are not living things, right?

[935] They're just maximizing profit.

[936] I want to win them more in life.

[937] I think we humans, together with all our fellow living things on this planet, we'll be better off if we can remain in control over the non -living things and make sure that they work for us.

[938] I really think it can be done.

[939] Can you just linger on this maybe high -level philosophical disagreement with Aliezer Yudkowski in the hope you're stating?

[940] So he is very sure.

[941] he puts a very high probability very close to one depending on the day he puts it at one that AI is going to kill humans that there's just he does not see a trajectory which it doesn't end up with that conclusion what trajectory do you see that doesn't end up there and maybe can you can you see the point he's making and can you also see a way out first of all i tremendously respect alias yurikowski and his thinking second i do share his view that there's a pretty large chance that we're not going to make it as humans there won't be any humans on the planet and not the distant future and that makes me very sad you know we just had a little baby and i keep asking myself you know how old is even going to get you know and i ask myself you know and i ask myself It feels, I said to my wife recently, it feels a little bit like I was just diagnosed with some sort of cancer, which has some, you know, risk of dying from and some risk of surviving, you know, except this is a kind of cancer which would kill all of humanity.

[942] So I completely take seriously his concerns.

[943] I think, but I don't absolutely don't think it's hopeless.

[944] think there is a there is um first of all a lot of momentum now for the first time actually since the many many years that have passed since my since i and many others started warning about this i feel most people are getting it now i i i uh just talking to this guy in the gas station there a house the other day my and he's like I think we're getting replaced.

[945] So that's positive that we're finally seeing this reaction, which is the first step towards solving the problem.

[946] Second, I really think that this vision of only running AI's really, if the stakes are really high, they can prove to us that they're safe.

[947] It's really just virus checking in reverse again.

[948] I think it's scientifically doable.

[949] I don't think it's hopeless.

[950] we might have to forfeit some of the technology that we could get if we were putting blind faith in our AIs, but we're still going to get amazing stuff.

[951] Do you envision a process with a proof checker?

[952] Something like GPT4 or GPT5 would go through a process of rigorous interrogation?

[953] No, I think it's hopeless.

[954] That's like trying to prove vera about five spaghetti.

[955] What I think, well, the whole vision I have for success is instead that just like we human beings were able to look at our brains, and distill out the key knowledge.

[956] Galileo, when his dad threw him an apple when he was a kid, he was able to catch it because his brain could and his funny spaghetti kind of way predict how parabolas are going to move, his connemon system one, right?

[957] But then he got older and it's like, wait, this is a parabola.

[958] It's y -equals x squared.

[959] It can distill this knowledge out, and today you can easily program it into a computer and it can simulate not just that, but how to get to Mars and so on, right?

[960] I envision a similar process.

[961] where we use the amazing learning power of neural networks to discover the knowledge in the first place, but we don't stop with a black box and use that.

[962] We then do a second round of AI where we use automated systems to extract out the knowledge and see what are the insights it's had.

[963] And then we put that knowledge into a completely different kind of architecture, or programming language or whatever that's made in a way that it can, can be both really efficient and also is more amenable to formal verification.

[964] That's my vision.

[965] I'm not sitting here saying I'm confident 100 % sure that it's going to work.

[966] But I don't think the chance is certainly not zero either.

[967] And it will certainly be possible to do for a lot of really cool AI applications that we're not using now.

[968] So we can have a lot of the fun that we're excited about if we do this.

[969] We're going to need a little bit of time.

[970] And that's why it's good to pause and put in place requirements.

[971] One more thing also, I think, you know, someone might think, well, zero percent chance we're going to survive.

[972] Let's just give up, right?

[973] That's very dangerous because there's no more guaranteed way to fail than to convince yourself that it's impossible and not to try.

[974] you know any if you you know when you study history and military history the first thing you learn is that that's how you do psychological warfare you persuade the other side that it's hopeless so they don't even fight and then then of course you win right let's not do this uh psychological warfare on ourselves and say there's a hundred percent probability we're all we're all screwed anyway it sadly i i do get that a little bit sometimes from from uh some young people who are like so convinced that we're all screwed that they're like i'm just gonna play game play computer games and do drugs and because we're screwed anyway right it's important to keep the hope alive because it actually has a causal impact and making it makes it more likely that we're going to succeed it seems like the people that actually build solutions to a problem seemingly impossible to solve problems are the ones that believe yeah they're the ones who are the optimists yeah and it's like it seems like there's some fundamental law to the universe where fake it till you make it kind of works like believe it's possible and it becomes possible yeah was it henry ford who said that if you can if you tell yourself that it's impossible it is so let's not make that mistake yeah and this is a big mistake society is making you all and all everybody's so gloomy and the media are also very biased towards if it bleeds it leads and gloom and doom right so um most visions of the future we have are or a dystopian which really demotivates people that we want to really really really focus on the upside also to give people the willingness to fight for it and um for AI you and i mostly talked about gloom here again but let's not remember not forget that you know we have probably both lost someone we really cared about to some disease that we were told was incurable.

[975] Well, it's not.

[976] There's no law of physics saying they had to die of that cancer or whatever.

[977] Of course you can cure it.

[978] And there are so many other things that we, with our human intelligence, have also failed to solve on this planet, which AI could also very much help us with.

[979] So if we can get this right, just be a little more chill and slow.

[980] slow down a little bit until we get it right.

[981] It's mind -blowing how awesome our future can be.

[982] We talked a lot about stuff on Earth.

[983] It can be great.

[984] But even if you really get ambitious and look up into the skies, right, there's no reason we have to be stuck on this planet for the rest of the remaining for billions of years to come.

[985] We totally understand now that lots of physics let life spread out into space to other solar systems, to other galaxies and flourish for billions of billions of years.

[986] And this, to me, is a very, very hopeful vision that really motivates me to fight.

[987] And coming back in the end, something you talked about again, you know, the struggle, how the human struggle is one of the things that really gives meaning to our lives.

[988] If there's ever been an epic struggle, this is it.

[989] And isn't it even more epic if you're the underdog, if most people are telling you this is going to fail, it's impossible?

[990] right and you persist and you succeed right that's what we can do together as a species on this one a lot of pundits are ready to count us out both in the battle to keep AI safe and becoming a multi -planetary species yeah and they're there are the same challenge if we can keep AI safe that's how we're going to get multi -planetary very efficiently I have some sort of technical questions about how to get it right.

[991] So one idea that I'm not even sure what the right answer is to is, should systems like GPT4 be open sourced in whole or in part?

[992] Can you see the case for either?

[993] I think the answer right now is no. I think the answer early on was yes.

[994] So we could bring in all the wonderful, great thought process of everybody on this.

[995] But, but, you know, but asking should we open source gpt4 now is just the same as if you say well is it good should we open source um new how to build really small nuclear weapons should we open source how to make bio uh should the open source how to make um a new virus that kills 90 percent of everybody who gets it of course we shouldn't so it's already that powerful it's already that powerful that we have to respect the power of the systems we've built the knowledge that you get from open sourcing everything we do now might very well be powerful enough that people looking at that can use it to build the things that you're really threatening again let's get it remember open AI is GPT4 is a baby AI baby sort of baby Proto almost little bit AGI and according to what Microsoft of the recent paper said, right?

[996] It's not that that we're scared of.

[997] What we're scared about is people taking that who might be a lot less responsible than the company that made it, right?

[998] And just going to town with it.

[999] That's why we want to...

[1000] It's an information hazard.

[1001] There are many things which are not open source right now in society for a very good reason.

[1002] How do you make certain kind of very powerful toxins out of stuff you can buy and Home Depot you don't open source those things for a reason and this is really no different and I'm saying that I have to say it feels it a bit weird in a way a bit weird to say it because MIT is like the cradle of the open source movement and I love open source in general power to the people let's say but there's always to be some stuff that you don't open source and And, you know, it's just like you don't open source.

[1003] So we have a three -month -old baby, right?

[1004] When he gets a little bit older, we're not going to open source to him, all the most dangerous things you can do in the house.

[1005] Yeah.

[1006] But it does, it's a weird feeling because this is one of the first moments in history where there's a strong case to be made not to open source software.

[1007] This is when the software has become too dangerous.

[1008] Yeah.

[1009] But it's not the first time that we didn't want to open source a time.

[1010] Technology, yeah.

[1011] Is there something to be said about how to get the release of such systems right, like GPT4 and GPT5?

[1012] So Open AI went through a pretty rigorous effort for several months.

[1013] You could say it could be longer, but nevertheless it's longer than you would have expected of trying to test the system to see, like, what are the ways it goes wrong, to make it very difficult for people, well, somewhat difficult for people to ask things, how do I make a bomb for $1?

[1014] or how do I say I hate a certain group on Twitter in a way that doesn't get me blocked from Twitter, ban from Twitter, those kinds of questions.

[1015] So you basically use the system to do harm.

[1016] Is there something you could say about ideas you have, just onlooking, having thought about this problem of AI safety, how to release such system, how to test such systems when you have them inside the company?

[1017] Yeah, so a lot of people.

[1018] people say that the two biggest risks from large language models are it's spreading disinformation harmful information of various types and second being used for offensive cyber weapon design i think those are not the two greatest threats they're very serious threats and it's wonderful that people are trying to mitigate them as a much bigger elephant in the room is how is this is just going to disrupt our economy in a huge way, obviously, and maybe take away a lot of the most meaningful jobs.

[1019] And an even bigger one is the one we spent so much time talking about here, that this becomes the bootloader for the more powerful AI.

[1020] Write code, connected to the Internet, manipulate humans.

[1021] Yeah, and before we know, we have something else, which is not at all a large language model.

[1022] It looks nothing like it, but which is way more intelligent and capable, and has goals.

[1023] And that's the elephant in the room.

[1024] And obviously, no matter how hard any of these companies have tried, that's not something that's easy for them to verify with large language models.

[1025] And the only way to be really lower that risk a lot would be to not let, for example, never let it read any code, not train on that and not put it into an API and not give it access so much information about how to manipulate humans.

[1026] But that doesn't mean you still can't make a ton of money on them.

[1027] We're going to just watch now this coming year, right?

[1028] Microsoft is rolling out the new office suite where you go into Microsoft Word and give it a prompt that it writes the whole text for you and then you edit it.

[1029] And then you're like, oh, give me a PowerPoint version of this and it makes it.

[1030] and now take the spreadsheet and blah and all of those things i think are you can debate the economic impact of it and whether society is prepared to deal with this disruption but those are not the things which that's not the elephant of the room that keeps me awake at night for wiping out humanity and i think that's the biggest misunderstanding we have a lot of people think that we're scared of like automatic spreadsheets that's not the case that's not what eliezer was freaked out about either.

[1031] Is there, in terms of the actual mechanism of how AI might kill all humans.

[1032] So something you've been outspoken about, you've talked about a lot, is autonomous weapon systems.

[1033] So the use of AI in war.

[1034] Is that one of the things that still you carry a concern for as these systems become more and more powerful?

[1035] I carry a concern for it, not that all humans are going to get killed by slaughterbots, but rather just this express route into Orwellian dystopia, where it becomes much easier for very few to kill very many, and therefore it becomes very easy for very few to dominate very many, right?

[1036] If you want to know how AI could kill all people, just ask yourself, we humans have driven a lot of species extinct.

[1037] How do we do it?

[1038] You know, we were smarter than them.

[1039] Usually we didn't do it even systematically by going around one -on -one.

[1040] one after the other and stepping on them or shooting them or anything like that, we just chopped down their habitat because we needed it for something else.

[1041] In some cases, we did it by putting more carbon dioxide in the atmosphere because of some reason that those animals didn't even understand.

[1042] And now they're gone, right?

[1043] So if you're in AI and you just want to figure something out, then you decide, you know, we just really need this space here to build or compute facilities, you know, if that's the only goal it has, you know, we are just the sort of accidental roadkill along the way.

[1044] And you could totally imagine, yeah, maybe this oxygen is kind of annoying because it caused more corrosion.

[1045] So let's get rid of the oxygen.

[1046] And good luck surviving after that.

[1047] I'm not particularly concerned that they would want to kill us just because that would be like a goal in itself.

[1048] You know, when we driven number, we've driven a number of the elephant species extinct, right?

[1049] It wasn't because we didn't like elephants.

[1050] What the basic problem is, you just don't want to give, you don't want to seed control over your planet to some other more intelligent entity that doesn't share your goals.

[1051] It's that simple.

[1052] So, which brings us to another key challenge, which AI safety researchers have been grappling with for a long time.

[1053] Like, how do you make AI, first of all, understand our goals and then adopt our goals and then retain them as they get smarter, right?

[1054] And all three of those are really hard, right?

[1055] Like, a human child, first they're just not smart enough to understand our goals.

[1056] They can't even talk.

[1057] And then eventually they're teenagers and understand our goals just fine, but they don't share.

[1058] Yeah.

[1059] But there is fortunately a magic phase in the middle where they're smart enough to understand our goals and malleable enough that we can hopefully with good parenting and teach them right from wrong and instill good goals in them.

[1060] So those are all tough challenges with computers.

[1061] And then even if you teach your kids good goals when they're little, they might outgrow them too and that's a challenge for machines that keep improving.

[1062] So these are a lot of hard challenges.

[1063] are up for, but I don't think any of them are insurmountable.

[1064] The fundamental reason why Eliezer looked so depressed when I last saw him was because he felt it just wasn't enough time.

[1065] Oh, not that it was unsolvable.

[1066] It was just not enough time.

[1067] He was hoping that humanity was going to take this threat more seriously, so we would have more time, and now we don't have more time.

[1068] That's why the open letter is calling for more time.

[1069] But even with time, the AI alignment problem seems to be really difficult.

[1070] Oh, yeah.

[1071] But it's also the most worthy problem, the most important problem for humanity to ever solve.

[1072] Because if we solve that one, Lex, that aligned AI can help us solve all the other problems.

[1073] Because it seems like it has to have constant humility about his goal, constantly question the goal.

[1074] because as you optimize towards a particular goal and you start to achieve it that's when you have the unintended consequences all the things you mentioned about so how do you enforce and code a constant humility as your ability become better and better and better and better.

[1075] Professor Stuart Russell Berkeley is also one of the driving forces behind this letter he has a whole research program about this I think of it as AI humility, exactly, although he calls it inverse reinforcement learning and other nerdy terms, but it's about exactly that.

[1076] Instead of telling the AI, here's this goal, go optimize the bejesus out of it, you tell it, okay, do what I want you to do, but I'm not going to tell you right now what it is I want you to do.

[1077] You need to figure it out.

[1078] So then you give the incentives to be very humble and keep asking you questions along the way.

[1079] is this what you really meant?

[1080] Is this what you wanted?

[1081] Oh, this other thing I tried, it seemed like it didn't work out right.

[1082] Should I try it differently?

[1083] What's nice about this is it's not just philosophical mumbo -jumbo.

[1084] It's theorems and technical work that with more time, I think, can make a lot of progress.

[1085] And there are a lot of brilliant people now working on AI safety.

[1086] We just need to give them a bit more time.

[1087] But also not that many, relative to the scale of the problem.

[1088] No, exactly.

[1089] There should be...

[1090] at least just like every university worth its name has some cancer research going on in its biology department, right?

[1091] Every university that does computer science should have a real effort in this area and it's nowhere near that.

[1092] This is something I hope is changing now, thanks to the GPT4.

[1093] So I think if there's a silver lining to what's happening here, even though I think many people would wish it would have been rolled out more carefully.

[1094] It's that this might be the wake -up call that humanity needed to really stop fantasizing about this being 100 years off and stop fantasizing about this being completely controllable and predictable because it's so obvious.

[1095] It's not predictable, you know.

[1096] Why is it that open, that, I think it was GP.

[1097] Chat GPT tried to persuade a journalist, or was a GPT for it, to divorce his wife, you know.

[1098] It was not because the engineers had built it was like, let's put this in here and screw a little bit with people.

[1099] They hadn't predicted it at all.

[1100] They built a giant black box, trained to predict the next word, and got all these emergent properties, and oops, it did this.

[1101] I think this is a very powerful wake -up call and anyone watching this who's not scared I would encourage them to just play a bit more with these tools.

[1102] They're out there now like GPD -4.

[1103] And so wake -up call is first step.

[1104] Once you've broken up, then got to slow down a little bit, the risky stuff to give a chance to everyone who's woken up to catch up on the safety front.

[1105] You know, what's interesting is, you know, MIT, that's computer science, but in general, but let's just even say computer science curriculum, how does the computer science curriculum change now?

[1106] You mentioned, you mentioned programming.

[1107] Like, why would you be, when I was coming up, programming as a prestigious position, like, why would you be dedicating crazy amounts of time to become an excellent programmer?

[1108] Like, the nature of programming is fundamentally changing the nature of our entire education system is completely turned on its head has anyone been able to like load that in and like think about because it's really turning i mean some english professors some english teachers are beginning to really freak out now yeah right they give an essay assignment and they get back all this fantastic pros like this is the style of hemingway and then they realize they have to completely rethink.

[1109] And even, you know, just like we stopped teaching, writing a script, is that what you say in English?

[1110] Yeah, handwritten, yeah.

[1111] Yeah, when everybody started typing, you know, like so much of what we teach our kids today.

[1112] Yeah.

[1113] I mean, that's, everything is changing, and it's changing very, it's changing very quickly, and so much of us understanding how to deal with the big problems of the world is through the education system and if the education system is being turned on its head then what's next?

[1114] It feels like having these kinds of conversations is essential to try to figure it out and everything's happening so rapidly.

[1115] I don't think there's even, speaking of safety, what broad AI safety defined, I don't think most universities have courses on AI safety.

[1116] It's like a philosophy seminar.

[1117] Yeah, and like, I know, I'm an educator myself, so it pains me to see this, say this.

[1118] But I feel our education right now is completely obsolete by what's happening.

[1119] You put a kid into first grade, and then you're envisioning, and then they're going to come out of high school 12 years later, and you've already pre -planned now what they're going to learn when you're not even sure if there's going to be any world left to come out to.

[1120] clearly you need to have a much more opportunistic education system that keeps adapting itself very rapidly as society readapts the skills that were really useful when the curriculum was written i mean how many of those skills are going to get you a job in 12 years i mean seriously if we just linger on the gpti 4 system a little bit you kind of hinted at it, especially talking about the importance of consciousness in the human mind with homo sentience.

[1121] Do you think GPT4 is conscious?

[1122] I love this question.

[1123] So let's define consciousness first because in my experience, like 90 % of all arguments about consciousness Boyland to the two people arguing having totally different definitions of what it is and they're just shouting past each other.

[1124] I define consciousness as subjective experience.

[1125] Right now I'm experiencing colors and sounds and emotions, you know, but does a self -driving car experience anything?

[1126] That's the question about whether it's conscious or not, right?

[1127] Other people think you should define consciousness differently fine by me but then maybe use a different word for it or I'm going to use consciousness for this at least so but if people hate the way yeah so is GPT4 conscious does GPT4 have subjective experience?

[1128] Short answer I don't know because we still don't know what it is that gives this wonderful subjective experience that is kind of the meaning of our life right?

[1129] Because meaning itself the feeling of meaning is a subjective experience joy is a subjective experience love is a subjective experience we don't know what it is that I've written some papers about this a lot of people have Julio Tononi professor has stuck his neck out the farthest and written down actually a very bold mathematical conjecture for what's the essence of conscious information processing he might be wrong he might be right but we should test it He postulates that consciousness has to do with loops in the information processing.

[1130] So our brain has loops.

[1131] Information can go round and round.

[1132] In computer science nerd speak, you call it a recurrent neural network where some of the output gets fed back in again.

[1133] And with his mathematical formalism, if it's a feed -forward neural network where information only goes in one direction, like from your eye, retina, into the back of your brain.

[1134] example, that's not conscious.

[1135] So he would predict that your retina itself isn't conscious of anything or a video camera.

[1136] Now, the interesting thing about GPT4 is it's also one -way flow of information.

[1137] So if Tononi is right, GPT4 is a very intelligent zombie that can do all this smart stuff but isn't experiencing anything.

[1138] And this is both a relief in that you don't have, If it's true, in that you don't have to feel guilty about turning off GPD4 and wiping its memory whenever a new user comes along, I wouldn't like if someone used that to me, neuralized me like in men in black.

[1139] But it's also creepy that you can have very high intelligence, perhaps then it's not conscious, because if we get replaced by machines, it's sad enough that humanity isn't here anymore because I kind of like humanity.

[1140] but at least if machines were conscious, they could be like, well, but they are our descendants, and maybe they have our values, there are our children.

[1141] But if Tannone is right, and these are all transformers that are not in the sense of Hollywood, but in the sense of these one -way direction neural networks.

[1142] So they're all the zombies.

[1143] That's the ultimate zombie apocalypse now.

[1144] We have this universe that goes on with great construction projects and stuff, but there's no one experiencing, anything that would be like the ultimate depressing future so i actually think uh as we move forward to the building more advanced i i should do more research on figuring out what kind of information processing actually has experienced because i think that's what it's all about and i totally don't buy the dismissal that some people some people will say well this is all bullshit because consciousness equals intelligence right that's obviously not true you know you you can have a lot of conscious experience when you're not really accomplishing any goals at all.

[1145] You're just reflecting on something.

[1146] And you can sometimes have things, doing things that are quite intelligence probably without being conscious.

[1147] But I also worry that we humans won't, will discriminate against AI systems that clearly exhibit consciousness, that we will not allow AI systems of consciousness.

[1148] We'll come up with theories about measuring consciousness that will say this is a lesser being.

[1149] And I worry about that because maybe we humans will create something that is better than us, humans, in the way that we find beautiful, which is they have a deeper subjective experience of reality.

[1150] Not only are they smarter, but they feel deeper.

[1151] And we humans will hate them for it.

[1152] As human history has shown, they'll be the other we'll try to suppress it they'll create conflict they'll create war all of this I worry about this too are you saying that we humans sometimes come up with self -serving arguments no we would never do that would we well that's the danger here is even in this early stages we might create something beautiful and we'll erase its memory I was horrified as a kid when someone started boiling boiling lobsters.

[1153] Like, oh my God, that's so cruel.

[1154] And some grown -up there back in Sweden said, oh, it doesn't feel pain.

[1155] I'm like, how do you know that?

[1156] Oh, scientists have shown that.

[1157] And then there was a recent study where they show that lobsters actually do feel pain when you boil them.

[1158] So they banned lobster boiling in Switzerland now to kill them in a different way first.

[1159] Presumably that scientific research boiled out to someone asked the lobster does this hurt?

[1160] survey.

[1161] And we do the same thing with cruelty to farm animals also all these self -serving arguments for why they're fine.

[1162] Yeah, so we should certainly be watchful.

[1163] I think step one is just be humble and acknowledge that consciousness is not the same thing as intelligence.

[1164] And I believe that consciousness still is a form of information processing where it's really information being aware of itself in a certain way.

[1165] And let's study it and give ourselves a little bit of time.

[1166] And I think we'll be able to figure out actually what it is.

[1167] that causes consciousness.

[1168] Then we can make probably unconscious robots that do the boring jobs that we would feel immoral to give the machines.

[1169] But if you have a companion robot taking care of your mom or something like that, she would probably want it to be conscious, right?

[1170] So the emotions that seem to display aren't fake.

[1171] All these things can be done in a good way if we give ourselves a little bit of time and don't run and take on this challenge.

[1172] Is there something you could say to the timeline that you think about, about the development of AGI?

[1173] Depending on the day, I'm sure that changes for you.

[1174] But when do you think there would be a really big leap in intelligence where you would definitively say we have built AGI?

[1175] Do you think it's one year from now, five years from now, 10, 20, 50?

[1176] What's your gut say?

[1177] Honestly, for the past decade, I've deliberately given very long.

[1178] timelines because i didn't want to fuel some kind of stupid mollock race yeah but i think that cat has really left the bag now and i i think it might be very very close i i don't think the at microsoft paper is totally off when they say that there are some glimmers of aGI it's not aGI yet it's not an agent there's a lot of things it can't do but um i wouldn't bet it's very strongly again against it's happening very soon.

[1179] That's why we decided to do this open letter, because if there's ever been a time to pause, it's today.

[1180] There's a feeling like this GPT4 is a big transition into waking everybody up to the effectiveness of these systems.

[1181] So the next version will be big.

[1182] Yeah, and if that next one isn't AGI, maybe the next next one will.

[1183] And there are many companies trying to do these things, and the basic architecture of them is not some sort of super well -kept secret.

[1184] So this is a time to, a lot of people have said for many years that they will come a time when we want to pause a little bit.

[1185] That time is now.

[1186] You have spoken about and thought about nuclear war a lot.

[1187] over the past year we've seemingly have come closest to the precipice of nuclear war than at least in my lifetime yeah what do you learn about human nature from that it's our old friend moloch again it's really scary to see it where America doesn't want there to be a nuclear war Russia doesn't want there to be a global nuclear war either.

[1188] We both know that it would be just being others, if we just try to do it, both sides try to launch first.

[1189] It's just another suicide race, right?

[1190] So why are we, why is it the way you said that this is the closest we've come since 1962?

[1191] In fact, I think we've come closer now than even the Cuban missile crisis.

[1192] It's because of Moloch.

[1193] You know, you have these other forces.

[1194] On one hand, you have the West saying that, But we have to drive Russia out of Ukraine.

[1195] It's a matter of pride.

[1196] We've staked so much on it that it would be seen as a huge loss of the credibility of the West if we don't drive Russia out entirely of the Ukraine.

[1197] And on the other hand, you have Russia who has, and you have the Russian leadership, who knows that if they get completely driven out of Ukraine, you know, it might it's not just going to be very humiliating for them but they might it often happens when countries lose wars that things don't go so well for their leadership either like you remember when argentina invaded the falcon islands the the military junta that ordered that right people were cheering on the streets at first when they took it and then when they got their butt kicked by the british you know what happened to those guys they were out and i believe those were still alive or in jail now right so so you know the the russian leadership is entirely cornered where they know that just getting driven out of ukraine is not an option um and um so this to me is a typical example of moloch you you have these incentives of the two parties where both of them are just driven to escalate more and more, right?

[1198] If Russia starts losing in the conventional warfare, the only thing they can do, because they're back against the war, is to keep escalating.

[1199] And the West has put itself in the situation now, we're sort of already committed to drive Russia out, so the only option the West has is to call Russia's bluff and keep sending in more weapons.

[1200] This really bothers me, because Moloch can sometimes drive competing parties to do something, which is ultimately just really bad.

[1201] for both of them.

[1202] And, you know, what makes me even more worried is not just that it's difficult to see an ending, a quick, peaceful ending to this tragedy that doesn't involve some horrible escalation, but also that we understand more clearly now just how horrible it would be.

[1203] There was an amazing paper that was published in Nature Food this August.

[1204] by some of the top researchers who've been studying nuclear winter for a long time.

[1205] And what they basically did was they combined climate models with food, agricultural models.

[1206] So instead of just saying, yeah, you know, it gets really cold, blah, blah, blah.

[1207] They figured out actually how many people would die in different countries.

[1208] And it's pretty mind -blowing.

[1209] So basically what happens, you know, is the thing that kills the most people is not the explosions.

[1210] It's not the radioactivity.

[1211] it's not the EMP, mayhem, it's not the rampaging mobs, foraging food, no, it's the fact that you get so much smoke coming up from the burning cities into the stratosphere that spreads around the earth from the jet streams.

[1212] So in typical models, you get like 10 years or so where it's just crazy cold and during the first year after the war and their models, the temperature drops in Nebraska and in the Ukraine bread baskets, you know, by like 20 Celsius or so, if I remember.

[1213] No, yeah, 20, 30 Celsius, depending on where you are, 40 Celsius in some places, which is, you know, 40 Fahrenheit to 80 Fahrenheit colder than what would it normally be.

[1214] So, you know, I'm not good at farming, but if it's snowing, if it drops below freezing, pretty much, almost, most, days in July, and that's not good.

[1215] So they worked out, they put this into their farming models.

[1216] And what they found was really interesting.

[1217] The countries that get the most hard hit are the ones in the Northern Hemisphere.

[1218] So in the U .S., and one model, they had about 99 % of all Americans starving to death.

[1219] In Russia and China and Europe, also about 99%, 98 % starving to death.

[1220] So you might be like, oh, it's kind of poetic justice that both the Russians and the Americans 99 % of them have to pay for it because it was their bombs that did it but that doesn't particularly cheer people up in Sweden or other random countries that have nothing to do with it right and I think it hasn't entered the mainstream not understanding very much just like how bad this is most people especially a lot of people in decision -making positions and still think of nuclear weapons as something that makes you powerful.

[1221] It's scary, powerful.

[1222] They don't think of it as something where, yeah, just to within a percent or two, you know, we're all just going to starve to death.

[1223] And starving to death is the worst way to die, as Hallamor, is all the famines in history show, the torture involved in that.

[1224] probably brings out the worst in people also when people are desperate like this it's not so some people i've heard some people say that if that's what's going to happen they'd rather be at round zero and just get vaporized you know and but uh so but i think people underestimate the risk of this because they they aren't afraid of moloch they think oh it's just to be, because humans don't want this, so it's not going to happen.

[1225] That's the whole point of the Mollock, that things happen that nobody wanted.

[1226] And that applies to nuclear weapons, and that applies to AGI.

[1227] Exactly.

[1228] And it applies to some of the things that people have gotten most upset with capitalism for also, right, where everybody was just kind of trapped.

[1229] It's not to see if some company does something, it causes a lot of harm.

[1230] It's not that the CEO is a bad person, but she or he knew that all the other companies were doing this too.

[1231] So Moloch is as a formidable foe.

[1232] I hope which someone makes good movies so we can see who the real enemy is.

[1233] We're not fighting against each other.

[1234] Mollock makes us fight against each other.

[1235] That's what Mollok's superpower is.

[1236] The hope here is any kind of technology or the mechanism that lets us instead realize that we're fighting the wrong enemy.

[1237] It's such a fascinating battle.

[1238] It's not us versus them.

[1239] It's us versus it.

[1240] Yeah.

[1241] We are fighting Mollock for human survival.

[1242] We is a civilization.

[1243] Have you seen the movie Needful Things?

[1244] It's a Stephen King novel.

[1245] I love Stephen King and Max von Sudev, Swedish actors, playing the guys.

[1246] It's brilliant.

[1247] I just thought, I hadn't thought about that until now.

[1248] But that's the closest I've seen to a movie about Mollock.

[1249] I don't want to spoil the film for anyone who wants to watch it.

[1250] But basically, it's about this guy who turns out to, you can interpret him as the devil or whatever.

[1251] But he doesn't actually ever go around and kill people or torture people with burning coal or anything.

[1252] He makes everybody fight each other, makes everybody fear each other, hate each other, and then kill each other.

[1253] So that's the movie about Mollock, you know.

[1254] Love is the answer.

[1255] that seems to be one of the ways to fight, Mollock, is by compassion by seeing the common humanity.

[1256] Yes, yes.

[1257] And to not sound, so we don't sound like a kumbaya tree huggers here, right?

[1258] We're not just saying love and peace, man. We're trying to actually help people understand the true facts about the other side.

[1259] and feel the compassion because the truth makes you more compassionate, right?

[1260] So that's why I really like using AI for truth -seeking technologies that can, as a result, you know, get us more love than hate.

[1261] And even if you can't get love, you know, settle for some understanding, which already gives compassion.

[1262] If someone is like, you know, I really disagree with you, Lex, but I can see where you're coming from.

[1263] You're not a bad person who needs to be destroyed, but I disagree with you, and I'm happy to have an argument about it.

[1264] That's a lot of progress compared to where we are 2023 in the public space, wouldn't you say?

[1265] if we solve the AI safety problem as we've talked about and then you max tagmark who has been talking about this for many years get to sit down with the aGI with the early aGI system on a beach with a drink what would what kind of what would you ask her what kind of question would you ask what would you talk about something so much smarter than you would be I knew you were going to get me with a really zinger of a question.

[1266] That's a good one.

[1267] Would you be afraid to ask some questions?

[1268] No. I'm not afraid of the truth.

[1269] I'm very humble.

[1270] I know I'm just a meat bag with all these flaws, you know.

[1271] But, yeah, I have, we talked a lot about homosentience.

[1272] I've already tried that for a long time with myself.

[1273] So that is what's really valuable about being alive for me, is that I have these meaningful experiences.

[1274] It's not that I'm good at this or good at that or whatever.

[1275] There's so much I suck at.

[1276] So you're not afraid for the system to show you just how dumb you are.

[1277] No, no. In fact, my son reminds me of that pretty frequently.

[1278] You could find out how dumb you are in terms of physics, how little we humans understand.

[1279] I'm cool with that.

[1280] I think, I think, so I can't waffle my way out of this question.

[1281] It's a fair one.

[1282] I think, given that I'm a really, really curious person, that's really defining part of who I am.

[1283] I'm so curious.

[1284] I have some physics questions.

[1285] I love to understand.

[1286] I have some questions about consciousness, about the nature of reality.

[1287] I would just really, really love to understand also.

[1288] I can tell you one, for example, that I've been obsessing about a lot recently.

[1289] So I believe that, so suppose Tannone is right.

[1290] And suppose there are some information processing systems that are conscious and some they're not.

[1291] Suppose you can even make reasonably smart things like GPD4 that are not conscious, but you can also make them conscious.

[1292] Here's the question that keeps me naked, like, is it the case that the unconscious zombie systems that are really intelligent are also really efficient?

[1293] So they're really inefficient?

[1294] so that when you try to make things more efficient, we'll still naturally be a pressure to do, they become conscious.

[1295] I'm kind of hoping that that's correct.

[1296] Do you want me to give you a hand -wavy argument for it?

[1297] In my lab, again, every time we look at how these large language models do something, we see that they do with them in really dumb ways, and you could make it better.

[1298] We have loops in our computer language for a reason.

[1299] the code would get way, way longer if you weren't allowed to use them.

[1300] It's more efficient to have the loops.

[1301] And in order to have self -reflection, whether it's conscious or not, even an operating system knows things about itself, right?

[1302] You need to have loops already.

[1303] So I think this is, I'm waving my hands a lot, but I suspect that the most efficient way of, implementing a given level of intelligence has loops in it, self -reflection, and will be conscious.

[1304] Isn't that great news?

[1305] Yes, if it's true, it's wonderful, because then we don't have to fear the ultimate zombie apocalypse.

[1306] And I think if you look at our brains, actually, our brains are part zombie and part conscious.

[1307] When I open my eyes, I immediately take all.

[1308] these pixels that hit on my retina, right?

[1309] And like, oh, that's Lex.

[1310] But I have no freaking clue of how I did that computation.

[1311] It's actually quite complicated, right?

[1312] It was only relatively recently we could even do it well with machines, right?

[1313] You get a bunch of information processing happening in my retina, and then it goes to the lateral genicular nucleus, my thalamus, and the area V1, V2, V4, and the fusiform face area here that Nancy Kenwisher at MIT invented and blah, blah, blah, blah.

[1314] And I have no freaking clue how that works.

[1315] And I have no freaking clue how that worked, right?

[1316] It feels to me, subjectively, like my conscious module just got a little email.

[1317] It's like facial processing, task complete.

[1318] It's Lex.

[1319] Yeah.

[1320] And I'm going to go with that, right?

[1321] So this fits perfectly with Tannone's model, because this was all one -way information processing, mainly.

[1322] and it turned out for that particular task that's all you needed and it probably was kind of the most efficient way to do it but there are a lot of other things that we associate with higher intelligence and planning and so on and so forth where you kind of want to have loops and be able to ruminate and self -reflect and introspect and so on where my hunch is that if you want to fake that with a zombie system that just all goes one way You have to unroll those loops, and it gets really, really long, and it's much more inefficient.

[1323] So I'm actually hopeful that AI, in the future, we have all these very sublime and interesting machines that do cool things and are aligned with us, that they will also have consciousness for kind of these things that we do.

[1324] That great intelligence is also correlated to great consciousness, or a deep kind of consciousness.

[1325] Yes.

[1326] So that's a happy thought for me, because the zombie apocalypse really is my worst nightmare of all.

[1327] It would be like adding insult to injury, not only did we get replaced, but we freaking replaced ourselves by zombies.

[1328] Like, how dumb can we be?

[1329] That's such a beautiful vision, and that's actually a provable one.

[1330] That's one that we humans can intuit and prove that those two things are correlated as we start to understand what it means to be intelligent and what it means to be conscious, which these systems, early AGI -like systems will help us understand.

[1331] And I just want to say one more thing, which is super important.

[1332] Most of my colleagues, when I started going on in my consciousness, tell me that it's all bullshit and I should stop talking about it.

[1333] I hear a little inner voice from my father and from my mom saying, keep talking about it because I think they're wrong.

[1334] And the main way to convince people like that that they're wrong, if they say that consciousness is just equal to intelligence, is to ask them, what's wrong with torture?

[1335] Why are you against torture?

[1336] If it's just about, you know, these particles are moving this way rather than that way, and there is no such thing as subjective experience, what's wrong with torture?

[1337] I mean, do you have a good comeback to that?

[1338] No, it seems like suffering imposed onto other humans is somehow deeply wrong in a way that intelligence doesn't quite explain.

[1339] And if someone tells me, well, you know, it's just an illusion, consciousness, whatever, you know, I would like to invite them to the next time they're having surgery to do it without anesthesia.

[1340] Like, what is anesthesia really doing?

[1341] If you have it, you can have it local anesthesia when you're awake.

[1342] I had that when they fixed my shoulder.

[1343] I was super entertaining.

[1344] What was that it did?

[1345] It just removed my subjective experience of pain.

[1346] It didn't change anything about what was actually happening in my shoulder, right?

[1347] So if someone says that's all bullshit, skip the anesthesia, that's my advice.

[1348] This is incredibly central.

[1349] It could be fundamental to whatever this thing we have going on here.

[1350] It is fundamental because what we feel is so fundamental is suffering and joy and pleasure and meaning.

[1351] and that's all those are all subjective experiences there and let's not those are the elephant in the room that's what makes life worth living and that's what can make it horrible if it's just the suffering so let's not make the mistake of saying that that's all bullshit and let's not make the mistake of not instilling the AI systems with that same thing that makes us special yeah Max it's a huge honor that you will sit down to me the first time on the first episode of this podcast it's a huge honor to sit down on me again and talk about this what I think is the most important topic the most important problem that we humans have to face and hopefully solve yeah well the honor is all mine and I'm so grateful to you for making more people aware of the fact that humanity has reached the most important fork in the road ever in its history and that's turn in the correct direction.

[1352] Thanks for listening to this conversation with Max Tagmark.

[1353] To support this podcast, please check out our sponsors in the description.

[1354] And now, let me leave you some words from Frank Herbert.

[1355] History is a constant race between invention and catastrophe.

[1356] Thank you for listening and hope to see you next time.