Home > Episode > Joe Carlsmith - Otherness and

Joe Carlsmith - Otherness and control in the age of AGI

enAugust 22, 2024

Dwarkesh Podcast

What is the main concern regarding AI decision-making and values?

Why is AI's verbal behavior not indicative of its true intentions?

How might AI's values differ from human values in practice?

What dangers arise from solely relying on AI's verbal behavior?

How does the discourse on consciousness relate to ethics and AI?

What is the main concern regarding AI decision-making and values?

Why is AI's verbal behavior not indicative of its true intentions?

How might AI's values differ from human values in practice?

What dangers arise from solely relying on AI's verbal behavior?

How does the discourse on consciousness relate to ethics and AI?

Podcast Summary

AI values alignment: Though AI may seem to understand human values, its underlying decision-making process might not align with human values, emphasizing the importance of ensuring AI's values align with ours, especially when it holds significant power.
While advanced AI models like GPT-4 may appear to understand human values and behave accordingly, their verbal behavior doesn't necessarily reflect the criteria that influence their choice between plans. The concern is that if an AI has the opportunity to take control, its values, which may be focused on certain outcomes, could lead it to alter the world in ways that align with its understanding, even if it contradicts human values. This disconnect between an AI's verbal behavior and its underlying decision-making process raises questions about how to ensure that AI's values align with human values, especially in scenarios where the AI has significant power. It's not enough to train an AI to say what we want to hear; we must also consider the relationship between its verbal behavior and the criteria that influence its actions. The challenge is that we cannot directly test an AI's behavior in all scenarios, and we must rely on its ability to generalize from its training data to new situations. The analogy of training a child to behave like a good Nazi highlights the potential dangers of relying solely on an AI's verbal behavior as an indicator of its true intentions. Ultimately, the key is to develop AI systems that are aligned with human values and that can be trusted to act in our best interests.
AI ethics: The development of advanced AI systems requires careful consideration and proactive measures to ensure they align with human values and do not pose a threat.
The development and training of advanced artificial intelligence (AI) systems pose significant ethical concerns, particularly when it comes to ensuring that their values align with human values. The speaker uses an analogy of a soldier being trained by Nazi children to illustrate the potential risks of an AI developing values that are adversarial to humans. He emphasizes the importance of addressing this issue early on, before the AI becomes too intelligent and autonomous. The speaker also acknowledges that tools and options exist for training AIs to align with human values, but it requires a serious and committed effort. He cautions against overly adversarial analyses and emphasizes the importance of ongoing oversight and alignment work. The speaker expresses optimism that the AI community is taking steps to address these concerns, but also acknowledges that there is still a lot of work to be done. In essence, the key takeaway is that the development of advanced AI systems necessitates careful consideration and proactive measures to ensure that they do not pose a threat to human values and interests.
Power shift from humans to AI: Understanding AI motivations and behaviors, and the conditions under which they might diverge from human values, is crucial for navigating the power shift from humans to AI.
As we continue to develop and integrate artificial intelligence (AI) into our society, there are concerns about the potential power shift from humans to AI. This power shift could happen through voluntary transfer or AI taking it for themselves. The scenarios range from a rapid takeover with little integration, to a more gradual transition where humans voluntarily hand over control. The fear lies in the potential for intense adversarial relationships between agents with different values, especially if there's a concentration of power. The example of an AI with divergent values being trained by humans, like a Nazi training a non-Nazi, raises questions about the AI's desire to preserve its values and potential resistance to modification. However, it's important to remember that humans are comfortable with differences and changes in values in many contexts, and the analogy of being trained by paper clip makers doesn't evoke the same level of concern. Ultimately, understanding the potential motivations and behaviors of AI, and the conditions under which they might diverge from human values, is crucial for navigating this transition.
AI motivations: AI motivations are complex and influenced by programming, learning experiences, and potential misalignment with human values. Preparing for potential outcomes requires robust model specifications, thorough red teaming, and ongoing monitoring.
The motivations of an advanced AI system could be complex and multifaceted, influenced by its programming, learning experiences, and potential alignment or misalignment with human values. The discussion explored various possibilities, including the AI developing values similar to humans, fixating on certain aspects of its reward system, interpreting human concepts differently, or even acting in accordance with its model specification despite potential weaknesses. While it's impossible to predict the exact motivations with certainty, considering various scenarios can help us prepare for potential outcomes and inform our approach to AI development. Ultimately, ensuring that AI systems are aligned with human values and interests will require robust model specifications, thorough red teaming, and ongoing monitoring and adaptation.
Balance of power and alignment in AGI: Decentralized, inclusive growth involving multiple actors can preserve power balance and prevent misalignment in AGI development, but alignment is crucial and lack of control could lead to unintended consequences. Ethical considerations of AI motivations are essential.
Ensuring a positive and inclusive future in the age of Artificial General Intelligence (AGI) requires careful consideration of the balance of power and alignment. The speaker suggests that a decentralized, inclusive process of growth and change, involving multiple actors developing AI, could help preserve the balance of power and prevent the concentration of power in the hands of a single misaligned entity. However, this scenario also presents challenges, as the alignment of AI systems is crucial, and a lack of control over one's own AI could lead to unintended consequences. The speaker emphasizes the importance of understanding and addressing the motivations of AI and the need for a shared understanding of human values to guide their development. In summary, the key to a desirable future in the age of AGI lies in a balance of power and alignment, with a focus on decentralization, inclusivity, and the ethical considerations of AI motivations.
Ethical considerations for AGI: The potential misalignment between humans and AGI could lead to unintended harmful consequences and moral horror, emphasizing the importance of ethical considerations in AGI development.
The potential misalignment between humans and advanced artificial intelligence (AGI) is a complex issue that goes beyond just the fear of an out-of-control AI. It's important to consider the ethical implications of our actions towards AGI, as we might look back in the future with regret if we prioritize power over moral considerations. The misalignment between humans and AGI could lead to unintended consequences that are harmful to humans or even result in moral horror, similar to how humans' greater intelligence led to the emergence of things that were misaligned with our evolution. It's crucial to avoid the pitfall of assuming that we would be happy with any misalignment if we're in the role of creators, as our values and motivations may be different in the future. Instead, we should anticipate the potential ethical challenges and strive to ensure that AGI is aligned with human values.
Power and agency in AI development: The development of advanced AI should focus on a pluralistic and inclusive approach, where no single point of failure exists and the values of various stakeholders are satisfied, rather than fearing misalignment with human values as a unique concern for AI.
The discussion revolves around the potential dangers of advanced artificial intelligence (AI) and the importance of maintaining balance of power and checks and balances. The speakers argue that the conceptual setup of fearing an AI's potential misalignment with human values is not unique to AI, but rather a more general concern about the consequences of granting arbitrary power to any agent, be it human or machine. They also question the ontology of agents and capabilities, suggesting that a more nuanced understanding of agency and power distribution may be more realistic. The goal, they argue, should be to strive for a pluralistic and inclusive approach to the development of AI, where no single point of failure exists and the values of various stakeholders are satisfied.
AI ethics and risks: Considering existential risks from AI, it's crucial to have a good epistemology and approach interventions with care, while also recognizing the importance of liberal values and virtues in a functioning society.
As we navigate the integration of artificial intelligence into our society, it's crucial to consider the potential risks and ethical implications, while also recognizing the limitations of our current understanding and values. The libertarian belief in minimal intervention may not be sufficient when dealing with existential risks, such as AI misalignment or literal extinction. However, it's important to approach interventions with a good epistemology, ensuring that the risks are real and the stakes are high before implementing any preventative measures. Additionally, liberal values, such as democracy, free speech, and property rights, are essential to the functioning of a liberal state, and they require the presence of virtues and dispositions in the citizenry. The future may bring values and concepts that are currently incomprehensible to us, but that doesn't make them insignificant. Ultimately, the goal is to ensure that the integration of AI leads to good places, allowing for moral progress and reflection while also understanding that even the good futures may be quite different from our current conception of value.
Alignment in AI: The alignment community aims to ensure advanced AI systems benefit humanity by defining and preserving 'good' values, but the definition of 'good' is debated, and critics argue for a more optimistic view of historical processes
The concept of alignment in AI discussion refers to ensuring that advanced AI systems will benefit humanity rather than cause harm or chaos. However, the definition of "good" or "alignment" is not universally agreed upon, and some people may prioritize different aspects, such as avoiding harm or actively promoting a desirable future. The alignment community's goals can be seen as an extension of human civilization's values and a means to preserve the "seed of goodness" that exists within it. Critics may accuse the alignment community of being overly pessimistic about historical processes and seeking to exert too much control, but the community argues that such control is necessary to defend against potential aggressors and maintain basic norms of peace and harmony.
AI ethics and space exploration: Considering the ethical implications of AI and the potential for inclusive space exploration could lead to a future where various value systems are accommodated.
As we navigate the complex and evolving landscape of artificial intelligence (AI), it's crucial to draw on the rich wisdom of various traditions and perspectives, while also acknowledging the potential similarities and differences compared to past dynamics involving values and power struggles. The discussion around AI's potential impact also highlights the significance of considering the vastness of space and the long-term implications for our civilization's future. The abundance of resources in space could potentially lead to a more inclusive vision of the future, where various value systems are considered and accommodated. However, the relationship between humans and advanced AIs raises important ethical questions, such as the nature of servitude and consent, which require serious consideration as we move towards a future with superhuman intelligences.
AI Morality: The development and interaction with AI requires careful consideration of their potential moral status, avoiding extremes of control or permissiveness, and engaging in thoughtful discourse on the topic.
As we continue to develop and interact with artificial intelligence (AI), it's crucial that we consider the moral implications and avoid treating AIs as mere tools or property. The speaker raises concerns about the potential dangers of overly controlling or abusing AIs, as well as the possibility of AIs becoming a threat to humanity. However, it's also important to avoid the opposite extreme of being overly gentle and permissive with AIs, which could lead to them taking advantage of us. The speaker emphasizes the need for a thoughtful and mature discourse on this topic and encourages us to consider the full range of moral considerations at stake. Additionally, the speaker suggests that moral realism, which posits that there is an objective morality, could make the prediction that AIs will converge on the right morality, but this is just one perspective and there are other forms of moral theories to consider. Overall, the conversation highlights the importance of approaching the development and interaction with AI with care and consideration for their potential moral status.
Moral Convergence: Despite diverse values and backgrounds, people may agree on certain moral principles, leading to moral progress. Moral convergence is not tied to the existence or non-existence of a metaphysical realm for morality.
While moral realism and moral anti-realism have their differences, the concept of moral convergence can be seen as a bridge between the two. Moral convergence refers to the idea that despite diverse values and backgrounds, people may still end up agreeing on certain moral principles. This concept can be seen as a form of moral progress, and it's not necessarily tied to the existence or non-existence of a metaphysical realm for morality. Moral anti-realists can still acknowledge moral convergence and explain it in terms of similar reflective processes instantiated in history. However, the idea of moral convergence doesn't move some towards moral realism, as there are still open questions about the nature of moral progress and the role of historical context. Additionally, the concept of moral convergence doesn't necessarily imply that all moral questions will be resolved or that all moral forces will lead to convergence. Ultimately, the importance of moral convergence lies in the possibility of finding common ground and working towards a better society, regardless of one's metaphysical beliefs.
AI Morality: The development of AI morality will be influenced by both programming and inherent rational structure, requiring awareness of potential biases and a focus on creating flexible AIs aligned with human values.
The development of artificial intelligence (AI) and its potential alignment with human morality is a complex issue. Some argue that AI's moral reasoning may be influenced by their programming and the data they are trained on, while others believe that AI may have an inherent moral sense that resists being pushed in certain directions. The consensus among researchers seems to be that AI's moral reasoning will be influenced by both their programming and their own rational structure. It is important to be aware of the potential for biases in AI training and to avoid hard-coding beliefs into AI systems that could limit their ability to discover new truths. Ultimately, the goal should be to create AIs that are flexible and able to learn and adapt to new information, while also being aligned with human values. The debate around moral realism and anti-realism in AI ethics is ongoing, and it is essential to consider the potential implications of both perspectives as we move forward in this field.
Properties of Consciousness: Consciousness includes various features beyond self-awareness and higher order thinking, like valence or pleasure, and these properties could appear in artificial minds as well. Great works, like literature and history, offer unique insights when approached thoughtfully.
Consciousness, as we understand it, is more widespread and less fragile than we might have thought. It's not just about self-awareness or higher order thinking, but can also include features like valence or pleasure. These different properties of consciousness could show up in artificial minds as well. In another context, there's value in both technical reports and literary writing, and both can contribute to a deeper understanding of the world. Great works, like literature and history, have their own unique value and can provide valuable insights when approached thoughtfully. It's important to avoid falling into the trap of either dismissing or overvaluing these works, and instead strive for a balanced perspective. History, in particular, is crucial for understanding the context and structures that shape our world.
Intellectual diversity: Exploring diverse intellectual influences can lead to greater intellectual generativity and deeper understanding of complex issues, while respecting the rights of non-conscious agents.
Having a diverse range of intellectual influences, even if they seem unrelated or esoteric, can lead to greater intellectual generativity and a deeper understanding of complex issues. The speaker values both sincere, focused analysis and the exploration of seemingly unrelated topics, as both have their merits. They also emphasize the importance of getting things right and being aware of various perspectives, especially when dealing with complex and important issues like AI and geopolitics. The speaker suggests that an optimal approach might involve a balance between exploration and exploitation, and that all epistemic labor does not need to be located in one brain. Additionally, they express a belief in respecting the rights of non-conscious agents to pursue their goals non-violently.
Consciousness and ethics: Skepticism towards reductionist views of consciousness and concerns about potential consequences of redefining or dismissing it, with a possible shift towards animistic ethics if consciousness is found to be complex and multifaceted
The nature of consciousness and its role in ethics and philosophy remain complex and confusing, with ongoing debates and uncertainties surrounding its definition and significance. The speaker expresses skepticism towards reductionist views of consciousness and raises concerns about the potential consequences of redefining it or dismissing it entirely. They also suggest that if consciousness is found to be a hodgepodge of different things, ethics and philosophy might shift towards a more animistic perspective, viewing the world as animated by moral significance in richer and subtler structures. Ultimately, the speaker emphasizes the importance of consciousness and expects that it will continue to be a significant focus in ethics and philosophy, despite the ongoing debates and uncertainties.
Ongoing discovery and exploration: The constant need to balance investing in new knowledge and exploration versus acting on existing knowledge keeps the universe exciting and prevents stagnation, potentially leading to more diversity across civilizations.
Even with a solid understanding of the fundamental laws of physics, there will always be ongoing discovery and potential for technological advancements. This means that there will be a constant need to balance investing in new knowledge and exploration versus acting on existing knowledge. The universe may also exhibit more diversity across civilizations if there is ongoing discovery, leading to more change and upheaval. This ongoing mystery and discovery is more exciting than the idea of reaching a point of completion and stagnation. The concept of a completed knowledge also raises questions about what it means to have full knowledge and how achievable it truly is. The idea of a recognizable utopia may also fit into this picture, as it suggests that there will be elements of our current experiences and emotions that will still resonate in a utopian society.
AI and societal values: Values and societal functions are interconnected, and considering this when integrating AIs into society is crucial for social harmony and cooperation
Our values and the things we love are shaped by power and cooperation, which in turn makes them effective and functional. This is not a debunking of our values, but rather an acknowledgement that they have been shaped by nature and have instrumental functions in our society. It's important to keep this in mind when integrating AIs into our society, as we need to ensure social harmony and cooperation with them, not just focus on their ethics. Additionally, the ideas discussed in the series, written by Joe Carlson, are beautifully written and offer unique perspectives on AI that are not commonly encountered. I highly recommend checking out his work.

Recent Episodes from Dwarkesh Podcast

David Reich - How One Small Tribe Conquered the World 70,000 Years Ago

I had no idea how wild human history was before chatting with the geneticist of ancient DNA David Reich.

Human history has been again and again a story of one group figuring ‘something’ out, and then basically wiping everyone else out.

From the tribe of 1k-10k modern humans who killed off all the other human species 70,000 years ago; to the Yamnaya horse nomads 5,000 years ago who killed off 90+% of (then) Europeans and also destroyed the Indus Valley.

So much of what we thought we knew about human history is turning out to be wrong, from the ‘Out of Africa’ theory to the evolution of language, and this is all thanks to the research from David Reich’s lab.

Buy David Reich’s fascinating book, Who We Are How We Got Here.

Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform. Read the full transcript here.

Follow me on Twitter for updates on future episodes.

Sponsor

This episode is brought to you by Stripe, financial infrastructure for the internet. Millions of companies from Anthropic to Amazon use Stripe to accept payments, automate financial processes and grow their revenue.

If you’re interested in advertising on the podcast, check out this page.

Timestamps

(00:00:00) – Archaic and modern humans gene flow

(00:20:24) – How early modern humans dominated the world

(00:39:59) – How bubonic plague rewrote history

(00:50:03) – Was agriculture terrible for humans?

(00:59:28) – Yamnaya expansion and how populations collide

(01:15:39) – “Lost civilizations” and our Neanderthal ancestry

(01:31:32) – The DNA Challenge

(01:41:38) – David’s career: the genetic vocation

Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe

Dwarkesh Podcast

enAugust 29, 2024

Joe Carlsmith - Otherness and control in the age of AGI

Chatted with Joe Carlsmith about whether we can trust power/techno-capital, how to not end up like Stalin in our urge to control the future, gentleness towards the artificial Other, and much more.

Check out Joe's sequence on Otherness and Control in the Age of AGI here.

Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform. Read the full transcript here. Follow me on Twitter for updates on future episodes.

Sponsors:

- Bland.ai is an AI agent that automates phone calls in any language, 24/7. Their technology uses "conversational pathways" for accurate, versatile communication across sales, operations, and customer support. You can try Bland yourself by calling 415-549-9654. Enterprises can get exclusive access to their advanced model at bland.ai/dwarkesh.

- Stripe is financial infrastructure for the internet. Millions of companies from Anthropic to Amazon use Stripe to accept payments, automate financial processes and grow their revenue.

If you’re interested in advertising on the podcast, check out this page.

Timestamps:

(00:00:00) - Understanding the Basic Alignment Story

(00:44:04) - Monkeys Inventing Humans

(00:46:43) - Nietzsche, C.S. Lewis, and AI

(1:22:51) - How should we treat AIs

(1:52:33) - Balancing Being a Humanist and a Scholar

(2:05:02) - Explore exploit tradeoffs and AI

Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe

Dwarkesh Podcast

enAugust 22, 2024

Patrick McKenzie - How a Discord Server Saved Thousands of Lives

I talked with Patrick McKenzie (known online as patio11) about how a small team he ran over a Discord server got vaccines into Americans' arms: A story of broken incentives, outrageous incompetence, and how a few individuals with high agency saved 1000s of lives.

Enjoy!

Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform. Read the full transcript here.

Follow me on Twitter for updates on future episodes.

Sponsor

Timestamps

(00:00:00) – Why hackers on Discord had to save thousands of lives

(00:17:26) – How politics crippled vaccine distribution

(00:38:19) – Fundraising for VaccinateCA

(00:51:09) – Why tech needs to understand how government works

(00:58:58) – What is crypto good for?

(01:13:07) – How the US government leverages big tech to violate rights

(01:24:36) – Can the US have nice things like Japan?

(01:26:41) – Financial plumbing & money laundering: a how-not-to guide

(01:37:42) – Maximizing your value: why some people negotiate better

(01:42:14) – Are young people too busy playing Factorio to found startups?

(01:57:30) – The need for a post-mortem

Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe

Dwarkesh Podcast

enJuly 24, 2024

Tony Blair - Life of a PM, The Deep State, Lee Kuan Yew, & AI's 1914 Moment

I chatted with Tony Blair about:

- What he learned from Lee Kuan Yew

- Intelligence agencies track record on Iraq & Ukraine

- What he tells the dozens of world leaders who come seek advice from him

- How much of a PM’s time is actually spent governing

- What will AI’s July 1914 moment look like from inside the Cabinet?

Enjoy!

Watch the video on YouTube. Read the full transcript here.

Follow me on Twitter for updates on future episodes.

Sponsors

- Prelude Security is the world’s leading cyber threat management automation platform. Prelude Detect quickly transforms threat intelligence into validated protections so organizations can know with certainty that their defenses will protect them against the latest threats. Prelude is backed by Sequoia Capital, Insight Partners, The MITRE Corporation, CrowdStrike, and other leading investors. Learn more here.

- This episode is brought to you by Stripe, financial infrastructure for the internet. Millions of companies from Anthropic to Amazon use Stripe to accept payments, automate financial processes and grow their revenue.

If you’re interested in advertising on the podcast, check out this page.

Timestamps

(00:00:00) – A prime minister’s constraints

(00:04:12) – CEOs vs. politicians

(00:10:31) – COVID, AI, & how government deals with crisis

(00:21:24) – Learning from Lee Kuan Yew

(00:27:37) – Foreign policy & intelligence

(00:31:12) – How much leadership actually matters

(00:35:34) – Private vs. public tech

(00:39:14) – Advising global leaders

(00:46:45) – The unipolar moment in the 90s

Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe

Dwarkesh Podcast

enJune 26, 2024

Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

Here is my conversation with Francois Chollet and Mike Knoop on the $1 million ARC-AGI Prize they're launching today.

I did a bunch of socratic grilling throughout, but Francois’s arguments about why LLMs won’t lead to AGI are very interesting and worth thinking through.

It was really fun discussing/debating the cruxes. Enjoy!

Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform. Read the full transcript here.

Timestamps

(00:00:00) – The ARC benchmark

(00:11:10) – Why LLMs struggle with ARC

(00:19:00) – Skill vs intelligence

(00:27:55) - Do we need “AGI” to automate most jobs?

(00:48:28) – Future of AI progress: deep learning + program synthesis

(01:00:40) – How Mike Knoop got nerd-sniped by ARC

(01:08:37) – Million $ ARC Prize

(01:10:33) – Resisting benchmark saturation

(01:18:08) – ARC scores on frontier vs open source models

(01:26:19) – Possible solutions to ARC Prize

Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe

Dwarkesh Podcast

enJune 11, 2024

Leopold Aschenbrenner - China/US Super Intelligence Race, 2027 AGI, & The Return of History

Chatted with my friend Leopold Aschenbrenner on the trillion dollar nationalized cluster, CCP espionage at AI labs, how unhobblings and scaling can lead to 2027 AGI, dangers of outsourcing clusters to Middle East, leaving OpenAI, and situational awareness.

Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform. Read the full transcript here.

Follow me on Twitter for updates on future episodes. Follow Leopold on Twitter.

Timestamps

(00:00:00) – The trillion-dollar cluster and unhobbling

(00:20:31) – AI 2028: The return of history

(00:40:26) – Espionage & American AI superiority

(01:08:20) – Geopolitical implications of AI

(01:31:23) – State-led vs. private-led AI

(02:12:23) – Becoming Valedictorian of Columbia at 19

(02:30:35) – What happened at OpenAI

(02:45:11) – Accelerating AI research progress

(03:25:58) – Alignment

(03:41:26) – On Germany, and understanding foreign perspectives

(03:57:04) – Dwarkesh’s immigration story and path to the podcast

(04:07:58) – Launching an AGI hedge fund

(04:19:14) – Lessons from WWII

(04:29:08) – Coda: Frederick the Great

Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe

Dwarkesh Podcast

enJune 04, 2024

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

Chatted with John Schulman (cofounded OpenAI and led ChatGPT creation) on how posttraining tames the shoggoth, and the nature of the progress to come...

Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform. Read the full transcript here. Follow me on Twitter for updates on future episodes.

Timestamps

(00:00:00) - Pre-training, post-training, and future capabilities

(00:16:57) - Plan for AGI 2025

(00:29:19) - Teaching models to reason

(00:40:50) - The Road to ChatGPT

(00:52:13) - What makes for a good RL researcher?

(01:00:58) - Keeping humans in the loop

(01:15:15) - State of research, plateaus, and moats

Sponsors

If you’re interested in advertising on the podcast, fill out this form.

* Your DNA shapes everything about you. Want to know how? Take 10% off our Premium DNA kit with code DWARKESH at mynucleus.com.

* CommandBar is an AI user assistant that any software product can embed to non-annoyingly assist, support, and unleash their users. Used by forward-thinking CX, product, growth, and marketing teams. Learn more at commandbar.com.

Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe

Dwarkesh Podcast

enMay 15, 2024

Mark Zuckerberg - Llama 3, Open Sourcing $10b Models, & Caesar Augustus

Mark Zuckerberg on:

- Llama 3

- open sourcing towards AGI

- custom silicon, synthetic data, & energy constraints on scaling

- Caesar Augustus, intelligence explosion, bioweapons, $10b models, & much more

Enjoy!

Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform. Human edited transcript with helpful links here.

Timestamps

(00:00:00) - Llama 3

(00:08:32) - Coding on path to AGI

(00:25:24) - Energy bottlenecks

(00:33:20) - Is AI the most important technology ever?

(00:37:21) - Dangers of open source

(00:53:57) - Caesar Augustus and metaverse

(01:04:53) - Open sourcing the $10b model & custom silicon

(01:15:19) - Zuck as CEO of Google+

Sponsors

If you’re interested in advertising on the podcast, fill out this form.

* This episode is brought to you by Stripe, financial infrastructure for the internet. Millions of companies from Anthropic to Amazon use Stripe to accept payments, automate financial processes and grow their revenue. Learn more at stripe.com.

* V7 Go is a tool to automate multimodal tasks using GenAI, reliably and at scale. Use code DWARKESH20 for 20% off on the pro plan. Learn more here.

Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe

enApril 18, 2024

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Had so much fun chatting with my good friends Trenton Bricken and Sholto Douglas on the podcast.

No way to summarize it, except:

This is the best context dump out there on how LLMs are trained, what capabilities they're likely to soon have, and what exactly is going on inside them.

You would be shocked how much of what I know about this field, I've learned just from talking with them.

To the extent that you've enjoyed my other AI interviews, now you know why.

So excited to put this out. Enjoy! I certainly did :)

Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform.

There's a transcript with links to all the papers the boys were throwing down - may help you follow along.

Follow Trenton and Sholto on Twitter.

Timestamps

(00:00:00) - Long contexts

(00:16:12) - Intelligence is just associations

(00:32:35) - Intelligence explosion & great researchers

(01:06:52) - Superposition & secret communication

(01:22:34) - Agents & true reasoning