Home > Episode > #173 - Gemini Pro, Llama 400B,

#173 - Gemini Pro, Llama 400B, Gen-3 Alpha, Moshi, Supreme Court

enJuly 07, 2024

Last Week in AI

What are the features of Google's Gemini 1.5 models?

How does Google's context caching feature improve efficiency?

What challenge does the reward model face in reinforcement learning?

What innovation is Tencent AI Lab introducing with personas?

How are publishers responding to AI summarization tools?

What are the features of Google's Gemini 1.5 models?

How does Google's context caching feature improve efficiency?

What challenge does the reward model face in reinforcement learning?

What innovation is Tencent AI Lab introducing with personas?

How are publishers responding to AI summarization tools?

Podcast Summary

Generative AI advancements: Google's Gemini 1.5 Flash and Pro offer larger context windows and context caching, allowing for more powerful and efficient handling of larger inputs. Meta is releasing a 400 billion parameter model, further advancing the field.
There have been recent advancements in generative AI models, specifically from Google with the public release of Gemini 1.5 Flash and Pro. These models offer larger context windows, up to 2 million tokens, making them more powerful and capable of handling larger inputs. Google's context caching feature is also a notable addition, allowing models to store and reuse information, resulting in cost savings and improved efficiency. The industry is shifting towards making these AI models more user-friendly and productized, with Google having an edge due to its enterprise offerings on its cloud platform. Additionally, Meta is about to release its biggest LLM yet, a 400 billion parameter model, which is expected to be a significant advancement in the field. These developments demonstrate the rapid progress being made in generative AI and its increasing importance in various industries.
AI model security: Meta faces challenges ensuring its 400 billion parameter model can't be easily misused or jailbroken, while companies like Runway and Google explore business models to create a competitive edge in the generative AI market
The release of larger language models like Meta's 400 billion parameter model comes with significant challenges, particularly in ensuring the models can't be easily jailbroken or misused. Meta is reportedly considering releasing the model, but the company faces the daunting task of implementing robust safeguards that can withstand unknown jailbreaking techniques. Meanwhile, Runway has released a paid version of its Gen Free Alpha AI video model, which offers text-to-video generation and plans to add image-to-video and video-to-video modes in the future. Google is also integrating AI features into its Pixel 9 smartphone, and audio generation firm 11 Labs has announced a reader app with famous voices. The generative AI market is heating up, and companies are exploring various business models, including offering paid versions of their products, to create a competitive moat. The potential benefits of these tools are significant, but so are the risks, making the decisions around their release a complex issue.
Text-to-Speech Innovation, AI Search: Companies collaborate with deceased actors for text-to-speech features, Perplexity upgrades Pro Search for better complex query understanding, and addressing latency and context window effectiveness are crucial for optimal user experiences, while clear marketing messaging is important to manage user expectations regarding AI capabilities
Companies are exploring innovative ways to provide text-to-speech features without facing backlash, by collaborating with estates of deceased actors and actresses for recorded content. This approach, while not without controversy, allows for a more professional and engaging user experience. Perplexity, an AI-powered search engine, has upgraded its Pro Search feature, enabling it to better understand complex queries and provide richer, more detailed answers. This advancement, along with Perplexity's polished product and positioning, positions it as a compelling alternative to legacy search engines. However, as these advanced technologies continue to develop, addressing issues like inference latency and context window effectiveness will be crucial for delivering optimal user experiences. Additionally, it was revealed that Gemini's data analyzing abilities may not be as accurate as Google claims, highlighting the importance of clear marketing messaging and realistic expectations.
AI and copyright infringement: The use of AI in accessing and summarizing content from websites with paywalls raises copyright infringement concerns, with publishers considering blocking downloads of data to protect intellectual property. Meanwhile, companies like OpenAI pursue media partnerships to differentiate their AI tools, and China continues to be a significant player in the global AI competition.
The use of AI in accessing and summarizing content from websites, particularly those with paywalls, is becoming a contentious issue. The case of PO, a summarization bot, raises questions about copyright infringement and the effectiveness of the robots exclusion protocol. Publishers are now considering blocking downloads of data to protect their intellectual property, while companies like OpenAI are pursuing media partnerships to differentiate their AI tools. In the hardware realm, Huawei and Wuhan Chinchin are reportedly collaborating to develop high bandwidth memory chips in the face of US restrictions, highlighting the importance of China in the global AI competition. Additionally, Alibaba's large language model has entered the top ranks on the developer platform Hugging Face, indicating growing competition in the AI model space between the US and China. These developments underscore the complex and evolving landscape of AI technology and its implications for copyright law, data ownership, and international relations.
AI competition and collaboration: Chinese companies focus on open source models for AI research due to limited resources, while Meta pushes wearable AI technology boundaries with Raybans and Apple collaborates with OpenAI
The race for AI dominance continues, with Chinese companies focusing on open source models due to limited access to advanced GPUs and potential geopolitical leverage. Meanwhile, Meta is making strides in wearable AI with its Raybans, offering video recording and AI integration. A deeper partnership between Apple and OpenAI is also developing, with Phil Schiller reportedly joining OpenAI's board. These developments highlight the intense competition and collaboration in the AI sector. Chinese companies are exploring open source models to stay competitive in AI research, while Meta is pushing the boundaries of wearable AI technology. Apple and OpenAI's partnership underscores the importance of collaboration between tech giants in the rapidly evolving AI landscape. The Meta Raybans, with their video recording capabilities and AI integration, offer a potential wearable paradigm that people may actually want, unlike previous failures in this space. Overall, the AI industry is witnessing significant advancements and strategic partnerships, with potential implications for geopolitical dynamics and consumer technology.
Microsoft-OpenAI relationship, Regulation: Microsoft's exclusivity agreement with OpenAI for GPT technologies may not last, and regulatory bodies play a crucial role in shaping technology development, particularly regarding AGI and potential misuse, while third-party model evaluations ensure responsible use and promote AI safety and governance.
The relationship between Microsoft and OpenAI, as well as the role of regulatory bodies in shaping technology development, continues to be a complex and evolving issue. The exclusivity agreement between Microsoft and OpenAI for GPT technologies may not be holding up, and the determination of when OpenAI achieves Artificial General Intelligence (AGI) is crucial, as it will impact Microsoft's access to the technology. Additionally, the evaluation of AI models and the development of third-party model evaluations are essential for ensuring responsible use and preventing potential misuse, such as identity theft and disinformation. Companies like Runway are raising significant funds to advance AI technology, particularly in the video domain, and new benchmarks are highlighting the gap between human and AI performance. Anthropic's push for third-party model evaluations is a response to the need for independent oversight and a part of the ongoing conversation about AI safety and governance.
AI safety regulations, misalignment risk: Anthropic advocates for government-mandated audits and certifications for AI models to address various risks, including misalignment risk, and Mozilla introduces LAMMA files for easier deployment of models, while researchers explore reducing memory usage and increasing throughput in large language models.
Anthropic, a leading AI safety research company, is pushing for government-mandated audits and certifications for AI models in both large and small companies. This initiative aims to address various risks, including cyber attacks, chemical, bio, radiological, and nuclear risks, autonomy, social manipulation, and misalignment risk. Anthropic's clear focus on misalignment risk as a separate category is noteworthy. Mozilla, a significant player in the open source space, has introduced LAMMA files, which package together the weights of an AI model with the software needed to run it, making it easier to deploy models on various platforms and devices. Researchers are also working on eliminating matrix multiplication in large language models (LLMs) by using ternary values and addition instead, which could lead to reduced memory usage and increased throughput. While these advancements show promise, it's important to note that the research is still in its early stages and more testing is needed to confirm the benefits at larger scales.
Simplifying complex architectures: Recent research challenges the assumption that complex architectures are always superior, showing that simple strategies like repeatedly calling a model or increasing its temperature can perform comparably for human evaluation tasks and often come with lower costs. Alternative approaches like weighted average rewarded policies in reinforcement learning are also proposed.
Recent research challenges the assumption that complex architectures are always superior to simpler ones in the field of machine learning, specifically in the context of language models and agent architectures. The paper "A Simple Architecture for Agent Evaluation" demonstrates that simple baselines, such as repeatedly calling a model or increasing its temperature, can perform comparably to more complex agent architectures for human evaluation tasks. Furthermore, these simple strategies often come with lower costs. The researchers also criticize the current evaluation practices for agents, emphasizing the importance of considering both accuracy and cost. Another paper, "Warp on the Benefits of Weight Average Rewarded Policies," focuses on reinforcement learning and suggests an alternative to the common practice of using KL regularization to prevent the model from forgetting pre-trained knowledge during training. This paper proposes a weighted average rewarded policies strategy that allows for better optimization in the RL stage while retaining more information. These papers serve as reminders that it's crucial to question assumptions and explore alternative approaches in the ever-evolving field of machine learning.
Perverse optimizations in reinforcement learning: To prevent perverse optimizations in reinforcement learning from human feedback, multi-run fusion is used to train multiple copies of the language model independently against the reward model and callback labeler, then merge their weights to create a more aligned model. Personas are also used to generate diverse synthetic data and elicit unique outputs from the model.
In reinforcement learning from human feedback, creating a reward model to train language models can lead to perverse optimizations where the model finds ways to manipulate the reward model rather than understanding human desires. To prevent this, researchers introduce the concept of an anchor or callback labeler divergence score to ensure the model stays close to its original behavior. This technique, called multi-run fusion, trains multiple copies of the language model independently against the reward model and the callback labeler, then merges their weights to create a more aligned model. This process is repeated to gradually improve the model's alignment. Another interesting paper discusses the challenge of generating synthetic data for AI models. The solution proposed is the use of personas, which are descriptions of different types of people. By tailoring prompts to these personas, the model provides unique outputs, eliciting a broader range of information from the model. The paper by Tencent AI Lab Seattle introduces a text-to-persona strategy to generate personas and a persona-to-persona strategy to derive additional personas based on interpersonal relationships. They have released over 200,000 personas and are open to releasing more, acknowledging the potential risks and concerns. Overall, these papers highlight the importance of understanding human feedback and generating diverse synthetic data to improve AI models.
AI regulation: The Supreme Court's decision to strike down Chevron deference may lead to more clear and detailed legislation from Congress regarding AI, but the technical nuances involved could present a significant challenge for implementation.
The use of personas and the generation of synthetic data at scale can significantly improve the performance of large language models, as demonstrated by a recent study using a 7 billion parameter Chinese model named Quen2, 7B, which surpassed the performance of the leading anthropic model, Gemini Ultra, on a math benchmark. Additionally, calibrating positional attention bias can help longer context utilization in LLMs. However, the regulatory landscape for AI is shifting with the Supreme Court's decision to strike down Chevron deference, which means that courts will now have to interpret ambiguous laws related to AI regulation, potentially leading to a need for more clear and detailed legislation from Congress. This change could present a significant challenge for the implementation of AI legislation due to the technical nuances involved.
US-China tech competition: New US law limiting Congress' ability to delegate regulatory authority to agencies could hinder US response to emerging technologies like AI, while US export control measures against Chinese tech companies lead to longer delays and fewer exports, potentially giving China an edge in the market
The new US law limiting Congress' ability to delegate regulatory authority to agencies could significantly hinder the country's agility in responding to emerging technologies like AI. This comes as the US and China continue to engage in export control measures, with the US relying on manual processes to oversee restrictions on Chinese tech companies. The manual processes at the Bureau of Industry and Security (BIS) have struggled to keep up with the increasing number of Chinese entities on their list, leading to longer delays and, by default, fewer exports. This situation could give China an edge in the market, as US companies face more obstacles in selling their products there. Additionally, the H20 GPU chips produced in China, which are less powerful than their US counterparts, are still experiencing significant sales due to the lagging effect of US export controls. Overall, these developments highlight the complex and evolving nature of the US-China tech competition and the importance of adaptability in navigating it.
Semiconductor, Finance: The semiconductor industry faces worker shortages due to growth, while finance adopts machine learning for investment decisions, highlighting the need to stay informed about technological advancements and their industry impacts
Both the semiconductor industry and the financial sector are experiencing significant changes driven by technological advancements. In the semiconductor industry, the US government is investing in workforce development programs to address projected worker shortages due to the industry's growth. Meanwhile, in finance, a billion-dollar fund run by Bridgewater Associates will use machine learning for decision making, potentially disrupting traditional investment strategies. These developments underscore the importance of staying informed about technological advancements and their potential impacts on various industries.

Recent Episodes from Last Week in AI

#181 - Google Chatbots, Cerebras vs Nvidia, AI Doom, ElevenLabs Controversy

Our 181st episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov and Jeremie Harris

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

In this episode:

- Google's AI advancements with Gemini 1.5 models and AI-generated avatars, along with Samsung's lithography progress. - Microsoft's Inflection usage caps for Pi, new AI inference services by Cerebrus Systems competing with Nvidia. - Biases in AI, prompt leak attacks, and transparency in models and distributed training optimizations, including the 'distro' optimizer. - AI regulation discussions including California’s SB1047, China's AI safety stance, and new export restrictions impacting Nvidia’s AI chips.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:03:08)Response to listener comments / corrections
Tools & Apps
- (00:09:19) Google’s custom AI chatbots have arrived
- (00:12:52) Google releases three new experimental AI models
- (00:17:14) Google Gemini will let you create AI-generated people again
- (00:22:32) Five months after Microsoft hired its founders, Inflection adds usage caps to Pi
- (00:26:42:) Plaud takes a crack at a simpler AI pin
Applications & Business
- (00:30:31) Cerebras Systems throws down gauntlet to Nvidia with launch of ‘world’s fastest’ AI inference service
- (00:41:06) Nvidia announces $50 billion stock buyback
- (00:46:24) OpenAI in talks to raise funding that would value it at more than $100 billion
- (00:50:44) OpenAI Aims to Release New AI Model, ‘Strawberry,’ in Fall
- (00:52:53) 3 Co-Founders Leave French AI Startup H Amid ‘Operational Differences’
- (00:57:29) Samsung to Adopt High-NA Lithography Alongside Intel, Ahead of TSMC
- (01:02:11) Unitree's $16,000 G1 could become the first mainstream humanoid robot
Projects & Open Source
- (01:04:59) Meta leads open-source AI boom, Llama downloads surge 10x year-over-year
- (01:09:08) A_Preliminary_Report_on_DisTrO.
Research & Advancements
- (01:13:56) Diffusion Models Are Real-Time Game Engines
- (01:23:18) LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
- (01:32:21) Interviewing AI researchers on automation of AI R&D
- (01:40:33) Anthropic releases AI model system prompts, winning praise for transparency
Policy & Safety
Synthetic Media & Art
- (02:11:13) Actors Say AI Voice-Over Generator ElevenLabs Cloned Likenesses
(02:14:06) Outro

Last Week in AI

enSeptember 15, 2024

#180 - Ideogram v2, Imagen 3, AI in 2030, Agent Q, SB 1047

Our 180th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

If you would like to get a sneak peek and help test Andrey's generative AI application, go to Astrocade.com to join the waitlist and the discord.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Episode Highlights:

Ideogram AI's new features, Google's Imagine 3, Dream Machine 1.5, and Runway's Gen3 Alpha Turbo model advancements.
Perplexity's integration of Flux image generation models and code interpreter updates for enhanced search results.
Exploration of the feasibility and investment needed for scaling advanced AI models like GPT-4 and Agent Q architecture enhancements.
Analysis of California's AI regulation bill SB1047 and legal issues related to synthetic media, copyright, and online personhood credentials.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:01:08) Response to Listener Comments / Corrections
Tools & Apps
- (00:03:58) Ideogram AI expands its features with v2 model and color palette options
- (00:07:48) Google Releases Powerful AI Image Generator You Can Use for Free
- (00:11:41) Perplexity adds Flux.1 model for Pro users alongside Playground v3 update
- (00:13:58) Luma drops Dream Machine 1.5 — here’s what’s new
- (00:17:49) Runway’s Gen-3 Alpha Turbo is here and can make AI videos faster than you can type
- (00:20:21) Perplexity’s latest update improves code interpreter, charts included
Applications & Business
Projects & Open Source
Research & Advancements
- (01:12:35) Can AI Scaling Continue Through 2030?
- (01:15:35) Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
- (01:23:58) Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models
- (01:31:18) Loss of plasticity in deep continual learning
Policy & Safety
Synthetic Media & Art
- (01:58:33) Authors sue Claude AI chatbot creator Anthropic for copyright infringement
- (01:59:32) Artists’ lawsuit against Stability AI and Midjourney gets more punch
(02:01:43) Outro

Last Week in AI

enSeptember 03, 2024

#179 - Grok 2, Gemini Live, Flux, FalconMamba, AI Scientist

Our 179th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

If you would like to get a sneak peek and help test Andrey's generative AI application, go to Astrocade.com to join the waitlist and the discord.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Episode Highlights:

- Grok 2's beta release features new image generation using Black Forest Labs' tech.

- Google introduces Gemini Voice Chat Mode available to subscribers and integrates it into Pixel Buds Pro 2.

- Huawei's Ascend 910C AI chip aims to rival NVIDIA's H100 amidst US export controls.

- Overview of potential risks of unaligned AI models and skepticism around SingularityNet's AGI supercomputer claims.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:02:15) Response to listener comments / corrections
Tools & Apps
- (00:04:24) Grok-2 is out in beta, now with added AI image generation
- (00:11:28) OpenAI reveals an updated GPT-4o model - but can't quite explain how it's better
- (00:13:48) Google Gemini’s voice chat mode is here
- (00:16:18) Google’s Pixel Buds Pro 2 bring Gemini to your ears
- (00:19:55) Google’s AI-generated search summaries change how they show their sources
- (00:23:13) Prompt Caching is Now Available on the Anthropic API for Specific Claude Models
Applications & Business
Projects & Open Source
Research & Advancements
- (01:14:40) The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
- (01:30:24) Imagen 3
- (01:32:48) The Data Addition Dilemma
- (01:37:35) LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Policy & Safety
Synthetic Media & Art
- (01:48:21) SAG-AFTRA Strikes Groundbreaking AI Digital Voice Replica Pact With Startup Firm Narrativ
- (01:51:52) How ‘Deepfake Elon Musk’ Became the Internet’s Biggest Scammer
(01:56:21) AI Song Outro

Last Week in AI

enAugust 20, 2024

#178 - More Not-Acquihires, More OpenAI drama, More LLM Scaling Talk

Our 178th episode with a summary and discussion of last week's big AI news!

NOTE: this is a re-upload with fixed audio, my bad on the last one! - Andrey

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

If you would like to get a sneak peek and help test Andrey's generative AI application, go to Astrocade.com to join the waitlist and the discord.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

In this episode: - Notable personnel movements and product updates, such as Character.ai leaders joining Google and new AI features in Reddit and Audible. - OpenAI's dramatic changes with co-founder exits, extended leaves, and new lawsuits from Elon Musk. - Rapid advancements in humanoid robotics exemplified by new models from companies like Figure in partnership with OpenAI, achieving amateur-level human performance in tasks like table tennis. - Research advancements such as Google's compute-efficient inference models and self-compressing neural networks, showcasing significant reductions in compute requirements while maintaining performance.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:03:14) Response to listener comments / corrections
Applications & Business
Tools & Apps
- (00:55:40) OpenAI cuts GPT-4o prices, launches Structured Outputs amidst price war with Google
- (01:02:08) Apple Intelligence could get a $20 Plus version
- (01:04:05) Audible is testing an AI-powered search feature
- (01:05:53) Reddit to test AI-powered search result pages
Research & Advancements
- (01:06:35) Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
- (01:16:27) Achieving Human Level Competitive Robot Table Tennis
- (01:20:19) Self-Compressing Neural Networks
- (01:28:30) Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models
- (01:32:43) Berkeley Humanoid: A Research Platform for Learning-based Control
Policy & Safety
- (01:33:35) METR announces results of study on comparative capabilities of humans and agents
- (01:39:35) ‘The Godmother of AI’ says California’s well-intended AI bill will harm the U.S. ecosystem
- (01:49:13) Google Monopolized Search Through Illegal Deals, Judge Rules
- (01:54:56) Amazon faces UK merger probe over $4B Anthropic AI investment
- (01:55:44) GPT-4o System Card
(02:03:09) Outro

Last Week in AI

enAugust 16, 2024

#177 - Instagram AI Bots, Noam Shazeer -> Google, FLUX.1, SAM2

Our 177th episode with a summary and discussion of last week's big AI news!

NOTE: apologies for this episode again coming out about a week late, next one will be coming out soon...

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

If you'd like to listen to the interview with Andrey, check out https://www.superdatascience.com/podcast

If you would like to get a sneak peek and help test Andrey's generative AI application, go to Astrocade.com to join the waitlist and the discord.

In this episode, hosts Andrey Kurenkov and John Krohn dive into significant updates and discussions in the AI world, including Instagram's new AI features, Waymo's driverless cars rollout in San Francisco, and NVIDIA’s chip delays. They also review Meta's AI Studio, character.ai CEO Noam Shazir's return to Google, and Google's Gemini updates. Additional topics cover NVIDIA's hardware issues, advancements in humanoid robots, and new open-source AI tools like Open Devon. Policy discussions touch on the EU AI Act, the U.S. stance on open-source AI, and investigations into Google and Anthropic. The impact of misinformation via deepfakes, particularly one involving Elon Musk, is also highlighted, all emphasizing significant industry effects and regulatory implications.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

(00:00:00) AI Song / Intro Banter
(00:05:32) Response to listener comments / corrections
Tools & Apps
- (00:10:16) Apple Intelligence to Miss Initial Launch of Upcoming iOS 18 Overhaul
- (00:16:35) Instagram starts letting people create AI versions of themselves
- Lighting round
  - (00:22:49) Runway just dropped image-to-video in Gen3
  - (00:25:41) Midjourney drops surprise v6.1 update — now humans look more real than ever
  - (00:28:07) AI-Powered Necklace Will Be Your Friend for $99
  - (00:30:06) Microsoft is adding AI-powered summaries to Bing search results
Applications & Business
- (00:31:44) Character.AI CEO Noam Shazeer returns to Google
- (00:39:41) Perplexity is cutting checks to publishers following plagiarism accusations
- Lighting round
  - (00:43:30) Nvidia reportedly delays its next AI chip due to a design flaw
  - (00:41:08) Neura shows off humanoid robot 4NE-1
  - (00:46:0) Yes, there are more driverless Waymos in S.F. Here’s how busy they are
  - (00:57:27) Canva acquires Leonardo.ai to boost its generative AI efforts
Projects & Open Source
- (00:59:19) Black Forest Labs Open-Source FLUX.1: A 12 Billion Parameter Rectified Flow Transformer Capable of Generating Images from Text Descriptions
- (01:01:59) Google releases new ‘open’ AI models with a focus on safety
- Lighting round
  - (01:05:09) Stability AI releases super-fast model for 3D asset image generation
  - (01:09:29) OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Research & Advancements
- (01:12:10) Meta AI Introduces Meta Segment Anything Model 2 (SAM 2): The First Unified Model for Segmenting Objects Across Images and Videos
- (01:19:20) MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts
- Lighting round
Policy & Safety
- (01:33:03) World's First-Ever AI Law Now Enforced in Europe, Targeting US Tech Giants
- (01:39:12) White House says no need to restrict ‘open-source’ artificial intelligence — at least for now
- Lighting round
  - (01:41:12) With Smugglers and Front Companies, China Is Skirting American A.I. Bans
  - (01:44:03) UK antitrust body probes Google’s ties with AI rival Anthropic
  - (01:45:20) Elon Musk posts deepfake of Kamala Harris that violates X policy
(01:50:10) AI Outro

Last Week in AI

enAugust 11, 2024

#176 - SearchGPT, Gemini 1.5 Flash, Lamma 3.1 405B, Mistral Large 2

Our 176th episode with a summary and discussion of last week's big AI news!

NOTE: apologies for this episode coming out about a week late, things got in the way of editing it...

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

(00:00:00) Intro Song
(00:00:34) Intro Banter
Tools & Apps
- (00:03:39) OpenAI announces SearchGPT, its AI-powered search engine
- (00:08:03) Google gives free Gemini users access to its faster, lighter 1.5 Flash AI model
- (00:09:10) X launches underwhelming Grok-powered ‘More About This Account’ feature
- (00:11:36) Kuaishou Launches Full Beta Testing for 'Kling AI' to Global Users, Elevates Model Capabilities
- (00:13:39) Adobe rolls out more generative AI features to Illustrator and Photoshop
- (00:14:25) Meta AI gets new ‘Imagine me’ selfie feature
Projects & Open Source
Applications & Business
Research & Advancements
Policy & Safety
Synthetic Media & Art
- (01:20:58) Video game performers will go on strike over artificial intelligence concerns
(01:23:03) Outro
(01:23:58) AI Song

Last Week in AI

enAugust 03, 2024

#175 - GPT-4o Mini, OpenAI's Strawberry, Mixture of A Million Experts

Our 175th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

In this episode of Last Week in AI, hosts Andrey Kurenkov and Jeremy Harris explore recent AI advancements including OpenAI's release of GPT 4.0 Mini and Mistral’s open-source models, covering their impacts on affordability and performance. They delve into enterprise tools for compliance, text-to-video models like Hyper 1.5, and YouTube Music enhancements. The conversation further addresses AI research topics such as the benefits of numerous small expert models, novel benchmarking techniques, and advanced AI reasoning. Policy issues including U.S. export controls on AI technology to China and internal controversies at OpenAI are also discussed, alongside Elon Musk's supercomputer ambitions and OpenAI’s Prover-Verify Games initiative.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) AI Song Intro
(00:00:40) Intro / Banter
Tools & Apps
- (00:03:57) OpenAI unveils GPT-4o mini, a small AI model powering ChatGPT
- (00:11:38) Meet Haiper 1.5, the new AI video generation model challenging Sora, Runway
- (00:16:32) Anthropic releases Claude app for Android
- (00:18:59) Google Vids is available to test out Gemini AI-created video presentations
- (00:20:27) YouTube Music sound search rolling out, AI ‘conversational radio’ in testing
Applications & Business
Projects & Open Source
Research & Advancements
- (01:01:49) FlashAttention-3 unleashes the power of H100 GPUs for LLMs
- (01:06:38) Mixture of A Million Experts
- (01:12:51) AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models
- (01:18:23) SpreadsheetLLM: Encoding Spreadsheets for Large Language >Models
Policy & Safety
(01:44:59) Outro + AI Song

Last Week in AI

enJuly 25, 2024

#174 - Odyssey Text-to-Video, Groq LLM Engine, OpenAI Security Issues

Our 174rd episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

In this episode of Last Week in AI, we delve into the latest advancements and challenges in the AI industry, highlighting new features from Figma and Quora, regulatory pressures on OpenAI, and significant investments in AI infrastructure. Key topics include AMD's acquisition of Silo AI, Elon Musk's GPU cluster plans for XAI, unique AI model training methods, and the nuances of AI copying and memory constraints. We discuss developments in AI's visual perception, real-time knowledge updates, and the need for transparency and regulation in AI content labeling and licensing.

See full episode notes here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) Intro AI Song
(00:00:41) Pre News Banter
Tools & Apps
- (00:07:09) Odyssey Building 'Hollywood-Grade' AI Text-to-Video Model to Compete With Sora, Gen-3 Alpha
- (00:10:28) Anthropic’s Claude adds a prompt playground to quickly improve your AI apps
- (00:15:06) Figma pauses its new AI feature after Apple controversy
- (00:18:30) Quora’s Poe now lets users create and share web apps
- (00:20:54) Suno launches iPhone app — now you can make AI music on the go
Applications & Business
Research & Advancements
Policy & Safety
- (01:26:49) Covert Malicious Finetuning
- (01:31:23) OpenAI’s week of security issues
- (01:36:39) Here’s how OpenAI will determine how powerful its AI systems are
- (01:39:56) Me, Myself and AI: The Situational Awareness Dataset for LLMs
- (01:44:34) Exclusive: OpenAI partners with Los Alamos to study AI in the lab
- (01:47:36) Judge dismisses coders’ DMCA claims against Microsoft, OpenAI and GitHub
- (01:49:55) A former OpenAI safety employee said he quit because the company's leaders were 'building the Titanic' and wanted 'newer, shinier' things to sell
Synthetic Media & Art
- (01:52:46) Vimeo joins YouTube and TikTok in launching new AI content labels
- (01:54:50) Tech Startup Aims to Help Media License Content for AI Training
- (01:57:23) Etsy adds AI-generated item guidelines in new seller policy
- (01:59:44) Bumble users can now report profiles that use AI-generated photos
(02:02:05) Outro + AI Song

Last Week in AI

enJuly 17, 2024

#173 - Gemini Pro, Llama 400B, Gen-3 Alpha, Moshi, Supreme Court

Our 173rd episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

See full episode notes here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

In this episode of Last Week in AI, we explore the latest advancements and debates in the AI field, including Google's release of Gemini 1.5, Meta's upcoming LLaMA 3, and Runway's Gen 3 Alpha video model. We discuss emerging AI features, legal disputes over data usage, and China's competition in AI. The conversation spans innovative research developments, cost considerations of AI architectures, and policy changes like the U.S. Supreme Court striking down Chevron deference. We also cover U.S. export controls on AI chips to China, workforce development in the semiconductor industry, and Bridgewater's new AI-driven financial fund, evaluating the broader financial and regulatory impacts of AI technologies.

Timestamps + links: