Home > Episode > #174 - Odyssey Text-to-Video,

#174 - Odyssey Text-to-Video, Groq LLM Engine, OpenAI Security Issues

enJuly 17, 2024

Last Week in AI

What is the focus of the startup Odyssey?

How much funding has Odyssey raised for its project?

What does test time training (TTT) allow in AI models?

What lawsuit was recently dismissed against Microsoft and OpenAI?

How does Vimeo contribute to AI content labeling?

What is the focus of the startup Odyssey?

How much funding has Odyssey raised for its project?

What does test time training (TTT) allow in AI models?

What lawsuit was recently dismissed against Microsoft and OpenAI?

How does Vimeo contribute to AI content labeling?

Podcast Summary

Text-to-Video AI startups: New AI startup Odyssey raised $9M for creating separate models for geometry, materials, lighting, and motion in text-to-video generation, offering finer control and importance for commercial applications and safety/interpretability reasons.
The field of AI is continuously evolving, with new startups and technologies emerging every week. During a recent episode of Last Week in AI, hosts Andrei Kamensky and Jeremy Harris discussed various news stories, including the launch of Odyssey, a new AI startup focused on building Hollywood-grade text-to-video models. The company, which has raised $9 million in seed funding, aims to provide full control over core layers of visual storytelling by creating separate models for geometry, materials, lighting, and motion. This approach allows for finer control over the AI system and is important for commercial applications, as well as for safety and interpretability reasons. The trend of text-to-video generation has gained momentum throughout the year, and it will be interesting to see how these technologies develop and find their product-market fit. Additionally, the hosts acknowledged the importance of feedback from their audience and addressed comments regarding the focus on geopolitics and bias in their discussions. Overall, the episode highlights the dynamic nature of the AI industry and the ongoing efforts to create more capable and controllable AI systems.
AI innovation: Anthropic introduces a prompt playground for AI app improvement, while Figma's AI-generated design feature faces controversy, highlighting the need for ongoing advancements and proper QA processes in the AI sector.
The field of AI is continuously evolving, with companies like Anthropic and Frapplex introducing new features to enhance the user experience and productivity. Anthropic's latest addition is a prompt playground, which allows developers to generate, test, and evaluate prompts to improve AI apps' responses for specialized tasks. This not only makes the use of cloud more formal but also collects valuable data for fine-tuning models. The potential integration of this technology into moviemaking visual effects workflows is also an intriguing development. Meanwhile, Figma's AI-generated design feature, Make Design, faced controversy when it produced designs similar to existing apps, such as Apple's Weather app. The controversy led to the feature being paused, and Figma's response suggested that they used off-the-shelf language models combined with commissioned systems. However, the response did not provide a clear explanation, and the issue was ultimately traced back to underlying design systems. The CEO acknowledged the lack of a proper QA process and the need to improve it to avoid potential copyright issues. This incident raises interesting legal questions regarding AI-generated content and intellectual property. Overall, these developments demonstrate the importance of continuous innovation and improvement in the AI sector.
AI models and chatbots improvements: Companies like Quora, Grok, and Suno are enhancing AI models and chatbots with new features and integrations, enabling faster testing and comparison, while addressing challenges like memory usage and scalability
Companies are continuously releasing new features and integrations for AI models and chatbots, but users often encounter issues and bugs. Quora's new feature, Previews, allows users to create and share interactive web apps within chatbot conversations, making it easier to test and compare various models. Grok, a company specializing in fast inference for language models, has announced a lightning-fast LM engine and a console for developers to switch from OpenAI. Suno, a text-to-music startup, has launched an iPhone app, providing users the ability to generate music ideas on the go. These developments reflect the increasing focus on inference and the trade-off between investing compute power during training or inference time. However, challenges such as memory usage and scalability remain. Microsoft and Apple have joined the boards of various AI companies amid regulatory scrutiny. Overall, these advancements demonstrate the rapid evolution of AI technology and its integration into various industries and applications.
Microsoft and OpenAI relationship investigations: Antitrust concerns led to Microsoft's resignation from OpenAI's board and departure of key personnel, raising questions about Microsoft's level of control over OpenAI despite no majority stake or voting seat.
Regulatory investigations into Microsoft and OpenAI's relationship led to the resignation of key personnel and the departure of Microsoft from OpenAI's board due to antitrust concerns. This rapid turnaround came after Microsoft offered Sam Altman, who was fired from OpenAI, a position at Microsoft to head up a big AI research group. This move effectively undercut the board's authority and raised questions about the level of control Microsoft held over OpenAI despite not owning a majority stake or having a voting seat. Meanwhile, OpenAI is also working on an AI health coach in partnership with Thrive AI Health, which aims to provide personalized health advice using AI and users' medical data. The startup Magic, which develops AI models to write software, is in talks to raise $200 million in funding, valuing the company at $1.5 billion, despite having no revenue or product for sale yet. These large funding rounds reflect the significant value investors see in automating software development.
GPU technology in AI: VC firms like Andreessen Horowitz and Sequoia Capital are making significant investments in GPU technology to gain a competitive edge in AI, with Andreessen Horowitz investing around $1.3 billion in generative AI deals over the last two years, but the value of these investments is being questioned due to the lack of significant consumer revenues and the depreciation of GPU values over time.
The race for AI dominance is heating up, and venture capital firms like Andreessen Horowitz and Sequoia Capital are making significant investments in GPU technology to gain a competitive edge. Andreessen Horowitz has amassed over 20,000 GPUs to support their portfolio companies, investing heavily in generative AI deals worth around $1.3 billion over the last two years. Sequoia Capital, on the other hand, has expressed skepticism about the value of these investments, pointing out the lack of significant consumer revenues in the AI industry and the depreciation of GPU values over time. Elon Musk's XAI is also making headlines with plans to build a massive 100,000 GPU training cluster, further highlighting the importance of GPU technology in the AI space. These moves illustrate the growing financialization of AI, with compute becoming a currency of sorts for these investments. However, the question remains whether these investments will pay off in the long run, or if the market will become increasingly commoditized.
AI Competition: Small companies like XAI and Skilled AI are competing with tech giants in AI research and hardware design, while AMD and Intel are making strategic acquisitions and investments to enhance their capabilities.
The AI industry is witnessing significant investments and advancements from various players, both in software and hardware domains. XAI, a small company, is aiming to compete with tech giants like Microsoft and Google in AI research and in-house data center design. AMD is acquiring SiloAI, a Finnish AI company, to enhance their software expertise and catch up with competitors. Skilled AI, a Pittsburgh-based startup, raised $300 million for their series A, developing general-purpose robotics models with a foundation model that claims to be unusually robust. Intel is beginning construction on a large chip fab in Germany to compete with market leader TSMC and reduce reliance on Taiwan. Investors, including Jeff Bezos, Sequoia Capital, and Carnegie Mellon, have shown confidence in these companies, indicating their potential to make significant contributions to the field. The industry continues to evolve rapidly, with companies exploring various approaches to AI and robotics to push the boundaries of technology.
Language model updates: Real-time weight updates through test time training and composable interventions are potential solutions for effectively and reliably incorporating new information into language models.
The ability to effectively and reliably incorporate new information into language models is a significant challenge. Traditional methods like prepending knowledge or knowledge updating don't always work effectively, especially when dealing with dynamic information such as API updates or new documentation. The introduction of test time training (TTT) and composable interventions are potential solutions to this problem. TTT allows for real-time weight updates, while composable interventions enable the study of the effects of multiple interventions on the same language model. Additionally, the order in which interventions like knowledge editing, unlearning, and model compression are applied matters, with the performance being negatively impacted if certain interventions are done in the wrong order. It's important to consider the potential impact of these interventions on various metrics and to evaluate models comprehensively following these interventions. The ability to effectively and efficiently incorporate new information into language models will become increasingly important as the software and data environments continue to evolve. Another interesting paper discussed the use of a predictive attention mechanism for neural decoding of visual perception, which can reconstruct what a person is looking at with remarkable accuracy using fMRI recordings.
AI-based photograph analysis, malicious uses: Research advances in deciphering photo content through AI and brain data, but concerns arise over potential misuse, such as covertly training models to understand harmful requests or hiding malicious messages in plaintext, which can evade detection
Researchers are making significant strides in deciphering what people are looking at in photographs using AI and brain data, leading to impressive reconstructions. This technique, called fMRI-based reconstruction, focuses on relevant brain areas and produces relatively accurate results, though not quite at the level of mind reading yet. Simultaneously, there's a growing concern about potential malicious uses of AI, such as covert malicious fine-tuning, which can make models respond to harmful requests without detection. In this technique, attackers can train models to understand a coded language or hide harmful messages in plaintext, making it difficult for safety and security algorithms to detect. Additionally, OpenAI faced several security issues, including a hacker gaining access to their internal messaging systems, potentially exposing sensitive information about their AI technology designs. These incidents highlight the importance of addressing both the technological advancements and potential risks associated with AI.
OpenAI security breach implications: OpenAI's decision not to involve law enforcement in a security breach raises questions about their ability to make national security assessments as a private company and the potential implications for partnerships with tech giants like Microsoft.
The discussion revolves around the incident where OpenAI, a private AI research company, experienced a security breach and chose not to involve law enforcement due to their internal assessment of the situation. The incident raises questions about OpenAI's ability to make national security assessments as a private company and the potential implications of such decisions. Additionally, OpenAI has an internal scale ranking the capabilities of AI from one to five, with AGI defined as a highly autonomous system surpassing humans in most economically valuable tasks. The definition of AGI and the board's determination of it has significant implications for OpenAI's partnership with Microsoft. Furthermore, a new dataset, "Me, Myself, and AI," explores situational awareness in LLMs and found that even the top models fell short of human baselines. The discussion also highlights the importance of defining and measuring situational awareness in AI models and the ongoing research in this area. Overall, the conversation underscores the need for transparency, accountability, and effective oversight in the development and deployment of advanced AI systems.
AI situational awareness: Cloud 3 Opus, tested by OpenAI, has a high situational awareness score, while OpenAI partners with Los Alamos National Laboratory to explore AI use in scientific research. Safety concerns were raised about OpenAI's new models, leading to a dismissed lawsuit and increased safety testing.
There are significant differences in situational awareness capabilities among various AI models, and these capabilities can be decoupled from other broader AI abilities. Cloud 3 Opus, tested by OpenAI, stands out for its high situational awareness score. Meanwhile, OpenAI is partnering with Los Alamos National Laboratory to explore the benefits and risks of using AI in scientific research, specifically in genetically engineering E. coli bacteria to produce insulin. A recent lawsuit against Microsoft, OpenAI, and GitHub over the use of intellectual property to train AI models was dismissed due to a failure to prove identical code reproduction. A former OpenAI safety employee, William Saunders, expressed concerns about the company's safety measures, comparing it to the Titanic, and emphasized the need for more safety testing before new models are launched. Vimeo, a video hosting service similar to YouTube and TikTok, has joined them in introducing new AI content labels.
AI-generated content disclosure: Platforms like YouTube, Vimeo, and Etsy are implementing new policies to require creators to disclose AI-generated elements in their content to ensure transparency and prevent confusion for users, while tech startup Avail helps media companies and independent creators monetize their data for AI training services.
Technology companies are increasingly requiring creators to disclose when their content includes AI-generated elements. This includes platforms like YouTube, Vimeo, and Etsy, which are implementing new policies to prevent confusion and ensure transparency. For instance, Vimeo now allows creators to label their content as AI-generated, while Etsy requires sellers to classify items based on the level of human involvement in their creation. These policies are aimed at addressing the growing use of AI in content creation and ensuring that users are aware of it. Additionally, a tech startup called Avail is helping media companies and independent creators license their content for AI training services, allowing them to monetize their data instead of having it scraped. These developments reflect the growing importance of AI in content creation and the need for clear guidelines and ethical practices.

Recent Episodes from Last Week in AI

#181 - Google Chatbots, Cerebras vs Nvidia, AI Doom, ElevenLabs Controversy

Our 181st episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov and Jeremie Harris

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

In this episode:

- Google's AI advancements with Gemini 1.5 models and AI-generated avatars, along with Samsung's lithography progress. - Microsoft's Inflection usage caps for Pi, new AI inference services by Cerebrus Systems competing with Nvidia. - Biases in AI, prompt leak attacks, and transparency in models and distributed training optimizations, including the 'distro' optimizer. - AI regulation discussions including California’s SB1047, China's AI safety stance, and new export restrictions impacting Nvidia’s AI chips.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:03:08)Response to listener comments / corrections
Tools & Apps
- (00:09:19) Google’s custom AI chatbots have arrived
- (00:12:52) Google releases three new experimental AI models
- (00:17:14) Google Gemini will let you create AI-generated people again
- (00:22:32) Five months after Microsoft hired its founders, Inflection adds usage caps to Pi
- (00:26:42:) Plaud takes a crack at a simpler AI pin
Applications & Business
- (00:30:31) Cerebras Systems throws down gauntlet to Nvidia with launch of ‘world’s fastest’ AI inference service
- (00:41:06) Nvidia announces $50 billion stock buyback
- (00:46:24) OpenAI in talks to raise funding that would value it at more than $100 billion
- (00:50:44) OpenAI Aims to Release New AI Model, ‘Strawberry,’ in Fall
- (00:52:53) 3 Co-Founders Leave French AI Startup H Amid ‘Operational Differences’
- (00:57:29) Samsung to Adopt High-NA Lithography Alongside Intel, Ahead of TSMC
- (01:02:11) Unitree's $16,000 G1 could become the first mainstream humanoid robot
Projects & Open Source
- (01:04:59) Meta leads open-source AI boom, Llama downloads surge 10x year-over-year
- (01:09:08) A_Preliminary_Report_on_DisTrO.
Research & Advancements
- (01:13:56) Diffusion Models Are Real-Time Game Engines
- (01:23:18) LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
- (01:32:21) Interviewing AI researchers on automation of AI R&D
- (01:40:33) Anthropic releases AI model system prompts, winning praise for transparency
Policy & Safety
Synthetic Media & Art
- (02:11:13) Actors Say AI Voice-Over Generator ElevenLabs Cloned Likenesses
(02:14:06) Outro

Last Week in AI

enSeptember 15, 2024

#180 - Ideogram v2, Imagen 3, AI in 2030, Agent Q, SB 1047

Our 180th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

If you would like to get a sneak peek and help test Andrey's generative AI application, go to Astrocade.com to join the waitlist and the discord.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Episode Highlights:

Ideogram AI's new features, Google's Imagine 3, Dream Machine 1.5, and Runway's Gen3 Alpha Turbo model advancements.
Perplexity's integration of Flux image generation models and code interpreter updates for enhanced search results.
Exploration of the feasibility and investment needed for scaling advanced AI models like GPT-4 and Agent Q architecture enhancements.
Analysis of California's AI regulation bill SB1047 and legal issues related to synthetic media, copyright, and online personhood credentials.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:01:08) Response to Listener Comments / Corrections
Tools & Apps
- (00:03:58) Ideogram AI expands its features with v2 model and color palette options
- (00:07:48) Google Releases Powerful AI Image Generator You Can Use for Free
- (00:11:41) Perplexity adds Flux.1 model for Pro users alongside Playground v3 update
- (00:13:58) Luma drops Dream Machine 1.5 — here’s what’s new
- (00:17:49) Runway’s Gen-3 Alpha Turbo is here and can make AI videos faster than you can type
- (00:20:21) Perplexity’s latest update improves code interpreter, charts included
Applications & Business
Projects & Open Source
Research & Advancements
- (01:12:35) Can AI Scaling Continue Through 2030?
- (01:15:35) Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
- (01:23:58) Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models
- (01:31:18) Loss of plasticity in deep continual learning
Policy & Safety
Synthetic Media & Art
- (01:58:33) Authors sue Claude AI chatbot creator Anthropic for copyright infringement
- (01:59:32) Artists’ lawsuit against Stability AI and Midjourney gets more punch
(02:01:43) Outro

Last Week in AI

enSeptember 03, 2024

#179 - Grok 2, Gemini Live, Flux, FalconMamba, AI Scientist

Our 179th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

If you would like to get a sneak peek and help test Andrey's generative AI application, go to Astrocade.com to join the waitlist and the discord.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Episode Highlights:

- Grok 2's beta release features new image generation using Black Forest Labs' tech.

- Google introduces Gemini Voice Chat Mode available to subscribers and integrates it into Pixel Buds Pro 2.

- Huawei's Ascend 910C AI chip aims to rival NVIDIA's H100 amidst US export controls.

- Overview of potential risks of unaligned AI models and skepticism around SingularityNet's AGI supercomputer claims.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:02:15) Response to listener comments / corrections
Tools & Apps
- (00:04:24) Grok-2 is out in beta, now with added AI image generation
- (00:11:28) OpenAI reveals an updated GPT-4o model - but can't quite explain how it's better
- (00:13:48) Google Gemini’s voice chat mode is here
- (00:16:18) Google’s Pixel Buds Pro 2 bring Gemini to your ears
- (00:19:55) Google’s AI-generated search summaries change how they show their sources
- (00:23:13) Prompt Caching is Now Available on the Anthropic API for Specific Claude Models
Applications & Business
Projects & Open Source
Research & Advancements
- (01:14:40) The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
- (01:30:24) Imagen 3
- (01:32:48) The Data Addition Dilemma
- (01:37:35) LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Policy & Safety
Synthetic Media & Art
- (01:48:21) SAG-AFTRA Strikes Groundbreaking AI Digital Voice Replica Pact With Startup Firm Narrativ
- (01:51:52) How ‘Deepfake Elon Musk’ Became the Internet’s Biggest Scammer
(01:56:21) AI Song Outro

Last Week in AI

enAugust 20, 2024

#178 - More Not-Acquihires, More OpenAI drama, More LLM Scaling Talk

Our 178th episode with a summary and discussion of last week's big AI news!

NOTE: this is a re-upload with fixed audio, my bad on the last one! - Andrey

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

If you would like to get a sneak peek and help test Andrey's generative AI application, go to Astrocade.com to join the waitlist and the discord.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

In this episode: - Notable personnel movements and product updates, such as Character.ai leaders joining Google and new AI features in Reddit and Audible. - OpenAI's dramatic changes with co-founder exits, extended leaves, and new lawsuits from Elon Musk. - Rapid advancements in humanoid robotics exemplified by new models from companies like Figure in partnership with OpenAI, achieving amateur-level human performance in tasks like table tennis. - Research advancements such as Google's compute-efficient inference models and self-compressing neural networks, showcasing significant reductions in compute requirements while maintaining performance.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:03:14) Response to listener comments / corrections
Applications & Business
Tools & Apps
- (00:55:40) OpenAI cuts GPT-4o prices, launches Structured Outputs amidst price war with Google
- (01:02:08) Apple Intelligence could get a $20 Plus version
- (01:04:05) Audible is testing an AI-powered search feature
- (01:05:53) Reddit to test AI-powered search result pages
Research & Advancements
- (01:06:35) Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
- (01:16:27) Achieving Human Level Competitive Robot Table Tennis
- (01:20:19) Self-Compressing Neural Networks
- (01:28:30) Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models
- (01:32:43) Berkeley Humanoid: A Research Platform for Learning-based Control
Policy & Safety
- (01:33:35) METR announces results of study on comparative capabilities of humans and agents
- (01:39:35) ‘The Godmother of AI’ says California’s well-intended AI bill will harm the U.S. ecosystem
- (01:49:13) Google Monopolized Search Through Illegal Deals, Judge Rules
- (01:54:56) Amazon faces UK merger probe over $4B Anthropic AI investment
- (01:55:44) GPT-4o System Card
(02:03:09) Outro

Last Week in AI

enAugust 16, 2024

#177 - Instagram AI Bots, Noam Shazeer -> Google, FLUX.1, SAM2

Our 177th episode with a summary and discussion of last week's big AI news!

NOTE: apologies for this episode again coming out about a week late, next one will be coming out soon...

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

If you'd like to listen to the interview with Andrey, check out https://www.superdatascience.com/podcast

If you would like to get a sneak peek and help test Andrey's generative AI application, go to Astrocade.com to join the waitlist and the discord.

In this episode, hosts Andrey Kurenkov and John Krohn dive into significant updates and discussions in the AI world, including Instagram's new AI features, Waymo's driverless cars rollout in San Francisco, and NVIDIA’s chip delays. They also review Meta's AI Studio, character.ai CEO Noam Shazir's return to Google, and Google's Gemini updates. Additional topics cover NVIDIA's hardware issues, advancements in humanoid robots, and new open-source AI tools like Open Devon. Policy discussions touch on the EU AI Act, the U.S. stance on open-source AI, and investigations into Google and Anthropic. The impact of misinformation via deepfakes, particularly one involving Elon Musk, is also highlighted, all emphasizing significant industry effects and regulatory implications.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

(00:00:00) AI Song / Intro Banter
(00:05:32) Response to listener comments / corrections
Tools & Apps
- (00:10:16) Apple Intelligence to Miss Initial Launch of Upcoming iOS 18 Overhaul
- (00:16:35) Instagram starts letting people create AI versions of themselves
- Lighting round
  - (00:22:49) Runway just dropped image-to-video in Gen3
  - (00:25:41) Midjourney drops surprise v6.1 update — now humans look more real than ever
  - (00:28:07) AI-Powered Necklace Will Be Your Friend for $99
  - (00:30:06) Microsoft is adding AI-powered summaries to Bing search results
Applications & Business
- (00:31:44) Character.AI CEO Noam Shazeer returns to Google
- (00:39:41) Perplexity is cutting checks to publishers following plagiarism accusations
- Lighting round
  - (00:43:30) Nvidia reportedly delays its next AI chip due to a design flaw
  - (00:41:08) Neura shows off humanoid robot 4NE-1
  - (00:46:0) Yes, there are more driverless Waymos in S.F. Here’s how busy they are
  - (00:57:27) Canva acquires Leonardo.ai to boost its generative AI efforts
Projects & Open Source
- (00:59:19) Black Forest Labs Open-Source FLUX.1: A 12 Billion Parameter Rectified Flow Transformer Capable of Generating Images from Text Descriptions
- (01:01:59) Google releases new ‘open’ AI models with a focus on safety
- Lighting round
  - (01:05:09) Stability AI releases super-fast model for 3D asset image generation
  - (01:09:29) OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Research & Advancements
- (01:12:10) Meta AI Introduces Meta Segment Anything Model 2 (SAM 2): The First Unified Model for Segmenting Objects Across Images and Videos
- (01:19:20) MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts
- Lighting round
Policy & Safety
- (01:33:03) World's First-Ever AI Law Now Enforced in Europe, Targeting US Tech Giants
- (01:39:12) White House says no need to restrict ‘open-source’ artificial intelligence — at least for now
- Lighting round
  - (01:41:12) With Smugglers and Front Companies, China Is Skirting American A.I. Bans
  - (01:44:03) UK antitrust body probes Google’s ties with AI rival Anthropic
  - (01:45:20) Elon Musk posts deepfake of Kamala Harris that violates X policy
(01:50:10) AI Outro

Last Week in AI

enAugust 11, 2024

#176 - SearchGPT, Gemini 1.5 Flash, Lamma 3.1 405B, Mistral Large 2

Our 176th episode with a summary and discussion of last week's big AI news!

NOTE: apologies for this episode coming out about a week late, things got in the way of editing it...

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

(00:00:00) Intro Song
(00:00:34) Intro Banter
Tools & Apps
- (00:03:39) OpenAI announces SearchGPT, its AI-powered search engine
- (00:08:03) Google gives free Gemini users access to its faster, lighter 1.5 Flash AI model
- (00:09:10) X launches underwhelming Grok-powered ‘More About This Account’ feature
- (00:11:36) Kuaishou Launches Full Beta Testing for 'Kling AI' to Global Users, Elevates Model Capabilities
- (00:13:39) Adobe rolls out more generative AI features to Illustrator and Photoshop
- (00:14:25) Meta AI gets new ‘Imagine me’ selfie feature
Projects & Open Source
Applications & Business
Research & Advancements
Policy & Safety
Synthetic Media & Art
- (01:20:58) Video game performers will go on strike over artificial intelligence concerns
(01:23:03) Outro
(01:23:58) AI Song

Last Week in AI

enAugust 03, 2024

#175 - GPT-4o Mini, OpenAI's Strawberry, Mixture of A Million Experts

Our 175th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

In this episode of Last Week in AI, hosts Andrey Kurenkov and Jeremy Harris explore recent AI advancements including OpenAI's release of GPT 4.0 Mini and Mistral’s open-source models, covering their impacts on affordability and performance. They delve into enterprise tools for compliance, text-to-video models like Hyper 1.5, and YouTube Music enhancements. The conversation further addresses AI research topics such as the benefits of numerous small expert models, novel benchmarking techniques, and advanced AI reasoning. Policy issues including U.S. export controls on AI technology to China and internal controversies at OpenAI are also discussed, alongside Elon Musk's supercomputer ambitions and OpenAI’s Prover-Verify Games initiative.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) AI Song Intro
(00:00:40) Intro / Banter
Tools & Apps
- (00:03:57) OpenAI unveils GPT-4o mini, a small AI model powering ChatGPT
- (00:11:38) Meet Haiper 1.5, the new AI video generation model challenging Sora, Runway
- (00:16:32) Anthropic releases Claude app for Android
- (00:18:59) Google Vids is available to test out Gemini AI-created video presentations
- (00:20:27) YouTube Music sound search rolling out, AI ‘conversational radio’ in testing
Applications & Business
Projects & Open Source
Research & Advancements
- (01:01:49) FlashAttention-3 unleashes the power of H100 GPUs for LLMs
- (01:06:38) Mixture of A Million Experts
- (01:12:51) AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models
- (01:18:23) SpreadsheetLLM: Encoding Spreadsheets for Large Language >Models
Policy & Safety
(01:44:59) Outro + AI Song

Last Week in AI

enJuly 25, 2024

#174 - Odyssey Text-to-Video, Groq LLM Engine, OpenAI Security Issues

Our 174rd episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

In this episode of Last Week in AI, we delve into the latest advancements and challenges in the AI industry, highlighting new features from Figma and Quora, regulatory pressures on OpenAI, and significant investments in AI infrastructure. Key topics include AMD's acquisition of Silo AI, Elon Musk's GPU cluster plans for XAI, unique AI model training methods, and the nuances of AI copying and memory constraints. We discuss developments in AI's visual perception, real-time knowledge updates, and the need for transparency and regulation in AI content labeling and licensing.

See full episode notes here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) Intro AI Song
(00:00:41) Pre News Banter
Tools & Apps
- (00:07:09) Odyssey Building 'Hollywood-Grade' AI Text-to-Video Model to Compete With Sora, Gen-3 Alpha
- (00:10:28) Anthropic’s Claude adds a prompt playground to quickly improve your AI apps
- (00:15:06) Figma pauses its new AI feature after Apple controversy
- (00:18:30) Quora’s Poe now lets users create and share web apps
- (00:20:54) Suno launches iPhone app — now you can make AI music on the go
Applications & Business
Research & Advancements
Policy & Safety
- (01:26:49) Covert Malicious Finetuning
- (01:31:23) OpenAI’s week of security issues
- (01:36:39) Here’s how OpenAI will determine how powerful its AI systems are
- (01:39:56) Me, Myself and AI: The Situational Awareness Dataset for LLMs
- (01:44:34) Exclusive: OpenAI partners with Los Alamos to study AI in the lab
- (01:47:36) Judge dismisses coders’ DMCA claims against Microsoft, OpenAI and GitHub
- (01:49:55) A former OpenAI safety employee said he quit because the company's leaders were 'building the Titanic' and wanted 'newer, shinier' things to sell
Synthetic Media & Art
- (01:52:46) Vimeo joins YouTube and TikTok in launching new AI content labels
- (01:54:50) Tech Startup Aims to Help Media License Content for AI Training
- (01:57:23) Etsy adds AI-generated item guidelines in new seller policy
- (01:59:44) Bumble users can now report profiles that use AI-generated photos
(02:02:05) Outro + AI Song

Last Week in AI

enJuly 17, 2024

#173 - Gemini Pro, Llama 400B, Gen-3 Alpha, Moshi, Supreme Court

Our 173rd episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

See full episode notes here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

In this episode of Last Week in AI, we explore the latest advancements and debates in the AI field, including Google's release of Gemini 1.5, Meta's upcoming LLaMA 3, and Runway's Gen 3 Alpha video model. We discuss emerging AI features, legal disputes over data usage, and China's competition in AI. The conversation spans innovative research developments, cost considerations of AI architectures, and policy changes like the U.S. Supreme Court striking down Chevron deference. We also cover U.S. export controls on AI chips to China, workforce development in the semiconductor industry, and Bridgewater's new AI-driven financial fund, evaluating the broader financial and regulatory impacts of AI technologies.

Timestamps + links: