#172 - Claude and Gemini updates, Gemma 2, GPT-4 Critic

enJuly 01, 2024

Last Week in AI

Podcast Summary

AI user experience race: Anthropic's focus on safety and alignment, OpenAI and ChatGPT's new tools, and Google's Gemini side panels are advancing the AI industry's user experience, with the winner being determined by both scaling and user experience, safety, and alignment features.
The AI industry is advancing at an unprecedented pace, with companies like Anthropic, OpenAI, and ChatGPT releasing new tools and features to enhance user experience and differentiate themselves. Anthropic, in particular, is focusing on safety and alignment, releasing constitutional AI and user-friendly projects functionality to collaborate and upload large contexts. Google is also joining the race with the rollout of Gemini side panels for Gmail and other workspace apps. The user experience fine-tuning is a crucial dimension of the AI race, as it can significantly leverage the capabilities of underlying models. The winner in this race will not only be determined by scaling but also by the best user experience, safety, and alignment features.
AI integration in apps: Tech companies like Google, OpenAI, and Microsoft are integrating AI into their apps and tools to enhance user experience, offering features like summarizing emails, drafting emails, creating tables and formulas, and voice mode for chats.
Tech companies, including Google and OpenAI, are increasingly integrating AI into various apps and tools to augment user experience. Google's AI functionality, known as Gemini, is now available in Gmail, Google Docs, and Google Sheets, offering features like summarizing email threads, drafting emails, and creating tables and formulas. OpenAI, on the other hand, is facing delays in rolling out its voice mode for ChatGPT due to the need for more refinement and testing for safety. OpenAI has also made ChatGPT available to all Mac users, deepening its integration with Apple. Microsoft's Co-pilot is another example of this trend, offering a single AI that spans across all apps. Apple, meanwhile, has taken a features-over-models approach, integrating AI throughout iOS in various ways. Waymo's removal of the waitlist for its robotaxi service in San Francisco is another sign of the growing prevalence of AI in everyday life.
AI integration across industries: Companies like Waymo, Figma, and MIT SOHU are making significant strides in AI technology, with Waymo preparing for robo-taxi era, Figma introducing generative tools, and MIT SOHU unveiling a transformer-specialized chip. Transformers are believed to be the future, but concerns remain about scalability and on-chip memory footprint.
We're witnessing significant advancements in technology across various industries, with a focus on artificial intelligence (AI) integration. Waymo, a self-driving tech company, is preparing for increased scrutiny as they graduate from testing to the robo-taxi era. Figma, a popular design software, has introduced a major redesign and AI capabilities, including generative text and image tools. In the tech hardware sector, MIT SOHU has unveiled the world's first transformer-specialized chip ASIC, which is claimed to be 20 times faster and cheaper than the latest GPUs. These developments underscore the growing importance of AI and its integration into various applications and industries. Companies are making bold bets on AI, with MIT SOHU specifically believing that transformers will be the way forward. However, questions remain regarding scalability and the on-chip memory footprint of these advanced technologies.
AI chips and architecture: NVIDIA's custom chips and transformer architecture lead to high utilization rates and success in partnership with TSMC. China invests in onshoring chip manufacturing and collaborating with Broadcom to develop advanced chips compliant with export control restrictions, while Chinese firms migrate to alternative APIs.
The race for advanced chips and AI technology continues to heat up, with major companies and nations investing heavily to stay competitive. In the case of NVIDIA, they're making a significant bet on custom chips and transformer architecture for AI training, achieving high utilization rates and seeing success with their partnership with TSMC. Meanwhile, China is making moves to onshore its chip manufacturing supply chain and collaborate with companies like Broadcom to develop advanced chips compliant with current export control restrictions. Chinese firms are also responding to API restrictions by migrating to alternative offerings. These developments underscore the importance of innovation and technological advancement in the global AI market, as well as the geopolitical implications of the chip industry.
Chinese AI competition: Chinese tech companies are incentivizing businesses to leave OpenAI for domestic chatbot alternatives amidst controversy and competition, while Meta releases a language model for compiler optimization.
Several Chinese tech companies, including Alibaba Cloud, Jipu AI, and GPUAI, are offering support and incentives for companies to migrate from OpenAI's chatbot models to their domestic alternatives, following OpenAI's withdrawal from the Chinese market. OpenAI has also faced controversy regarding its stock sales policies and non-disparagement agreements. Meta, on the other hand, has announced the release of a large language model for compiler optimization, aimed at improving the performance and efficiency of compiled code. Jipu AI, a Chinese company, has gained attention for its open source models and unique licensing requirements. The complex set of events highlights the ongoing competition and shifting landscape in the AI industry, particularly in the context of geopolitical positioning and open source technologies.
AI optimization of programming languages: Researchers use AI for just-in-time compilation, creating more efficient code without multiple compilations. Google's Gemma 2 simulates biological evolution to generate new proteins using a 100 billion parameter model, demonstrating AI's potential in diverse fields.
Researchers are using artificial intelligence (AI) to optimize programming languages and create more efficient code. This process, known as just-in-time compilation with AI, allows for the optimization of code without the need for multiple compilations. Google is also advancing language models with the launch of Gemma 2, which simulates biological evolution to generate new proteins. This team, formerly from Meta, has raised $142 million in seed funding and is using a transformer model to predict protein structure and function based on sequence information. The most compute ever applied to training a biological model was used in this research, resulting in a 100 billion parameter model. This new version of green fluorescent protein, designed using AI, has a sequence that is only 58% similar to the closest known fluorescent protein. This demonstrates the potential of AI to advance various fields, from programming languages to biology. The team behind this research is a public benefit company, with a mission focused on social good rather than just the bottom line. This is a significant step forward in the integration of AI into various industries and scientific research.
AI model evaluation: Despite a plateau in AI model performance, advancements in biology and new benchmarks continue to push the boundaries of what's possible in the industry.
The field of AI model development has reached a plateau in terms of performance on various benchmarks, with no clear leader surpassing the GPD4 level. This has led to skepticism towards new model announcements, especially from China, due to the difficulty of making accurate comparisons between models trained on different datasets and languages. The Hugging Face leaderboard, a significant resource for evaluating models, has undergone an update to address this issue by using new, harder benchmarks and changing the scoring criteria. In biology, a recent paper on the structural mechanism of bridge RNA-guided recombination represents a significant step forward in the field, made possible by AI tools like AlphaFold 2 and CollabFold. Despite the technical jargon, this discovery has the potential to revolutionize the way we solve problems in genomics and insert genetic information into sequences. Overall, the AI industry is facing challenges in scaling and evaluating new models, but advancements in various fields continue to push the boundaries of what's possible.
Biology and AI intersection, deep safety alignment: Google DeepMind's research in genetic engineering and language models' deep safety alignment are crucial advancements in their respective fields. Refusal tokens in language models can improve safety against attacks, but opposition to safety testing exists in the policy realm.
The intersection of biology and AI technology is leading to significant advancements, particularly in the field of genetic engineering. Google DeepMind's research in this area has been instrumental in unlocking new possibilities. Another key finding from the research community is the importance of deep safety alignment in language models, which goes beyond just the first few output tokens. This can help improve robustness against common exploits, such as adversarial suffix attacks and pre-filling attacks. The researchers found that language models often start responses with refusal tokens, and by training models with both the question and a dangerous response followed by refusal tokens, they were able to reduce attack success rates. In the policy realm, there has been opposition from startups to California's AI safety bill (Senate Bill 1047), which would require certain models to undergo safety testing. This bill has sparked debate about the balance between safety and innovation in the rapidly evolving field of AI. Overall, these findings highlight the importance of continued research and innovation in the areas of biology and AI, as well as the need for thoughtful policy approaches to ensure safety and ethical use of these technologies.
AI regulation: Expertise and understanding are crucial in AI regulation debates, as shown by Y Combinator's concerns and the need for addressing AI misuse cases.
The debate surrounding AI regulation, specifically California's B1047 bill, highlights the need for expertise and understanding in this complex field. Y Combinator, a renowned startup accelerator, has voiced concerns about the potential impact of the bill on innovation and investment in AI research. However, critics argue that the organization lacks the necessary national security and AI control expertise to fully evaluate the risks at stake. Meanwhile, there have been reported cases of AI misuse, such as pro-government supporters using AI tools for mass messaging campaigns. These incidents underscore the importance of having mechanisms in place for reporting and addressing dual use capabilities of advanced AI systems. It's crucial that all stakeholders, including organizations like Y Combinator, engage in open and informed discussions with experts to ensure that regulations are effective and balanced.
AI policy, weaponization and dual use: An 'AI observatory' or 'information clearing house' is proposed to monitor dual use capabilities and a liaison in Congress is suggested to oversee its implementation. A new technique called 'SkeletonKey' can bypass AI safety measures, and the music industry is suing AI music generators for copyright infringement.
The document discussed is a well-thought-out piece focusing on AI policy, specifically addressing the weaponization and dual use of AI systems. The authors advocate for the establishment of an "AI observatory" or "information clearing house" to collect evidence of dual use capabilities and assign a liaison in Congress to oversee its implementation. This could involve setting up reporting requirements and incident response plans, among other practical steps. Additionally, a new jailbreak technique called "SkeletonKey" was introduced, which allows users to bypass AI safety measures by providing a warning before asking for harmful instructions. Microsoft researchers demonstrated that this technique works on various AI models, including GPT-4, but noted that it only works when included in the system prompt rather than the user input. Lastly, the music industry is suing AI music generators for copyright infringement, raising questions about the legality of using copyrighted material for training AI models. Overall, the document provides valuable insights into the importance of AI policy and the challenges and potential solutions in this rapidly evolving field.
AI legal and ethical issues: Companies seek permission to use existing content for AI training while artists' control and compensation are unclear. AI's limitations constrain creative choices and legal uncertainty makes it a financial risk for new companies.
The use of generative AI, whether it be in text or music, raises significant legal and ethical questions. Companies like YouTube are seeking permission to use existing content for training their AI models, while others are creating new content using existing AI models. However, the question of how much control artists have over the use of their content and the compensation they receive remains unanswered. Additionally, the limitations of current AI models, such as Sora being video-only, can constrain creative choices. As AI continues to evolve, these issues will become even more complex and important to address. Another key point is that the use of generative AI is a big bet for new companies, given the legal uncertainty and the potential financial cost of licensing existing content. Overall, the use of generative AI in various industries is an exciting development, but it also comes with significant challenges that need to be addressed.

Recent Episodes from Last Week in AI

#172 - Claude and Gemini updates, Gemma 2, GPT-4 Critic

Our 172nd episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

(00:00:00) Intro / Banter
Tools & Apps
- (00:03:02) Anthropic Debuts Collaboration Tools for Claude AI Assistant
- (00:08:32) Google rolls out Gemini side panels for Gmail and other Workspace apps
- (00:12:30) OpenAI delays rolling out its 'Voice Mode' to July
- (00:15:40) OpenAI’s ChatGPT for Mac is now available to all users
- (00:17:27) Waymo ditches the waitlist and opens up its robotaxis to everyone in San Francisco
- (00:18:53) Figma announces big redesign with AI
Applications & Business
Projects & Open Source
Research & Advancements
- (00:56:57) Finding GPT-4’s mistakes with GPT-4
- (01:03:30) Chinese-built ChatGLM exceeds GPT-4 Across Several Benchmarks
- (01:07:15) Performances are plateauing, let's make the leaderboard steep again
- (01:11:18) Structural mechanism of bridge RNA-guided recombination
- (01:15:01) Reconciling Kaplan and Chinchilla Scaling Laws
Policy & Safety
Synthetic Media & Art
- (01:39:35) Music labels sue AI music generators for copyright infringement
- (01:42:43) YouTube is trying to make AI music deals with major record labels
- (01:45:07) Toys ‘R’ Us Debuts First Video Ad Using Sora, OpenAI’s Text-to-Video Tool
(01:49:12) Outro + AI Song

Last Week in AI

enJuly 01, 2024

#171 - - Apple Intelligence, Dream Machine, SSI Inc

Our 171st episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter
Tools & Apps
Applications & Business
- (00:28:23 ) Sam Altman might reportedly turn OpenAI into a regular for-profit company
- (00:31:19) Ilya Sutskever, Daniel Gross, Daniel Levy launch Safe Superintelligence Inc.
- (00:38:53) OpenAI welcomes Sarah Friar (CFO) and Kevin Weil (CPO)
- (00:41:44) Report: OpenAI Doubled Annualized Revenue in 6 Months
- (00:44:30) AI startup Adept is in deal talks with Microsoft
- (00:48:55) Mistral closes €600m at €5.8bn valuation with new lead investor
- (00:53:12) Huawei Claims Ascend 910B AI Chip Manages To Surpass NVIDIA’s A100, A Crucial Alternative For China
- (00:56:58) Astrocade raises $12M for AI-based social gaming platform
Projects & Open Source
Research & Advancements
- (01:12:02) Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
- (01:22:07) Improve Mathematical Reasoning in Language Models by Automated Process Supervision
- (01:28:01) Introducing Lamini Memory Tuning: 95% LLM Accuracy, 10x Fewer Hallucinations
- (01:30:32) An Empirical Study of Mamba-based Language Models
- (01:31:57) BERTs are Generative In-Context Learners
- (01:33:33) SELFGOAL: Your Language Agents Already Know How to Achieve High-level Goals
Policy & Safety
Synthetic Media & Art
(02:02:23) Outro + AI Song

Last Week in AI

enJune 24, 2024

#170 - new Sora rival, OpenAI robotics, understanding GPT4, AGI by 2027?

Our 170th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

Tools & Apps
Applications & Business
- (00:19:40) OpenAI is restarting its robotics research group
- (00:25:01) Saudi fund invests in China effort to create rival to OpenAI
- (00:29:34) UAE seeks ‘marriage’ with US over artificial intelligence deals
- (00:33:01) Zoox to test self-driving cars in Austin and Miami
- (00:35:49) Microsoft Lays Off 1,500 Workers, Blames "AI Wave"
- (00:38:28) Avengers, assemble—Google, Intel, Microsoft, AMD and more team up to develop an interconnect standard to rival Nvidia's NVLink
Projects & Open Source
- (00:40:39) GLM-4-9B-Chat-1M
- (00:46:37) Hugging Face and Pollen Robotics show off first project: an open source robot that does chores
- (00:49:40) Zyphra debuts Zyda, a 1.3T language modeling dataset it claims outperforms Pile, C4, arxiv
- (00:51:59) Stability AI debuts new Stable Audio Open for sound design
Research & Advancements
Policy & Safety
- (01:20:11) Former OpenAI researcher foresees AGI reality in 2027
- (01:28:03) OpenAI Insiders Warn of a ‘Reckless’ Race for Dominance
- (01:33:52) Testing and mitigating elections-related risks
- (01:36:26) Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Synthetic Media & Art
- (01:43:23) The Uncanny Rise of the World's First AI Beauty Pageant
(01:46:25) Outro + AI Song

Last Week in AI

enJune 09, 2024

#169 - Google's Search Errors, OpenAI news & DRAMA, new leaderboards

Our 168th episode with a summary and discussion of last week's big AI news!

Feel free to leave us feedback here: https://forms.gle/ngXvXZpNJxaAprDv6

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter
(00:02:55) Response to listener comments / corrections
Tools & Apps
- (00:04:33) Google’s A.I. Search Errors Cause a Furor Online
- (00:10:56) Telegram gets an in-app Copilot bot
- (00:13:13) Opera is adding Google's Gemini AI to its browser
- (0016:13) Amazon plans to give Alexa an AI overhaul — and a monthly subscription price
- (00:19:15) Microsoft Edge will translate and dub YouTube videos as you’re watching them
- (00:21:12) Iyo thinks its gen AI earbuds can succeed where Humane and Rabbit stumbled
Applications & Business
Projects & Open Source
Research & Advancements
- (01:09:23) The Road Less Scheduled
- (01:14:10) Training Compute of Frontier AI Models Grows by 4-5x per Year
- (01:21:33) gzip Predicts Data-dependent Scaling Laws
- (01:25:51) Neural Scaling Laws for Embodied AI
- (01:28:47) Contextual Position Encoding: Learning to Count What’s Important
- (01:33:09) New AI products much hyped but not much used, study says
Policy & Safety
- (01:37:00) Ex-OpenAI board member reveals what led to Sam Altman's brief ousting
- (01:46:36) OpenAI researcher who resigned over safety concerns joins Anthropic
- (01:49:16) Leaked OpenAI Documents Show Sam Altman Was Clearly Aware of Silencing Former Employees
- (01:54:33) OpenAI Board Forms Safety and Security Committee
- (01:58:07) Robocaller Who Used AI to Clone Biden’s Voice Fined $6 Million
- (01:59:08) Hacker Releases Jailbroken "Godmode" Version of ChatGPT
- (02:00:46) China Creates $47.5 Billion Chip Fund to Back Nation’s Firms
Synthetic Media & Art
- (02:02:23) Alphabet, Meta Offer Millions to Partner With Hollywood on AI
(02:04:21) Outro + AI Song

Last Week in AI

enJune 03, 2024

#168 - OpenAI vs Scar Jo + safety researchers, MS AI updates, cool Anthropic research

Our 168th episode with a summary and discussion of last week's big AI news!

With guest host Gavin Purcell from AI for Humans podcast!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter + Response to listener comments / corrections
Tools & Apps
Applications & Business
Projects & Open Source
Research & Advancements
- (00:56:05) New Anthropic Research Sheds Light on AI's 'Black Box'
- (01:04:03) Chameleon: Mixed-Modal Early-Fusion Foundation Models
- (01:08:14) SpeechVerse: A Large-scale Generalizable Audio Language Model
- (01:09:05) CAT3D: Create Anything in 3D with Multi-View Diffusion Models
- (01:11:17) Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning
- (01:12:10) SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models
Policy & Safety
Synthetic Media & Art
- (01:28:32) Sony Music warns tech companies over ‘unauthorized’ use of its content to train AI
- (01:32:34) Hollywood agency CAA aims to help stars manage their own AI likenesses
- (01:38:28) What Do You Do When A.I. Takes Your Voice?
(01:42:01) Outro + AI Song

Last Week in AI

enMay 28, 2024

#167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

Our 167th episode with a summary and discussion of last week's big AI news!

With guest host Daliana Liu (https://www.linkedin.com/in/dalianaliu/) from The Data Scientist Show!

And a special one-time interview with Andrey in the latter part of the podcast.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Intro / Banter
Tools & Apps
- (00:03:42) OpenAI releases GPT-4o, a faster model that’s free for all ChatGPT users
- (00:12:06) Project Astra is the future of AI at Google
- (00:18:06) Google is redesigning its search engine — and it’s AI all the way down
- (00:19:39) Google unveils Veo and Imagen 3, its latest AI media creation models
- (00:23:36) Google Unveils Music AI Sandbox Making Loops From Prompts
- (00:26:27) Anthropic AI Launches a Prompt Engineering Tool that Generates Production-Ready Prompts in the Anthropic Console
Applications & Business
- (00:31:02) OpenAI’s Chief Scientist and Co-Founder Is Leaving the Company
- (00:35:15) Mike Krieger joins Anthropic as Chief Product Officer
- (00:36:28) $16k G1 humanoid rises up to smash nuts, twist and twirl
- (00:41:02) GM's Cruise to start testing robotaxis in Phoenix area with human safety drivers on board
- (00:42:52) US agency probes Amazon-owned Zoox self-driving vehicles after two crashes
- (00:43:58) Waymo’s robotaxis under investigation after crashes and traffic mishaps
Projects & Open Source
Research & Advancements
- (00:49:22) The Platonic Representation Hypothesis
- (00:53:08) SUTRA: Scalable Multilingual Language Model Architecture
Policy & Safety
- (00:54:46) Bipartisan Senate bill on AI security would bolster voluntary cyber reporting processes
- (00:56:17) U.K. agency releases tools to test AI model safety
- (00:57:25) Protesters Are Fighting to Stop AI, but They’re Split on How to Do It
Synthetic Media & Art
(01:06:37) Daliana Interviews Andrey
(01:42:00) AI Outro Song

Last Week in AI

enMay 19, 2024

#166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

Our 166th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) Intro / Banter
Tools & Apps
- (00:04:23) ElevenLabs previews music-generating AI model
- (00:09:31) Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts
- (00:13:00) SoundHound AI and Perplexity Partner to Bring Online LLMs to Next Gen Voice Assistants Across Cars and IoT Devices
- (00:14:50) Stability AI sows gen AI discord with Stable Artisan
- (00:16:35) Apple Will Revamp Siri to Catch Up to Its Chatbot Competitors
- (00:18:54) Alibaba rolls out latest version of its large language model to meet robust AI demand
Applications & Business
- (00:19:34) OpenAI and Stack Overflow partner to bring more technical knowledge into ChatGPT
- (00:17:31) New Microsoft AI model may challenge GPT-4 and Google Gemini
- (00:31:08) Wayve, an A.I. Start-Up for Autonomous Driving, Raises $1 Billion
- (00:32:00) Motional delays commercial robotaxi plans amid restructuring
- (00:33:54) The rise of the Chinese AI unicorns doing battle with OpenAI
Projects & Open Source
Research & Advancements
- (00:50:02) Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA
- (00:57:20) xLSTM: Extended Long Short-Term Memory
- (01:06:35) StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
- (01:07:55) Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
- (01:11:48) KAN: Kolmogorov-Arnold Networks
Policy & Safety
- (01:13:20) US lawmakers unveil bill to make it easier to restrict exports of AI models
- (01:17:30) OpenAI’s Model Spec outlines some basic rules for AI
- (01:20:18) Robot dogs armed with AI-targeting rifles undergo US Marines Special Ops evaluation
- (01:25:15) OpenAI Releases ‘Deepfake’ Detector to Disinformation Researchers
Synthetic Media & Art
(01:40:18) AI Outro Song

Last Week in AI

enMay 12, 2024

#165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

Our 165th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Tools & Apps
Applications & Business
Research & Advancements
- (00:39:20) Capabilities of Gemini Models in Medicine
- (00:45:34) Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
- (00:52:20) NExT: Teaching Large Language Models to Reason about Code Execution
- (00:55:08) SenseNova 5.0: China’s latest AI model surpasses OpenAI’s GPT-4
- (00:57:20) Octopus v4: Graph of language models
- (01:00:28) Better & Faster Large Language Models via Multi-token Prediction
Policy & Safety
Synthetic Media & Art
- (01:26:30) Air Head creators say OpenAI's Sora finicky to work with, needs hundreds of prompts, serious VFX work for under 2 minutes of cohesive story ↺
- (01:29:50) Eight newspaper publishers sue OpenAI over copyright infringement

Last Week in AI

enMay 05, 2024

#164 - Meta AI, Phi-3, OpenELM, Bollywood Deepfakes

Our 164th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Tools & Apps
- (00:04:02) Meta, in Its Biggest A.I. Push, Places Smart Assistants Across Its Apps
- (00:07:26) Microsoft launches Phi-3, its smallest AI model yet
- (00:15:35) The Ray-Ban Meta Smart Glasses have multimodal AI now
- (00:17:32) OpenAI winds down AI image generator that blew minds and forged friendships in 2022
- (00:18:44) Baidu claims 200 million users for Ernie chatbot after only 13 months
- (00:21:13) The new Adobe Photoshop gets an in-app image generator, major Generative Fill upgrades
Applications & Business
Projects & Open Source
- (00:39:03) Apple releases OpenELM: small, open source AI models designed to run on-device
- (00:44:12) Snowflake launches Arctic, an open ‘mixture-of-experts’ LLM to take on DBRX, Llama 3
Research & Advancements
Policy & Safety
- (01:05:11) Deepfakes of Bollywood stars spark worries of AI meddling in India election
- (01:08:51) LLM Agents can Autonomously Exploit One-day Vulnerabilities
- (01:15:27) The Necessity of AI Audit Standards Boards
- (01:19:45) A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
- (01:22:45) COERCING LLMS TO DO AND REVEAL (ALMOST) ANYTHING
- (01:26:40) China acquired recently banned Nvidia chips in Super Micro, Dell servers, tenders show
Synthetic Media & Art
- (01:29:08) Drake threatened with lawsuit over diss track featuring AI Tupac

Last Week in AI

enApril 30, 2024

#163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

Our 163rd episode with a summary and discussion of last week's big AI news!

Note: apology for this one coming out a few days late, got delayed in editing it -Andrey

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Intro / Banter
Tools & Apps
- (00:02:16) Meta releases Llama 3, claims it’s among the best open models available
- (00:14:01) Elon Musk’s xAI Unveils Grok-1.5 Vision, Beats OpenAI’s GPT-4V
- (00:17:55) Reka releases Reka Core, its multimodal language model to rival GPT-4 and Claude 3 Opus
- (00:21:50) Cohere Compass Private Beta: A New Multi-Aspect Embedding Model
- (00:23:48) Amazon Music’s Maestro lets listeners make AI playlists
- (00:24:36) Snap plans to add watermarks to images created with its AI-powered tools
Applications & Business
Projects & Open Source
- (00:44:08) OpenEQA: Embodied Question Answering in the Era of Foundation Models
- (00:50:03) Introducing Idefics2: A Powerful 8B Vision-Language Model for the community
Research & Advancements
- (00:51:21) RHO-1: Not All Tokens Are What You Need
- (00:57:21) Scaling Laws for Fine-Grained Mixture of Experts
- (01:03:20) Chinchilla Scaling: A replication attempt
- (01:07:18) China develops new light-based chiplet that could power artificial general intelligence — where AI is smarter than humans
- (01:10:45) OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Policy & Safety
Synthetic Media & Art
- (01:30:25) Medium bans AI-generated content from its paid Partner Program

Last Week in AI

enApril 24, 2024

On this page

#172 - Claude and Gemini updates, Gemma 2, GPT-4 Critic

Last Week in AI

Podcast Summary

Recent Episodes from Last Week in AI

#172 - Claude and Gemini updates, Gemma 2, GPT-4 Critic

#171 - - Apple Intelligence, Dream Machine, SSI Inc

#170 - new Sora rival, OpenAI robotics, understanding GPT4, AGI by 2027?

#169 - Google's Search Errors, OpenAI news & DRAMA, new leaderboards

#168 - OpenAI vs Scar Jo + safety researchers, MS AI updates, cool Anthropic research

#167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

#166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

#165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

#164 - Meta AI, Phi-3, OpenELM, Bollywood Deepfakes

#163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban