#169 - Google's Search Errors, OpenAI news & DRAMA, new leaderboards

enJune 03, 2024

Last Week in AI

Podcast Summary

AI misinformation: Google's new AI search feature faced backlash due to inaccurate and misleading information it provided, highlighting the need for careful implementation and consideration when adding AI features.
Last week saw significant developments in AI, but also raised concerns over potential misinformation and errors in AI-generated content. Google's new AI search feature, which generates summaries of sources at the top of search results, faced backlash due to inaccurate and misleading information it provided. This included incorrect information about Muslim presidents in the US and suggestions to eat rocks. Google's response was defensive, stating that the errors only affected a minority of queries and that they were manually removing bad results. However, this incident highlights the need for careful implementation and consideration when adding AI features, especially as the pressure to innovate and ship faster increases. Additionally, the conversation touched on the importance of addressing critics and skeptics in AI discussions, as well as the EU's decentralized approach to AI regulations compared to the US. Overall, the episode emphasized the importance of balancing the benefits and risks of AI technology.
AI chatbot integrations: Recent developments include Microsoft's Co-Pilot on Telegram, Google's Gemini on Oprah's browser, and Amazon's plan to overhaul Alexa, signifying a trend towards making AI chatbots ubiquitous but raising strategic considerations
There have been several recent developments in the integration of AI chatbots into various platforms, with Microsoft's Co-Pilot bot being added to Telegram and Google's Gemini being integrated into Oprah's browser. These integrations signify a trend towards making AI chatbots ubiquitous, as they are being added to multiple platforms for easy access. However, these integrations also raise strategic considerations, as they could potentially create an extra layer between users and the AI providers, leading to data sharing and potential competition. Another notable development is Amazon's plan to give Alexa an AI overhaul and introduce a monthly subscription price for its enhanced features. These advancements demonstrate the growing importance of AI in everyday life and the ongoing competition among tech companies to offer the best AI experiences to users.
Amazon's push into generative AI: Amazon is forming a new team to compete in generative AI, facing challenges in hardware and not bundling new AI service ChatGPT with Prime membership, while Microsoft is making strides in real-time translation and PwC becomes the first enterprise user of OpenAI's ChatGPT, expanding its reach in the business world.
Amazon is making a significant push into the generative AI space, forming a new team to help them compete with industry leaders like Microsoft. While they already have a strong brand and distribution with Alexa, they are facing challenges in hardware and need to make up for lost time. Amazon's new AI service, ChatGPT, is not bundled with their Prime membership, indicating that inference may still be too expensive to include. Microsoft, on the other hand, is making strides in real-time translation, making language barriers less of an issue. This could be a game-changer for video content and communication. Another notable development is EO's new AI earbuds, which aim to succeed where other attempts have failed. These earbuds offer intrinsic value as high-quality headphones and have a reasonable price point, making them a safer bet than some previous failures in the market. PwC, a large consulting firm, has also entered the scene, becoming the first reseller and largest enterprise user of OpenAI's ChatGPT. This deal will give 100,000 PwC employees access to the enterprise version of ChatGPT, further expanding its reach in the business world. Overall, these developments highlight the growing importance of generative AI and its potential applications in various industries.
OpenAI, PwC deal: OpenAI's deal with PwC marks its first foray into a resale model, granting 100,000 PwC employees access to advanced AI technology, generating substantial revenue and boosting PwC's reputation as an AI-driven consulting firm, potentially shifting the balance of power in the media landscape
OpenAI, a leading AI research lab, has recently signed a significant deal with PwC, one of the world's largest professional services network, granting 100,000 PwC employees access to OpenAI's advanced AI technology. This marks OpenAI's first foray into a resale model and a strategic move towards greater independence from tech giants like Microsoft, with whom they have a complex relationship. The deal is expected to bring substantial revenue for OpenAI and boost PwC's reputation as a forward-thinking, AI-driven consulting firm. This trend of corporate media outlets partnering with AI companies for data licensing and enhanced content generation is likely to continue, potentially shifting the balance of power in the media landscape. As more revenues flow through these AI platforms, the implications for unbiased coverage and the role of social media platforms in shaping media narratives become crucial questions to consider.
AI industry partnerships with media outlets: Technology companies are partnering with media outlets for access to updated information and training data, offering incentives such as discounted rates for schools and nonprofits, and China's advancements in nanometer-class process technology could position it as a long-term competitor in the AI industry.
Technology companies, including those in the AI sector, are seeking access to updated information and training data, leading to potential partnerships and incentives with media outlets. This trend is important from an economic and business perspective as AI continues to advance and become a commodity, with multiple options available for users. OpenAI, for instance, is offering discounted rates for schools and nonprofits to use its chatbot technology. Meanwhile, China is making strides in nanometer-class process technology, despite US sanctions, through multi-patterning and quadruple patterning lithography methods. These developments could potentially position China to compete long-term in the AI industry. However, it's important to note that the relationship between technology companies and media outlets, as well as the timing of certain announcements, may raise questions and concerns.
Semiconductor and AI competition: Huawei and SMIC are pushing for smaller node sizes with multi-patterning, while NVIDIA dominates with record revenue and effective use of market power. XAI joins high-valued AI companies, but long-term sustainability is uncertain. Scale AI introduces reliable private leaderboards for AI model evaluation.
Huawei and SMIC are making significant strides in semiconductor technology with their multi-patterning technique, aiming to reach the three nanometer node size. NVIDIA, on the other hand, continues to dominate the tech industry with record-breaking revenue and profits, accelerating shipping velocity, and effective use of market dominance to secure fabrication capacity. Elon Musk's XAI has raised $6 billion in funding, joining the ranks of high-valued AI companies, but the long-term sustainability of their position remains uncertain. Scale AI's new private leaderboards offer a more reliable evaluation of AI model performance, as they are harder to game and involve human evaluators. These developments demonstrate the ongoing competition and innovation in the AI and semiconductor industries.
AI language models evaluation: Significant strides have been made in evaluating AI language models using elo-scale rankings and human evaluations. Multilingual models like Cohere's AIA 23 are expanding the state of the art to nearly half of the world's population, while efforts continue to make advanced models open source.
There have been significant strides made in evaluating and comparing AI language models, with models like Cloud 3 Opus and GPT-4 leading in different areas. The use of elo-scale rankings and human evaluations has proven to be the most robust method for evaluating these models. Recently, Cohere for AI launched AIA 23, a multilingual model with 8 and 35 billion parameter versions, expanding the state of the art language model to nearly half of the world's population. The focus on multilingual models is a key differentiator for Cohere, as smaller models don't always perform as well on foreign languages. Additionally, there are ongoing efforts to make advanced models like AlphaFold open source, but there are concerns about the specific use cases and capabilities that will be made available. Overall, the field of AI language models is becoming increasingly crowded, with advances being made in specific areas like low resource languages.
AI advancements: Mistral's Stral model generates code, completes functions, and answers questions about codebase in English, while the new optimization method, 'The Road Less Scheduled,' eliminates the need for learning rate schedules, making training more efficient.
The recent advancements in AI, specifically Mistral's release of its first generative AI model for code and the introduction of a new optimization method, are making significant strides in the field. The accessibility and potential impact of these developments are impressive, with potential implications for various industries, including drug discovery and technology. Mistral's model, Stral, can generate code, complete functions, write and test code, and answer questions about a codebase in English. Although it's open-source, its large size (22 billion parameters) and long context window (32,000 tokens) limit its accessibility to larger companies with substantial compute infrastructure. The new optimization method, "The Road Less Scheduled," eliminates the need for learning rate schedules, making the training process more efficient and less reliant on trial-and-error hyperparameter tuning. This approach has shown strong performance on various benchmarks, generating excitement in the engineering and machine learning communities. While these advancements are promising, there are concerns about the fairness of comparisons made by Mistral and the potential limitations of their models. As the field continues to evolve, it's crucial to evaluate these developments critically and consider their implications for the future of AI research and application.
Learning rate adjustments and compute scaling: Historically, consistent learning rates during training have not been optimal. Strategies like adjusting learning rates during different stages of training have shown better results. Compute scaling for advanced AI models has been increasing significantly over the years, with an estimated growth of around 4-5x per year.
The learning rate in machine learning models plays a crucial role in determining how quickly and effectively the model adapts to new data. A larger learning rate means bigger changes to the model, while a smaller learning rate implies more cautious adjustments. Historically, a consistent learning rate throughout training has not been optimal, and strategies like adjusting learning rates during different stages of training have shown better results. Additionally, the trend of compute scaling for advanced AI models has grown significantly over the years, with an estimated increase of around 4-5x per year. This trend, which started around 2012 with models like AlexNet, has been driven by companies like Google, OpenAI, and Meta, with Meta showing a slightly more aggressive scaling rate. However, it's important to note that the optimal learning rate and compute scaling trends are subject to change as new research and technologies emerge.
Model size vs data size: More complex, less compressible data requires more data to train a model effectively, challenging the belief that model size and data size should scale one-to-one.
The quality and density of data play a significant role in determining the optimal ratio of model size versus data size when scaling up machine learning models. This was highlighted in a research paper that found more complex, less compressible data requires more data to train a model effectively, as opposed to a larger model with more parameters. This discovery challenges the long-held belief that model size and data size should scale one-to-one. The findings have implications for theories of intelligence and learning, suggesting that the more data and compute invested in a model, the more performant it becomes. Additionally, in the context of robotics, scaling laws have been identified, revealing that more data and compute lead to better results, but deployment constraints may necessitate data-optimal training or other approaches. Another intriguing topic in language models is contextual positioning, which allows models to attend to specific tokens based on their context, enabling better performance on tasks like selective copying, counting, and flipping. This approach improves perplexity on language model encoding tasks. Overall, these findings underscore the importance of understanding the relationship between model size, data size, and data quality in machine learning and robotics research.
AI limitations, usage gap: Despite advancements in AI research, limitations persist and usage gap between hype and reality remains. Meta's CoPE addresses positional encoding challenges, but awareness and usage of advanced AI models like GPT-3 is low.
While Meta's AI research is making significant strides, there are limitations to transformer models that make advanced reasoning challenging. Meta's Contextual Positional Encoding (CoPE) addresses this issue by providing more contextual positional information, improving performance on tasks requiring precise counting and understanding of context. Despite the hype surrounding new AI products, a recent survey revealed that only a small percentage of people in various countries use these tools daily. This includes popular models like GPT-3, with only 58% awareness in the US and 53% in the UK. This gap between hype and usage is not surprising, as it has been observed in previous technological trends. However, the true impact of AI may not be measured by usage numbers alone, but by the value it creates, such as Google's AI-driven search summaries. In the world of AI governance, the ongoing drama at OpenAI took a new turn with Helen Toner, a board member, revealing more details about Sam Altman's ousting. The board allegedly learned of CHA-GPT on Twitter, and there were concerns over a lack of transparency and misleading information regarding safety processes. Additionally, accusations of a toxic culture and psychological abuse have surfaced, with Altman not being the only person accused. These revelations add to the ongoing narrative of the OpenAI leadership change.
OpenAI leadership concerns: Concerns about OpenAI's leadership, specifically Sam Altman, include inaccurate information given to the board, silencing of whistleblowers, and a culture of secrecy contradicting OpenAI's public messaging, raising questions about governance and potential impact on humanity.
There have been concerns raised about OpenAI and its leadership, specifically Sam Altman, regarding transparency, accountability, and safety practices. These concerns include instances of inaccurate information given to the board, silencing of whistleblowers, and a culture of secrecy that contradicts OpenAI's public messaging. The revelations have raised questions about the governance and leadership of the organization, which is working on advanced artificial intelligence technology. The lack of transparency and accountability is particularly concerning given the potential impact of OpenAI's work on humanity. The resignation of a researcher over safety concerns and his move to a rival company further highlights these issues. OpenAI has responded with statements denying concerns regarding product safety or security, but the lack of clear communication and transparency leaves many questioning the validity of these statements. The ongoing debate highlights the importance of openness and accountability in organizations, especially those working on advanced technologies with significant implications for society.
OpenAI governance: OpenAI's use of non-disparagement clauses and lack of transparency in addressing criticisms has raised concerns about governance and potential impact on public trust and AI safety regulation.
OpenAI, a leading AI research lab, has faced numerous criticisms regarding its governance and employment practices. These include the use of strict non-disparagement clauses in employment agreements, which could prevent former employees from speaking negatively about the company and potentially result in the clawback of their equity. The CEO, Sam Altman, has been criticized for being aware of these practices but not addressing them publicly until faced with significant scrutiny. The formation of a new safety and security committee, led by OpenAI insiders, has been met with skepticism. The training of a new advanced AI model, GPD-5, was also announced as a response to these criticisms. These events raise concerns about transparency and oversight within OpenAI, potentially impacting public trust and the broader debate on AI safety and regulation.
AI governance and accountability: Ongoing concerns about OpenAI's governance and accountability persist, with fines for deepfake creators and hacked AI models highlighting the need for ethical and regulatory oversight in the industry
There are ongoing concerns about governance and accountability at OpenAI, despite their expertise and policy leaders. Sam Altman's influence and the composition of the committee raising objections have raised questions. In other news, a person who created deepfake Biden robocalls was fined $6 million by the FCC, setting a precedent. A hacker released a jailbroken version of ChatGPT, highlighting the challenges of aligning AI models. China announced a $47.5 billion chip fund, a significant investment in semiconductor industry, and Alphabet and Meta are partnering with Hollywood on AI, potentially democratizing video production. These developments underscore the importance of addressing ethical and regulatory issues in AI and technology.

Recent Episodes from Last Week in AI

#171 - - Apple Intelligence, Dream Machine, SSI Inc

Our 171st episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter
Tools & Apps
Applications & Business
- (00:28:23 ) Sam Altman might reportedly turn OpenAI into a regular for-profit company
- (00:31:19) Ilya Sutskever, Daniel Gross, Daniel Levy launch Safe Superintelligence Inc.
- (00:38:53) OpenAI welcomes Sarah Friar (CFO) and Kevin Weil (CPO)
- (00:41:44) Report: OpenAI Doubled Annualized Revenue in 6 Months
- (00:44:30) AI startup Adept is in deal talks with Microsoft
- (00:48:55) Mistral closes €600m at €5.8bn valuation with new lead investor
- (00:53:12) Huawei Claims Ascend 910B AI Chip Manages To Surpass NVIDIA’s A100, A Crucial Alternative For China
- (00:56:58) Astrocade raises $12M for AI-based social gaming platform
Projects & Open Source
Research & Advancements
- (01:12:02) Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
- (01:22:07) Improve Mathematical Reasoning in Language Models by Automated Process Supervision
- (01:28:01) Introducing Lamini Memory Tuning: 95% LLM Accuracy, 10x Fewer Hallucinations
- (01:30:32) An Empirical Study of Mamba-based Language Models
- (01:31:57) BERTs are Generative In-Context Learners
- (01:33:33) SELFGOAL: Your Language Agents Already Know How to Achieve High-level Goals
Policy & Safety
Synthetic Media & Art
(02:02:23) Outro + AI Song

Last Week in AI

enJune 24, 2024

#170 - new Sora rival, OpenAI robotics, understanding GPT4, AGI by 2027?

Our 170th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

Tools & Apps
Applications & Business
- (00:19:40) OpenAI is restarting its robotics research group
- (00:25:01) Saudi fund invests in China effort to create rival to OpenAI
- (00:29:34) UAE seeks ‘marriage’ with US over artificial intelligence deals
- (00:33:01) Zoox to test self-driving cars in Austin and Miami
- (00:35:49) Microsoft Lays Off 1,500 Workers, Blames "AI Wave"
- (00:38:28) Avengers, assemble—Google, Intel, Microsoft, AMD and more team up to develop an interconnect standard to rival Nvidia's NVLink
Projects & Open Source
- (00:40:39) GLM-4-9B-Chat-1M
- (00:46:37) Hugging Face and Pollen Robotics show off first project: an open source robot that does chores
- (00:49:40) Zyphra debuts Zyda, a 1.3T language modeling dataset it claims outperforms Pile, C4, arxiv
- (00:51:59) Stability AI debuts new Stable Audio Open for sound design
Research & Advancements
Policy & Safety
- (01:20:11) Former OpenAI researcher foresees AGI reality in 2027
- (01:28:03) OpenAI Insiders Warn of a ‘Reckless’ Race for Dominance
- (01:33:52) Testing and mitigating elections-related risks
- (01:36:26) Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Synthetic Media & Art
- (01:43:23) The Uncanny Rise of the World's First AI Beauty Pageant
(01:46:25) Outro + AI Song

Last Week in AI

enJune 09, 2024

#169 - Google's Search Errors, OpenAI news & DRAMA, new leaderboards

Our 168th episode with a summary and discussion of last week's big AI news!

Feel free to leave us feedback here: https://forms.gle/ngXvXZpNJxaAprDv6

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter
(00:02:55) Response to listener comments / corrections
Tools & Apps
- (00:04:33) Google’s A.I. Search Errors Cause a Furor Online
- (00:10:56) Telegram gets an in-app Copilot bot
- (00:13:13) Opera is adding Google's Gemini AI to its browser
- (0016:13) Amazon plans to give Alexa an AI overhaul — and a monthly subscription price
- (00:19:15) Microsoft Edge will translate and dub YouTube videos as you’re watching them
- (00:21:12) Iyo thinks its gen AI earbuds can succeed where Humane and Rabbit stumbled
Applications & Business
Projects & Open Source
Research & Advancements
- (01:09:23) The Road Less Scheduled
- (01:14:10) Training Compute of Frontier AI Models Grows by 4-5x per Year
- (01:21:33) gzip Predicts Data-dependent Scaling Laws
- (01:25:51) Neural Scaling Laws for Embodied AI
- (01:28:47) Contextual Position Encoding: Learning to Count What’s Important
- (01:33:09) New AI products much hyped but not much used, study says
Policy & Safety
- (01:37:00) Ex-OpenAI board member reveals what led to Sam Altman's brief ousting
- (01:46:36) OpenAI researcher who resigned over safety concerns joins Anthropic
- (01:49:16) Leaked OpenAI Documents Show Sam Altman Was Clearly Aware of Silencing Former Employees
- (01:54:33) OpenAI Board Forms Safety and Security Committee
- (01:58:07) Robocaller Who Used AI to Clone Biden’s Voice Fined $6 Million
- (01:59:08) Hacker Releases Jailbroken "Godmode" Version of ChatGPT
- (02:00:46) China Creates $47.5 Billion Chip Fund to Back Nation’s Firms
Synthetic Media & Art
- (02:02:23) Alphabet, Meta Offer Millions to Partner With Hollywood on AI
(02:04:21) Outro + AI Song

Last Week in AI

enJune 03, 2024

#168 - OpenAI vs Scar Jo + safety researchers, MS AI updates, cool Anthropic research

Our 168th episode with a summary and discussion of last week's big AI news!

With guest host Gavin Purcell from AI for Humans podcast!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter + Response to listener comments / corrections
Tools & Apps
Applications & Business
Projects & Open Source
Research & Advancements
- (00:56:05) New Anthropic Research Sheds Light on AI's 'Black Box'
- (01:04:03) Chameleon: Mixed-Modal Early-Fusion Foundation Models
- (01:08:14) SpeechVerse: A Large-scale Generalizable Audio Language Model
- (01:09:05) CAT3D: Create Anything in 3D with Multi-View Diffusion Models
- (01:11:17) Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning
- (01:12:10) SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models
Policy & Safety
Synthetic Media & Art
- (01:28:32) Sony Music warns tech companies over ‘unauthorized’ use of its content to train AI
- (01:32:34) Hollywood agency CAA aims to help stars manage their own AI likenesses
- (01:38:28) What Do You Do When A.I. Takes Your Voice?
(01:42:01) Outro + AI Song

Last Week in AI

enMay 28, 2024

#167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

Our 167th episode with a summary and discussion of last week's big AI news!

With guest host Daliana Liu (https://www.linkedin.com/in/dalianaliu/) from The Data Scientist Show!

And a special one-time interview with Andrey in the latter part of the podcast.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Intro / Banter
Tools & Apps
- (00:03:42) OpenAI releases GPT-4o, a faster model that’s free for all ChatGPT users
- (00:12:06) Project Astra is the future of AI at Google
- (00:18:06) Google is redesigning its search engine — and it’s AI all the way down
- (00:19:39) Google unveils Veo and Imagen 3, its latest AI media creation models
- (00:23:36) Google Unveils Music AI Sandbox Making Loops From Prompts
- (00:26:27) Anthropic AI Launches a Prompt Engineering Tool that Generates Production-Ready Prompts in the Anthropic Console
Applications & Business
- (00:31:02) OpenAI’s Chief Scientist and Co-Founder Is Leaving the Company
- (00:35:15) Mike Krieger joins Anthropic as Chief Product Officer
- (00:36:28) $16k G1 humanoid rises up to smash nuts, twist and twirl
- (00:41:02) GM's Cruise to start testing robotaxis in Phoenix area with human safety drivers on board
- (00:42:52) US agency probes Amazon-owned Zoox self-driving vehicles after two crashes
- (00:43:58) Waymo’s robotaxis under investigation after crashes and traffic mishaps
Projects & Open Source
Research & Advancements
- (00:49:22) The Platonic Representation Hypothesis
- (00:53:08) SUTRA: Scalable Multilingual Language Model Architecture
Policy & Safety
- (00:54:46) Bipartisan Senate bill on AI security would bolster voluntary cyber reporting processes
- (00:56:17) U.K. agency releases tools to test AI model safety
- (00:57:25) Protesters Are Fighting to Stop AI, but They’re Split on How to Do It
Synthetic Media & Art
(01:06:37) Daliana Interviews Andrey
(01:42:00) AI Outro Song

Last Week in AI

enMay 19, 2024

#166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

Our 166th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) Intro / Banter
Tools & Apps
- (00:04:23) ElevenLabs previews music-generating AI model
- (00:09:31) Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts
- (00:13:00) SoundHound AI and Perplexity Partner to Bring Online LLMs to Next Gen Voice Assistants Across Cars and IoT Devices
- (00:14:50) Stability AI sows gen AI discord with Stable Artisan
- (00:16:35) Apple Will Revamp Siri to Catch Up to Its Chatbot Competitors
- (00:18:54) Alibaba rolls out latest version of its large language model to meet robust AI demand
Applications & Business
- (00:19:34) OpenAI and Stack Overflow partner to bring more technical knowledge into ChatGPT
- (00:17:31) New Microsoft AI model may challenge GPT-4 and Google Gemini
- (00:31:08) Wayve, an A.I. Start-Up for Autonomous Driving, Raises $1 Billion
- (00:32:00) Motional delays commercial robotaxi plans amid restructuring
- (00:33:54) The rise of the Chinese AI unicorns doing battle with OpenAI
Projects & Open Source
Research & Advancements
- (00:50:02) Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA
- (00:57:20) xLSTM: Extended Long Short-Term Memory
- (01:06:35) StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
- (01:07:55) Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
- (01:11:48) KAN: Kolmogorov-Arnold Networks
Policy & Safety
- (01:13:20) US lawmakers unveil bill to make it easier to restrict exports of AI models
- (01:17:30) OpenAI’s Model Spec outlines some basic rules for AI
- (01:20:18) Robot dogs armed with AI-targeting rifles undergo US Marines Special Ops evaluation
- (01:25:15) OpenAI Releases ‘Deepfake’ Detector to Disinformation Researchers
Synthetic Media & Art
(01:40:18) AI Outro Song

Last Week in AI

enMay 12, 2024

#165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

Our 165th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Tools & Apps
Applications & Business
Research & Advancements
- (00:39:20) Capabilities of Gemini Models in Medicine
- (00:45:34) Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
- (00:52:20) NExT: Teaching Large Language Models to Reason about Code Execution
- (00:55:08) SenseNova 5.0: China’s latest AI model surpasses OpenAI’s GPT-4
- (00:57:20) Octopus v4: Graph of language models
- (01:00:28) Better & Faster Large Language Models via Multi-token Prediction
Policy & Safety
Synthetic Media & Art
- (01:26:30) Air Head creators say OpenAI's Sora finicky to work with, needs hundreds of prompts, serious VFX work for under 2 minutes of cohesive story ↺
- (01:29:50) Eight newspaper publishers sue OpenAI over copyright infringement

Last Week in AI

enMay 05, 2024

#164 - Meta AI, Phi-3, OpenELM, Bollywood Deepfakes

Our 164th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Tools & Apps
- (00:04:02) Meta, in Its Biggest A.I. Push, Places Smart Assistants Across Its Apps
- (00:07:26) Microsoft launches Phi-3, its smallest AI model yet
- (00:15:35) The Ray-Ban Meta Smart Glasses have multimodal AI now
- (00:17:32) OpenAI winds down AI image generator that blew minds and forged friendships in 2022
- (00:18:44) Baidu claims 200 million users for Ernie chatbot after only 13 months
- (00:21:13) The new Adobe Photoshop gets an in-app image generator, major Generative Fill upgrades
Applications & Business
Projects & Open Source
- (00:39:03) Apple releases OpenELM: small, open source AI models designed to run on-device
- (00:44:12) Snowflake launches Arctic, an open ‘mixture-of-experts’ LLM to take on DBRX, Llama 3
Research & Advancements
Policy & Safety
- (01:05:11) Deepfakes of Bollywood stars spark worries of AI meddling in India election
- (01:08:51) LLM Agents can Autonomously Exploit One-day Vulnerabilities
- (01:15:27) The Necessity of AI Audit Standards Boards
- (01:19:45) A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
- (01:22:45) COERCING LLMS TO DO AND REVEAL (ALMOST) ANYTHING
- (01:26:40) China acquired recently banned Nvidia chips in Super Micro, Dell servers, tenders show
Synthetic Media & Art
- (01:29:08) Drake threatened with lawsuit over diss track featuring AI Tupac

Last Week in AI

enApril 30, 2024

#163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

Our 163rd episode with a summary and discussion of last week's big AI news!

Note: apology for this one coming out a few days late, got delayed in editing it -Andrey

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Intro / Banter
Tools & Apps
- (00:02:16) Meta releases Llama 3, claims it’s among the best open models available
- (00:14:01) Elon Musk’s xAI Unveils Grok-1.5 Vision, Beats OpenAI’s GPT-4V
- (00:17:55) Reka releases Reka Core, its multimodal language model to rival GPT-4 and Claude 3 Opus
- (00:21:50) Cohere Compass Private Beta: A New Multi-Aspect Embedding Model
- (00:23:48) Amazon Music’s Maestro lets listeners make AI playlists
- (00:24:36) Snap plans to add watermarks to images created with its AI-powered tools
Applications & Business
Projects & Open Source
- (00:44:08) OpenEQA: Embodied Question Answering in the Era of Foundation Models
- (00:50:03) Introducing Idefics2: A Powerful 8B Vision-Language Model for the community
Research & Advancements
- (00:51:21) RHO-1: Not All Tokens Are What You Need
- (00:57:21) Scaling Laws for Fine-Grained Mixture of Experts
- (01:03:20) Chinchilla Scaling: A replication attempt
- (01:07:18) China develops new light-based chiplet that could power artificial general intelligence — where AI is smarter than humans
- (01:10:45) OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Policy & Safety
Synthetic Media & Art
- (01:30:25) Medium bans AI-generated content from its paid Partner Program

Last Week in AI

enApril 24, 2024

#162 - Udio Song AI, TPU v5, Mixtral 8x22, Mixture-of-Depths, Musicians sign open letter

Our 162nd episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Tools & Apps
- (00:02:50) AI-Music Arms Race: Meet Udio, the Other ChatGPT for Music
- (00:07:42) Anthropic launches external tool use for Claude AI, enabling stock ticker integrations and more
- (00:11:51) Building LLMs for Code Repair
- (00:14:16) Early Reviews of Humane AI Pin Aren’t Impressed
- (00:16:23) Microsoft 365’s Copilot gets a GPT-4 Turbo upgrade and improved image generation
- (00:18:41) AI editing tools are coming to all Google Photos users
Applications & Business
- (00:19:21) Google announces the Cloud TPU v5p, its most powerful AI accelerator yet
- (00:23:32) Meta unveils its newest custom AI chip as it races to catch up
- (00:27:27) Intel Unveils New AI Accelerator in Bid to Challenge Nvidia
- (00:30:46) Adobe Is Buying Videos for $3 Per Minute to Build AI Model
- (00:32:55) OpenAI transcribed over a million hours of YouTube videos to train GPT-4
- (00:36:23) Waymo will launch paid robotaxi service in Los Angeles on Wednesday
- (00:37:23) OpenAI removes Sam Altman's ownership of its Startup Fund
Projects & Open Source
Research & Advancements
- (00:52:08) Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
- (00:57:41) Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
- (01:03:31) Octopus v2: On-device language model for super agent
- (01:07:54) Bigger is not Always Better: Scaling Properties of Latent Diffusion Models
- (01:09:54) Many-shot Jailbreaking
Policy & Safety
Synthetic Media & Art
- (01:39:26) Billie Eilish, Pearl Jam, Nicki Minaj Among 200 Artists Calling for Responsible AI Music Practices
Fun!
- (01:41:52) OpenAI's Sora just made its first music video and it's like a psychedelic trip

Last Week in AI

enApril 15, 2024

On this page

#169 - Google's Search Errors, OpenAI news & DRAMA, new leaderboards

Last Week in AI

Podcast Summary

Recent Episodes from Last Week in AI

#171 - - Apple Intelligence, Dream Machine, SSI Inc

#170 - new Sora rival, OpenAI robotics, understanding GPT4, AGI by 2027?

#169 - Google's Search Errors, OpenAI news & DRAMA, new leaderboards

#168 - OpenAI vs Scar Jo + safety researchers, MS AI updates, cool Anthropic research

#167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

#166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

#165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

#164 - Meta AI, Phi-3, OpenELM, Bollywood Deepfakes

#163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

#162 - Udio Song AI, TPU v5, Mixtral 8x22, Mixture-of-Depths, Musicians sign open letter