Reflecting on AI news in 2021 (so far) with the host of the Towards Data Science Podcast

enJuly 21, 2021

Last Week in AI

Podcast Summary

Multimodal AI: Blending Language and Images: OpenAI's avocado armchair project showcases the trend towards multimodal AI, which combines language models and images to create a more nuanced understanding of data.
AI research is continually evolving, and one significant trend is the move towards multimodal models. During this special crossover episode of the Let's Talk AI Podcast, the hosts discussed some of the most intriguing AI stories from the first half of 2021. Jeremy, the host of the Tortoise Data Science Podcast, shared his pick - an article about OpenAI's avocado armchair. This project combined language models and images to create a mapping from images to text descriptions, focusing on the semantic meaning behind the images rather than just labeling them. This multimodal approach represents a growing tendency in the field to mix different operating modes, enabling AI to capture more nuanced and complex information.
Advancements in Text and Image Processing with Clip and Dali: Clip identifies images based on visual content, while Dali generates images based on verbal descriptions, showcasing the power of large language models and the emergence of creative behavior in multimodal AI
OpenAI's latest advancements, specifically Clip and Dali, represent a significant leap forward in the intersection of text and image processing. These models build upon earlier experiments from 2016, where researchers mapped images onto word vectors to achieve impressive results, albeit not as advanced as what Clip and Dali offer. Clip functions as a classifier, accurately identifying images based on their visual content, while Dali is the generative counterpart. It fills in images based on verbal descriptions, demonstrating the power of large language models and the emergence of creative behavior. This multimodal approach is exciting because it allows for better interaction between NLP and computer vision, enabling the embedding of each modality into a single shared space. As humans, we naturally process information from different modalities as interconnected, and these advancements reflect this trend towards a more unified representation of data. Additionally, recent discoveries have shown that individual neurons can capture both a visual image and a sketch of the same object, further emphasizing the significance of this trend. Overall, the transformative power of transformers and pure scaling are driving these advancements, making it an exciting time for the intersection of text and image processing.
Scaling up AI through compute, data, and architecture: OpenAI's success with GPT-3, trained on a massive dataset, showcases the importance of scale in AI, surpassing the significance of model architecture
OpenAI's approach to achieving impressive AI results has been less about developing more sophisticated algorithms and more about scaling up in terms of compute, data, and architecture. This commitment to a repeatable set of building blocks, such as transformers, has led to models with impressive zero-shot performance across various tasks. OpenAI's success with GPT-3, which was trained on a massive dataset scraped from the internet, has significantly impacted the field of NLP and may even be expanding into the broader AI space. The infrastructure required to support this massive scale has become a competitive advantage for OpenAI, surpassing the importance of the model architecture itself. Despite initial skepticism, OpenAI's approach to scaling up through self-supervised learning has proven to be a game-changer in the world of AI.
The Shift in AI Focus: Simplifying Economic Inputs: The current AI race prioritizes simplifying economic inputs, leading to significant leaps in capabilities, but raises questions about potential consequences, including safety concerns when profit margins may compromise research.
The current race towards Artificial General Intelligence (AGI) is primarily focused on simplifying economic inputs such as compute and data, with companies like OpenAI aiming to remove the need for customized machine learning expertise. This shift, while leading to significant leaps in AI capabilities, raises questions about the potential consequences. For instance, LU4AI, an open-source collective attempting to recreate and exceed GPT-3, is working towards models with a trillion parameters. Meanwhile, companies like Hugging Face and Google are promoting open-source models using TPUs. This debate revolves around ensuring independent researchers can conduct AI safety research, as profit margins may compromise safety when organizations are racing to scale.
Balancing Technology Scaling and Safety: Ensuring safety and addressing potential biases and ethical concerns are crucial as AI technologies advance, with potential consequences illustrated by the New Year Parks case. Steps towards addressing these issues include ongoing research into fair language models and incentivizing commercial entities to prioritize safety.
As AI technologies, particularly those related to language models and facial recognition, continue to advance and become more integrated into society, ensuring safety and addressing potential biases and ethical concerns become increasingly important. The debate surrounding the balance between scaling these technologies and ensuring safety was discussed in relation to OpenAI's models. The potential for misuse and ethical dilemmas were highlighted, with the possibility of aligning incentives to prioritize responsible scaling as a potential solution. A real-world example of the consequences of inadequate consideration of these issues was provided by the case of New Year Parks, who was falsely arrested based on inaccurate facial recognition software. This incident underscores the importance of addressing potential issues with these technologies before they are deployed on a larger scale. The ongoing research into training language models to be fair and unbiased, as well as the potential for commercial entities to be incentivized to prioritize safety, were suggested as potential steps towards addressing these concerns.
Facial recognition technology raises concerns of racial bias: Facial recognition tech misidentifies black men, perpetuating harm; broader issues of AI ethics and fairness need addressing
There is a significant issue with the use of facial recognition technology, particularly in relation to racial bias. This technology, which is being deployed as a product, has been identified as misidentifying black men in three separate cases. This raises concerns about the alignment of AI with social values and the potential for harm to certain groups of people. The problem is not just with the AI algorithm itself, but also with how it is being used by human officers. Research has shown that facial recognition algorithms from major tech companies perform worse for black people compared to white people. However, it's important to note that these issues are not unique to facial recognition technology. The broader question is where we want AI to stand in terms of morality and ethics, and whether we want it to be above or below us. At the end of the day, there should be fairness across different groups of people, which is not being seen today. It's crucial to address these issues to prevent potential harm and ensure that AI is used thoughtfully and effectively.
Understanding fairness in AI and its ethical implications: The lack of consensus on defining and quantifying fairness in AI leads to debates and disagreements, emphasizing the importance of ethical discussions and considerations in advancing AI research while minimizing negative consequences and ensuring fairness.
As we continue to develop and rely on artificial intelligence (AI), it's crucial to grapple with the ethical implications and potential biases in its use. Fairness is a complex concept, with different interpretations ranging from equality of opportunity to equality of outcome. However, there's a lack of consensus on how to define and quantify fairness in AI. This leads to debates and disagreements, with different individuals or organizations making decisions based on their preferences. It's important for the community to engage in ethical discussions and considerations, as regulations like the European Union's impact statement push for more thoughtfulness. Resources like NACL's panel of ethics experts and Stanford's advisory board can help researchers navigate ethical dilemmas. The goal is to continue advancing AI research while minimizing negative consequences and ensuring fairness.
AI's use in various fields raises concerns about epistemology and ethics: AI systems can have harmful consequences, particularly in areas where human judgment matters, and interdisciplinary research is necessary to understand complex human-computer interactions. Maintain human oversight to ensure AI aligns with values and goals.
The use of AI in various fields, including philosophy, policing, and healthcare, raises significant epistemological and ethical concerns. The people who engage with these AI systems are not a random, representative sample of the population, and the consequences of AI use can be harmful, particularly in areas where human judgment and intervention are crucial. The over-reliance on AI systems can lead to downstream issues, such as incorrect predictions or actions, and it's essential to involve interdisciplinary researchers to understand the complex human-computer interactions. A recent example of AI-generated content, a new Nirvana song created using AI software, highlights the potential and limitations of AI in creative fields. While it's an intriguing development, it underscores the importance of maintaining human oversight and involvement in AI systems to ensure they align with our values and goals. Ultimately, it's crucial to approach AI with humility, recognizing that it's not a panacea and that we must be aware of its limitations and potential risks.
AI-generated content in creative fields: music, speech, and voice acting: AI is making waves in creative industries like music, speech, and voice acting, raising ethical concerns and potential opportunities for artists and regulators.
We are witnessing an increasing trend of AI-generated content in various creative fields, including music, speech, writing, and voice acting. This was highlighted by Google's Magenta project, which raised awareness for mental health while also showcasing its capabilities in generating Nirvana-like songs. This trend extends to the gaming industry, where an AI model was used to create a character's voice in The Witcher, sparking controversy among voice actors. As AI continues to advance, it will likely make its way into generative speech and audio, potentially leading to new opportunities and challenges for artists and regulators. Artists may need to adapt to this new landscape by learning about AI and its economics. For instance, they may need to decide whether to license their voices or protect their brand from potential misuse. This adds another layer of complexity to their roles. Looking back, there are parallels to the past when startups tried to license voices like Morgan Freeman's for automation. However, the counterintuitive nature of these deals and the potential brand damage make the situation intriguing. As AI continues to advance, it will be crucial for regulators to step in and address the intellectual property implications and potential ethical concerns. This emerging field will require careful consideration and collaboration between artists, technologists, and policymakers.
AI in content creation: Opportunities and challenges: AI can enhance content creation with realistic and varied outputs, but raises concerns about brand identity, deep fakes, and potential misinformation or harm
As AI technology advances, particularly in areas like voice synthesis and deep fakes, it presents both opportunities and challenges. On the one hand, it can help small businesses and individuals create more realistic and varied content, such as generating dialogue for video games or music. On the other hand, it also raises concerns about brand identity, deep fakes, and the potential for misinformation or harm. The integration of AI into existing tools and the increasing accessibility of these technologies could make it harder to regulate and monitor their use, bringing up comparisons to past issues like the proliferation of pirated content. Ultimately, it's important to consider both the potential benefits and risks as AI continues to evolve in this area.
Exploring the Complex Relationship Between AI and Blockchain: AI empowers individuals to create new expressions and revolutionizes industries, while also raising challenges like copyright issues and job replacement. Balancing opportunities and challenges is crucial.
While AI and blockchain may be perceived as opposing forces, with AI being centralizing and blockchain decentralizing, the reality is more complex. AI, through proliferation, empowers individuals to create new art, culture, and expressions. This is an exciting aspect of AI's future. Additionally, advancements in AI technology, such as generative models, voice synthesis, machine translation, and visual effects, have the potential to revolutionize industries and provide new opportunities. For instance, Morgan Freeman could potentially license and monetize his voice using AI technology. The future of AI holds both challenges and opportunities, and it's essential to navigate this complex landscape with an open mind. Furthermore, the discussion also touched upon the potential copyright issues that may arise with AI-generated content and the possibility of AI replacing human jobs, such as voice acting. However, the potential benefits, such as increased efficiency and productivity, should not be overlooked. It's important to strike a balance between embracing the opportunities that AI presents and addressing the challenges it poses. In conclusion, the future of AI is an intriguing and complex topic that requires ongoing exploration and discussion. It's a salad of possibilities, challenges, and opportunities that we'll have to navigate in the coming years. So, let's keep the conversation going and continue to explore the exciting world of AI.
Support our podcast for valuable insights: Subscribe, rate, and share our podcasts for latest trends and advancements in AI and data science
Engaging with Let's Talk AI and our awards data science podcast is important to us and can provide value to you. By subscribing, rating, and tuning in to future episodes, you're helping to support our content and ensuring that you don't miss out on valuable insights and information. Remember, your engagement and feedback are crucial to our continued success and growth. So, don't forget to subscribe, rate, and share our podcasts with your network. Together, we can explore the latest trends and advancements in artificial intelligence and data science, and stay ahead of the curve in these rapidly evolving fields.

Recent Episodes from Last Week in AI

#172 - Claude and Gemini updates, Gemma 2, GPT-4 Critic

Our 172nd episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

(00:00:00) Intro / Banter
Tools & Apps
- (00:03:02) Anthropic Debuts Collaboration Tools for Claude AI Assistant
- (00:08:32) Google rolls out Gemini side panels for Gmail and other Workspace apps
- (00:12:30) OpenAI delays rolling out its 'Voice Mode' to July
- (00:15:40) OpenAI’s ChatGPT for Mac is now available to all users
- (00:17:27) Waymo ditches the waitlist and opens up its robotaxis to everyone in San Francisco
- (00:18:53) Figma announces big redesign with AI
Applications & Business
Projects & Open Source
Research & Advancements
- (00:56:57) Finding GPT-4’s mistakes with GPT-4
- (01:03:30) Chinese-built ChatGLM exceeds GPT-4 Across Several Benchmarks
- (01:07:15) Performances are plateauing, let's make the leaderboard steep again
- (01:11:18) Structural mechanism of bridge RNA-guided recombination
- (01:15:01) Reconciling Kaplan and Chinchilla Scaling Laws
Policy & Safety
Synthetic Media & Art
- (01:39:35) Music labels sue AI music generators for copyright infringement
- (01:42:43) YouTube is trying to make AI music deals with major record labels
- (01:45:07) Toys ‘R’ Us Debuts First Video Ad Using Sora, OpenAI’s Text-to-Video Tool
(01:49:12) Outro + AI Song

Last Week in AI

enJuly 01, 2024

#171 - - Apple Intelligence, Dream Machine, SSI Inc

Our 171st episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter
Tools & Apps
Applications & Business
- (00:28:23 ) Sam Altman might reportedly turn OpenAI into a regular for-profit company
- (00:31:19) Ilya Sutskever, Daniel Gross, Daniel Levy launch Safe Superintelligence Inc.
- (00:38:53) OpenAI welcomes Sarah Friar (CFO) and Kevin Weil (CPO)
- (00:41:44) Report: OpenAI Doubled Annualized Revenue in 6 Months
- (00:44:30) AI startup Adept is in deal talks with Microsoft
- (00:48:55) Mistral closes €600m at €5.8bn valuation with new lead investor
- (00:53:12) Huawei Claims Ascend 910B AI Chip Manages To Surpass NVIDIA’s A100, A Crucial Alternative For China
- (00:56:58) Astrocade raises $12M for AI-based social gaming platform
Projects & Open Source
Research & Advancements
- (01:12:02) Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
- (01:22:07) Improve Mathematical Reasoning in Language Models by Automated Process Supervision
- (01:28:01) Introducing Lamini Memory Tuning: 95% LLM Accuracy, 10x Fewer Hallucinations
- (01:30:32) An Empirical Study of Mamba-based Language Models
- (01:31:57) BERTs are Generative In-Context Learners
- (01:33:33) SELFGOAL: Your Language Agents Already Know How to Achieve High-level Goals
Policy & Safety
Synthetic Media & Art
(02:02:23) Outro + AI Song

Last Week in AI

enJune 24, 2024

#170 - new Sora rival, OpenAI robotics, understanding GPT4, AGI by 2027?

Our 170th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

Tools & Apps
Applications & Business
- (00:19:40) OpenAI is restarting its robotics research group
- (00:25:01) Saudi fund invests in China effort to create rival to OpenAI
- (00:29:34) UAE seeks ‘marriage’ with US over artificial intelligence deals
- (00:33:01) Zoox to test self-driving cars in Austin and Miami
- (00:35:49) Microsoft Lays Off 1,500 Workers, Blames "AI Wave"
- (00:38:28) Avengers, assemble—Google, Intel, Microsoft, AMD and more team up to develop an interconnect standard to rival Nvidia's NVLink
Projects & Open Source
- (00:40:39) GLM-4-9B-Chat-1M
- (00:46:37) Hugging Face and Pollen Robotics show off first project: an open source robot that does chores
- (00:49:40) Zyphra debuts Zyda, a 1.3T language modeling dataset it claims outperforms Pile, C4, arxiv
- (00:51:59) Stability AI debuts new Stable Audio Open for sound design
Research & Advancements
Policy & Safety
- (01:20:11) Former OpenAI researcher foresees AGI reality in 2027
- (01:28:03) OpenAI Insiders Warn of a ‘Reckless’ Race for Dominance
- (01:33:52) Testing and mitigating elections-related risks
- (01:36:26) Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Synthetic Media & Art
- (01:43:23) The Uncanny Rise of the World's First AI Beauty Pageant
(01:46:25) Outro + AI Song

Last Week in AI

enJune 09, 2024

#169 - Google's Search Errors, OpenAI news & DRAMA, new leaderboards

Our 168th episode with a summary and discussion of last week's big AI news!

Feel free to leave us feedback here: https://forms.gle/ngXvXZpNJxaAprDv6

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter
(00:02:55) Response to listener comments / corrections
Tools & Apps
- (00:04:33) Google’s A.I. Search Errors Cause a Furor Online
- (00:10:56) Telegram gets an in-app Copilot bot
- (00:13:13) Opera is adding Google's Gemini AI to its browser
- (0016:13) Amazon plans to give Alexa an AI overhaul — and a monthly subscription price
- (00:19:15) Microsoft Edge will translate and dub YouTube videos as you’re watching them
- (00:21:12) Iyo thinks its gen AI earbuds can succeed where Humane and Rabbit stumbled
Applications & Business
Projects & Open Source
Research & Advancements
- (01:09:23) The Road Less Scheduled
- (01:14:10) Training Compute of Frontier AI Models Grows by 4-5x per Year
- (01:21:33) gzip Predicts Data-dependent Scaling Laws
- (01:25:51) Neural Scaling Laws for Embodied AI
- (01:28:47) Contextual Position Encoding: Learning to Count What’s Important
- (01:33:09) New AI products much hyped but not much used, study says
Policy & Safety
- (01:37:00) Ex-OpenAI board member reveals what led to Sam Altman's brief ousting
- (01:46:36) OpenAI researcher who resigned over safety concerns joins Anthropic
- (01:49:16) Leaked OpenAI Documents Show Sam Altman Was Clearly Aware of Silencing Former Employees
- (01:54:33) OpenAI Board Forms Safety and Security Committee
- (01:58:07) Robocaller Who Used AI to Clone Biden’s Voice Fined $6 Million
- (01:59:08) Hacker Releases Jailbroken "Godmode" Version of ChatGPT
- (02:00:46) China Creates $47.5 Billion Chip Fund to Back Nation’s Firms
Synthetic Media & Art
- (02:02:23) Alphabet, Meta Offer Millions to Partner With Hollywood on AI
(02:04:21) Outro + AI Song

Last Week in AI

enJune 03, 2024

#168 - OpenAI vs Scar Jo + safety researchers, MS AI updates, cool Anthropic research

Our 168th episode with a summary and discussion of last week's big AI news!

With guest host Gavin Purcell from AI for Humans podcast!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter + Response to listener comments / corrections
Tools & Apps
Applications & Business
Projects & Open Source
Research & Advancements
- (00:56:05) New Anthropic Research Sheds Light on AI's 'Black Box'
- (01:04:03) Chameleon: Mixed-Modal Early-Fusion Foundation Models
- (01:08:14) SpeechVerse: A Large-scale Generalizable Audio Language Model
- (01:09:05) CAT3D: Create Anything in 3D with Multi-View Diffusion Models
- (01:11:17) Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning
- (01:12:10) SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models
Policy & Safety
Synthetic Media & Art
- (01:28:32) Sony Music warns tech companies over ‘unauthorized’ use of its content to train AI
- (01:32:34) Hollywood agency CAA aims to help stars manage their own AI likenesses
- (01:38:28) What Do You Do When A.I. Takes Your Voice?
(01:42:01) Outro + AI Song

Last Week in AI

enMay 28, 2024

#167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

Our 167th episode with a summary and discussion of last week's big AI news!

With guest host Daliana Liu (https://www.linkedin.com/in/dalianaliu/) from The Data Scientist Show!

And a special one-time interview with Andrey in the latter part of the podcast.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Intro / Banter
Tools & Apps
- (00:03:42) OpenAI releases GPT-4o, a faster model that’s free for all ChatGPT users
- (00:12:06) Project Astra is the future of AI at Google
- (00:18:06) Google is redesigning its search engine — and it’s AI all the way down
- (00:19:39) Google unveils Veo and Imagen 3, its latest AI media creation models
- (00:23:36) Google Unveils Music AI Sandbox Making Loops From Prompts
- (00:26:27) Anthropic AI Launches a Prompt Engineering Tool that Generates Production-Ready Prompts in the Anthropic Console
Applications & Business
- (00:31:02) OpenAI’s Chief Scientist and Co-Founder Is Leaving the Company
- (00:35:15) Mike Krieger joins Anthropic as Chief Product Officer
- (00:36:28) $16k G1 humanoid rises up to smash nuts, twist and twirl
- (00:41:02) GM's Cruise to start testing robotaxis in Phoenix area with human safety drivers on board
- (00:42:52) US agency probes Amazon-owned Zoox self-driving vehicles after two crashes
- (00:43:58) Waymo’s robotaxis under investigation after crashes and traffic mishaps
Projects & Open Source
Research & Advancements
- (00:49:22) The Platonic Representation Hypothesis
- (00:53:08) SUTRA: Scalable Multilingual Language Model Architecture
Policy & Safety
- (00:54:46) Bipartisan Senate bill on AI security would bolster voluntary cyber reporting processes
- (00:56:17) U.K. agency releases tools to test AI model safety
- (00:57:25) Protesters Are Fighting to Stop AI, but They’re Split on How to Do It
Synthetic Media & Art
(01:06:37) Daliana Interviews Andrey
(01:42:00) AI Outro Song

Last Week in AI

enMay 19, 2024

#166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

Our 166th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) Intro / Banter
Tools & Apps
- (00:04:23) ElevenLabs previews music-generating AI model
- (00:09:31) Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts
- (00:13:00) SoundHound AI and Perplexity Partner to Bring Online LLMs to Next Gen Voice Assistants Across Cars and IoT Devices
- (00:14:50) Stability AI sows gen AI discord with Stable Artisan
- (00:16:35) Apple Will Revamp Siri to Catch Up to Its Chatbot Competitors
- (00:18:54) Alibaba rolls out latest version of its large language model to meet robust AI demand
Applications & Business
- (00:19:34) OpenAI and Stack Overflow partner to bring more technical knowledge into ChatGPT
- (00:17:31) New Microsoft AI model may challenge GPT-4 and Google Gemini
- (00:31:08) Wayve, an A.I. Start-Up for Autonomous Driving, Raises $1 Billion
- (00:32:00) Motional delays commercial robotaxi plans amid restructuring
- (00:33:54) The rise of the Chinese AI unicorns doing battle with OpenAI
Projects & Open Source
Research & Advancements
- (00:50:02) Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA
- (00:57:20) xLSTM: Extended Long Short-Term Memory
- (01:06:35) StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
- (01:07:55) Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
- (01:11:48) KAN: Kolmogorov-Arnold Networks
Policy & Safety
- (01:13:20) US lawmakers unveil bill to make it easier to restrict exports of AI models
- (01:17:30) OpenAI’s Model Spec outlines some basic rules for AI
- (01:20:18) Robot dogs armed with AI-targeting rifles undergo US Marines Special Ops evaluation
- (01:25:15) OpenAI Releases ‘Deepfake’ Detector to Disinformation Researchers
Synthetic Media & Art
(01:40:18) AI Outro Song

Last Week in AI

enMay 12, 2024

#165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

Our 165th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Tools & Apps
Applications & Business
Research & Advancements
- (00:39:20) Capabilities of Gemini Models in Medicine
- (00:45:34) Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
- (00:52:20) NExT: Teaching Large Language Models to Reason about Code Execution
- (00:55:08) SenseNova 5.0: China’s latest AI model surpasses OpenAI’s GPT-4
- (00:57:20) Octopus v4: Graph of language models
- (01:00:28) Better & Faster Large Language Models via Multi-token Prediction
Policy & Safety
Synthetic Media & Art
- (01:26:30) Air Head creators say OpenAI's Sora finicky to work with, needs hundreds of prompts, serious VFX work for under 2 minutes of cohesive story ↺
- (01:29:50) Eight newspaper publishers sue OpenAI over copyright infringement

Last Week in AI

enMay 05, 2024

#164 - Meta AI, Phi-3, OpenELM, Bollywood Deepfakes

Our 164th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Tools & Apps
- (00:04:02) Meta, in Its Biggest A.I. Push, Places Smart Assistants Across Its Apps
- (00:07:26) Microsoft launches Phi-3, its smallest AI model yet
- (00:15:35) The Ray-Ban Meta Smart Glasses have multimodal AI now
- (00:17:32) OpenAI winds down AI image generator that blew minds and forged friendships in 2022
- (00:18:44) Baidu claims 200 million users for Ernie chatbot after only 13 months
- (00:21:13) The new Adobe Photoshop gets an in-app image generator, major Generative Fill upgrades
Applications & Business
Projects & Open Source
- (00:39:03) Apple releases OpenELM: small, open source AI models designed to run on-device
- (00:44:12) Snowflake launches Arctic, an open ‘mixture-of-experts’ LLM to take on DBRX, Llama 3
Research & Advancements
Policy & Safety
- (01:05:11) Deepfakes of Bollywood stars spark worries of AI meddling in India election
- (01:08:51) LLM Agents can Autonomously Exploit One-day Vulnerabilities
- (01:15:27) The Necessity of AI Audit Standards Boards
- (01:19:45) A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
- (01:22:45) COERCING LLMS TO DO AND REVEAL (ALMOST) ANYTHING
- (01:26:40) China acquired recently banned Nvidia chips in Super Micro, Dell servers, tenders show
Synthetic Media & Art
- (01:29:08) Drake threatened with lawsuit over diss track featuring AI Tupac

Last Week in AI

enApril 30, 2024

#163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

Our 163rd episode with a summary and discussion of last week's big AI news!

Note: apology for this one coming out a few days late, got delayed in editing it -Andrey

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Intro / Banter
Tools & Apps
- (00:02:16) Meta releases Llama 3, claims it’s among the best open models available
- (00:14:01) Elon Musk’s xAI Unveils Grok-1.5 Vision, Beats OpenAI’s GPT-4V
- (00:17:55) Reka releases Reka Core, its multimodal language model to rival GPT-4 and Claude 3 Opus
- (00:21:50) Cohere Compass Private Beta: A New Multi-Aspect Embedding Model
- (00:23:48) Amazon Music’s Maestro lets listeners make AI playlists
- (00:24:36) Snap plans to add watermarks to images created with its AI-powered tools
Applications & Business
Projects & Open Source
- (00:44:08) OpenEQA: Embodied Question Answering in the Era of Foundation Models
- (00:50:03) Introducing Idefics2: A Powerful 8B Vision-Language Model for the community
Research & Advancements
- (00:51:21) RHO-1: Not All Tokens Are What You Need
- (00:57:21) Scaling Laws for Fine-Grained Mixture of Experts
- (01:03:20) Chinchilla Scaling: A replication attempt
- (01:07:18) China develops new light-based chiplet that could power artificial general intelligence — where AI is smarter than humans
- (01:10:45) OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Policy & Safety
Synthetic Media & Art
- (01:30:25) Medium bans AI-generated content from its paid Partner Program

Last Week in AI

enApril 24, 2024

Related Episodes

DeepNude Bot, Tesla Full Self Driving, Google AI US-Mexico Border

Our latest episode with a summary and discussion of last week's big AI news!

This week Automating Image Abuse: deepfake bots on Telegram, Activists Turn Facial Recognition Tools Against the Police, Tesla is putting ‘self driving’ in the hands of drivers amid criticism the tech is not ready , Google AI tech will be used for virtual border wall, CBP contract shows

0:00 - 0:40 Intro 0:40 - 5:40 News Summary segment 5:40 News Discussion segment

Find this and more in our text version of this news roundup: https://www.skynettoday.com/digests/the-eighty-eighth

Music: Deliberate Thought, Inspired by Kevin MacLeod (incompetech.com)

enOctober 29, 2020

GPT-3 on reddit, Facial Recognition in Argentina, Stats on Big Tech Financing Academics

Our latest episode with a summary and discussion of last week's big AI news!

This week A GPT-3 bot posted comments on Reddit for a week and no one noticed , Live facial recognition is tracking kids suspected of being criminals , Many Top AI Researchers Get Financial Backing From Big Tech

0:00 - 0:40 Intro 0:40 - 5:00 News Summary segment 5:00 News Discussion segment

Find this and more in our text version of this news roundup: https://www.skynettoday.com/digests/the-eighty-sixth

Music: Deliberate Thought, Inspired by Kevin MacLeod (incompetech.com)

enOctober 15, 2020

academic independence

Ep. 3 - Artificial Intelligence: Opening Thoughts on the Most Important Trend of our Era

Artificial Intelligence has already changed the way we all live our lives. Recent technological advancements have accelerated the use of AI by ordinary people to answer fairly ordinary questions. It is becoming clear that AI will fundamentally change many aspects of our society and create huge opportunities and risks. In this episode, Brian J. Matos shares his preliminary thoughts on AI in the context of how it may impact global trends and geopolitical issues. He poses foundational questions about how we should think about the very essence of AI and offers his view on the most practical implications of living in an era of advanced machine thought processing. From medical testing to teaching to military applications and international diplomacy, AI will likley speed up discoveries while forcing us to quickly determine how it's use is governed in the best interest of the global community.

Join the conversation and share your views on AI. E-mail: info@brianjmatos.com or find Brian on your favorite social media platform.

Brian J Matos

en-usApril 10, 2023

machine learning

ai ethics

large language models

artificial intellegence

Artificial Intelligence

Today, the guys discuss the ramifications of recent updates in AI. Did ChatGPT write this, or is this something a human would say? Learn more about your ad choices. Visit podcastchoices.com/adchoices

enMay 19, 2023