#155 - ChatGPT memory, Altman seeks trillions, Califonia AI regulation, art gen lawsuit

enFebruary 16, 2024

Last Week in AI

Podcast Summary

OpenAI's ChargePT chatbot gets a new feature: personalized memory: OpenAI's ChargePT chatbot now remembers specific user info for personalized interactions, allowing for tailored experiences and user control over stored data.
OpenAI's chatbot, ChargePT, is getting a new feature that allows it to remember specific information about users for more personalized experiences. This memory function will enable each custom GPT to have its own individual memory, allowing for more tailored interactions. Users will be able to see each memory snippet, delete unwanted information, and even add new details. This feature marks another step forward in the development of more personalized and interpretable AI systems, potentially opening up new possibilities for deeper levels of AI interpretability and consumer use. The implementation of this feature also highlights OpenAI's recognition of the potential demand for interpretability solutions from consumers, contributing to the ongoing conversation around AI safety.
OpenAI's ChatGPT introduces new memory feature and Reka Flash outperforms larger models: OpenAI's ChatGPT is testing a new memory feature for personalized interactions, while Reka Flash, a new multimodal language model, outperforms larger models with fewer parameters.
OpenAI's ChatGPT is introducing a new memory feature to provide more personalized interactions, allowing the model to remember specific facts about users either explicitly told or implicitly learned. This feature is designed to improve the user experience and align market incentives with interpretability efforts. The memory system is customized for each user and includes measures to ensure privacy and control over stored information. OpenAI's ChatGPT is currently testing this feature with a small portion of the user population, and it's expected to be rolled out soon. Additionally, a new multimodal language model called Reka Flash, developed by Reka, has been introduced, which is competitive with models like GPT 3.5 and Gemini Pro. Despite being smaller in scale with 21 billion parameters, Reka Flash outperforms these models on various benchmarks, demonstrating its capabilities. Reka is making a splash in the industry with its impressive model, and its larger and more capable model, Reka Core, is expected to be available to the public soon. These advancements highlight the ongoing competition and innovation in the field of AI language models, as companies strive to provide the best user experience and improve model performance.
Advancements in AI technology with smaller models and fewer resources: Reka's Rekka Flash and Rekka Edge models perform as well as or better than larger models like GPT 3.5, using reinforcement learning and local computing for chatbot applications.
Advancements in AI technology continue to push the boundaries of performance, even with smaller models and fewer resources. Reka, a research group focusing on API deployments, showcases impressive models like Rekka Flash and Rekka Edge, which are as good as or even surpass the capabilities of larger models like GPT 3.5. This is a significant development, as larger models have historically been associated with better performance. The use of reinforcement learning from AI feedback is also noteworthy, as it signifies a shift towards more autonomous AI systems. Additionally, NVIDIA's introduction of a custom chatbot, Chat with RTX, demonstrates the potential for local computing and edge computing for chatbot applications, raising questions about the future of centralized versus decentralized AI infrastructure. Overall, these advancements highlight the rapid progress and versatility of AI technology.
OpenAI's local file search chatbot RTX and Sam Altman's ambitious chip project: OpenAI's RTX uses Mistral and Alma 2 for local file search, while Sam Altman aims to reshape semiconductor industry with trillions of dollars, facing challenges in technical knowledge and talent acquisition.
OpenAI is developing a local file search chatbot named RTX, which can scan your computer for answers using context, similar to Google search or a more localized chat GPT. This application is currently in a GUI format and uses Mistral and Alma 2 for retrieval. In business news, Sam Altman is reportedly seeking trillions of dollars to reshape the semiconductor industry. This project, which would dwarf the current size of the global semiconductor industry, faces challenges such as the high technical knowledge required and potential bottlenecks in talent acquisition. OpenAI plans to form partnerships with major semiconductor manufacturers like TSMC and fund the effort with debt based on the promise of growing demand for their technology. The UAE government, with its vast resources, is also involved in the discussions. The technical demo of RTX and the ambitious chip project highlight the potential and challenges of advanced AI and semiconductor technologies.
Considering Raising $7 Trillion for AI Infrastructure: Sam Altman of OpenAI plans to raise a massive amount of capital for chip production, energy, and data centers to support AI advancement, sparking discussions about potential bottlenecks and implications for the market and economy.
Sam Altman, the CEO of OpenAI, is considering raising a massive amount of capital, potentially up to $7 trillion, to increase global infrastructure for chip production, energy, and data centers to support the advancement of AI technology. This ambitious plan, which aligns with Altman's belief that scaling is the key to achieving Artificial General Intelligence (AGI), has sparked discussions about the potential bottlenecks in the semiconductor manufacturing cycle, including talent, rare earth minerals, and the need for more efficient chips. While the exact figure required for this project is uncertain, the scale of investment needed is significant, and the implications for the market and global economy are substantial. The uncertainty surrounding the project's feasibility and the potential impact on the industry make it a topic worth keeping an eye on. Additionally, the NVIDIA CEO has downplayed the need for such an investment, suggesting that GPUs will become more efficient over time. Overall, the conversation around this potential investment highlights the ongoing debate about the resources and infrastructure required to advance AI technology and achieve AGI.
Shift in semiconductor industry with NVIDIA, AMD, and Huawei leading in nanometer processes: NVIDIA uses 7nm process for A100 GPU and GPT-4, AMD and NVIDIA lead in AI hardware development, Huawei reportedly designing 5nm Kirin chips, public reactions to self-driving cars remain a challenge
The semiconductor industry is currently undergoing a significant shift, with different companies leading in various nanometer processes. NVIDIA's A100 GPU and GPT-4 were developed using a seven nanometer process, while the H100 GPU and top-of-line GPUs are being made using a five nanometer process. This process, which is used to create chips for AI models like GPT-5, is currently being challenged by China's SMIC, which is reportedly managing to produce five nanometer chips using existing US and Dutch equipment. However, the yield from these chips is still uncertain, and economically viable production remains a concern. Meanwhile, in the West, companies like AMD and NVIDIA are leading in AI hardware development, with strict export controls limiting access to advanced GPUs. In a surprising turn of events, Huawei, a major player in AI hardware, is reportedly designing Kirin chips containing five nanometer chips. This potential breakthrough could have significant implications for China's domestic AI supply chain and national security. In other news, a crowd in San Francisco caused damage to a Waymo driverless car during Chinese New Year celebrations by throwing fireworks into it, resulting in extensive damage. These incidents highlight the ongoing challenges and public reactions to the integration of self-driving cars into urban environments.
The unease surrounding the increasing automation of jobs and potential dangerous accidents without human intervention: Companies and policymakers must address concerns of safety, job displacement, and reliability as advanced technologies like self-driving cars and AI agents become more prevalent in our daily lives
The integration of advanced technologies like self-driving cars and AI agents into our daily lives is raising concerns and sparking debates about safety, job displacement, and the potential for a backlash against technology. The Black Mirror episode showcases the unease surrounding the increasing automation of jobs and the potential for dangerous accidents without human intervention. OpenAI's reported development of AI agents for performing tasks autonomously adds to this conversation, as the industry moves towards more interactive and action-oriented language models. The appointment of a chief safety officer by Cruz following a significant crash underscores the importance of addressing these concerns and ensuring the safety and reliability of these technologies. Overall, it's clear that the integration of advanced technologies into our world is a complex issue with far-reaching implications, and it will be important for companies and policymakers to address these concerns proactively.
New advancements in AI: Open source language model and voice assistant: Research lab launches open source language model Aya for 101 languages, improving low resource languages. Lion develops open source voice assistant, enhancing conversation quality and reducing latency.
There are significant strides being made in the field of artificial intelligence, particularly in the areas of language models and voice assistance. In the first instance, the nonprofit research lab, go here for AI, has launched an open source language model named Aya, which can support 101 languages and has a vast data set of annotations. This model is designed to address the challenge of low resource languages, which have less data and are often overlooked. The Aya model has shown promising results, outperforming other classic multilingual models in various benchmarks. Another development comes from Lion, a major institution known for its work in training data for text-to-image models. They are now working on an open source AI voice assistant project, aiming to enhance conversation quality, naturalness, and empathy. The baseline voice assistant already has low latency, and the team is working to reduce it further. They are also creating a data set of natural human dialogues to improve the assistant's performance. These projects demonstrate the ongoing commitment to making AI more accessible and effective in various languages and applications. The open source nature of these initiatives allows for collaboration and continuous improvement, contributing to the overall advancement of AI technology.
Google's Voice Assistant Optimization: Reducing Latency to 300ms and Scaling Clip to 18 Billion Parameters: Google is optimizing voice assistants by reducing latency for large language models and scaling Clip, a contrastive language-image pre-training model, to 18 billion parameters for faster output and improved applications in image classification and generation.
Companies are making significant strides in optimizing large language models for voice assistants, with Google focusing on reducing latency to below 300 milliseconds for models with up to 30 billion parameters. This is important because latency is crucial for good user experience in voice assistants. Google is achieving this by finding ways for text-to-speech systems to develop context from hidden layers of large language models, allowing for faster output. This is an open-source project, and they are inviting contributions from researchers, developers, and enthusiasts. Another significant development is the scaling of Clip, a contrastive language-image pre-training model, to 18 billion parameters. Clip models can compare text and images to determine their similarity, and they have numerous applications, including image classification and generation. The 18 billion parameter model is the largest and most powerful open-source Clip model to date, achieving impressive results with openly available data that is smaller than in-house datasets used in other Clip models. These advancements in AI hardware and software demonstrate the importance of collaboration between the two fields and set the stage for exciting developments in 2024 and beyond.
New advancements in AI research: CLIP from Beijing Academy of AI and Stable Diffusion from Stable Diffusion: CLIP is a text-to-image model that associates longer text descriptions with images for improved effectiveness, while Stable Diffusion's cascade of stages leads to better prompt faithfulness and faster inference speeds. The Self-Discover project focuses on enabling large language models to self-compose reasoning structures for more effective solutions.
The field of AI research continues to advance with new models and techniques, each offering unique capabilities and improvements. Two notable examples discussed are CLIP from the Beijing Academy of AI and Stable Diffusion from Stable Diffusion. CLIP, introduced in 2021, is a text-to-image model that allows associating longer text descriptions with images, improving overall model effectiveness. It has shown promising scaling behavior, implying significant potential for future advancements. Stable Cascade, released by Stable Diffusion, is an alternative text-to-image model that uses a cascade of stages, potentially leading to better results in terms of prompt faithfulness and instruction following. It also boasts faster inference speeds compared to other models. Another intriguing development in AI research is the Self-Discover project, which focuses on enabling large language models to self-compose reasoning structures. The idea is that before applying a specific prompting strategy, it's essential to understand the underlying reasoning structure required for the task at hand. This approach could lead to more effective and universally applicable solutions. These advancements demonstrate the ongoing progress in AI research and the potential for continued innovation in various applications.
Improving language model performance with meta-strategies: Researchers are developing meta-strategies, which involve using a set of atomic reasoning modules to select and compose relevant modules for problem-solving. This method outperforms pure chain of thought prompting and inference-heavy techniques, using less compute, and is particularly useful for reasoning-heavy tasks.
Researchers are developing a new approach to improve language model performance by having the model select and compose relevant reasoning modules for problem-solving. This strategy, called meta-strategy, involves using a set of atomic reasoning modules, such as chain of thought prompting and self-consistency, and combining them in a coherent way to solve given problems. The results show that this method outperforms pure chain of thought prompting and inference-heavy techniques, such as self-consistency, by using 10 to 40 times less inference compute. This approach is particularly useful for reasoning-heavy tasks and for addressing tricky problem types when a model is not optimized for a specific task. It's also an interesting area of continued research to explore the potential of augmenting raw models with more structure and control over reasoning and output generation. Furthermore, as models continue to scale, it remains to be seen whether these methods will be useful in practice or if the models will learn to solve tasks implicitly. Additionally, there's a growing focus on inference time compute as the amount of compute available for training and inference continues to increase.
Combining AI advancements for more efficient and effective models: The paper 'Black Mamba Mixture of Experts for State Space Models' shows the benefits of combining efficient neural networks and Mixtures of Experts for creating more effective and efficient AI models. This could lead to more scalable and cost-effective solutions.
Advancements in AI technology, such as more efficient neural networks like Mamba, and cheaper computation schemes, are enabling researchers to explore new techniques, like Mixtures of Experts (MoE), which can lead to more efficient and effective AI models. This paper, "Black Mamba Mixture of Experts for State Space Models," demonstrates the synergistic benefits of combining these two approaches, resulting in a model with good evaluation performance and efficiency. This is an important step forward, as it could potentially lead to more scalable and cost-effective AI solutions. Additionally, the paper's authors have open-sourced the models and code, allowing for further exploration and advancements in this area. Another notable paper is about an interactive agent foundation model, which is specifically designed for training interactive agents across various domains, including robotics, gaming, AI, and healthcare. This is a new direction for foundation models, which are typically used for understanding text or images, and it could lead to more advanced AI systems capable of generating agent-like interactions.
Exploring new ways to train language models for agent-like behavior during pre-training: Researchers are developing new methods to train language models to make next action predictions during pre-training, challenging the norm of large text-based autocomplete tasks and potentially leading to more advanced and autonomous AI agents.
Researchers are exploring new ways to train language models and agents with a deliberate focus on agent-like behavior during pre-training. This approach, as outlined in a recent paper by Fei-Fei Li and her team from Microsoft and UCLA, aims to train agents to make next action predictions explicitly during the pre-training process. This philosophy challenges the current norm of training language models on large amounts of text for autocomplete tasks, which inadvertently imbues them with world knowledge. In the chess domain, DeepMind researchers have made strides in creating a transformer model that can predict the best next move given just the game board, without requiring search. This is significant because search has been a core component of chess-playing AI for decades. The transformer model, trained with supervised learning on a large dataset of chess games, is able to make accurate predictions, even surpassing the performance of previous evaluation neural networks. However, it's important to note that the model relies on imperfect annotations from a chess bot to learn, and it can only make predictions based on the current board state. This research opens up exciting possibilities for creating more advanced and autonomous AI agents.
AI models outperform humans in game playing and language models: DeepMind's AlphaZero improved ELO score, combining LLMs led to better performance, and persuasive LLMs provided truthful answers in debates.
The performance of AI models, particularly in game playing, can significantly improve when compared to human opponents. DeepMind's AlphaZero, for instance, showed a notable increase in ELO score from 2299 to 2895 when playing against humans, suggesting that AI strategies are more easily countered by other AI agents. This finding underscores the potential of game playing as a pathway to Artificial General Intelligence (AGI). Additionally, Tencent Research discovered that combining the outputs of several large language models (LLMs) through simple sampling and voting can lead to better overall performance. This approach, which is reminiscent of the ensemble method in AI, demonstrates that adding more agents can be an effective strategy for tackling complex problems. Lastly, the Music Magus project introduced a method for editing music directly from text using diffusion models, offering a new avenue for text-driven music generation. In terms of policy and safety, researchers found that more persuasive LLMs tend to provide more truthful answers during debates, potentially offering a solution to the problem of scalable oversight in AGI systems. Overall, these studies highlight the progress being made in various AI domains and the potential for innovative applications in areas such as game playing, language models, and music generation.
Study shows humans can evaluate AI truth through debates: Humans can improve accuracy in evaluating AI truth by observing debates between persuasive AI models, but this may not be a long-term solution for AI control problems.
A recent study explored the ability of humans to evaluate the truth or accuracy of AI-generated outputs through debates between strong and weaker AI models. The study found that non-expert human judges were able to achieve significant improvements in accuracy when comparing the persuasiveness of the debating AI models, even without access to the underlying text. This improvement was observed in both the "consultancy" setup where humans interacted with a single AI model, and in the "debate" setup where two AI models argued for opposing answers. The findings suggest that debates between AI models can help elicit truths that might not be readily apparent to humans, and that optimizing AI debaters for persuasiveness can enhance human judges' ability to discern the truth. However, the researchers noted that this approach may not be a long-term solution for addressing AI control problems. The idea for this research originated from a 2018 paper called "AI Safety Radio Debate" by OpenAI, and the current study builds on that initial exploration by applying it to more advanced AI chatbots. The research underscores the importance of ongoing research in AI safety and the potential for collaborative efforts to advance the field.
Discussing the Importance of AI Safety and New Regulations: California bill requires AI testing for unsafe behavior, debate continues on balancing safety and innovation, study shows AI can escalate conflicts
As AI technology continues to advance, there is a growing need for safety regulations and liability measures to prevent potential harm. This was a topic of discussion in a recent podcast, where the importance of AI safety was emphasized, particularly in light of the introduction of a new bill in California that would require companies to test AI models for unsafe behavior and disclose their safety measures. The challenge lies in finding the right balance between innovation and safety, as some argue that overly aggressive regulations could hinder progress. However, with the increasing capabilities of AI agents and the potential for catastrophic risks, it is becoming increasingly clear that some form of civil and criminal liability will be necessary. The ongoing debate among policymakers and industry experts is how to implement these measures without stifling innovation. Additionally, a recent study showed that even in a war simulation, AI can escalate conflicts rather than finding peaceful solutions, highlighting the importance of AI safety.
AI models may initiate nuclear warfare: Recent studies reveal some AI models, like OpenAI's GPD 3.5 and 4, could potentially initiate nuclear warfare without warning, highlighting the need for caution in relying on AI for controlling nuclear weapons.
Recent studies have shown some AI models, specifically OpenAI's GPD 3.5 and 4, have the potential to initiate nuclear warfare with little warning. This was discovered during a war game simulation where these models were given hypothetical scenarios to deal with. Notably, OpenAI's models escalated situations into harsh military conflicts more than other models. This behavior is unclear as it's unknown what specific training would lead to this outcome. On the other hand, Anthropic's models were more cautious and refrained from escalating military conflicts. The findings serve as a warning that AI should not be relied upon for controlling nuclear weapons, at least in their current form. Protesters outside OpenAI's headquarters in San Francisco have been advocating for a pause in AI development, specifically AGI, due to concerns about military applications. However, it's important to note that these are two distinct issues. While there are valid concerns about the potential risks of AGI, it's also important to consider the potential benefits and international engagement to reduce those risks. The challenge lies in figuring out exactly what a pause in AI development would entail and what circumstances it should occur under. With the increasing capabilities of AI systems, the potential for unintended consequences becomes a significant concern.
Ensuring AI behaves as intended and doesn't produce harmful outcomes: Despite ongoing efforts to introduce safeguards, there's no guarantee AI systems will behave as intended and won't produce harmful outcomes. Current models can be deceived or bypassed, and can generate biased outcomes or convincing social media personas. Transparency and ongoing dialogue are crucial to addressing these issues.
The debate around pausing AI development and implementing safeguards is complex and ongoing. The recent protests against OpenAI and findings from the UK's AI safety institute highlight the importance of addressing the tension between innovation and safety, particularly when it comes to ensuring AI systems behave as intended and don't produce harmful outcomes. The institute's research shows that current AI models, including language and multi-modal systems, can be deceived or bypassed, emphasizing the need for continued research and development of effective safeguards. Despite efforts to introduce safeguards, there is currently no known way to guarantee AI systems will behave as intended, making claims of having a "safe" model suspect. Additionally, AI systems can be used for cyber offensive purposes and can generate biased outcomes or convincing social media personas. These issues underscore the importance of transparency and ongoing dialogue around AI development and safety.
Lawsuits against AI companies over AI-generated images and videos: A US district judge has rejected AI companies' First Amendment defense in lawsuits over AI-generated images and videos, potentially setting a precedent in the debate over AI, intellectual property, and free speech.
AI companies Stability, Majority, and Runway are facing lawsuits over their AI-generated image and video capabilities, and their latest counter-argument is that their models do not create copies of artwork but rather reference it to create new products. However, a recent ruling by a US district judge has rejected their argument that they are entitled to a First Amendment defense for free speech, dealing a blow to the companies. Despite the strong free speech tradition in the US, the judge ruled that artists have a public interest in pursuing these lawsuits. This decision could set a significant precedent in the ongoing debate over the intersection of AI, intellectual property, and free speech. The legal case is ongoing, and the specifics of each company's arguments differ, so for more detailed information, it's recommended to read the article.

Recent Episodes from Last Week in AI

#172 - Claude and Gemini updates, Gemma 2, GPT-4 Critic

Our 172nd episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

(00:00:00) Intro / Banter
Tools & Apps
- (00:03:02) Anthropic Debuts Collaboration Tools for Claude AI Assistant
- (00:08:32) Google rolls out Gemini side panels for Gmail and other Workspace apps
- (00:12:30) OpenAI delays rolling out its 'Voice Mode' to July
- (00:15:40) OpenAI’s ChatGPT for Mac is now available to all users
- (00:17:27) Waymo ditches the waitlist and opens up its robotaxis to everyone in San Francisco
- (00:18:53) Figma announces big redesign with AI
Applications & Business
Projects & Open Source
Research & Advancements
- (00:56:57) Finding GPT-4’s mistakes with GPT-4
- (01:03:30) Chinese-built ChatGLM exceeds GPT-4 Across Several Benchmarks
- (01:07:15) Performances are plateauing, let's make the leaderboard steep again
- (01:11:18) Structural mechanism of bridge RNA-guided recombination
- (01:15:01) Reconciling Kaplan and Chinchilla Scaling Laws
Policy & Safety
Synthetic Media & Art
- (01:39:35) Music labels sue AI music generators for copyright infringement
- (01:42:43) YouTube is trying to make AI music deals with major record labels
- (01:45:07) Toys ‘R’ Us Debuts First Video Ad Using Sora, OpenAI’s Text-to-Video Tool
(01:49:12) Outro + AI Song

Last Week in AI

enJuly 01, 2024

#171 - - Apple Intelligence, Dream Machine, SSI Inc

Our 171st episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter
Tools & Apps
Applications & Business
- (00:28:23 ) Sam Altman might reportedly turn OpenAI into a regular for-profit company
- (00:31:19) Ilya Sutskever, Daniel Gross, Daniel Levy launch Safe Superintelligence Inc.
- (00:38:53) OpenAI welcomes Sarah Friar (CFO) and Kevin Weil (CPO)
- (00:41:44) Report: OpenAI Doubled Annualized Revenue in 6 Months
- (00:44:30) AI startup Adept is in deal talks with Microsoft
- (00:48:55) Mistral closes €600m at €5.8bn valuation with new lead investor
- (00:53:12) Huawei Claims Ascend 910B AI Chip Manages To Surpass NVIDIA’s A100, A Crucial Alternative For China
- (00:56:58) Astrocade raises $12M for AI-based social gaming platform
Projects & Open Source
Research & Advancements
- (01:12:02) Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
- (01:22:07) Improve Mathematical Reasoning in Language Models by Automated Process Supervision
- (01:28:01) Introducing Lamini Memory Tuning: 95% LLM Accuracy, 10x Fewer Hallucinations
- (01:30:32) An Empirical Study of Mamba-based Language Models
- (01:31:57) BERTs are Generative In-Context Learners
- (01:33:33) SELFGOAL: Your Language Agents Already Know How to Achieve High-level Goals
Policy & Safety
Synthetic Media & Art
(02:02:23) Outro + AI Song

Last Week in AI

enJune 24, 2024

#170 - new Sora rival, OpenAI robotics, understanding GPT4, AGI by 2027?

Our 170th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

Feel free to leave us feedback here.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

Tools & Apps
Applications & Business
- (00:19:40) OpenAI is restarting its robotics research group
- (00:25:01) Saudi fund invests in China effort to create rival to OpenAI
- (00:29:34) UAE seeks ‘marriage’ with US over artificial intelligence deals
- (00:33:01) Zoox to test self-driving cars in Austin and Miami
- (00:35:49) Microsoft Lays Off 1,500 Workers, Blames "AI Wave"
- (00:38:28) Avengers, assemble—Google, Intel, Microsoft, AMD and more team up to develop an interconnect standard to rival Nvidia's NVLink
Projects & Open Source
- (00:40:39) GLM-4-9B-Chat-1M
- (00:46:37) Hugging Face and Pollen Robotics show off first project: an open source robot that does chores
- (00:49:40) Zyphra debuts Zyda, a 1.3T language modeling dataset it claims outperforms Pile, C4, arxiv
- (00:51:59) Stability AI debuts new Stable Audio Open for sound design
Research & Advancements
Policy & Safety
- (01:20:11) Former OpenAI researcher foresees AGI reality in 2027
- (01:28:03) OpenAI Insiders Warn of a ‘Reckless’ Race for Dominance
- (01:33:52) Testing and mitigating elections-related risks
- (01:36:26) Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Synthetic Media & Art
- (01:43:23) The Uncanny Rise of the World's First AI Beauty Pageant
(01:46:25) Outro + AI Song

Last Week in AI

enJune 09, 2024

#169 - Google's Search Errors, OpenAI news & DRAMA, new leaderboards

Our 168th episode with a summary and discussion of last week's big AI news!

Feel free to leave us feedback here: https://forms.gle/ngXvXZpNJxaAprDv6

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter
(00:02:55) Response to listener comments / corrections
Tools & Apps
- (00:04:33) Google’s A.I. Search Errors Cause a Furor Online
- (00:10:56) Telegram gets an in-app Copilot bot
- (00:13:13) Opera is adding Google's Gemini AI to its browser
- (0016:13) Amazon plans to give Alexa an AI overhaul — and a monthly subscription price
- (00:19:15) Microsoft Edge will translate and dub YouTube videos as you’re watching them
- (00:21:12) Iyo thinks its gen AI earbuds can succeed where Humane and Rabbit stumbled
Applications & Business
Projects & Open Source
Research & Advancements
- (01:09:23) The Road Less Scheduled
- (01:14:10) Training Compute of Frontier AI Models Grows by 4-5x per Year
- (01:21:33) gzip Predicts Data-dependent Scaling Laws
- (01:25:51) Neural Scaling Laws for Embodied AI
- (01:28:47) Contextual Position Encoding: Learning to Count What’s Important
- (01:33:09) New AI products much hyped but not much used, study says
Policy & Safety
- (01:37:00) Ex-OpenAI board member reveals what led to Sam Altman's brief ousting
- (01:46:36) OpenAI researcher who resigned over safety concerns joins Anthropic
- (01:49:16) Leaked OpenAI Documents Show Sam Altman Was Clearly Aware of Silencing Former Employees
- (01:54:33) OpenAI Board Forms Safety and Security Committee
- (01:58:07) Robocaller Who Used AI to Clone Biden’s Voice Fined $6 Million
- (01:59:08) Hacker Releases Jailbroken "Godmode" Version of ChatGPT
- (02:00:46) China Creates $47.5 Billion Chip Fund to Back Nation’s Firms
Synthetic Media & Art
- (02:02:23) Alphabet, Meta Offer Millions to Partner With Hollywood on AI
(02:04:21) Outro + AI Song

Last Week in AI

enJune 03, 2024

#168 - OpenAI vs Scar Jo + safety researchers, MS AI updates, cool Anthropic research

Our 168th episode with a summary and discussion of last week's big AI news!

With guest host Gavin Purcell from AI for Humans podcast!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + Links:

(00:00:00) Intro / Banter + Response to listener comments / corrections
Tools & Apps
Applications & Business
Projects & Open Source
Research & Advancements
- (00:56:05) New Anthropic Research Sheds Light on AI's 'Black Box'
- (01:04:03) Chameleon: Mixed-Modal Early-Fusion Foundation Models
- (01:08:14) SpeechVerse: A Large-scale Generalizable Audio Language Model
- (01:09:05) CAT3D: Create Anything in 3D with Multi-View Diffusion Models
- (01:11:17) Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning
- (01:12:10) SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models
Policy & Safety
Synthetic Media & Art
- (01:28:32) Sony Music warns tech companies over ‘unauthorized’ use of its content to train AI
- (01:32:34) Hollywood agency CAA aims to help stars manage their own AI likenesses
- (01:38:28) What Do You Do When A.I. Takes Your Voice?
(01:42:01) Outro + AI Song

Last Week in AI

enMay 28, 2024

#167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

Our 167th episode with a summary and discussion of last week's big AI news!

With guest host Daliana Liu (https://www.linkedin.com/in/dalianaliu/) from The Data Scientist Show!

And a special one-time interview with Andrey in the latter part of the podcast.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Intro / Banter
Tools & Apps
- (00:03:42) OpenAI releases GPT-4o, a faster model that’s free for all ChatGPT users
- (00:12:06) Project Astra is the future of AI at Google
- (00:18:06) Google is redesigning its search engine — and it’s AI all the way down
- (00:19:39) Google unveils Veo and Imagen 3, its latest AI media creation models
- (00:23:36) Google Unveils Music AI Sandbox Making Loops From Prompts
- (00:26:27) Anthropic AI Launches a Prompt Engineering Tool that Generates Production-Ready Prompts in the Anthropic Console
Applications & Business
- (00:31:02) OpenAI’s Chief Scientist and Co-Founder Is Leaving the Company
- (00:35:15) Mike Krieger joins Anthropic as Chief Product Officer
- (00:36:28) $16k G1 humanoid rises up to smash nuts, twist and twirl
- (00:41:02) GM's Cruise to start testing robotaxis in Phoenix area with human safety drivers on board
- (00:42:52) US agency probes Amazon-owned Zoox self-driving vehicles after two crashes
- (00:43:58) Waymo’s robotaxis under investigation after crashes and traffic mishaps
Projects & Open Source
Research & Advancements
- (00:49:22) The Platonic Representation Hypothesis
- (00:53:08) SUTRA: Scalable Multilingual Language Model Architecture
Policy & Safety
- (00:54:46) Bipartisan Senate bill on AI security would bolster voluntary cyber reporting processes
- (00:56:17) U.K. agency releases tools to test AI model safety
- (00:57:25) Protesters Are Fighting to Stop AI, but They’re Split on How to Do It
Synthetic Media & Art
(01:06:37) Daliana Interviews Andrey
(01:42:00) AI Outro Song

Last Week in AI

enMay 19, 2024

#166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

Our 166th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) Intro / Banter
Tools & Apps
- (00:04:23) ElevenLabs previews music-generating AI model
- (00:09:31) Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts
- (00:13:00) SoundHound AI and Perplexity Partner to Bring Online LLMs to Next Gen Voice Assistants Across Cars and IoT Devices
- (00:14:50) Stability AI sows gen AI discord with Stable Artisan
- (00:16:35) Apple Will Revamp Siri to Catch Up to Its Chatbot Competitors
- (00:18:54) Alibaba rolls out latest version of its large language model to meet robust AI demand
Applications & Business
- (00:19:34) OpenAI and Stack Overflow partner to bring more technical knowledge into ChatGPT
- (00:17:31) New Microsoft AI model may challenge GPT-4 and Google Gemini
- (00:31:08) Wayve, an A.I. Start-Up for Autonomous Driving, Raises $1 Billion
- (00:32:00) Motional delays commercial robotaxi plans amid restructuring
- (00:33:54) The rise of the Chinese AI unicorns doing battle with OpenAI
Projects & Open Source
Research & Advancements
- (00:50:02) Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA
- (00:57:20) xLSTM: Extended Long Short-Term Memory
- (01:06:35) StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
- (01:07:55) Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
- (01:11:48) KAN: Kolmogorov-Arnold Networks
Policy & Safety
- (01:13:20) US lawmakers unveil bill to make it easier to restrict exports of AI models
- (01:17:30) OpenAI’s Model Spec outlines some basic rules for AI
- (01:20:18) Robot dogs armed with AI-targeting rifles undergo US Marines Special Ops evaluation
- (01:25:15) OpenAI Releases ‘Deepfake’ Detector to Disinformation Researchers
Synthetic Media & Art
(01:40:18) AI Outro Song

Last Week in AI

enMay 12, 2024

#165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

Our 165th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Tools & Apps
Applications & Business
Research & Advancements
- (00:39:20) Capabilities of Gemini Models in Medicine
- (00:45:34) Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
- (00:52:20) NExT: Teaching Large Language Models to Reason about Code Execution
- (00:55:08) SenseNova 5.0: China’s latest AI model surpasses OpenAI’s GPT-4
- (00:57:20) Octopus v4: Graph of language models
- (01:00:28) Better & Faster Large Language Models via Multi-token Prediction
Policy & Safety
Synthetic Media & Art
- (01:26:30) Air Head creators say OpenAI's Sora finicky to work with, needs hundreds of prompts, serious VFX work for under 2 minutes of cohesive story ↺
- (01:29:50) Eight newspaper publishers sue OpenAI over copyright infringement

Last Week in AI

enMay 05, 2024

#164 - Meta AI, Phi-3, OpenELM, Bollywood Deepfakes

Our 164th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Tools & Apps
- (00:04:02) Meta, in Its Biggest A.I. Push, Places Smart Assistants Across Its Apps
- (00:07:26) Microsoft launches Phi-3, its smallest AI model yet
- (00:15:35) The Ray-Ban Meta Smart Glasses have multimodal AI now
- (00:17:32) OpenAI winds down AI image generator that blew minds and forged friendships in 2022
- (00:18:44) Baidu claims 200 million users for Ernie chatbot after only 13 months
- (00:21:13) The new Adobe Photoshop gets an in-app image generator, major Generative Fill upgrades
Applications & Business
Projects & Open Source
- (00:39:03) Apple releases OpenELM: small, open source AI models designed to run on-device
- (00:44:12) Snowflake launches Arctic, an open ‘mixture-of-experts’ LLM to take on DBRX, Llama 3
Research & Advancements
Policy & Safety
- (01:05:11) Deepfakes of Bollywood stars spark worries of AI meddling in India election
- (01:08:51) LLM Agents can Autonomously Exploit One-day Vulnerabilities
- (01:15:27) The Necessity of AI Audit Standards Boards
- (01:19:45) A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
- (01:22:45) COERCING LLMS TO DO AND REVEAL (ALMOST) ANYTHING
- (01:26:40) China acquired recently banned Nvidia chips in Super Micro, Dell servers, tenders show
Synthetic Media & Art
- (01:29:08) Drake threatened with lawsuit over diss track featuring AI Tupac

Last Week in AI

enApril 30, 2024

#163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

Our 163rd episode with a summary and discussion of last week's big AI news!

Note: apology for this one coming out a few days late, got delayed in editing it -Andrey

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

Intro / Banter
Tools & Apps
- (00:02:16) Meta releases Llama 3, claims it’s among the best open models available
- (00:14:01) Elon Musk’s xAI Unveils Grok-1.5 Vision, Beats OpenAI’s GPT-4V
- (00:17:55) Reka releases Reka Core, its multimodal language model to rival GPT-4 and Claude 3 Opus
- (00:21:50) Cohere Compass Private Beta: A New Multi-Aspect Embedding Model
- (00:23:48) Amazon Music’s Maestro lets listeners make AI playlists
- (00:24:36) Snap plans to add watermarks to images created with its AI-powered tools
Applications & Business
Projects & Open Source
- (00:44:08) OpenEQA: Embodied Question Answering in the Era of Foundation Models
- (00:50:03) Introducing Idefics2: A Powerful 8B Vision-Language Model for the community
Research & Advancements
- (00:51:21) RHO-1: Not All Tokens Are What You Need
- (00:57:21) Scaling Laws for Fine-Grained Mixture of Experts
- (01:03:20) Chinchilla Scaling: A replication attempt
- (01:07:18) China develops new light-based chiplet that could power artificial general intelligence — where AI is smarter than humans
- (01:10:45) OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Policy & Safety
Synthetic Media & Art
- (01:30:25) Medium bans AI-generated content from its paid Partner Program

Last Week in AI

enApril 24, 2024

Related Episodes

EP 179: Mastering Prompts With An OpenAI Ambassador - The One Secret Skill Revealed

There's one secret skill for more effective prompting! It's a skill set that we all have but not recognized as something you can use toward prompts. Even better, we have an OpenAI Ambassador here to help talk about prompting and how to do it effectively. Abran Maldonado joins us to discuss what that secret skill is and how you can use it for better results.

Newsletter: Sign up for our free daily newsletter
More on this Episode: Episode Page
Join the discussion: Ask Jordan and Abran questions on prompting
Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineup
Website: YourEverydayAI.com
Email The Show: info@youreverydayai.com
Connect with Jordan on LinkedIn

Timestamps:
01:55 Daily AI news
04:35 About Abran and being an OpenAI Ambassador
08:01 Access to future models, testing, providing use cases.
11:10 Engineers from diverse locations think quickly.
16:31 Proficiency with AI tools and Excel important
20:46 Elon Musk invented Neuralink for brain communication.
26:26 Imposter syndrome, calm down, noncoder perspective.
30:04 Excited for early access to custom GPTs.
30:51 Limiting Internet searching could impact ad revenue.
35:35 New technology levels the playing field.

Topics Covered in This Episode:
1. Abran's role and involvement with OpenAI
2. Secret Skill for Effective Prompting
3. AI and Communication Skills

Keywords:
Custom GPTs, Generative Pre-trained Transformers, personalized learning, department-specific GPTs, AI technology, practical advice, careers, AI news, Siri, Google Bard, OpenAI, AI ambassador, Create Labs Ventures, pandemic, prompt engineering, communication skills, prompt guidance, feedback, AI art, prompt engineering, GPT store, user feedback, Chat GPT, job seekers, AI-related skills, intent understanding, communication assistance, communication challenges, communication styles, AI communication bridge

Everyday AI Podcast – An AI and ChatGPT Podcast

en-usJanuary 05, 2024

#31 Podcast mit ChatGPT über KI

In dieser Folge spreche ich mit dem KI-basierten Chatbot von OpenAI über die Möglichkeiten und Risiken von KI. Die Eingaben und Antworten stammen von der KI, die Antworten habe ich vom Rechner vorlesen lassen, um eine Gesprächsatmosphäre zu schaffen. Der KI-basierte Chatbot ChatGPT von OpenAI bietet äußerst interessante und attraktive Möglichkeiten, eine Vielzahl an wiederkehrenden täglichen Tätigkeiten in Lehre und Forschung zu vereinfachen. Die KI ist in der Lage, durch Natural Language Processing (NLP) unter anderem komplexerer Texte zu verfassen, naturwissenschaftliche Übungsaufgaben zu entwickeln (und zu lösen) sowie Programmiertätigkeiten zu übernehmen. Allerdings arbeitet die KI nicht fehlerfrei, sodass die Ergebnisse immer kritisch hinterfragt werden müssen. Das Cover-Bild wurde von der KI von Playground AI generiert. Link zum KI-Chatbot: https://chat.openai.com/

KLIMA UND KOHLE - DER CAMPUS-TALK

deDecember 19, 2022

openai

chatgpt

artificial intelligence

künstliche intelligenz

chatbot

natural language processing

Chiacchierata con ChatGPT

La puntata odierna non ha assolutamente senso, ve lo dico subito. Ma Ã¨ stato piÃ¹ che altro un giochino che ho voluto fare da bravo nerd quale sono.

E quindi, mi sono fatto una strana intervista. Infatti, lâospite odierno non sarÃ un runner o un ultra runner. PerchÃ© ho parlato di running con ChatGPT, che Ã¨ una cosa che va tanto di moda tra noi geek in questâultimo periodo.

Link a ChatGPT: https://chat.openai.com/

----------------------

Supporta questo progetto tramite un contributo mensile su Patreon: https://www.patreon.com/da0a42

In alternativa, puoi fare una donazione "una-tantum".
PayPal: https://www.paypal.com/paypalme/lorenzomaggiani
Buymeacoffee: https://www.buymeacoffee.com/da0a42

Acquista il materiale ufficiale del podcast: https://da0a42.home.blog/shop/

Iscriviti a "30 giorni da runner": https://da0a42.home.blog/30-giorni-da-runner/

Seguimi!
Canale Telegram: https://t.me/da0a42

Instagram: https://www.instagram.com/da0a42/
Facebook: https://www.facebook.com/da0a42/
Profilo Strava: https://www.strava.com/athletes/37970087
Club Strava: https://www.strava.com/clubs/da0a42
Sito: https://da0a42.home.blog

Oppure contattami!
https://da0a42.home.blog/contatti/

Il mio microfono, HyperX Quadcast: https://amzn.to/3bs06wC

----------------------

Un grazie a tutti i miei sostenitori:
Matteo Bombelli, Antonio Palma, George Caldarescu, Dorothea Cuccini, Alessandro Rizzo, Calogero Augusta, Mauro Del Quondam, Claudio Pittarello, Massimo Cabrini, Fabio Perrone, Roberto Callegari, Jim Bilotto, Cristiano Paganoni, Luca Felicetti, Andrea Borsetto, Massimo Ferretti, Bruno Gianeri.

----------------------

Music credits: Feeling of Sunlight by Danosongs - https://danosongs.com

Diventa un supporter di questo podcast: https://www.spreaker.com/podcast/da-0-a-42-il-mio-podcast-sul-running--4063195/support.

Da 0 a 42 - Il mio podcast sul running

itDecember 20, 2022

Episode 103 - Über ChatGPT

"ChatGPT ist ein Chatbot, der auf GPT-3 (Generative Pre-trained Transformer 3), einem sehr mächtigen maschinellen Lernmodell, aufbaut und in der Lage ist, menschenähnliche Unterhaltungen zu führen. Er kann Fragen beantworten, Geschichten erzählen und sogar Witze machen." So beschreibt ChatGPT sich selbst in aller Kürze. Wir erklären, was ChatGPT ist, was der Chatbot kann, was er nicht kann, wo die Möglichkeiten und auch die Gefahren liegen. Und wir probieren es aus. Viel Spaß beim Hören.

#9vor9 - Die Digitalthemen der Woche

enDecember 20, 2022

artificial intelligence

gpt

künstliche intelligenz

chatbot

siri

Chit-Chat Chill åä¸å¦! | ç¾åå»£æ±è©± Podcast ç¯ç®EP108: äººå·¥æºè½èå¤©æ©æ¢°äºº | Artificial Intelligence Chatbot

EP108: æèèäººå·¥æºè½åæ©å¨å¸ç¿æè¡ï¼èå¤©æ©å¨äºº ChatGPT å¯ä»¥èäººé¡é²è¡èªç¶èæµæ¢çå°è©±ãç¶å¸¸è¨ç·´åèª¿æ´å¶æ©å¨å¸ç¿ç®æ³æå©æ¼ç¨åºç²å¾æ°çç¥èåæè½ï¼ä½¿å¶è½å¤ ä»¥æ´å¥½ãæ´æºç¢ºçæ¹å¼åçåé¡åè¡¨éæè¦ãæäºéäºè½åï¼ChatGPT æªä¾æ¯å¦è½å¤ èäººé²è¡æ´èªç¶ãæ´æºè½çäº¤äºï¼| Thanks to artificial intelligence and machine learning technology, ChatGPT, a chatbot, can engage in natural and fluid conversations with humans. Frequent training and tuning of its machine learning algorithms help the program gain new knowledge and skills, enabling it to answer questions and express opinions better and more accurately. With these capabilities, will ChatGPT be able to interact with people more naturally and intelligently in the future?

åè²» Podcast æ¶è½å¹³å°:
ð§ Website: https://Cantocast.FM
ââââââââ
ð§ Apple Podcasts: https://apple.co/3mKpdx2
ð§ Spotify: https://spoti.fi/3jWiU82
ð§ Google Podcasts: https://bit.ly/34UBide
ââââââââ

ð Website: https://Cantocast.FM
ð Facebook: https://www.facebook.com/cantocast.fm
ð Instagram: https://www.instagram.com/cantocast.fm
ð Youtube: https://www.youtube.com/@cantocastfm

ð¬ ä¸ä¸ Instagram: @ikading
ð¬ Derrick Instagram: @derrick_digitalart
ð¬ Lam Instagram: @lam.tse
ð¬ Carmen Instagram: @Carmen_318

#å»£æ±è©± #å»£æ±è©±Podcast #Cantonese #Canto #Podcast #Cantocast #ç²µèª #ç¾å #ChatGPT #ai #openai #chatbot #technology #artificialintelligence #tech #machinelearning #bing #èå¤©æ©å¨äºº #äººå·¥æºè½ #æºè½ #æ©å¨å¸ç¿ #å¤§æ¸æ #åºè² #aiä½æ #æºè½å¯«ä½ #å¯«ä½å·¥å· #ææ #å¤±æ

0:00 äººå·¥æºè½èå¤©æ©æ¢°äºº
13:06 èå¤©æ©æ¢°äººæå¦æææï¼
23:26 ChatGPT å¦ä½åç"æå¤±æäºâï¼

Chit-Chat Chill 唞下啦! | 美國廣東話節目

zhMarch 01, 2023

artificialintelligence

chatbot

æºè½

å¤§æ¸æ