Logo
    Search

    How Stack Overflow is partnering with Google to encourage socially responsible AI

    enMarch 12, 2024

    Podcast Summary

    • Stack Overflow AI strategyNew CPO Ryan Polk shares Stack Overflow's focus on trust, transparency, and community-driven development for their AI strategy to create a more inclusive and effective tool for developers

      Ryan Polk, Stack Overflow's new Chief Product Officer, brings a wealth of experience from various industries and roles within software development and product management to the company. Throughout his career, he's worked in gaming, banking, and cybersecurity industries, and even took a detour into venture capital. His passion for the community and AI strategy led him to join Stack Overflow six months ago. During our conversation, he shared his background and how it shaped his perspective. He also mentioned his long-term engagement as a Stack Overflow user and his plans to encourage more contributions from the community. As for the company's news, Ryan's first blog post outlined the principles for Stack Overflow's AI strategy, focusing on trust, transparency, and community-driven development. This approach aims to create a more inclusive and effective AI tool for developers while maintaining the integrity of the platform.

    • Stack Overflow-Google partnershipStack Overflow partners with Google to improve AI model performance by integrating community data, emphasizing transparency and accountability, and fostering a mutually beneficial relationship between AI companies and communities

      Stack Overflow, in partnership with Google, is integrating community data into Google's AI model, Gemini, to improve its performance and provide more accurate results. This collaboration came about due to the increasing prevalence of large language models (LLMs) and the recognition that these models rely on community knowledge for training. Stack Overflow, being at the heart of these discussions, aims to facilitate this interaction by making it easier for users to find answers, verify information, and engage with the community. The partnership also emphasizes the importance of transparency and accountability, as users should be able to trust the sources of information and the reasoning behind the answers they receive. This approach fosters a virtuous circle between AI companies and the communities they serve, ensuring that the benefits are mutual.

    • AI model attribution and community interactionGoogle integrates attribution and community interaction into their IDEs, allowing developers to ask questions and contribute answers directly, making the interaction with the community a natural part of the development process

      AI models, particularly Large Language Models (LLMs), are increasingly being integrated into enterprise environments, and providers must ensure transparency and provenance to meet enterprise requirements. Attribution and showing the reasoning behind answers is essential, as it's a requirement for the community and a key factor in building trust. Google is an example of a partner in this endeavor, as they're working to integrate attribution and community interaction directly into their IDEs. This brings the community into the development process, allowing developers to ask questions, research, and even contribute answers directly from their IDEs. The goal is to make the interaction with the community a natural part of the development process, rather than an afterthought. This approach not only makes the AI model a starting point for conversation but also a valuable tool for collaboration and knowledge sharing.

    • Stack Overflow data for LLM trainingStack Overflow's conversational data, with multiple possible answers and a focus on human interaction, is ideal for training Large Language Models, offering an advantage to companies using Stack Overflow for Teams

      Stack Overflow's unique community-driven question and answer format, which has been the backbone of its success for the last 15 years, is also an excellent resource for training Large Language Models (LLMs). The conversation-like structure of the data, where multiple possible answers are involved, offers a valuable training ground for these models. While LLMs cannot assign accuracy or recency scores to answers like human communities can, they can learn from the conversation flow and the chain of thought that mirrors the way humans challenge, reflect, and critique each other's ideas. This makes Stack Overflow's data especially well-suited for training LLMs, giving companies that use Stack Overflow for Teams a potential advantage. The focus remains on enhancing human interaction and collaboration within the community, with AI capabilities serving to make the system easier and faster, but never replacing the human role in driving answers and communication.

    • Stack Overflow improvementStack Overflow is improving the question asking process, prioritizing human community, and partnering with LLM providers to expand reach and knowledge sharing.

      Stack Overflow, a leading platform for programmers to learn, ask questions, and share knowledge, is focusing on improving the question asking process and community engagement. The platform aims to make it easier for users to ask valid questions and speed up the answering process, while ensuring that humans are answering. They are exploring how AI can support this process, but the human community remains the priority. Stack Overflow is also forming strategic partnerships with a select number of Language Model (LLM) providers, with Google being the first, to expand their reach and create a virtuous cycle of community interaction and knowledge sharing. The platform's latest developments, such as their conversational chatbot and summarization capability, have been successful but require careful consideration. The ultimate goal is to continue providing up-to-date answers to contemporary coding questions by maintaining an active and engaged community.

    • Chatbot interaction in online communitiesThe focus is shifting towards enhancing interactive capabilities on online communities, making it easier for users to ask questions and find answers, and providing tools for moderators and answerers to make the process faster and more efficient through search and summarization, question asking and answering tools, and personalization features.

      The use of conversational chatbots in online communities might hinder interaction among community members instead of facilitating it. Therefore, the focus is shifting towards enhancing interactive capabilities on the site, making it easier for users to ask questions and find answers, and providing tools for moderators and answerers to make the process faster and more efficient. This includes investing in search and summarization, question asking and answering tools, and personalization features. The ultimate goal is to make users' lives easier and quicker in finding the information they need within the community. The integration of AI capabilities into enterprise products like Stack Overflow for Teams is also a priority, with a focus on summarization, Slack integration, IDE integrations, and partnerships to help users get answers quickly within their corporate environments.

    • Stack Overflow communityStack Overflow is a valuable resource for seeking answers to technical questions with over 86,000 users benefiting from a single question. The community values knowledge sharing and encourages interaction with users like Ben Popper (@BenPopper) and Ryan on Twitter and LinkedIn.

      The Stack Overflow community is a valuable resource for seeking answers to technical questions, with over 86,000 people benefiting from a question about extracting all keys from a JSON column in PostgreSQL. The community values curiosity and knowledge sharing, as evidenced by the recognition of a user named "very good software, not virus" who asked and answered this question. Ben Popper, the director of content at Stack Overflow, and Ryan, the new chief product officer, emphasized the importance of interacting with the community and encouraged listeners to connect with them directly. Ben can be found on Twitter at @BenPopper, and Ryan can be found on LinkedIn. They welcome feedback and look forward to working with the community as they continue to grow. Additionally, the importance of attribution and understanding the reasoning behind solutions was highlighted during the discussion. Overall, the Stack Overflow community is a vibrant and supportive network of individuals who are dedicated to helping each other solve technical challenges and expand their knowledge.

    Recent Episodes from The Stack Overflow Podcast

    How to build open source apps in a highly regulated industry

    How to build open source apps in a highly regulated industry

    Before Medplum, Reshma founded and exited two startups in the healthcare space – MedXT (managing medical images online acquired by Box) and Droplet (at-home diagnostics company acquired by Ro). Reshma has a B.S. in computer science and a Masters of Engineering from MIT.

    You can learn more about Medplum here and check out their Github, which has over 1,200 stars, here.

    You can learn more about Khilnani on her website, GitHub, and on LinkedIn.

    Congrats to Stack Overflow user Kvam for earning a Lifeboat Badge with an answer to the question: 

    What is the advantage of using a Bitarray when you can store your bool values in a bool[]?

    A very special 5-year-anniversary edition of the Stack Overflow podcast!

    A very special 5-year-anniversary edition of the Stack Overflow podcast!

    Cassidy reflect on her time as a CTO of a startup and how the shifting environment for funding has created new pressures and incentives for founders, developers, and venture capitalists.

    Ben tries to get a bead on a new Moore’s law for the GenAI era: when will we start to see diminishing returns and fewer step factor jumps? 

    Ben and Cassidy remember the time they made a viral joke of a keyboard!

    Ryan sees how things goes in cycles. A Stack Overflow job board is back! And what do we make of the trend of AI assisted job interviews where cover letters and even technical interviews have a bot in the background helping out.

    Congrats to Erwin Brandstetter for winning a lifeboat badge with an answer to this question:  How do I convert a simple select query like select * from customers into a stored procedure / function in pg?

    Say goodbye to "junior" engineering roles

    Say goodbye to "junior" engineering roles

    How would all this work in practice? Of course, any metric you set out can easily become a target that developers look to game. With Snapshot Reviews, the goal is to get a high level overview of a software team’s total activity and then use AI to measure the complexity of the tasks and output.

    If a pull request attached to a Jira ticket is evaluated as simple by the system, for example, and a programmer takes weeks to finish it, then their productivity would be scored poorly. If a coder pushes code changes only once or twice a week, but the system rates them as complex and useful, then a high score would be awarded. 

    You can learn more about Snapshot Reviews here.

    You can learn more about Flatiron Software here.

    Connect with Kirim on LinkedIn here.

    Congrats to Stack Overflow user Cherry who earned a great question badge for asking: Is it safe to use ALGORITHM=INPLACE for MySQL?

    Making ETL pipelines a thing of the past

    Making ETL pipelines a thing of the past

    RelationalAI’s first big partner is Snowflake, meaning customers can now start using their data with GenAI without worrying about the privacy, security, and governance hassle that would come with porting their data to a new cloud provider. The company promises it can also add metadata and a knowledge graph to existing data without pushing it through an ETL pipeline.

    You can learn more about the company’s services here.

    You can catch up with Cassie on LinkedIn.

    Congrats to Stack Overflow user antimirov for earning a lifeboat badge by providing a great answer to the question: 

    How do you efficiently compare two sets in Python?

    The world’s most popular web framework is going AI native

    The world’s most popular web framework is going AI native

    Palmer says that a huge percentage of today’s top websites, including apps like ChartGPT, Perplexity, and Claude, were built with Vercel’s Next.JS. 

    For the second goal, you can see what Vercel is up to with its v0 project, which lets developers use text prompts and images to generate code. 

    Third, the Vercel AI SDK, which aims to to help developers build conversational, streaming, and chat user interfaces in JavaScript and TypeScript. You can learn more here.

    If you want to catch Jared posting memes, check him out on Twitter. If you want to learn more abiout the AI SDK, check it out 

    here.

    A big thanks to Pierce Darragh for providing a great answer and earning a lifeboat badge by saving a question from the dustinbin of history. Pierce explained: How you can split documents into training set and test set

    Can software startups that need $$$ avoid venture captial?

    Can software startups that need $$$ avoid venture captial?

    You can find Shestakofsky on his website or check him out on X.

    Grab a copy of his new book: Behind the Startup: How Venture Capital Shapes Work, Innovation, and Inequality. 

    As he writes on his website, the book:

    Draws on 19 months of participant-observation research to examine how investors’ demand for rapid growth created organizational problems that managers solved by combining high-tech systems with low-wage human labor. The book shows how the burdens imposed on startups by venture capital—as well as the benefits and costs of “moving fast and breaking things”—are unevenly distributed across a company’s workforce and customers. With its focus on the financialization of innovation, Behind the Startup explains how the gains generated by tech startups are funneled into the pockets of a small cadre of elite investors and entrepreneurs. To promote innovation that benefits the many rather than the few, Shestakofsky argues that we should focus less on fixing the technology and more on changing the financial infrastructure that supports it.

    A big thanks to our user of the week, Parusnik, who was awarded a Great Question badge for asking: How to run a .NET Core console application on Linux?

    An open-source development paradigm

    An open-source development paradigm

    Temporal is an open-source implementation of durable execution, a development paradigm that preserves complete application state so that upon host or software failure it can seamlessly migrate execution to another machine. Learn how it works or dive into the docs. 

    Temporal’s SaaS offering is Temporal Cloud.

    Replay is a three-day conference focused on durable execution. Replay 2024 is September 18-20 in Seattle, Washington, USA. Get your early bird tickets or submit a talk proposal!

    Connect with Maxim on LinkedIn.

    User Honda hoda earned a Famous Question badge for SQLSTATE[01000]: Warning: 1265 Data truncated for column.