Logo

    on-call

    Explore "on-call" with insightful episodes like "The On-Call Engineer: From Alarms to Incident Resolution in Software Engineering", "Things You Can't Do at Home, the Importance of Community and Networking at Your Workspace S2E2", "Slight Reliability Episode 66 - Building Digital Assistants for SRE with Kyle Forster", "Slight Reliability Episode 62 - On-Call with Matt Brown" and "Slight Reliability Episode 60 - From Zero to SRE with Amin Astaneh" from podcasts like ""Remote Work Tech Brief By HackerNoon", "Business RadioX ® Network", "Slight Reliability", "Slight Reliability" and "Slight Reliability"" and more!

    Episodes (8)

    The On-Call Engineer: From Alarms to Incident Resolution in Software Engineering

    The On-Call Engineer: From Alarms to Incident Resolution in Software Engineering

    This story was originally published on HackerNoon at: https://hackernoon.com/the-on-call-engineer-from-alarms-to-incident-resolution-in-software-engineering.
    Navigating on-call complexities: from alert management to incident resolution, unravel the process in software engineering.
    Check more stories related to remote-work at: https://hackernoon.com/c/remote-work. You can also check exclusive content about #on-call, #software-development, #software-engineering, #alert-management, #team-collaboration, #engineering-practices, #on-call-process, #software-developers, and more.

    This story was written by: @shlokashah. Learn more about this writer by checking @shlokashah's about page, and for more stories, please visit hackernoon.com.

    Things You Can't Do at Home, the Importance of Community and Networking at Your Workspace S2E2

    Things You Can't Do at Home, the Importance of Community and Networking at Your Workspace S2E2
    In this episode of the Future of Work Podcast, Kyle McIntosh and Al Loveland are joined by Ben Johnson and Austin Miller. The episode discusses various aspects of work, entrepreneurship, and the role of community at Mac 6. The conversation touches upon the journeys of both guests, who transitioned from corporate roles to entrepreneurship. Austin […] The post Things You Can't Do at Home, the Importance of Community and Networking at Your Workspace S2E2 appeared first on Business RadioX ®.

    Slight Reliability Episode 66 - Building Digital Assistants for SRE with Kyle Forster

    Slight Reliability Episode 66 - Building Digital Assistants for SRE with Kyle Forster

    This week Stephen brings back Kyle Forster from RunWhen to talk about the purple elephant in the room… “AI”.

    What makes it GenAI, LLM, Advanced Statistics, or ML? Kyle shares his experience surrounding building AI powered search engines for SRE troubleshooting commands and how to incorporate a (paid) open source community of experts rather than trust AI by itself. They discuss what search looks like under the hood, why GenAI powered chatbots will or won't take over the SaaS industry, how Digital Assistants can be utilised by SREs to increase productivity (hint: giving them to app developers!), how to make informed decisions when purchasing AI products, and *much* more.

    You can find Kyle at:

    LinkedIn: https://www.linkedin.com/in/kyforster/recent-activity/all/

    And you can find out more about RunWhen at:

    Website: https://www.runwhen.com/
    Product videos: https://www.youtube.com/@whatdoirunwhen
    RunWhen Local:  https://github.com/runwhen-contrib/runwhen-local (RunWhen Local is an open source troubleshooting cheat sheet that suggests commands from the RunWhen community for all of the namespaces in your cluster - ready to copy & paste)

    You can find the official Slight Reliability podcast website at: https://slightreliability.com/

    You can find Stephen at:

    LinkedIn: https://www.linkedin.com/in/stephentownshend/
    X: https://twitter.com/the_kiwi_sre
    YouTube: https://www.youtube.com/c/SlightReliability
    Instagram: https://www.instagram.com/slight_reliability/
    TikTok: https://www.tiktok.com/@the_kiwi_sre

    Slight Reliability Episode 62 - On-Call with Matt Brown

    Slight Reliability Episode 62 - On-Call with Matt Brown

    This week Stephen chats with former-Google SRE Matt Brown about being on-call. They cover how to up-lift junior engineers so they can be on-call, what a fair on-call schedule looks like, run-books, and much more.

    As you heard, Matt believes flexibility is key to a healthy on-call rotation. Matt is exploring ideas for improvements to existing tooling and products in this space and would love to hear from as many listeners as possible with feedback on what they find useful or frustrating with the existing tools they use to support on-call in their teams. You can reach him at oncall-feedback@mkmba.nz or schedule a chat via https://zcal.co/mattb/oncall, please don't be shy!

    You can also find Matt at:

    Website: https://www.mattb.nz/
    LinkedIn: https://www.linkedin.com/in/mattbrown/
    Mastodon: https://mastodon.nz/@mattb
    Twitter: https://twitter.com/xleem

    You can find the official Slight Reliability podcast website at: https://slightreliability.com/

    You can find Stephen at:

    LinkedIn: https://www.linkedin.com/in/stephentownshend/
    Twitter: https://twitter.com/the_kiwi_sre
    YouTube: https://www.youtube.com/c/SlightReliability
    Instagram: https://www.instagram.com/slight_reliability/
    TikTok: https://www.tiktok.com/@the_kiwi_sre

    Slight Reliability Episode 60 - From Zero to SRE with Amin Astaneh

    Slight Reliability Episode 60 - From Zero to SRE with Amin Astaneh

    In this episode Amin Astaneh from Certo Modo discusses his experience undertaking an SRE transformation over several years.

    Stephen and Amin cover a lot of ground including making ops work visible, measuring toil, the power of calculating the $ value of work, getting developers on-call, the embedded model for SRE, SLOs, culture change, and a whole lot more.

    You can find Amin on his company website https://certomodo.io, LinkedIn: https://www.linkedin.com/in/aminastaneh/ and Twitter: https://twitter.com/aastaneh

    The books Amin mentions are...

    The Practice of Cloud System Administration: https://www.oreilly.com/library/view/practice-of-cloud/9780133478549/

    The Phoenix Project: https://www.oreilly.com/library/view/the-phoenix-project/9781457191350/

    You can find the official Slight Reliability podcast website at: https://slightreliability.com/

    You can find Stephen at:

    LinkedIn: https://www.linkedin.com/in/stephentownshend/

    Twitter: https://twitter.com/the_kiwi_sre
    Instagram: https://www.instagram.com/slight_reliability/

    Slight Reliability Episode 44 - Cognitive Overload with Paige Cruz

    Slight Reliability Episode 44 - Cognitive Overload with Paige Cruz

    In this episode we discuss cognitive overload in SRE with Paige Cruz from Chronosphere. We cover both what cognitive load is, what causes it, as well as some potential antidotes and preventative measures.

    You can check out Chronosphere here: https://chronosphere.io/

    You can find Paige on LinkedIn: https://www.linkedin.com/in/paigerduty/

    You can find the official Slight Reliability podcast website at: https://slightreliability.com/

    You can find Stephen at:

    LinkedIn: https://www.linkedin.com/in/stephentownshend/

    Twitter: https://twitter.com/the_kiwi_sre

    Episode 18: On Call Engineering: How to do it and Smile with Charity Majors

    Episode 18: On Call Engineering: How to do it and Smile with Charity Majors

    On-Call Engineering: How to do it and smile

    • Intro to the topic and Charity Majors
    • What is on-call even? And why does it exist?
    • Our Experiences with on-call?
      • When did we start? How many years, some stories about our past and why we are talking about it 
    • How to onboard to on-call?
    • Can it be done and actually be positive? 
    • What causes on-call to be negative then?
      • What are the steps that can be done to improve it? 
    • What is incident management?

    Charity Majors is the CTO of Honeycomb.io, @mipsytipy on Twitter, and blogs at https://charity.wtf/

    Links Raz shared with his team:


    Also Available on:
    Spotify, iTunes, Google, Stitcher, TuneIn, PlayerFM

    Music credits: Dan Lebowitz: Come and Get It! Licensed under Creative Commons: By Attribution 3.0 License 

    The 24/7 Job

    The 24/7 Job
    The mysterious infrastructure admin "Gareth" appears to discuss the challenges of working in a 24/7 profession. TOPICS - The Evolution to the IT Job - The 24/7 Cycle - On Call (good, bad, other) MUSIC FROM THE CCMIXTER COMMUNITY Snazzy Intro: On Top of the World by texasradiofish (c) copyright 2015 Licensed under a Creative Commons Attribution Noncommercial (3.0) license. http://dig.ccmixter.org/files/texasradiofish/50478 Ft: John Fletcher, Patricia Edwards, ElRon XChile, Speck, alexplaysguitar Live Show Starting Soon: cdk - Sunday by Analog By Nature (c) copyright 2016 Licensed under a Creative Commons Attribution (3.0) license. http://dig.ccmixter.org/files/cdk/53755 Live Show Outro: MILLENNIALS by Analog By Nature (c) copyright 2018 Licensed under a Creative Commons Attribution (3.0) license. http://dig.ccmixter.org/files/cdk/57150 Special Guest: Gareth Kingston.
    Logo

    © 2024 Podcastworld. All rights reserved

    Stay up to date

    For any inquiries, please email us at hello@podcastworld.io