Feb
1

NDC London 2024

Actionable Observability

Previous approaches to monitoring have relied primarily on infrastructure and service metrics, but modern approaches have embraced the idea of monitoring based on metrics that reflect the user experiences more accurately.

This talk will elaborate on those differences and a new monitoring approach that actions on SLOs. We’ll cover the following:
1. Observability vs Monitoring
2. Metrics and Types of Metrics & Monitors
3. Why SLOs are a good monitoring strategy

View Event →
Nov
30
to Dec 8

YOW! Australia

Organisational Substainability with Platform Engineering

Engineering organizations often face the consequences of building software in a way that prioritizes short-term gains over long-term ones. This has a lot of sociotechnical consequences, including tech debt, retention issues, and, ultimately, business risk. This talk focuses on how Platform Engineering can drive sustainability through its DevOps based principles, strong support system, and standardized shared architecture.

We’ll begin by reviewing what organizational sustainability is and how Platform Engineering can facilitate it. The rest of the talk will be split into three primary sections:

  1. The sociotechnical principles provided by DevOps

  2. The robust support structures that enable platform adoption and faster delivery.

  3. The Platform architecture, its principles, common tensions, and a framework for how to build platform architectures that enable product engineers to do their best work.

By the end, these principles and practices will tie together to form a concrete case study on how organizations can benefit from Platform Engineering teams.

View Event →
Nov
8

GitHub Universe

Microservice Observability with Node.js and OpenTelemetry

In this talk, we'll focus on using the OpenTelemetry standard for Node.js observability, especially Node.js applications within distributed systems. We'll dive into OpenTelemetry concepts such as auto vs manual instrumentation and the different types of telemetry data, e.g. distributed tracing, metrics, and logging.

We'll discuss this topic through the the lens of a platformized approach to implementing observability, and how this approach enables organizations to adopt OpenTelemetry. We'll define best practices, influenced by DevOps, shared tooling, and the support needed to create robust, observable applications.

View Event →
Oct
24

Codemotion Milan

Driving Sustainability with Platform Engineering

Engineering organizations often suffer from building software in a way that prioritizes short-term gains over long-term ones. This has a lot of sociotechnical consequences, including tech debt, retention issues, etc. This talk focuses on how Delivery Engineering can drive sustainability through its DevOps based principles and scalable technical practices.

For example, we’ll review how standardization can drive sustainability and provide examples. By the end, these principles and practices will tie together to form a concrete case study on how organizations can benefit from Delivery Engineering teams.

View Event →
Oct
11

DevOpsDays Eindhoven

Driving Sustainability with Platform Engineering

Engineering organizations often face the consequences of building software in a way that prioritizes short-term gains over long-term ones. This has a lot of sociotechnical consequences, including tech debt, retention issues, and, ultimately, business risk. This talk focuses on how Platform Engineering can drive sustainability through its DevOps based principles, strong support system, and standardized shared architecture.

We’ll begin by reviewing what organizational sustainability is and how Platform Engineering can facilitate it. The rest of the talk will be split into three primary sections:

  1. The sociotechnical principles provided by DevOps

  2. The robust support structures that enable platform adoption and faster delivery.

  3. The Platform architecture, its principles, common tensions, and a framework for how to build platform architectures that enable product engineers to do their best work.

By the end, these principles and practices will tie together to form a concrete case study on how organizations can benefit from Platform Engineering teams.

View Event →
Oct
6

Stretch Con

Trauma Informed Reliability Management

Psychological safety is particularly important for teams that manage service reliability. The vulnerability that comes with mitigating failures in production requires principles of trust, transparency, and inclusion that can only come from cultures that minimize harm and enable empowerment.

Cultivating this kind of culture requires leaders to think proactively about how to build processes and systems that enable teams to be healthy, productive, and effective, while being adequately prepared for situations when failure inevitably happens.

We’ll review the cultural consequences of chronic issues and the strategies we can use as leaders to align with our shared goal of building excellent teams. We’ll touch upon themes of privilege, power, and accountability.

View Event →
DevOps Con New York
Sep
25

DevOps Con New York

  • 333 Adams Street Brooklyn, NY, 11201 United States (map)
  • Google Calendar ICS

Incorporating DevOps Principles to Product Engineering

In this talk, we'll be diving into the intersection between product engineering and Site Reliability Engineering (SRE). We'll explore how they're combined to create a culture of excellence and psychological safety, both within a team and across an entire organization.

We'll start by discussing the fundamentals of DevOps and SRE, and then we'll explore how product engineers can use these principles and practices to develop more reliable, scalable, and resilient systems. We'll cover topics such as Service Level Objectives (SLOs), and how to define and use them effectively to manage expectations and prioritize high-impact work.

We'll also touch on how to create feedback loops to continuously improve the quality and performance of your products, and how to get buy-in from key stakeholders. Ultimately, you'll leave this talk with a deeper understanding of how to foster a culture of excellence, accountability, and psychological safety in your product engineering teams, and drive better outcomes for your organization as a whole.

View Event →
DevOpsDays London
Sep
22

DevOpsDays London

  • Broad Sanctuary London, England, SW1 United Kingdom (map)
  • Google Calendar ICS

Psychologically Safe Reliability Management

Psychological safety is particularly important for teams that manage service reliability. The vulnerability that comes with mitigating failures in production requires principles of trust, transparency, and inclusion that can only come from cultures that minimize harm and enable empowerment. Cultivating this kind of culture requires leaders to think proactively about how to build processes and systems that enable teams to be healthy, productive, and effective, while being adequately prepared for situations when failure inevitably happens. We’ll review the cultural consequences of chronic issues and the strategies we can use as leaders to align with our shared goal of building excellent teams. We’ll touch upon themes of privilege, power, and accountability.

View Event →
Sep
14

DevOps World

Belonging in DevOps: The Importance of Intersectionality

This talk will discuss intersectionality, while uncovering the different types of intersectionality which are often overlooked by organizations. We shall also discuss tips, advice and best practices for tech professionals hoping to implement intersectional goals and make tangible changes within their organizations.

View Event →
Aug
31

NDC Copenhagen

  • 68 Store Kongensgade København, 1264 Denmark (map)
  • Google Calendar ICS

Driving Sustainability with Platform Engineering

Engineering organizations often suffer from building software in a way that prioritizes short-term gains over long-term ones. This has a lot of sociotechnical consequences, including tech debt, retention issues, etc. This talk focuses on how Delivery Engineering can drive sustainability through its DevOps based principles and scalable technical practices.

For example, we’ll review how standardization can drive sustainability and provide examples. By the end, these principles and practices will tie together to form a concrete case study on how organizations can benefit from Delivery Engineering teams.

View Event →
Aug
25

NDC Copenhagen

Driving Sustainability with Platform Engineering

Engineering organizations often suffer from building software in a way that prioritizes short-term gains over long-term ones. This has a lot of sociotechnical consequences, including tech debt, retention issues, etc. This talk focuses on how Delivery Engineering can drive sustainability through its DevOps based principles and scalable technical practices.

For example, we’ll review how standardization can drive sustainability and provide examples. By the end, these principles and practices will tie together to form a concrete case study on how organizations can benefit from Delivery Engineering teams.

View Event →
Jul
25
to Jul 26

O'Reilly Publishing

Microservice Observability

You’ll learn how to create an effective observability system in microservice architectures. Understand which observability practices and microservice architecture patterns align well and set microservice organizations up for success. We will examine observability patterns and strategies  that you may  use when building systems to enable effective production debugging. Debugging software requires us to understand the behavior of our software so we can identify causes and solutions. The complexity created by distributed systems makes debugging applications considerably more challenging than it already is. Only by developing robust observable software can we overcome the challenges we experience while debugging production services.

View Event →
Jul
13

REFACTOR DX

A Platformized Approach to Node.js

In this talk, we’ll define the challenges developers face when building Node.js applications, and how Platform Engineering can provide robust runtime experiences through best practices, shared tooling, and centralized support.

More specifically, we’ll explore how creating shared standardized platforms helps maintain consistency and reliability across distributed systems, more specifically those that have embraced Node.js as their primary runtime. By establishing a shared platform, platform engineers can ensure that all Node.js applications adhere to the same set of standards, simplifying management and maintenance.

We’ll begin by covering key principles of Platform engineering, influenced by DevOps, and provide specific examples from my experiences developing shared platforms for Node.js developers.

View Event →
Jun
23

Devopsdays Amsterdam

Psychologically Safe Reliability Management

Psychological safety is particularly important for teams that manage service reliability. The vulnerability that comes with mitigating failures in production requires principles of trust, transparency, and inclusion that can only come from cultures that minimize harm and enable empowerment.

Cultivating this kind of culture requires leaders to think proactively about how to build processes and systems that enable teams to be healthy, productive, and effective, while being adequately prepared for situations when failure inevitably happens.

We’ll review the cultural consequences of chronic issues and the strategies we can use as leaders to align with our shared goal of building excellent teams. We’ll touch upon themes of privilege, power, and accountability.

View Event →
Jun
15

QCon NY 2023

Scaling Organizations with Platform Engineering

Engineering organizations are often incentivized to prioritize short-term growth over long-term needs, leading to socio-technical consequences such as tech debt, organization siloes, and attrition. This talk focuses on how Platform Engineering can drive sustainability for growing organizations through DevOps principles, centralization, and scalable technical practices.

We'll cover these topics through the lens of standardization and provide guidance on when Platform Engineering is a practical approach. Using my experience at small and mid-size startups and large companies, we'll discuss what it means to build a sustainable organization and the natural evolution and role platform engineering plays in that organizational need. By the end of the talk, attendees will understand how Platform Engineering can provide long-term benefits to growing organizations so that short-term growth doesn't come at the expense of long-term sustainability.

View Event →
Jun
9

GothamGo 2023

Using Golang for Platform Engineering

We’ll explore how Golang can improve platform engineering with its robustness, concurrency model, and cross-platform support. We’ll talk about DevOps best practices and Golang examples for creating tools, deployment automation, integrations, and using infra-as-code.

View Event →
May
16

DevOps Summit Enterprise

Driving Sustainability with Platform Engineering

Engineering organizations often suffer from building software in a way that prioritizes short-term gains over long-term ones. This has a lot of sociotechnical consequences, including tech debt, retention issues, etc. This talk focuses on how Platform Engineering can drive sustainability through its DevOps based principles and scalable technical practices.

For example, we’ll review how standardization can drive sustainability and provide examples. By the end, these principles and practices will tie together to form a concrete case study on how organizations can benefit from Platform Engineering teams.

View Event →
May
10

O'Reilly Infra & Ops Superstream

Effective Observability Microservices Architectures

I discuss a standardized platform-focused approach to building effective observable architectures, including how the approach addresses the new organizational challenges specific to microservices. This approach encompasses three parts: the patterns, the organizational support, and the stack of tools. You'll explore these concepts at a high level, with practical examples, and learn how to address these organizational challenges.

View Event →
May
4

Devopsdays Zurich 2023

  • 8 Technikumstrasse Winterthur, ZH, 8400 Switzerland (map)
  • Google Calendar ICS

Effective Observability in Microservice Architectures

Managing microservice architectures requires navigating highly complex systems. This complexity informs how we approach making these systems observable.

We’ll review a standardized platform-focused approach to building effective observable architectures, including how it addresses the new organizational challenges specific to microservices. The platform-focused approach encompasses three parts: the patterns we use, the needed organizational support, and the stack we use.

View Event →
May
2

Devopsdays Copenhagen 2023

Effective Observability Microservices Architectures

Managing microservice architectures requires navigating highly complex systems. This complexity informs how we approach making these systems observable.

We’ll review a standardized platform-focused approach to building effective observable architectures, including how it addresses the new organizational challenges specific to microservices. The platform-focused approach encompasses three parts: the patterns we use, the needed organizational support, and the stack we use.

View Event →
Mar
23

The DEVOPS Conference

Psychologically Safe Reliability Management

Psychological safety is particularly important for teams that manage service reliability. The vulnerability that comes with mitigating failures in production requires principles of trust, transparency, and inclusion that can only come from cultures that minimize harm and enable empowerment.

Cultivating this kind of culture requires leaders to think proactively about how to build processes and systems that enable teams to be healthy, productive, and effective, while being adequately prepared for situations when failure inevitably happens.

We’ll review the cultural consequences of chronic issues and the strategies we can use as leaders to align with our shared goal of building excellent teams. We’ll touch upon themes of privilege, power, and accountability.

View Event →
Jan
27

NDC London

Psychologically Safe Reliability Management

Psychological safety is particularly important for teams that manage service reliability. The vulnerability that comes with mitigating failures in production requires principles of trust, transparency, and inclusion that can only come from cultures that minimize harm and enable empowerment.

Cultivating this kind of culture requires leaders to think proactively about how to build processes and systems that enable teams to be healthy, productive, and effective, while being adequately prepared for situations when failure inevitably happens.

We’ll review the cultural consequences of chronic issues and the strategies we can use as leaders to align with our shared goal of building excellent teams. We’ll touch upon themes of privilege, power, and accountability.

View Event →
Sep
21

Devopsdays Chicago 2022

  • 1213 South Halsted Street Chicago, IL, 60607 United States (map)
  • Google Calendar ICS

Effective Observability in Microservices

This talk will focus on how to create an effective monitoring system across an organization’s entire microservice architecture. This talk combines principals from effective monitoring with microservice architecture patterns to define criteria that will set microservice organizations up for success.

One example of effective techniques include having consistency across all services since this consistency can centralize the definition of observability & what it means to have an “observable” system, make it easier to set up monitoring for services, and enable engineers to troubleshoot production issues across their own services and service dependencies.

Another example is having SLOs that align well with microservice ownership. Much like any given team should not own more than a few services, any given team also should not own more than a few different SLOs. I will dive into the organizational benefits that microservices provides and SRE reinforces by implementing SLOs.

Thirdly, how the main units of an observable system (events) align well with event-driven microservices. I will go into specifics on how events, logs, metrics, and traces relate to one another.

View Event →
Apr
18

Devopsdays Atlanta 2022

Psychological Safety for SRE teams

This talk will talk about how principles from trauma-informed teaching & management can be applied to Site Reliability Engineering teams, making them more effective, productive, and happy. It will discuss both the concepts and implementation details of these concepts, making sure to highlight benefits and match them to existing SRE principles.

View Event →
Apr
18

Devopsdays Birmingham 2022

Effective Observability in a Microservices Architecture

This talk will focus on how to create an effective monitoring system across an organization’s entire microservice architecture. This talk combines principals from effective monitoring with microservice architecture patterns to define criteria that will set microservice organizations up for success.

View Event →
Apr
14

Devopsdays Raleigh

  • 1101 Gorman Street Raleigh, NC, 27606 United States (map)
  • Google Calendar ICS

Effective Observability in Microservices

This talk will focus on how to create an effective monitoring system across an organization’s entire microservice architecture. This talk combines principles from effective monitoring with microservice architecture patterns to define criteria that will set microservice organizations up for success.

One example of effective techniques include having consistency across all services since this consistency can centralize the definition of observability what it means to have an “observable” system, make it easier to set up monitoring for services, and enable engineers to troubleshoot production issues across their own services and service dependencies.

Another example is having SLOs that align well with microservice ownership. Much like any given team should not own more than a few services, any given team also should not own more than a few different SLOs. I will dive into the organizational benefits that microservices provides and SRE reinforces by implementing SLOs.

Thirdly, how the main units of an observable system (events) align well with event-driven microservices. I will go into specifics on how events, logs, metrics, and traces relate to one another.

View Event →
Apr
5

LeadDev New York 2022

Effective Observability in Microservice Architectures

This talk will focus on how to create an effective observability system in microservice architectures. This talk focuses on which observability practices and microservice architecture patterns align well and set microservice organizations up for success.

One example of effective techniques includes having consistency across all services since this consistency can centralize the definition of observability & what it means to have an “observable” system, make it easier to set up observability for services, and enable engineers to troubleshoot production issues across their own services and service dependencies.

Another example is having SLOs that align well with microservice ownership. Much like any given team should not own more than a few services, any given team also should not own more than a few different SLOs. I will dive into the organizational benefits that microservices provides and SRE reinforces by implementing SLOs.

Thirdly, how the main units of an observable system (events) align well with event-driven microservices. I will go into specifics on how events, logs, metrics, and traces relate to one another

View Event →
Mar
3

Devopsdays Austin 2021

Fitting Site Reliability & Developer Experience into Your Team’s Priorities

This talk will focus on how you can use principles & metrics from Site Reliability teams and Developer Experience teams to create a culture of technical excellence, high velocity, and psychological safety.

When thinking about metrics that guide technical & team strategy, there are two underlying priorities that ideally should align towards one goal: system and engineering team health. Metrics relating to system health provide insight on how stable the user experience is, whereas engineering team health metrics provide insight on the experience of the engineers building these systems.

Site Reliability Engineering (SRE) defines the guiding principles and processes for ensuring system health, whereas Developer Experience (DX) is less about the actual system and more about the tools, processes, and productivity levels related to the development cycle of that system.

View Event →