{"id":2129,"date":"2026-02-16T03:45:20","date_gmt":"2026-02-16T03:45:20","guid":{"rendered":"https:\/\/sreschool.com\/blog\/?p=2129"},"modified":"2026-02-16T03:45:21","modified_gmt":"2026-02-16T03:45:21","slug":"sre-keywords-and-terminology","status":"publish","type":"post","link":"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/","title":{"rendered":"SRE Keywords and Terminology"},"content":{"rendered":"\n<p>topic<br>SRE<br>Site Reliability Engineering<br>Reliability engineering<br>Production engineering<br>Operations<br>DevOps<br>Platform engineering<br>Service ownership<br>You build it you run it<br>Shared responsibility<br>Production readiness review PRR<br>Launch checklist<br>Operational excellence<br>Reliability<br>Resilience<br>Fault tolerance<br>High availability HA<br>Scalability<br>Elasticity<br>Durability<br>Maintainability<br>Observability<br>Monitoring<br>Telemetry<br>Instrumentation<br>Reliability culture<br>Toil<br>Toil budget<br>Automation<br>Autoremediation<br>Self healing<br>Runbook<br>Playbook<br>Operational runbook<br>Standard operating procedure SOP<br>On call<br>Pager duty<br>Primary on call<br>Secondary on call<br>Escalation policy<br>Escalation chain<br>Follow the sun<br>Incident response<br>Incident management<br>Major incident<br>SEV1<br>SEV2<br>SEV3<br>Incident commander IC<br>Communications lead<br>Scribe<br>War room<br>Bridge line<br>Incident timeline<br>Impact assessment<br>Customer impact<br>Blameless postmortem<br>Post incident review PIR<br>Root cause analysis RCA<br>5 Whys<br>Fishbone diagram<br>Action items<br>Corrective action<br>Preventive action<br>CAPA<br>Lessons learned<br>Incident retrospective<br>Change management<br>Change advisory board CAB<br>Change request<br>Change calendar<br>Maintenance window<br>Release management<br>Deployment<br>Rollback<br>Roll forward<br>Hotfix<br>Patch<br>Backport<br>Release train<br>Service catalog<br>System of record<br>Configuration management database CMDB<br>Asset inventory<br>Reliability testing<br>Load testing<br>Stress testing<br>Soak testing<br>Chaos engineering<br>Game day<br>Fault injection<br>Failure mode and effects analysis FMEA<br>Risk assessment<br>Risk register<br>Threat modeling<br>SLI<br>Service Level Indicator<br>SLO<br>Service Level Objective<br>SLA<br>Service Level Agreement<br>Error budget<br>Error budget burn<br>Burn rate<br>Multi window burn rate<br>Budget policy<br>Freeze policy<br>Availability<br>Uptime<br>Downtime<br>Outage<br>Partial outage<br>Degradation<br>Latency<br>Tail latency<br>P50 latency<br>P90 latency<br>P95 latency<br>P99 latency<br>P999 latency<br>Throughput<br>RPS<br>QPS<br>Error rate<br>Success rate<br>Apdex<br>Saturation<br>Capacity<br>Utilization<br>Headroom<br>MTTR<br>Mean Time to Recovery<br>Mean Time to Restore<br>Mean Time to Resolution<br>MTTD<br>Mean Time to Detect<br>MTTA<br>Mean Time to Acknowledge<br>MTBF<br>Mean Time Between Failures<br>Change failure rate<br>Deployment frequency<br>Lead time for changes<br>Metrics<br>Time series<br>Counter<br>Gauge<br>Histogram<br>Summary<br>Percentile<br>Cardinality<br>Dimensional metrics<br>Label<br>Tag<br>Metric namespace<br>Metric scraping<br>Pull model<br>Push model<br>Prometheus<br>Alertmanager<br>PromQL<br>Recording rule<br>Alerting rule<br>Service discovery<br>Grafana<br>Dashboard<br>Panel<br>Annotation<br>Templating<br>Variables<br>SLO dashboard<br>Golden signals<br>Four golden signals<br>RED method<br>USE method<br>Latency RED<br>Errors RED<br>Duration RED<br>Rate RED<br>Utilization USE<br>Saturation USE<br>Errors USE<br>Health check<br>Liveness check<br>Readiness check<br>Synthetic monitoring<br>Canary check<br>Black box monitoring<br>White box monitoring<br>Heartbeat<br>Uptime check<br>Alert<br>Alarm<br>Page<br>Notification<br>Alert routing<br>Alert deduplication<br>Alert suppression<br>Silence<br>Maintenance mode<br>Alert correlation<br>Noise reduction<br>Alert fatigue<br>Threshold alert<br>Anomaly detection<br>Dynamic threshold<br>Baseline<br>Seasonality<br>SLI query<br>SLO compliance<br>Error budget policy<br>Service level reporting<br>Logs<br>Structured logging<br>Unstructured logs<br>Log level<br>DEBUG<br>INFO<br>WARN<br>ERROR<br>FATAL<br>Log aggregation<br>Log shipping<br>Log forwarder<br>Log parsing<br>Log enrichment<br>Log sampling<br>Log retention<br>Log rotation<br>Centralized logging<br>Log indexing<br>Log search<br>Log analytics<br>Syslog<br>Journald<br>Fluentd<br>Fluent Bit<br>Logstash<br>Filebeat<br>Vector<br>OpenSearch<br>Elasticsearch<br>Kibana<br>OpenSearch Dashboards<br>Splunk<br>Graylog<br>Loki<br>Distributed tracing<br>Trace<br>Span<br>Span context<br>Trace ID<br>Span ID<br>Parent span<br>Root span<br>Context propagation<br>Baggage<br>Correlation ID<br>Request ID<br>Trace correlation<br>Log correlation<br>Sampling<br>Head based sampling<br>Tail based sampling<br>Probability sampling<br>Rate limiting sampler<br>Trace exporter<br>Span exporter<br>OTLP<br>OpenTelemetry<br>OTel<br>OpenTracing<br>OpenCensus<br>OpenTelemetry Collector<br>Receiver<br>Processor<br>Exporter<br>Sampler<br>Batch processor<br>Resource<br>Resource attributes<br>Semantic conventions<br>Instrumentation library<br>Auto instrumentation<br>Manual instrumentation<br>W3C Trace Context<br>traceparent<br>tracestate<br>B3 propagation<br>Jaeger<br>Zipkin<br>Tempo<br>Lightstep<br>Honeycomb<br>X Ray<br>APM<br>Application Performance Monitoring<br>RUM<br>Real User Monitoring<br>Synthetic transactions<br>Service map<br>Dependency graph<br>Flame graph<br>Profiling<br>Continuous profiling<br>eBPF<br>PagerDuty<br>Opsgenie<br>VictorOps<br>incidentio<br>Status page<br>StatusPage<br>Incident channel<br>Runbook automation<br>ChatOps<br>Slack<br>Teams<br>Zoom bridge<br>Incident bot<br>Circuit breaker<br>Bulkhead<br>Timeout<br>Retry<br>Exponential backoff<br>Jitter<br>Token bucket<br>Leaky bucket<br>Load shedding<br>Backpressure<br>Container<br>Container runtime<br>OCI<br>containerd<br>Image<br>Container image<br>Image registry<br>Kubernetes<br>Cluster<br>Control plane<br>API server<br>etcd<br>Scheduler<br>Node<br>Kubelet<br>Pod<br>Deployment<br>StatefulSet<br>DaemonSet<br>Service<br>Ingress<br>Namespace<br>ConfigMap<br>Secret<br>ServiceAccount<br>PersistentVolume PV<br>PersistentVolumeClaim PVC<br>StorageClass<br>CNI<br>CSI<br>Horizontal Pod Autoscaler HPA<br>Cluster Autoscaler<br>Pod disruption budget PDB<br>CustomResourceDefinition CRD<br>Operator<br>Admission controller<br>Helm<br>Helm chart<br>Kustomize<br>Service mesh<br>Sidecar<br>Envoy<br>Istio<br>Linkerd<br>CI<br>Continuous Integration<br>CD<br>Continuous Delivery<br>Continuous Deployment<br>Pipeline<br>GitOps<br>Argo CD<br>Flux<br>Jenkins<br>GitHub Actions<br>Infrastructure as Code IaC<br>Terraform<br>AWS CloudFormation<br>Pulumi<br>Ansible<br>Message queue<br>Kafka<br>Topic<br>Partition Kafka<br>Consumer group<br>Dead letter queue DLQ<br>Disaster recovery DR<br>RTO<br>RPO<br>Active active<br>Active passive<br>Multi region<br>Multi AZ<br>AWS<br>EC2<br>S3<br>EBS<br>EFS<br>RDS<br>Aurora<br>DynamoDB<br>ElastiCache<br>EKS<br>ECS<br>Fargate<br>Lambda<br>API Gateway<br>CloudFront<br>CloudWatch<br>CloudWatch Logs<br>CloudTrail<br>IAM policy<br>KMS key<br>Secrets Manager<br>SSM Parameter Store<br>Route 53<br>VPC Flow Logs<br>WAF AWS<br>Shield<br>Auto Scaling Group ASG<br>Elastic Load Balancing ELB<br>ALB<br>NLB<br>EventBridge<br>SQS<br>SNS<br>Kinesis<br>OpenSearch Service<br>GCP<br>Compute Engine<br>GKE<br>Cloud Run<br>Cloud Functions<br>App Engine<br>Cloud Storage<br>Persistent Disk<br>Cloud SQL<br>Spanner<br>Bigtable<br>Firestore<br>PubSub GCP<br>BigQuery<br>Cloud Monitoring<br>Cloud Logging<br>Cloud Trace<br>Cloud Profiler<br>Cloud IAM<br>Cloud Load Balancing<br>Cloud DNS<br>Cloud Armor<br>Azure<br>Virtual Machines Azure<br>AKS<br>Azure Functions<br>App Service<br>Blob Storage<br>Azure Files<br>Managed Disks<br>Azure SQL Database<br>Cosmos DB<br>Azure Cache for Redis<br>Service Bus<br>Event Hubs<br>Event Grid<br>Azure Monitor<br>Log Analytics<br>Application Insights<br>Azure Active Directory<br>Managed Identity<br>Key Vault<br>Virtual Network VNet<br>Network Security Group NSG<br>Application Gateway<br>Azure Firewall<br>Azure CDN<br>Datadog<br>New Relic<br>Dynatrace<br>AppDynamics<br>Elastic APM<br>Splunk Observability<br>Grafana Cloud<br>Thanos<br>Cortex<br>Mimir<br>VictoriaMetrics<br>InfluxDB<br>Telegraf<br>StatsD<br>Prometheus Remote Write<br>Graceful degradation<\/p>\n","protected":false},"excerpt":{"rendered":"<p>topicSRESite Reliability EngineeringReliability engineeringProduction engineeringOperationsDevOpsPlatform engineeringService ownershipYou build it you run itShared responsibilityProduction readiness review PRRLaunch checklistOperational excellenceReliabilityResilienceFault toleranceHigh availability [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-2129","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>SRE Keywords and Terminology - SRE School<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"SRE Keywords and Terminology - SRE School\" \/>\n<meta property=\"og:description\" content=\"topicSRESite Reliability EngineeringReliability engineeringProduction engineeringOperationsDevOpsPlatform engineeringService ownershipYou build it you run itShared responsibilityProduction readiness review PRRLaunch checklistOperational excellenceReliabilityResilienceFault toleranceHigh availability [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/\" \/>\n<meta property=\"og:site_name\" content=\"SRE School\" \/>\n<meta property=\"article:published_time\" content=\"2026-02-16T03:45:20+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-02-16T03:45:21+00:00\" \/>\n<meta name=\"author\" content=\"Rajesh Kumar\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Rajesh Kumar\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/\",\"url\":\"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/\",\"name\":\"SRE Keywords and Terminology - SRE School\",\"isPartOf\":{\"@id\":\"https:\/\/sreschool.com\/blog\/#website\"},\"datePublished\":\"2026-02-16T03:45:20+00:00\",\"dateModified\":\"2026-02-16T03:45:21+00:00\",\"author\":{\"@id\":\"https:\/\/sreschool.com\/blog\/#\/schema\/person\/0ffe446f77bb2589992dbe3a7f417201\"},\"breadcrumb\":{\"@id\":\"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/#breadcrumb\"},\"inLanguage\":\"en\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/sreschool.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"SRE Keywords and Terminology\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/sreschool.com\/blog\/#website\",\"url\":\"https:\/\/sreschool.com\/blog\/\",\"name\":\"SRESchool\",\"description\":\"Master SRE. Build Resilient Systems. Lead the Future of Reliability\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/sreschool.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/sreschool.com\/blog\/#\/schema\/person\/0ffe446f77bb2589992dbe3a7f417201\",\"name\":\"Rajesh Kumar\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en\",\"@id\":\"https:\/\/sreschool.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/f901a4f2929fa034a291a8363d589791d5a3c1f6a051c22e744acb8bfc8e022a?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/f901a4f2929fa034a291a8363d589791d5a3c1f6a051c22e744acb8bfc8e022a?s=96&d=mm&r=g\",\"caption\":\"Rajesh Kumar\"},\"sameAs\":[\"http:\/\/sreschool.com\/blog\"],\"url\":\"https:\/\/sreschool.com\/blog\/author\/admin\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"SRE Keywords and Terminology - SRE School","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/","og_locale":"en_US","og_type":"article","og_title":"SRE Keywords and Terminology - SRE School","og_description":"topicSRESite Reliability EngineeringReliability engineeringProduction engineeringOperationsDevOpsPlatform engineeringService ownershipYou build it you run itShared responsibilityProduction readiness review PRRLaunch checklistOperational excellenceReliabilityResilienceFault toleranceHigh availability [&hellip;]","og_url":"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/","og_site_name":"SRE School","article_published_time":"2026-02-16T03:45:20+00:00","article_modified_time":"2026-02-16T03:45:21+00:00","author":"Rajesh Kumar","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Rajesh Kumar","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/","url":"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/","name":"SRE Keywords and Terminology - SRE School","isPartOf":{"@id":"https:\/\/sreschool.com\/blog\/#website"},"datePublished":"2026-02-16T03:45:20+00:00","dateModified":"2026-02-16T03:45:21+00:00","author":{"@id":"https:\/\/sreschool.com\/blog\/#\/schema\/person\/0ffe446f77bb2589992dbe3a7f417201"},"breadcrumb":{"@id":"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/#breadcrumb"},"inLanguage":"en","potentialAction":[{"@type":"ReadAction","target":["https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/sreschool.com\/blog\/sre-keywords-and-terminology\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/sreschool.com\/blog\/"},{"@type":"ListItem","position":2,"name":"SRE Keywords and Terminology"}]},{"@type":"WebSite","@id":"https:\/\/sreschool.com\/blog\/#website","url":"https:\/\/sreschool.com\/blog\/","name":"SRESchool","description":"Master SRE. Build Resilient Systems. Lead the Future of Reliability","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/sreschool.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en"},{"@type":"Person","@id":"https:\/\/sreschool.com\/blog\/#\/schema\/person\/0ffe446f77bb2589992dbe3a7f417201","name":"Rajesh Kumar","image":{"@type":"ImageObject","inLanguage":"en","@id":"https:\/\/sreschool.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/f901a4f2929fa034a291a8363d589791d5a3c1f6a051c22e744acb8bfc8e022a?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/f901a4f2929fa034a291a8363d589791d5a3c1f6a051c22e744acb8bfc8e022a?s=96&d=mm&r=g","caption":"Rajesh Kumar"},"sameAs":["http:\/\/sreschool.com\/blog"],"url":"https:\/\/sreschool.com\/blog\/author\/admin\/"}]}},"_links":{"self":[{"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/posts\/2129","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/comments?post=2129"}],"version-history":[{"count":1,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/posts\/2129\/revisions"}],"predecessor-version":[{"id":2130,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/posts\/2129\/revisions\/2130"}],"wp:attachment":[{"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/media?parent=2129"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/categories?post=2129"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/tags?post=2129"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}