{"id":55,"date":"2025-04-28T07:49:38","date_gmt":"2025-04-28T07:49:38","guid":{"rendered":"https:\/\/sreschool.com\/blog\/?p=55"},"modified":"2025-04-28T07:51:10","modified_gmt":"2025-04-28T07:51:10","slug":"auto-remediation","status":"publish","type":"post","link":"https:\/\/sreschool.com\/blog\/auto-remediation\/","title":{"rendered":"Auto Remediation \u2013 Building Self-Healing Systems via Automation"},"content":{"rendered":"\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\ud83d\udd39 Part 1: Introduction \u2013 What is Auto Remediation?<\/h2>\n\n\n\n<p><strong>Auto Remediation<\/strong> refers to a system\u2019s ability to <strong>detect<\/strong> an issue and <strong>resolve<\/strong> it <strong>automatically<\/strong> without human intervention.<\/p>\n\n\n\n<p>\ud83d\udc49 <strong>Examples:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Restarting a failed pod in Kubernetes.<\/li>\n\n\n\n<li>Rebooting an unhealthy server in AWS using CloudWatch alarms and Lambda.<\/li>\n\n\n\n<li>Replacing a bad disk in cloud storage via monitoring alerts.<\/li>\n<\/ul>\n\n\n\n<p><strong>Goal:<\/strong> Reduce downtime, improve system reliability, and save human effort.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\ud83d\udd39 Part 2: Why Auto Remediation? (Real-world Importance)<\/h2>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Without Auto Remediation<\/th><th>With Auto Remediation<\/th><\/tr><\/thead><tbody><tr><td>Manual detection and fixes<\/td><td>Automated detection and fixes<\/td><\/tr><tr><td>Higher downtime<\/td><td>Lower downtime<\/td><\/tr><tr><td>Increased operational costs<\/td><td>Optimized costs<\/td><\/tr><tr><td>Human errors possible<\/td><td>Fewer human errors<\/td><\/tr><tr><td>Stressful on-call duty<\/td><td>Happier DevOps\/SRE teams<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>\ud83d\udc49 Companies like Netflix, Google, AWS heavily depend on <strong>auto remediation<\/strong> for <strong>self-healing<\/strong>.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\ud83d\udd39 Part 3: Core Concepts to Understand First<\/h2>\n\n\n\n<p>Before diving into building, understand these basics:<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Concept<\/th><th>Explanation<\/th><\/tr><\/thead><tbody><tr><td><strong>Monitoring<\/strong><\/td><td>Continuously check the health of systems.<\/td><\/tr><tr><td><strong>Alerting<\/strong><\/td><td>Notifying when something unusual happens.<\/td><\/tr><tr><td><strong>Incident Detection<\/strong><\/td><td>Identifying a deviation from normal behavior.<\/td><\/tr><tr><td><strong>Runbooks<\/strong><\/td><td>Step-by-step guides to fix common incidents manually.<\/td><\/tr><tr><td><strong>Automation Tools<\/strong><\/td><td>Scripts, workflows, or platforms that can trigger actions.<\/td><\/tr><tr><td><strong>State Management<\/strong><\/td><td>Knowing if a system is healthy, degraded, or failed.<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\ud83d\udd39 Part 4: Basic Auto Remediation Example \u2013 Restarting a Service<\/h2>\n\n\n\n<p>Let&#8217;s start simple.<\/p>\n\n\n\n<p><strong>Scenario:<\/strong> Restart Apache web server automatically if it stops.<\/p>\n\n\n\n<p><strong>Steps:<\/strong><\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>Set Up Monitoring<\/strong> (Example: Using <code>monit<\/code>)<\/li>\n<\/ol>\n\n\n\n<pre class=\"wp-block-code\"><code>sudo apt install monit\n<\/code><\/pre>\n\n\n\n<ol start=\"2\" class=\"wp-block-list\">\n<li><strong>Configure monit<\/strong> to watch Apache:<\/li>\n<\/ol>\n\n\n\n<pre class=\"wp-block-code\"><code># \/etc\/monit\/conf-enabled\/apache\ncheck process apache2 with pidfile \/var\/run\/apache2\/apache2.pid\n    start program = \"\/etc\/init.d\/apache2 start\"\n    stop program  = \"\/etc\/init.d\/apache2 stop\"\n    if failed port 80 protocol http then restart\n<\/code><\/pre>\n\n\n\n<ol start=\"3\" class=\"wp-block-list\">\n<li><strong>Start monit service:<\/strong><\/li>\n<\/ol>\n\n\n\n<pre class=\"wp-block-code\"><code>sudo systemctl restart monit\nsudo monit status\n<\/code><\/pre>\n\n\n\n<p><strong>\u2705 Result:<\/strong> If Apache crashes, Monit detects and restarts it!<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\ud83d\udd39 Part 5: Intermediate Auto Remediation \u2013 Cloud Resource Healing (AWS Example)<\/h2>\n\n\n\n<p><strong>Scenario:<\/strong> If an EC2 instance becomes <strong>unhealthy<\/strong>, auto-replace it.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Step-by-Step:<\/h3>\n\n\n\n<p><strong>1. Monitoring:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Set up <strong>CloudWatch Health Checks<\/strong> on EC2.<\/li>\n<\/ul>\n\n\n\n<p><strong>2. Alerting:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Create an <strong>Alarm<\/strong>: If instance status = impaired for more than 5 minutes.<\/li>\n<\/ul>\n\n\n\n<p><strong>3. Remediation Trigger:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Use <strong>AWS SNS<\/strong> to publish an alert.<\/li>\n<\/ul>\n\n\n\n<p><strong>4. Automation:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Attach a <strong>Lambda Function<\/strong> to SNS that:\n<ul class=\"wp-block-list\">\n<li>Terminates the unhealthy EC2 instance.<\/li>\n\n\n\n<li>Launches a new instance using an <strong>Auto Scaling Group<\/strong>.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<pre class=\"wp-block-code\"><code># Example (Lambda Python code)\nimport boto3\n\ndef lambda_handler(event, context):\n    ec2 = boto3.client('ec2')\n    instance_id = event&#91;'detail']&#91;'instance-id']\n    \n    ec2.terminate_instances(InstanceIds=&#91;instance_id])\n    print(f\"Terminated {instance_id} successfully\")\n<\/code><\/pre>\n\n\n\n<p>\u2705 <strong>Result:<\/strong> EC2 automatically replaced.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\ud83d\udd39 Part 6: Building a Self-Healing Kubernetes Cluster<\/h2>\n\n\n\n<p><strong>Scenario:<\/strong> If a Pod crashes or goes into CrashLoopBackOff, it should self-heal.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>1. Kubernetes already self-heals pods<\/strong> via <strong>Deployments<\/strong> and <strong>ReplicaSets<\/strong>.<\/p>\n\n\n\n<p>\u2705 Example:<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>apiVersion: apps\/v1\nkind: Deployment\nmetadata:\n  name: nginx\nspec:\n  replicas: 2\n  selector:\n    matchLabels:\n      app: nginx\n  template:\n    metadata:\n      labels:\n        app: nginx\n    spec:\n      containers:\n      - name: nginx\n        image: nginx:latest\n<\/code><\/pre>\n\n\n\n<ul class=\"wp-block-list\">\n<li>If a pod crashes, ReplicaSet ensures another one takes its place.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>2. Advanced Auto Remediation: Using Kubernetes Operators<\/strong><\/p>\n\n\n\n<p>\ud83d\udc49 Build a <strong>Custom Operator<\/strong> using <strong>Operator SDK<\/strong>.<\/p>\n\n\n\n<p>Example Use Case:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Detect custom application crash (not just pod crash).<\/li>\n\n\n\n<li>Perform custom logic (like cleaning temp files before restart).<\/li>\n<\/ul>\n\n\n\n<p>Tools:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Kubernetes Operator SDK<\/strong><\/li>\n\n\n\n<li><strong>Prometheus Operator<\/strong><\/li>\n\n\n\n<li><strong>Argo Rollouts<\/strong> for progressive delivery and automatic rollback.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\ud83d\udd39 Part 7: Advanced Auto Remediation Design Patterns<\/h2>\n\n\n\n<h3 class=\"wp-block-heading\">1. <strong>Self-Healing Infrastructure<\/strong><\/h3>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Component<\/th><th>Role<\/th><\/tr><\/thead><tbody><tr><td>Monitoring (Prometheus, Datadog)<\/td><td>Detect anomalies<\/td><\/tr><tr><td>Alerting (PagerDuty, Opsgenie)<\/td><td>Raise alarms<\/td><\/tr><tr><td>Orchestration (AWS Lambda, Azure Functions)<\/td><td>Take automated actions<\/td><\/tr><tr><td>Runbooks (Pre-written automation scripts)<\/td><td>Codify incident resolution<\/td><\/tr><tr><td>Auto Rollback (ArgoCD, Spinnaker)<\/td><td>Rollback bad deployments<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">2. <strong>Event-Driven Healing<\/strong><\/h3>\n\n\n\n<p>Instead of periodic checks, <strong>event-driven<\/strong> systems react instantly.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Example: Use <strong>AWS EventBridge<\/strong> rules + Lambda automation.<\/li>\n\n\n\n<li>Example: Kubernetes <strong>Admission Controllers<\/strong> that reject unhealthy pods at runtime.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">3. <strong>AIOps + Machine Learning Based Healing<\/strong><\/h3>\n\n\n\n<p>Future of Auto Remediation:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Systems predict upcoming failures using AI\/ML.<\/li>\n\n\n\n<li>Pre-emptively heal or scale resources.<\/li>\n<\/ul>\n\n\n\n<p>Examples:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Dynatrace Davis AI<\/li>\n\n\n\n<li>IBM Watson AIOps<\/li>\n\n\n\n<li>Moogsoft<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\ud83d\udd39 Part 8: Best Practices for Auto Remediation<\/h2>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Best Practice<\/th><th>Why It\u2019s Important<\/th><\/tr><\/thead><tbody><tr><td>Always verify before fixing<\/td><td>Avoid false positive actions<\/td><\/tr><tr><td>Start small and simple<\/td><td>Don\u2019t automate everything at once<\/td><\/tr><tr><td>Keep audit logs<\/td><td>For tracking automatic changes<\/td><\/tr><tr><td>Build safe rollbacks<\/td><td>In case auto-fix worsens things<\/td><\/tr><tr><td>Design idempotent remediations<\/td><td>Running the same fix multiple times shouldn&#8217;t break anything<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\ud83d\udd39 Part 9: Tools &amp; Frameworks You Should Learn<\/h2>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Tool<\/th><th>Purpose<\/th><\/tr><\/thead><tbody><tr><td>AWS Lambda \/ Azure Functions<\/td><td>Serverless automation<\/td><\/tr><tr><td>Kubernetes Operator SDK<\/td><td>Custom remediation controllers<\/td><\/tr><tr><td>PagerDuty Rundeck<\/td><td>Incident-driven workflows<\/td><\/tr><tr><td>Prometheus<\/td><td>Monitoring and alerting<\/td><\/tr><tr><td>Terraform + AWS Config Rules<\/td><td>Infrastructure compliance and healing<\/td><\/tr><tr><td>Ansible \/ Chef<\/td><td>Config-based auto remediation<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\ud83d\udd39 Part 10: Hands-on Mini Project \ud83d\udee0\ufe0f<\/h2>\n\n\n\n<p>\ud83c\udfaf <strong>Build your first auto remediation project:<\/strong><\/p>\n\n\n\n<p>\u2705 Scenario:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Monitor a Linux server.<\/li>\n\n\n\n<li>If disk usage > 90%, automatically clean temp files.<\/li>\n\n\n\n<li>If not resolved within 5 minutes, send an escalation alert via email.<\/li>\n<\/ul>\n\n\n\n<p>\u2705 Hints:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Use <code>cron<\/code> + <code>bash<\/code> scripts initially.<\/li>\n\n\n\n<li>Later automate via AWS Systems Manager Automation Documents (SSM Documents).<\/li>\n\n\n\n<li>Implement escalation policy via SNS and Lambda.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h1 class=\"wp-block-heading\">\u2705 Conclusion<\/h1>\n\n\n\n<p>Auto Remediation is not magic. It\u2019s a structured practice:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Detect<\/strong> \u2192 <strong>Decide<\/strong> \u2192 <strong>Act<\/strong> \u2192 <strong>Audit<\/strong><\/li>\n<\/ul>\n\n\n\n<p>With the right monitoring, alerting, and automation setup, you can build <strong>highly resilient<\/strong>, <strong>self-healing<\/strong> systems that minimize downtime and human intervention.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>\ud83d\udd39 Part 1: Introduction \u2013 What is Auto Remediation? Auto Remediation refers to a system\u2019s ability to detect an issue [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-55","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Auto Remediation \u2013 Building Self-Healing Systems via Automation - SRE School<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/sreschool.com\/blog\/auto-remediation\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Auto Remediation \u2013 Building Self-Healing Systems via Automation - SRE School\" \/>\n<meta property=\"og:description\" content=\"\ud83d\udd39 Part 1: Introduction \u2013 What is Auto Remediation? Auto Remediation refers to a system\u2019s ability to detect an issue [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/sreschool.com\/blog\/auto-remediation\/\" \/>\n<meta property=\"og:site_name\" content=\"SRE School\" \/>\n<meta property=\"article:published_time\" content=\"2025-04-28T07:49:38+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-04-28T07:51:10+00:00\" \/>\n<meta name=\"author\" content=\"Rajesh Kumar\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Rajesh Kumar\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/sreschool.com\/blog\/auto-remediation\/\",\"url\":\"https:\/\/sreschool.com\/blog\/auto-remediation\/\",\"name\":\"Auto Remediation \u2013 Building Self-Healing Systems via Automation - SRE School\",\"isPartOf\":{\"@id\":\"https:\/\/sreschool.com\/blog\/#website\"},\"datePublished\":\"2025-04-28T07:49:38+00:00\",\"dateModified\":\"2025-04-28T07:51:10+00:00\",\"author\":{\"@id\":\"https:\/\/sreschool.com\/blog\/#\/schema\/person\/0ffe446f77bb2589992dbe3a7f417201\"},\"breadcrumb\":{\"@id\":\"https:\/\/sreschool.com\/blog\/auto-remediation\/#breadcrumb\"},\"inLanguage\":\"en\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/sreschool.com\/blog\/auto-remediation\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/sreschool.com\/blog\/auto-remediation\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/sreschool.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Auto Remediation \u2013 Building Self-Healing Systems via Automation\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/sreschool.com\/blog\/#website\",\"url\":\"https:\/\/sreschool.com\/blog\/\",\"name\":\"SRESchool\",\"description\":\"Master SRE. Build Resilient Systems. Lead the Future of Reliability\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/sreschool.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/sreschool.com\/blog\/#\/schema\/person\/0ffe446f77bb2589992dbe3a7f417201\",\"name\":\"Rajesh Kumar\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en\",\"@id\":\"https:\/\/sreschool.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/f901a4f2929fa034a291a8363d589791d5a3c1f6a051c22e744acb8bfc8e022a?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/f901a4f2929fa034a291a8363d589791d5a3c1f6a051c22e744acb8bfc8e022a?s=96&d=mm&r=g\",\"caption\":\"Rajesh Kumar\"},\"sameAs\":[\"http:\/\/sreschool.com\/blog\"],\"url\":\"https:\/\/sreschool.com\/blog\/author\/admin\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Auto Remediation \u2013 Building Self-Healing Systems via Automation - SRE School","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/sreschool.com\/blog\/auto-remediation\/","og_locale":"en_US","og_type":"article","og_title":"Auto Remediation \u2013 Building Self-Healing Systems via Automation - SRE School","og_description":"\ud83d\udd39 Part 1: Introduction \u2013 What is Auto Remediation? Auto Remediation refers to a system\u2019s ability to detect an issue [&hellip;]","og_url":"https:\/\/sreschool.com\/blog\/auto-remediation\/","og_site_name":"SRE School","article_published_time":"2025-04-28T07:49:38+00:00","article_modified_time":"2025-04-28T07:51:10+00:00","author":"Rajesh Kumar","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Rajesh Kumar","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/sreschool.com\/blog\/auto-remediation\/","url":"https:\/\/sreschool.com\/blog\/auto-remediation\/","name":"Auto Remediation \u2013 Building Self-Healing Systems via Automation - SRE School","isPartOf":{"@id":"https:\/\/sreschool.com\/blog\/#website"},"datePublished":"2025-04-28T07:49:38+00:00","dateModified":"2025-04-28T07:51:10+00:00","author":{"@id":"https:\/\/sreschool.com\/blog\/#\/schema\/person\/0ffe446f77bb2589992dbe3a7f417201"},"breadcrumb":{"@id":"https:\/\/sreschool.com\/blog\/auto-remediation\/#breadcrumb"},"inLanguage":"en","potentialAction":[{"@type":"ReadAction","target":["https:\/\/sreschool.com\/blog\/auto-remediation\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/sreschool.com\/blog\/auto-remediation\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/sreschool.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Auto Remediation \u2013 Building Self-Healing Systems via Automation"}]},{"@type":"WebSite","@id":"https:\/\/sreschool.com\/blog\/#website","url":"https:\/\/sreschool.com\/blog\/","name":"SRESchool","description":"Master SRE. Build Resilient Systems. Lead the Future of Reliability","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/sreschool.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en"},{"@type":"Person","@id":"https:\/\/sreschool.com\/blog\/#\/schema\/person\/0ffe446f77bb2589992dbe3a7f417201","name":"Rajesh Kumar","image":{"@type":"ImageObject","inLanguage":"en","@id":"https:\/\/sreschool.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/f901a4f2929fa034a291a8363d589791d5a3c1f6a051c22e744acb8bfc8e022a?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/f901a4f2929fa034a291a8363d589791d5a3c1f6a051c22e744acb8bfc8e022a?s=96&d=mm&r=g","caption":"Rajesh Kumar"},"sameAs":["http:\/\/sreschool.com\/blog"],"url":"https:\/\/sreschool.com\/blog\/author\/admin\/"}]}},"_links":{"self":[{"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/posts\/55","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/comments?post=55"}],"version-history":[{"count":3,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/posts\/55\/revisions"}],"predecessor-version":[{"id":58,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/posts\/55\/revisions\/58"}],"wp:attachment":[{"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/media?parent=55"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/categories?post=55"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sreschool.com\/blog\/wp-json\/wp\/v2\/tags?post=55"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}