Posted 4 hours ago
Job Description
Responsibilities
• Implement monitoring, alerting and incident response
• Automate operational processes using Terraform and Ansible
• Collaborate with development teams on SLOs and SLAs
• Drive post-incident reviews and continuous improvement
Requirements
• Proficiency with AWS or GCP services
• Strong scripting skills (Python, Bash)
• Experience with Kubernetes and Helm
• Knowledge of observability tools (Prometheus, Grafana, Datadog)
Login Required to Apply
To ensure a secure application process and track your progress, please create a free candidate account or sign in.
Similar Opportunities
Cloud Infrastructure Architect
HP Connect Ltd • Edinburgh, UK