Senior Site Reliability Engineer (Remote)

Remote-Global
Posted 3 hours, 29 minutes ago
Engineering

About the role

Job summary

As a Senior Site Reliability Engineer, you will be responsible for ensuring the operational excellence and infrastructure strategy of a platform designed for AI integration in HR and Finance. You will collaborate with various teams to maintain a reliable, secure, and scalable infrastructure.

Qualifications

  • Proven experience in Site Reliability Engineering, DevOps Engineering, or SysOps roles, with a track record of managing production systems at scale.
  • Deep hands-on experience with Kubernetes and AWS, including compute, networking, storage, and managed services.
  • Proficiency in Infrastructure-as-Code tools such as Terraform.
  • Experience with CI/CD tools like GitLab, GitHub Actions, or Jenkins.
  • Strong bash scripting skills and familiarity with Linux system-level issues.
  • Excellent communication skills for conveying complex infrastructure concepts to technical and non-technical stakeholders.

Responsibilities

  • Design, implement, and maintain infrastructure-as-code patterns using Terraform and Kubernetes.
  • Build and maintain monitoring, logging, and alerting systems; lead incident response and post-mortems.
  • Collaborate with the Security team to integrate security into the infrastructure and ensure compliance across multiple jurisdictions.
  • Optimize system performance, resource utilization, and cloud costs.
  • Identify and eliminate manual operational tasks, creating tools for efficient team operations.
  • Work with platform teams to enhance the reliability and observability of APIs and other services.

Skills

  • Experience with backend programming languages such as Elixir, Python, Go, Java, or Node.js is a plus.
  • Familiarity with observability tools like Datadog, Prometheus, ELK, or Grafana.
  • Experience in consultancy settings and with multi-tenant platforms is advantageous.

Education

  • Relevant degree or equivalent experience in a related field is preferred.

Tools

  • Terraform, Kubernetes, AWS, GitLab, GitHub Actions, Jenkins, and various observability tools.
Full Access

Ready to apply for this role?

Full Access gives you the company name, full job description, and a direct link to apply. The summary above helps you explore the role.

Share this job