Job Description

Do you like collaborating across teams to solve complex problems?

Do you enjoy building and maintaining cloud compute platforms?

Join our Compute Site Reliability team!

Our team is responsible for monitoring and measuring the reliability of our Cloud Native products. In collaboration with Engineering and Product teams, our focus is improving performance and reliability.

Partner with the best

As a Senior SRE specializing in Kubernetes and DevOps, you'll play a critical role in maintaining and enhancing our Cloud Native Product line and driving the adoption of best practices. You'll work closely with cross-functional teams to build, scale, and manage our Cloud Native offering, while fostering a culture of automation and collaboration.

As a Site Reliability Engineer Senior, you will be responsible for:

  • Managing and maintaining Kubernetes clusters, ensuring high availability and optimal performance
  • Developing and maintaining Infrastructure as Code (IaC) scripts and templates using tools like Terraform or Helm
  • Collaborating to integrate DevOps practices into the SDLC, including CI/CD pipelines, automated testing, and deployment automation.
  • Setting up and Configuring the monitoring and alerting systems for Kubernetes clusters
  • Optimizing application deployment processes and enhancing the Kubernetes infrastructure to improve reliability and reduce downtime
  • Creating and maintain comprehensive documentation related to Kubernetes configurations, DevOps processes, procedures, and troubleshooting guides
  • Working closely with cross-functional teams to align infrastructure requirements with application needs

Do what you love

To be successful in this role you will:

  • Have relevant experience and a Bachelor's diploma in Computer Science, Engineering, or related field
  • Have experience as a Site Reliability Engineer or similar role with a focus on Kubernetes and DevOps
  • Possess knowledge of Kubernetes architecture, administration, and DevOps best practices, and containerization technologies (i.e. Docker)
  • Have scripting and automation experience using languages like Python, Bash, or Go and IaC tools such as Terraform, Helm or Ansible
  • Have experience with cloud platforms (e.g., AWS, Azure, Linode GCP)
  • Demonstrate solid communication, teamwork and problem-solving skills and have a proactive approach to identifying and mitigating risks

Work in a way that works for you

FlexBase, Akamai's Global Flexible Working Program, is based on the principles that are helping us create the best workplace in the world. When our colleagues said that flexible working was important to them, we listened. We also know flexible working is important to many of the incredible people considering joining Akamai. FlexBase, gives 95% of employees the choice to work from their home, their office, or both (in the country advertised). This permanent workplace flexibility program is consistent and fair globally, to help us find incredible talent, virtually anywhere. We are happy to discuss working options for this role and encourage you to speak with your recruiter in more detail when you apply.

Learn what makes Akamai a great place to work

Connect with us on social and see what life at Akamai is like!

We power and protect life online, by solving the toughest challenges, together.

At Akamai, we're curious, innovative, collaborative and tenacious. We celebrate diversity of thought and we hold an unwavering belief that we can make a meaningful difference. Our teams use their global perspectives to put customers at the forefront of everything they do, so if you are people-centric, you'll thrive here.

Working for you

At Akamai, we will provide you with opportunities to grow, flourish, and achieve great things. Our benefit options are designed to meet your individual needs for today and in the future. We provide benefits surrounding all aspects of your life:

  • Your health
  • Your finances
  • Your family
  • Your time at work
  • Your time pursuing other endeavors

Our benefit plan options are designed to meet your individual needs and budget, both today and in the future.

About us

Akamai powers and protects life online. Leading companies worldwide choose Akamai to build, deliver, and secure their digital experiences helping billions of people live, work, and play every day. With the world's most distributed compute platform from cloud to edge we make it easy for customers to develop and run applications, while we keep experiences closer to users and threats farther away.

Join us

Are you seeking an opportunity to make a real difference in a company with a global reach and exciting services and clients? Come join us and grow with a team of people who will energize and inspire you!

#LI-Remote

Is a Remote Job?
Remote

We power and protect life online. Global companies trust us to build, deliver, and secure digital experiences — helping billions to live, work, and play online. Akamai’s intelligent edge platform...

Apply Now