Razer Jobs

Senior Site Reliability Engineer

Razer

Senior Site Reliability Engineer

Reposted 14 Days Ago

Be an Early Applicant

In-Office or Remote

Hiring Remotely in South

Senior level

In-Office or Remote

Hiring Remotely in South

Senior level

The Senior Site Reliability Engineer will design and maintain Infrastructure as Code solutions, enhance cloud infrastructure, lead incident responses, and mentor junior engineers.

The summary above was generated by AI

Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.

Job Responsibilities :

We are seeking a skilled and driven Senior Site Reliability Engineer (SRE) to join our growing infrastructure and platform engineering team. The ideal candidate will have hands-on experience in Amazon Web Services (AWS), strong troubleshooting capabilities, and a passion for building scalable, observable, and resilient systems using modern Infrastructure as Code (IaC) and automation tools.

REQUIREMENTS:

Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field.
Minimum 3 years of experience in SRE, DevOps, cloud infrastructure, or system administration roles.
Hands-on expertise with AWS Cloud Services, including:
Compute & Containerization: EC2, Lambda, ECS, EKS, Auto Scaling
Networking: Load Balancers, VPC, Route 53, Security Groups, Firewalls
Storage & Databases: RDS, ElastiCache, Athena, S3
Messaging: SQS, SES
Deep understanding of Infrastructure as Code (IaC) tools such as Terraform and CloudFormation.
Proficiency in at least one programming/scripting language: Python, Node.js, Bash, Ruby, or related.
Experience operating and troubleshooting across Linux, Windows, and container-based environments.
Strong understanding of distributed systems, cloud networking (routers, switches), firewalls, DNS, and HTTP/TLS.
Experience implementing monitoring and alerting systems and working with incident management processes.
Experience with Zero Downtime Deployments, blue/green or canary deployments.
Familiarity with cost optimization and right-sizing AWS resources.
Exposure to multi-region, multi-account AWS architecture.
Understanding of API gateway, or edge networking (e.g., Akamai, CloudFront).

JOB DESCRIPTION:

Design, implement, and maintain Infrastructure as Code (IaC) solutions using Terraform and/or CloudFormation across multi-account AWS environments.
Collaborate with developers, architects, and DevOps teams to build scalable, secure, and observable cloud infrastructure.
Lead and participate in architecture design sessions, focusing on system reliability, scalability, security, and performance.
Implement and manage robust monitoring, alerting, and observability solutions (e.g., CloudWatch, Prometheus, ELK, Datadog).
Set and monitor Key Performance Indicators (KPIs) for system uptime, latency, throughput, and overall reliability.
Drive incident response processes, including coordination, triaging, resolution, documentation, and post-incident reviews (PIRs).
Supervise and mentor junior SREs and infrastructure engineers, fostering knowledge-sharing and team growth.
Collaborate across development, operations, and security teams to ensure secure and compliant deployments.
Automate manual tasks and workflows through scripting and tooling (Python, Node.js, Bash, Ruby, JSON/YAML).
Troubleshoot complex infrastructure issues across Linux, Windows, Docker, and cloud-native environments.
Provide IaC and CI/CD best practices to ensure repeatability, scalability, and compliance across all environments.
Provide on-call support, participate in incident rotations, and lead technical investigations during outages or degradations.
Strong understanding and experience for Disaster Recovery (DR).
Provide support and solution handling to incident and tickets assigned.

Pre-Requisites :

Razer is proud to be an Equal Opportunity Employer. We believe that diverse teams drive better ideas, better products, and a stronger culture. We are committed to providing an inclusive, respectful, and fair workplace for every employee across all the countries we operate in. We do not discriminate on the basis of race, ethnicity, colour, nationality, ancestry, religion, age, sex, sexual orientation, gender identity or expression, disability, marital status, or any other characteristic protected under local laws. Where needed, we provide reasonable accommodations - including for disability or religious practices - to ensure every team member can perform and contribute at their best.

Are you game?

Similar Jobs

Razer

Site Reliability Engineer

Yesterday

In-Office or Remote

Mid level

Gaming • Hardware

Seeking a Senior Site Reliability Engineer to design and manage AWS infrastructure, implement IaC, enhance reliability, and improve monitoring systems.

Top Skills: AthenaAWSAws CloudformationBashEc2EcsEksElasticacheElkGrafanaLambdaNode.jsPrometheusPythonRdsRubyS3SesSqsTerraform

Astro (astro.com)

Intern, Affiliate Marketing

2 Days Ago

Remote

Internship

News + Entertainment

Assist in proposal design, product catalog management, operational tasks, performance reporting, and creative input for affiliate marketing campaigns.

Top Skills: CanvaExcelGoogle SheetsKeynotePowerPoint

Razer

Data Engineer

2 Days Ago

Remote

Internship

Gaming • Hardware

The Data Engineer will design and maintain data pipelines, develop cloud-native solutions, and collaborate with stakeholders to enhance data governance and quality.

Top Skills: AirflowAWSDbtGitLinuxPythonSQL

What you need to know about the Sydney Tech Scene

From opera to comedy shows, the Sydney Opera House hosts more than 1,600 performances a year, yet its entertainment sector isn't the only one taking center stage. The city's tech sector has earned a reputation as one of the fastest-growing in the region. More specifically, its IT sector stands out as the country's third-largest, growing at twice the rate of overall employment in the past decade as businesses continue to digitize their operations to stay competitive.