There are NO limits to your career: come shape the future and be part of a truly unique global culture at OutSystems!
Hybrid Onsite in Menlo Park, CA
Site Reliability Engineering Function
Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals of SRE are to create scalable and highly reliable systems. Our SREs ensure our production systems' reliability, performance, and scalability while enabling rapid development and deployment of new features and services.
SREs at OutSystems work closely with development teams, acting as an extension of the team, in adopting the reliability tenets with the shared goal of meeting Service Level Objectives (SLOs) and thus delivering a smooth and frictionless Customer Experience.
Site Reliability Engineer Role
As an SRE at OutSystems here are your key responsibilities and duties:
Lead and onboard services and teams to the reliability tenets;
Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs);
Design and implement scalable, reliable, and secure infrastructure, while ensuring cloud-native best practices;
Collaborate with software development teams to ensure systems are resilient (observable, fault-tolerant, recoverable, scalable) and performant;
Implement monitoring, alerting, logging, and tracing solutions to detect and respond to incidents;
Lead incident response efforts, ensuring quick resolution and minimal downtime, and conduct RCA/post-mortems;
Automate every operational task, with a special focus on fast incident detection & recovery;
Programming in Python supported by Gen AI tooling to accelerate development of mission critical automation and tools.
Foster a culture of continuous improvement and knowledge sharing;
Communicate effectively with stakeholders, providing updates on system reliability and performance;
Participate in on-call rotation to provide 24/7 support for production systems.
Site Reliability Engineering Performance Indicators
The main KPIs that aid in understanding the impact and success of the SRE function at OutSystems are:
SLA and Service Level Objectives (SLO) compliance;
SLO Coverage and Detection Ratio;
MTTA - Mean time to acknowledge;
MTTR - Mean time to resolve.
Qualifications and Skills
To illustrate the desired profile for a Site Reliability Engineer. Nevertheless, the selection of candidates will always vary depending on specific knowledge of the field and prior experience.
Qualifications
BS/MS in Computer Science or Equivalent
6+ years of experience in Site Reliability Engineering, managing infrastructure and services at scale
History of end-to-end project delivery
Experience managing Hadoop and Kubernetes infrastructure and related services, or equivalent experience
Advanced knowledge of Linux, Networking, and Containers
Proficiency in at least one high-level programming language (Python, GoLang etc.).
Strong troubleshooting and debugging skills.
Fluency in English and excellent communication skills.
Soft Skills
Communication - able to communicate effectively (in English) both orally and written showing empathy for the other person;
Collaboration - Proactive collaboration and presentation skills to effectively communicate ideas and represent the deliverables and needs of the SRE team with leadership.
Humbleness - accepts mistakes and acts accordingly, with a humble attitude, apologizing for them and mitigating them ASAP to avoid higher impact.
Accountability - takes ownership of problems and makes sure to see them through. Even if he does not have all the necessary knowledge to move on alone, can involve the right people to reach closure.
Negotiation Skills - has tough and politically complex conversations with colleagues and customers, defusing disagreements and leading towards a mutual agreement and understanding of all parties involved.
Process Oriented - is organized and able to properly follow defined processes, whilst being able to properly challenge inefficient processes and suggest improvements.
Problem-solving - Has a top-down approach to problems, breaking them into smaller pieces and solving them by starting with a wider scope and narrowing it down as the analysis progresses. Has critical thinking, so can analyze information objectively and make a reasoned judgment.
Technical Skills
Experience in any of the following is valued, but not fully required:
Ability to establish, monitor, and improve Service Level Objectives (SLOs), Indicators (SLIs), and Agreements (SLAs) in line with business needs.
Containerization technologies and orchestration platforms, mainly Kubernetes and EKS
(CKA, CKAD, CKS certifications are valued);
Experience with automation and Infrastructure as Code (IaC) tools, such as AWS CloudFormation, Terraform, Puppet, Chef, Spacelift, etc;
Experience with Python, Go, Bash/Shell scripting, or other automation tools/languages;
Familiarity with AWS services like EC2, RDS, ELB, CloudFront, Lambda, etc;
Proficiency in monitoring and troubleshooting complex distributed systems;
Experience with Grafana, ELK stack, Prometheus, or others;
Strong understanding of designing resilient and fault-tolerant systems;
Expertise in debugging complex distributed systems.
More about OutSystemsOutSystems is a leading AI Development Platform built for the enterprise. Global organizations trust OutSystems to rapidly build mission-critical apps and agents, modernize legacy processes with agentic systems, and govern their entire AI portfolio across complex regulatory environments, all on one unified platform.
As the future becomes agentic, our customers need us now more than ever. While AI has opened the door to extraordinary possibilities, most large organizations find themselves stuck on one side of the "enterprise gap" because AI by itself doesn't solve their complex use cases and business challenges. OutSystems bridges the "enterprise gap" by combining the speed of generative AI with a deterministic, enterprise-grade framework. We provide the tools for teams of any size to deliver high-quality, reliable AI solutions that drive real business impact.
We are looking for passionate, talented, and motivated people to join us as we empower organizations to build, deploy, and scale the next generation of enterprise software. While we are leading the charge into the agentic era, our mission is broader: we are the platform enterprise leaders trust to evolve their entire business, accelerating innovation through secure, governed human-AI collaboration.
OutSystems is a global company, with more than 900k developer community members, 1,700 employees, more than 600 partners, and thousands of active customers in over 75 countries and across 21 industries. Founded in 2001, OutSystems now has offices in the United States, United Kingdom, the Netherlands, Portugal, Germany, the UAE, Japan, Hong Kong, Malaysia, Australia, India, and Singapore, and includes a thriving, worldwide community of remote employees.
Our customers are some of the world's most recognizable brands across diverse industries— such as Toyota, Heineken, Bosch, KeyBank, and UCLA—who trust OutSystems to deliver ROI and transformational impact.
Consistently recognized as a leader by top analyst firms Gartner, IDC and Forrester, OutSystems continues to shape the future of enterprise software development in the agentic era. We are proud to be named a leader in more than 100 categories on G2, including #1 in Customer Satisfaction in Enterprise Low Code Development, and most recently as a leader in AI Agent Building in the G2 Spring 2026 Reports.
Working at OutSystemsOur culture is built on our core values of Trust, Customer Success, Innovation, and Alignment. We operate as one global OutSystems team, taking ownership to pursue our vision of being the AI platform enterprise leaders trust to build, secure, and evolve their most critical applications and systems.
What do we have to offer you?
A company at the vanguard of the agentic revolution, where we don’t just react to AI innovation—we architect it. Joining OutSystems means stepping onto a high-growth rocket ship that combines the fearless agility of a startup with the sophisticated, global foundation of an enterprise powerhouse.
Real growth opportunities. We don't just talk about development; we invest in it through structured programs designed to scale your expertise. Whether you are aiming for vertical progression, exploring lateral moves into new domains, or mastering specialized AI skills through our Professional Development Fund and Internal Mobility Program, we provide the resources to get you there.
A global collective of world-class talent, where you’ll collaborate with enterprise software legends and sought-after thought leaders. At OutSystems, our industry experts aren't just visionaries—they are accessible, approachable mentors who are deeply invested in your growth as we architect the agentic future together.
OutSystems nurtures an inclusive culture where talented individuals from all backgrounds are empowered to learn, experiment and make an impact. . We believe that driving our next phase of growth requires the radical creativity that only comes from diverse perspectives. We are committed to building a team as global and diverse as the organizations we serve, ensuring every individual can perform to their full potential. As an equal opportunity employer, all qualified applicants receive equal consideration regardless of race, origin, religion, sex, sexual orientation, gender identity, disability, veteran status, or any other protected status.
Top Skills
OutSystems Sydney, New South Wales, AUS Office
333 George Street,, Sydney, New South Wales, Australia, 2000


