FlexHired LogoFlexHired
Logo of Xero

Xero

Senior Site Reliability Engineer (Technical Duty Officer)

Job Summary

This role involves owning and improving the incident management process to ensure service reliability across Xero's products and services. The candidate will lead technical responses to high-severity incidents, coordinate multiple teams, and promote best practices within the SRE culture. It requires hands-on experience with troubleshooting, automation, and technical leadership, particularly in cloud environments like AWS. The position emphasizes fostering a culture of continuous learning, operational excellence, and proactive incident prevention.

Required Skills

Python
Troubleshooting
Communication Skills
Technical Leadership
Networking
AWS
Automation
Incident Management
Site Reliability Engineering
Monitoring and Observability
Incident Response
SRE Principles

Benefits

Health Insurance
Career Development
Paid Parental Leave
Life Insurance
Flexible Working
Employee Assistance Program
Employee Resource Groups
Income Protection
Paid Leave
Employee Share Plan
Wellbeing Programs
Sports Programs

Job Description

Our Purpose

At Xero, we’re here to help you supercharge your business. We do this by automating routine tasks, surfacing actionable insights and connecting businesses with the right data, advisors and apps. When that happens, we’re not only making life better for small business, we’ll be building a stronger economy that can change the world.


About the team


Xero’s Incident and Problem Management team are a part of the Site Reliability Engineering (SRE) organization and are responsible for the build, delivery and ongoing maintenance of robust process and tooling around Incident management.


The team is responsible for driving enduring reliability at Xero through robust, consistent and fast response to high severity incidents. They are responsible for building a world class process and ensuring that process matures as the demands of the business grows.


About the roles


We're looking to hire multiple roles at Senior Engineer level. These positions require experienced SRE professionals with a strong technical background, deep experience in SRE, a passion for building and delivering robust processes, and extensive experience of leading technical response to high severity cloud issues.


They will drive best practice across the business and contribute to the ongoing transformation of the Xero SRE culture. As expert communicators, they will lead technical discussions to identify and track actions associated with and identified during incident situations.


Across our SRE function, we're looking for those who are keen to deep dive into causes of incidents and proactively examine the potential causes of future incidents; working with engineering teams to remove the risk of that failure scenario. Ultimately building playbooks and automation to ensure quick and effective responses. In addition, provide ongoing training across the business to ensure the process is well understood and adhered to.


These roles will form the backbone of a new team, providing a Technical Duty Officer (TDO) function within the business. TDO’s are incident commanders who use SRE skillsets to drive fast mitigation and enduring resolution of impactful events.



What you'll do:
  • Own the incident management process, ensuring it drives enduring reliability across all products and services within Xero.
  • Provide expert leadership during critical outages, coordinating multiple teams to ensure streamlined decision-making and quick resolution.
  • Lead and advocate for the transformation to a world-leading SRE organization, promoting SRE principles within the Engineering Department.
  • Promote a customer-focused approach by addressing and mitigating global customer environment issues, and fostering a culture of continuous learning and technical excellence within the SRE team.
  • Develop and implement scalable process frameworks and observability strategies to ensure rapid problem diagnosis, response, and service reliability.
  • Collaborate with product teams to thoroughly analyze failures and integrate insights to improve service reliability, scalability, and operational efficiency.


What you'll bring:
  • Previous career experience as a Site Reliability Engineer, in an Operations or Engineering environment
  • Hands-on experience troubleshooting AWS hosted services
  • Networking knowledge and able to troubleshoot TCP/IP, SSL/TLS, DNSSEC, IPsec, and BGP issues.
  • Coding experience (preferably Python) building tools, scripting, or automation
  • Strong communication (oral & written) skills including the ability to translate technical issues/concepts into agreed actions



Why Xero?

Offering very generous paid leave to use however you’d like (plus statutory holidays!), dedicated paid leave to care for your physical and mental wellbeing as well as an Employee Assistance Program to access mental health care for you and your family, health insurance, life insurance, and income protection, wellbeing and sports programmes, employee resource groups, 26 weeks of paid parental leave for primary caregivers, an Employee Share Plan, beautiful offices, flexible working, career development, and many other benefits that reflect our human value, you’ll do the best work of your life at Xero.

Interested in this job?

Application deadline: Open until filled

Logo of Xero

Xero

Xero online accounting software for your business connects you to your bank, accountant, bookkeeper, and other business apps. Start a free trial today.

See more jobs
Date PostedJanuary 15th, 2025
Job TypeFull Time
LocationRemote
SalaryCompetitive rates
Exciting fully remote opportunity for a Senior Site Reliability Engineer (Technical Duty Officer) at Xero. Offering competitive salary (full time). Explore more remote jobs on FlexHired!

Safe Remote Job Search Tips

Verify Employer Thoroughly

Research the company's identity thoroughly before applying. Check for a professional website with contacts, active social media, and LinkedIn profiles. Verify details across platforms and look for reviews on Glassdoor or Trustpilot to confirm legitimacy.

Never Pay to Get a Job

Legitimate employers never require payment for applications, training, background checks, or equipment. Always reject upfront payment requests or demands for bank details, even if they claim it's for purchasing necessary work gear on your behalf.

Safeguard Your Personal Information

Protect sensitive data like SSN, bank details, or ID copies. Share this only after accepting a formal, written job offer. Ensure it's submitted via a secure company system or portal, never through insecure channels like standard email attachments.

Scrutinize Communication & Interviews

Watch for communication red flags: poor grammar, generic emails (@gmail), vague details, or undue pressure. Be highly suspicious of interviews held only via text or chat apps; legitimate companies typically use video or phone calls.

Beware of Unrealistic Offers

If an offer's salary or benefits seem unrealistically high for the work involved, be cautious. Research standard pay for similar roles. Offers that appear 'too good to be true' are often scams designed to lure you into providing information or payment.

Insist on a Formal Contract

Always secure and review a formal, written job offer or employment contract before starting work or sharing final personal details. Ensure it clearly defines your role, compensation, key terms, and conditions to avoid misunderstandings or scams.

Related Jobs

Full Time
Remote - Philippines
Full Time
Remote - Philippines
Full Time
Cardiff, London or Remote (UK)
Full Time
Cardiff, London or Remote (UK)

Subscribe Newsletter

Never miss a remote job opportunity. Subscribe to our newsletter today and receive exclusive job alerts, career advice, and industry insights delivered straight to your inbox.