FlexHired LogoFlexHired
Logo of Instacart

Instacart

Senior Site Reliability Engineer II

Job Summary

The role involves developing scalable infrastructure strategies, monitoring system performance, and leading incident management to ensure high system availability and reliability. Responsibilities include automating deployment processes, refining service level objectives, and collaborating with cross-functional teams for system improvements. Candidates should have proven experience in programming, incident management, and troubleshooting, with preferred skills in cloud platforms and containerization. The position is remote, with a focus on fostering a collaborative culture of innovation and continuous improvement.

Required Skills

Troubleshooting
Systems Design
Risk Assessment
Containerization
Automation
Monitoring
Incident Management
Site Reliability Engineering
Cloud Platforms
Infrastructure Strategy

Benefits

Remote Work Flexibility
Market-Competitive Compensation
Equity Grants
Annual Refresh Grants
Benefits Offerings

Job Description

We're transforming the grocery industry

At Instacart, we invite the world to share love through food because we believe everyone should have access to the food they love and more time to enjoy it together. Where others see a simple need for grocery delivery, we see exciting complexity and endless opportunity to serve the varied needs of our community. We work to deliver an essential service that customers rely on to get their groceries and household goods, while also offering safe and flexible earnings opportunities to Instacart Personal Shoppers.

Instacart has become a lifeline for millions of people, and we’re building the team to help push our shopping cart forward. If you’re ready to do the best work of your life, come join our table.

Instacart is a Flex First team

There’s no one-size fits all approach to how we do our best work. Our employees have the flexibility to choose where they do their best work—whether it’s from home, an office, or your favorite coffee shop—while staying connected and building community through regular in-person events. Learn more about our flexible approach to where we work.

Overview

About the Role

Join our team as a Senior Site Reliability Engineer II, where your expertise will play a crucial role in maintaining the backbone of our platform's operations. You'll take on challenges directly, ensuring optimal performance and growth while fostering a culture that prioritizes diligent and effective reliability practices. We're seeking someone eager to take ownership, skilled at addressing complex issues, and ready to explore innovative solutions to support the well-being of our teams and services.

About the Team

The Site Reliability Engineering (SRE) team combines software and systems engineering to design and manage large-scale, distributed, and fault-tolerant systems. This team is tasked with ensuring high reliability, optimal system performance, and continuous improvement for both Instacart's critical internal services and externally facing systems.

SRE focuses on optimizing existing systems, building robust infrastructure, and automating processes to minimize manual effort. Joining the SRE team means facing unique scaling challenges while leveraging expertise in coding, algorithms, complexity analysis, and large-scale system design.

The team thrives within a culture of intellectual curiosity, problem-solving, and collaboration. With members from diverse backgrounds and experiences, SRE fosters a supportive and risk-tolerant environment where individuals are encouraged to think big, take on impactful projects, and grow with mentorship and guidance.

About the Job

  • Develop scalable infrastructure strategies to ensure high availability, that align infrastructure planning with product roadmaps, and optimize cost, risk and performance with cloud providers.
  • Establish and lead incident management protocols and response plans to coordinate rapid responses, investigate root causes, prevent recurrence, and collaborate with security teams to test response readiness and address security risks.
  • Continuously monitor performance metrics and trends to proactively identify reliability risks. Regularly refine SLOs, SLIs, and Error Budgets to align with evolving standards and leverage data insights to propose improvement plans and suggest architectural updates to enhance system reliability.
  • Oversee regular system evaluations to pinpoint and refine process shortcomings and lead cross-functional projects that promote system optimization and minimize technical debt. Collaborate with product and engineering teams to ensure system enhancements align with user requirements.
  • Design and deploy automation tools to streamline deployment and operations, ensuring seamless processes while overseeing the continuous enhancement of automation scripts and frameworks, and rigorously monitor automated systems for performance and reliability. Address and tackle issues in automated environments promptly to reduce disruptions.
  • Provide technical guidance to junior colleagues, fostering a collaborative culture for problem-solving and innovation. Organize and lead knowledge-sharing sessions and coordinate training in site reliability best practices to enhance team proficiency.

About You

Minimum Qualifications

  • Proven experience in programming
  • Robust knowledge of incident management processes and tools
  • Exemplary troubleshooting and problem-solving skills
  • Ability to work under pressure and prioritize tasks during high-stress situations
  • Expertise in scaling application infrastructure for high availability

Preferred Qualifications

  • Proficient in Ruby or Go
  • Experience with cloud platforms (eg, AWS, GCP, Azure) and containerization (eg, Docker, Kubernetes)
  • Skill in risk assessment for foundational infrastructure changes
  • Experience in monitoring system performance and trend analysis

#LI-Remote

Instacart provides highly market-competitive compensation and benefits in each location where our employees work. This role is remote and the base pay range for a successful candidate is dependent on their permanent work location. Please review our Flex First remote work policy here. Currently, we are only hiring in the following provinces: Ontario, Alberta, British Columbia, and Nova Scotia.

Offers may vary based on many factors, such as candidate experience and skills required for the role. Additionally, this role is eligible for a new hire equity grant as well as annual refresh grants. Please read more about our benefits offerings here.

For Canadian based candidates, the base pay ranges for a successful candidate are listed below.

CAN
$183,000$203,000 CAD

Interested in this job?

Application deadline: Open until filled

Logo of Instacart

Instacart

A grocery delivery service allowing users to order from local stores and have items delivered by personal shoppers.

See more jobs
Date PostedJune 6th, 2025
Job TypeFull Time
LocationCanada - Remote (BC, ON, AB, or NS only)
Salary$183,000 - $203,000
Exciting remote opportunity (requires residency in Canada) for a Senior Site Reliability Engineer II at Instacart. Offering $183,000 - $203,000 (full time). Explore more remote jobs on FlexHired!

Safe Remote Job Search Tips

Verify Employer Thoroughly

Research the company's identity thoroughly before applying. Check for a professional website with contacts, active social media, and LinkedIn profiles. Verify details across platforms and look for reviews on Glassdoor or Trustpilot to confirm legitimacy.

Never Pay to Get a Job

Legitimate employers never require payment for applications, training, background checks, or equipment. Always reject upfront payment requests or demands for bank details, even if they claim it's for purchasing necessary work gear on your behalf.

Safeguard Your Personal Information

Protect sensitive data like SSN, bank details, or ID copies. Share this only after accepting a formal, written job offer. Ensure it's submitted via a secure company system or portal, never through insecure channels like standard email attachments.

Scrutinize Communication & Interviews

Watch for communication red flags: poor grammar, generic emails (@gmail), vague details, or undue pressure. Be highly suspicious of interviews held only via text or chat apps; legitimate companies typically use video or phone calls.

Beware of Unrealistic Offers

If an offer's salary or benefits seem unrealistically high for the work involved, be cautious. Research standard pay for similar roles. Offers that appear 'too good to be true' are often scams designed to lure you into providing information or payment.

Insist on a Formal Contract

Always secure and review a formal, written job offer or employment contract before starting work or sharing final personal details. Ensure it clearly defines your role, compensation, key terms, and conditions to avoid misunderstandings or scams.

Related Jobs

Full Time
$165,000 - $183,000
Canada - Remote (ON, AB, BC, or NS Only)
Full Time
$165,000 - $183,000
Canada - Remote (ON, AB, BC, or NS Only)
Full Time
$193,000 - $214,000
Canada - Remote (ON, AB, BC, or NS Only)
Full Time
$165,000 - $183,000
Canada - Remote (ON, AB, BC, or NS Only)
Full Time
$176,000 - $225,000
Canada - Remote (AB, BC, ON, NS ONLY)

Subscribe Newsletter

Never miss a remote job opportunity. Subscribe to our newsletter today and receive exclusive job alerts, career advice, and industry insights delivered straight to your inbox.