FlexHired LogoFlexHired
Logo of Upstart

Upstart

Principal Site Reliability Engineer

Job Summary

The Principal Site Reliability Engineer at Upstart is responsible for ensuring the reliability, resiliency, and observability of the company's production systems. The role involves leading the adoption of SRE best practices, mentoring engineering teams, and collaborating across departments such as Product Engineering, DevOps, and Data Engineering to improve system performance and incident response. Candidates should have extensive experience in software engineering and SRE disciplines, proficiency in programming languages like Python, Go, and JavaScript, and expertise with automation, observability tools, and cloud infrastructure. The position offers opportunities to influence technical strategies and drive enterprise-wide reliability initiatives in a fast-paced, digital-first environment.

Required Skills

Cloud Infrastructure
Python
JavaScript
Program Management
Full Stack Development
TypeScript
Go
Automation
Observability
Monitoring
Incident Management
Site Reliability Engineering
Infrastructure as Code
Performance Monitoring
Service Mesh
Distributed Tracing
Self-Healing Systems
ML/GenAI

Benefits

Health Insurance
Parental Leave
Dental Insurance
Vision Insurance
Life Insurance
Disability Insurance
Employee Stock Purchase Plan
401(k) Retirement Plan
Health Savings Account
Paid Leave
Family Care Leave
Military Leave
Wellness Reimbursements
Catered Meals
Technology & Ergonomic Reimbursements
Social Activities

Job Description

About Upstart

Upstart is the leading AI lending marketplace partnering with banks and credit unions to expand access to affordable credit. By leveraging Upstart's AI marketplace, Upstart-powered banks and credit unions can have higher approval rates and lower loss rates across races, ages, and genders, while simultaneously delivering the exceptional digital-first lending experience their customers demand. More than 80% of borrowers are approved instantly, with zero documentation to upload.

Upstart is a digital-first company, which means that most Upstarters live and work anywhere in the United States. However, we also have offices in San Mateo, California; Columbus, Ohio; and Austin, Texas.

Most Upstarters join us because they connect with our mission of enabling access to effortless credit based on true risk. If you are energized by the impact you can make at Upstart, we’d love to hear from you!

The Team

Upstart’s Site Reliability Engineering (SRE) team owns the reliability, resiliency, and observability of Upstart’s production systems. We build automation, tooling, and frameworks to ensure our infrastructure is healthy, scalable, and able to support a seamless experience for both engineers and customers. Our scope includes defining Upstart’s technology operations risk strategy, implementing disaster recovery planning, and setting company-wide reliability standards.

As a Principal Site Reliability Engineer at Upstart, you will serve as a thought leader and SRE evangelist - driving adoption of best practices, mentoring engineers across the organization, and influencing both technical and business decisions. Your impact will extend beyond SRE into cross-functional collaboration with Product Engineering, DevEx, Development Productivity (Quality), DevOps, Data Engineering, and Machine Learning teams to elevate operational excellence across the company.


How you’ll make an impact:

  • Lead the definition, advocacy, and adoption of SRE principles across engineering teams
  • Partner with leadership to shape long-term reliability, resiliency, and observability strategies
  • Champion distributed tracing, real user monitoring (RUM), and key performance metrics such as Largest Contentful Paint (LCP) to improve system visibility and user experience
  • Build and scale self-healing systems to minimize manual intervention and reduce downtime
  • Drive enterprise-wide improvements to incident response processes, including those related to Machine Learning systems
  • Collaborate closely with Development Productivity and Quality teams to improve engineering velocity without sacrificing reliability
  • Influence technical and operational roadmaps through data-driven insights and hands-on technical contributions
  • Own and deliver cross-functional initiatives from concept through execution, applying program management skills to align stakeholders and achieve results


What we’re looking for:

  • Minimum requirements:
    • 10+ years combined experience across Software Engineering and Site Reliability Engineering, with a balanced background in both disciplines
    • Proven track record as an SRE thought leader and evangelist, driving adoption of reliability best practices across organizations
    • Strong communication and mentoring skills to influence engineers across disciplines
    • Proficiency in Python, Go, and JavaScript/TypeScript
    • Proficiency with Infrastructure as Code (Terraform, CDK, CloudFormation, etc.)
    • Experience building internal tooling from scratch in agile development environments
    • Expertise with observability, distributed tracing, RUM, LCP, and performance monitoring tools (e.g., Datadog, Prometheus)
    • Experience with on-call and incident management, including large-scale or ML-related incidents
    • Strong background in automation and building self-healing systems
    • Hands-on experience with LLM/GenAI to improve SRE efficiency and processes
    • Program management skills, including the ability to propose innovative solutions, influence leadership, improve processes, and drive cross-functional projects to completion
  • Preferred qualifications:
    • Experience with service mesh
    • Full stack development skills
    • Experience building or extending observability platforms
    • Background in Development Productivity or Quality Platforms
    • Experience in high-scale SaaS, microservice-oriented cloud environments

Position Location - This role is available in the following locations: Remote, San Mateo, Columbus, Austin

Time Zone Requirements - This team operates across all U.S. time zones.

Travel Requirements - This team has regular on-site collaboration sessions. These occur 3 days per quarter at an Upstart office. If you need to travel to make these meetups, Upstart will cover all travel related expenses.

What you'll love:

  • Competitive Compensation (base + bonus & equity)
  • Comprehensive medical, dental, and vision coverage with Health Savings Account contributions from Upstart
  • 401(k) with 100% company match up to $4,500 and immediate vesting and after-tax savings
  • Employee Stock Purchase Plan (ESPP)
  • Life and disability insurance
  • Generous holiday, vacation, sick and safety leave
  • Supportive parental, family care, and military leave programs
  • Annual wellness, technology & ergonomic reimbursement programs
  • Social activities including team events and onsites, all-company updates, employee resource groups (ERGs), and other interest groups such as book clubs, fitness, investing, and volunteering
  • Catered lunches + snacks & drinks when working in offices

#LI-REMOTE

#LI-MidSenior

At Upstart, your base pay is one part of your total compensation package. The anticipated base salary for this position is expected to be within the below range. Your actual base pay will depend on your geographic location–with our “digital first” philosophy, Upstart uses compensation regions that vary depending on location. Individual pay is also determined by job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

In addition, Upstart provides employees with target bonuses, equity compensation, and generous benefits packages (including medical, dental, vision, and 401k).

United States | Remote - Anticipated Base Salary Range
$186,100$257,500 USD

Upstart is a proud Equal Opportunity Employer. We are dedicated to ensuring that underrepresented classes receive better access to affordable credit, and are just as committed to embracing diversity and inclusion in our hiring practices. We celebrate all cultures, backgrounds, perspectives, and experiences, and know that we can only become better together.

If you require reasonable accommodation in completing an application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please email [email protected]

https://www.upstart.com/candidate_privacy_policy

Interested in this job?

Application deadline: Open until filled

Logo of Upstart

Upstart

A lending platform using AI to provide personal loans and credit solutions with a focus on fair and fast approvals.

See more jobs
Date PostedAugust 15th, 2025
Job TypeFull Time
LocationUnited States | Remote
Salary$186,100 - $257,500
Exciting remote opportunity (requires residency in United States) for a Principal Site Reliability Engineer at Upstart. Offering $186,100 - $257,500 (full time). Explore more remote jobs on FlexHired!

Safe Remote Job Search Tips

Verify Employer Thoroughly

Research the company's identity thoroughly before applying. Check for a professional website with contacts, active social media, and LinkedIn profiles. Verify details across platforms and look for reviews on Glassdoor or Trustpilot to confirm legitimacy.

Never Pay to Get a Job

Legitimate employers never require payment for applications, training, background checks, or equipment. Always reject upfront payment requests or demands for bank details, even if they claim it's for purchasing necessary work gear on your behalf.

Safeguard Your Personal Information

Protect sensitive data like SSN, bank details, or ID copies. Share this only after accepting a formal, written job offer. Ensure it's submitted via a secure company system or portal, never through insecure channels like standard email attachments.

Scrutinize Communication & Interviews

Watch for communication red flags: poor grammar, generic emails (@gmail), vague details, or undue pressure. Be highly suspicious of interviews held only via text or chat apps; legitimate companies typically use video or phone calls.

Beware of Unrealistic Offers

If an offer's salary or benefits seem unrealistically high for the work involved, be cautious. Research standard pay for similar roles. Offers that appear 'too good to be true' are often scams designed to lure you into providing information or payment.

Insist on a Formal Contract

Always secure and review a formal, written job offer or employment contract before starting work or sharing final personal details. Ensure it clearly defines your role, compensation, key terms, and conditions to avoid misunderstandings or scams.

Related Jobs

Full Time
$182,300 - $252,500
United States | Remote
Full Time
$186,100 - $257,500
United States | Remote
Full Time
$182,300 - $252,500
United States | Remote
Full Time
$187,900 - $260,000
United States | Remote
Full Time
$198,700 - $275,000
United States | Remote

Subscribe Newsletter

Never miss a remote job opportunity. Subscribe to our newsletter today and receive exclusive job alerts, career advice, and industry insights delivered straight to your inbox.