FlexHired LogoFlexHired
Logo of Nearsure

Nearsure

(1016) Staff Site Reliability Engineer

Job Summary

This role as a Staff Site Reliability Engineer focuses on owning and optimizing observability pipelines, supporting incident response, and building automation tools to enhance system reliability. The candidate should have extensive experience with monitoring tools, cloud infrastructure, and scripting, along with strong communication skills and a background in Kubernetes and IaC. The position offers flexible remote work, comprehensive employee benefits, and opportunities to influence architectural decisions and improve system resiliency across the organization.

Required Skills

Terraform
CI/CD
Automation
Reliability Engineering
Kubernetes
Monitoring
Security
Alerting
Data Ingestion
Incident Response
Infrastructure-as-Code
GitOps
Logging
Playbook Development
Observability Pipelines
SLO / SLA
Cloud (AWS)
Scripting (Python, Go)
Telemtry
Resiliency Design

Benefits

Health Insurance
Paid Time Off
Remote Work
Sick Leave
Birthday Day Off
National Holidays
Team-building Activities
Refundable Annual Credit

Job Description

Explore the Nearsure experience!

🌐 Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management.

🍃 Say goodbye to micromanagement! We champion autonomy, open communication, and respect for diversity as our core values.

Your well-being matters: Our People Care team is here from day one to support you with everything from time-off requests to wellness check-ins.

Plus, our Accounts Management team ensures smooth, effective client relationships, so you can focus on what you do best.

Ready to grow with us? 🚀

Here’s what we offer you by joining us!

Competitive USD salary 💲 – We value your skills and contributions!

🌐 100% remote work 🏢 – While you can work from anywhere, you’re always welcome to connect with teammates and grow your network at our coworking spaces across LATAM!

💼 Paid time off – Take the time you need according to your country’s regulations, all while receiving your full salary. Rest, recharge, and come back stronger!

🎉 National Holidays celebrated 🌴 – Take time off to celebrate important events and traditions with loved ones, fully embracing your culture.

😷 Sick leave – Focus on your health without the stress. Take the necessary time to recover and feel better.

💸 Refundable Annual Credit – Spend it on the perks you love to enhance your work-life balance!

🤝 Team-building activities – Join us for coffee breaks, tech talks, and after-work gatherings to bond with your Nearsure family and feel part of our vibrant community.

🥳 Birthday day off 🎂 – Enjoy an extra day off during your birthday week to celebrate in style with friends and family!


About the project

As a Staff Site Reliability Engineer, you will own and optimize OpenTelemetry pipelines, enabling scalable and efficient observability. You’ll build tools that empower teams, support incident response, and drive best practices. Your work ensures a reliable, secure infrastructure and actionable alerting across the organization.


How your day-to-day work will look like

Design, implement, and maintain observability pipelines across the three main signals—logs, metrics, and traces—ensuring standardized, scalable, and efficient data ingestion. Optimize ingestion strategies to balance cost, performance, and usability.
Build self-service automation and tooling that enables development teams to instrument and leverage observability without requiring manual intervention from the SRE team. Drive adoption of best practices while ensuring teams own their telemetry.
Design the processes, playbooks, checklists, and automations for them and other engineers to follow during an incident.
Interact with members from almost all teams across the business to understand their monitoring, alerting, and SLO / SLA requirements and design systems and processes that ensure we meet or exceed these requirements. Influence architectural decisions during initial design stages to ensure resiliency and scale at the outset of software development.
Design the processes, playbooks, checklists, and automations for them and other engineers to follow during an incident.
Leverage Infrastructure-as-Code (IaC) to provision and manage monitoring tools, alerting rules, and our observability configurations across OTEL Pipelines.
Design base-level requirements for new and existing services to ensure that all client infrastructure and code are monitored consistently and accurately at a basic level.
Take full ownership of client infrastructure reliability, ensuring adherence to key availability and security KPIs.

This would make you the ideal candidate

Bachelor's Degree in Computer Science, Engineering, or a related field.
8+ Years of experience working as an SRE Engineer or in a very similar role, more focused on observability.
5+ Years of experience working with cloud (AWS).
5+ Years of experience working with IaC tools (Terraform) and GitOps CI/CD solutions (ArgoCD, GitHub Actions, or similar).
4+ Years of experience working with monitoring and logging OpenSource tools such as Grafana, Prometheus, Elastic/OpenSearch, Loki, Tempo.
4+ Years of experience working in Kubernetes, including its core components, deployment methodologies, and monitoring best practices.
Strong scripting abilities (Python, Go, or similar) for automating observability tasks.
✨ Experience in managing observability: SLI, SLOs, Log Transformation, Cardinality Management, Business and Resilience Metrics, 4 Golden Signals, Distributed Tracing.
Experience with automated alerting workflows.
Exposure with OpenTelemetry Pipelines.
Advanced English Level is required for this role as you will work with US clients. Effective communication in English is essential to deliver the best solutions to our clients and expand your horizons.

What to expect from our hiring process

1️. Let’s chat about your experience!
2. Impress our recruiters, and you’ll move on to a technical interview with our top developers.
3. Nail that, and you’ll meet our client - your final step to joining our amazing team!

🎯 At Nearsure, we’re dedicated to solving complex business challenges through cutting-edge technology and we believe in the power of tailored solutions. Whether you are passionate about transforming businesses with Generative AI, building innovative software products, or implementing comprehensive enterprise platform solutions, we invite you to be part of our dynamic team!

We would love to hear from you if you are eager to make an impact and join a collaborative team that values creativity and expertise.

Let’s work together to shape the future of technology!

🧑💻 Apply now!

By applying to this position, you authorize Nearsure to collect, store, transfer, and process your personal data in accordance with our Privacy Policy. For more information, please review our Privacy Policy.

Interested in this job?

Application deadline: Open until filled

Logo of Nearsure

Nearsure

Transform your digital journey with Nearsure. With expertise in 160+ technologies, a 90% retention rate, and an 85 NPS score, we deliver high-impact solutions.

See more jobs
Date PostedMay 26th, 2025
Job TypeFull Time
LocationLatin America - Remote
SalaryCompetitive rates
Exciting remote opportunity (requires residency in Canada, Mexico, United States) for a (1016) Staff Site Reliability Engineer at Nearsure. Offering competitive salary (full time). Explore more remote jobs on FlexHired!

Safe Remote Job Search Tips

Verify Employer Thoroughly

Research the company's identity thoroughly before applying. Check for a professional website with contacts, active social media, and LinkedIn profiles. Verify details across platforms and look for reviews on Glassdoor or Trustpilot to confirm legitimacy.

Never Pay to Get a Job

Legitimate employers never require payment for applications, training, background checks, or equipment. Always reject upfront payment requests or demands for bank details, even if they claim it's for purchasing necessary work gear on your behalf.

Safeguard Your Personal Information

Protect sensitive data like SSN, bank details, or ID copies. Share this only after accepting a formal, written job offer. Ensure it's submitted via a secure company system or portal, never through insecure channels like standard email attachments.

Scrutinize Communication & Interviews

Watch for communication red flags: poor grammar, generic emails (@gmail), vague details, or undue pressure. Be highly suspicious of interviews held only via text or chat apps; legitimate companies typically use video or phone calls.

Beware of Unrealistic Offers

If an offer's salary or benefits seem unrealistically high for the work involved, be cautious. Research standard pay for similar roles. Offers that appear 'too good to be true' are often scams designed to lure you into providing information or payment.

Insist on a Formal Contract

Always secure and review a formal, written job offer or employment contract before starting work or sharing final personal details. Ensure it clearly defines your role, compensation, key terms, and conditions to avoid misunderstandings or scams.

Related Jobs

Full Time
Latin America - Remote
Full Time
Latin America - Remote
Full Time
Latin America - Remote
Full Time
Latin America - Remote

Subscribe Newsletter

Never miss a remote job opportunity. Subscribe to our newsletter today and receive exclusive job alerts, career advice, and industry insights delivered straight to your inbox.