Rackspace
Site Reliability Engineer / Observability Engineer III (Fixed Night Shift Role)
Job Summary
The role involves ensuring system reliability, availability, and performance through collaboration with development and platform teams. Key responsibilities include building scalable systems, developing monitoring tools, analyzing metrics, troubleshooting issues, and improving automation. Candidates should have extensive experience with AWS, Terraform, CDN, monitoring, and scripting, along with a proactive problem-solving approach. The position requires strong communication skills and the ability to work effectively in a fast-paced, collaborative environment.
Required Skills
Job Description
Sr. Site Reliability Engineer III - Job Description
As a Site Reliability Engineer, you will play a key role in ensuring our systems remain reliable, available, and performant for both our customers and internal teams. Your expertise will directly impact our users' experience and the success of our business.
In this role, you'll collaborate closely with our product development and platform engineering teams to build scalable systems and create robust automation that supports our company's goals. Your day-to-day work will make a meaningful difference in how efficiently and effectively our technology operates.
We're looking for someone who has hands-on experience with technologies like AWS, CDN, Terraform, Packer, and Splunk. Keen troubleshooting abilities will be essential as you identify and solve complex issues in the critical applications our customers rely on daily.
The ideal candidate thrives on learning new technologies and approaches challenges with enthusiasm. You'll be joining a collaborative environment where your problem-solving skills will shine as you work across multiple teams. If you're self-motivated, passionate about quality, and ready to make an impact, we want to hear from you!
- Responsibilities:
- · Collaborate with development teams to implement and deploy new features that meet high standards for reliability, security, and performance.
- · Partner with cross-functional teams to establish and enhance enterprise standards and best practices.
- · Develop and maintain effective monitoring tools, alerts, and dashboards that provide clear visibility into system health and performance.
- · Analyze metrics and logs to proactively detect anomalies, optimize performance, plan capacity, and isolate issues before customer impact occurs.
- · Identify innovative solutions to complex problems and implement corrective actions decisively.
- · Mentor junior team members while documenting and sharing solutions to build team knowledge.
- Minimum 5 years' experience in DevOps engineering roles such as SRE, DevOps, CloudOps.
- Advanced proficiency with Terraform for infrastructure as code implementation (required)
- Extensive experience with AWS technologies and services, including EC2, S3, RDS, and IAM (required).
- Comprehensive understanding of HTTP protocols, web server technologies, and troubleshooting.
- Strong experience with load balancing solutions such as AWS ELB, NGINX, or HAProxy.
- Practical knowledge of caching technologies and CDN implementations.
- Working experience with Redis for in-memory data storage and caching.
- Demonstrated ability implementing and optimizing CDN solutions for global content delivery (Preferred).
- Expertise in monitoring and troubleshooting web application performance and availability.
- Practical experience with observability solutions such as Splunk, Datadog, or similar.
- Proficiency in one or more languages such as Java, Go, Python, or Linux Shell.
- Proven experience operating effectively in an agile software development environment.
- Strong understanding of AWS pricing/cost models across compute, storage, and database offerings.
- Experience implementing and maintaining CI/CD pipelines.
- Ability to multitask and adapt to changing priorities in a fast-paced, 24x7 environment.
- Collaborative approach to working with cross-functional teams of both technical and business professionals.
- Excellent communication, problem-solving, and customer service skills.
- Bachelor's degree in computer science, science, engineering or equivalent technical certifications preferred.
Rackspace
As a cloud computing services pioneer, we deliver proven multicloud solutions across your apps, data, and security. Maximize the benefits of modern cloud.
See more jobsSafe Remote Job Search Tips
Verify Employer Thoroughly
Research the company's identity thoroughly before applying. Check for a professional website with contacts, active social media, and LinkedIn profiles. Verify details across platforms and look for reviews on Glassdoor or Trustpilot to confirm legitimacy.
Never Pay to Get a Job
Legitimate employers never require payment for applications, training, background checks, or equipment. Always reject upfront payment requests or demands for bank details, even if they claim it's for purchasing necessary work gear on your behalf.
Safeguard Your Personal Information
Protect sensitive data like SSN, bank details, or ID copies. Share this only after accepting a formal, written job offer. Ensure it's submitted via a secure company system or portal, never through insecure channels like standard email attachments.
Scrutinize Communication & Interviews
Watch for communication red flags: poor grammar, generic emails (@gmail), vague details, or undue pressure. Be highly suspicious of interviews held only via text or chat apps; legitimate companies typically use video or phone calls.
Beware of Unrealistic Offers
If an offer's salary or benefits seem unrealistically high for the work involved, be cautious. Research standard pay for similar roles. Offers that appear 'too good to be true' are often scams designed to lure you into providing information or payment.
Insist on a Formal Contract
Always secure and review a formal, written job offer or employment contract before starting work or sharing final personal details. Ensure it clearly defines your role, compensation, key terms, and conditions to avoid misunderstandings or scams.