FlexHired LogoFlexHired
Logo of Upwork

Upwork

Principal ML Infra Engineer

Job Summary

The role involves designing, developing, and maintaining scalable machine learning infrastructure to support large-scale AI and ML workflows. Responsibilities include implementing distributed systems for data ingestion, model training, deployment, and monitoring, while collaborating with researchers and data scientists to translate research into software solutions. The ideal candidate has senior or leadership experience in ML infrastructure engineering, with strong software engineering skills and a collaborative mindset. Upwork provides comprehensive benefits and promotes a remote-first, inclusive work environment committed to innovation and growth.

Required Skills

Cloud Technologies
Model Training
Team Collaboration
Distributed Systems
Software Engineering
Innovation
Model Deployment
Machine Learning Infrastructure
Feature Engineering
Monitoring
Data Ingestion
Code Reviews
ML Research Collaboration
ML Concepts
Impactful Solutions

Benefits

Paid Time Off
Paid Parental Leave
Medical Insurance
Employee Stock Purchase Plan
401(k) Plan

Job Description

Upwork ($UPWK) is the world’s largest work marketplace, connecting businesses with highly skilled professionals worldwide. From entrepreneurs to Fortune 100 enterprises, companies trust Upwork’s platform to access expert talent, leverage AI-powered work solutions, and drive meaningful business outcomes.

Upwork’s AI-powered platform has facilitated over $20 billion in economic opportunity for professionals worldwide. With professionals spanning 10,000+ skills, including AI and machine learning, software development, sales and marketing, customer support, finance and accounting, and more, Upwork empowers businesses of all sizes to scale, innovate, and build agile teams.


The Machine Learning Infrastructure & Data team is responsible for architecting and building the foundational ML systems and tools that enable efficient development, deployment, and management of machine learning models at scale.

As a Principal ML Infrastructure Engineer in the Machine Learning Infrastructure & Data team, you will play a pivotal role in designing, developing, and maintaining robust and scalable ML infrastructure components to support the company's machine learning initiatives. You will collaborate closely with cross-functional teams including machine learning researchers, data scientists, and software engineers to build state-of-the-art platforms and tools that accelerate the development and deployment of machine learning models.

Responsibilities:

  • Own technical workstreams from start to finish, contribute to the team’s product roadmap, and be responsible for major technical decisions and tradeoffs. Effectively participate in team’s planning, code reviews, and design discussions
  • Consider the effects of projects across multiple teams and proactively manage conflicts. Work together with partner teams to achieve cross-departmental goals and satisfy broad requirements
  • Design, implement, and optimize distributed systems and infrastructure components to support large-scale machine learning workflows, including data ingestion, feature engineering, model training, and serving.
  • Develop and maintain frameworks, libraries, and tools to streamline the end-to-end machine learning lifecycle, from data preparation, model training, evaluation, deployment, and monitoring.
  • Architect and implement highly available, fault-tolerant, and secure systems that meet the performance and scalability requirements of production machine learning workloads.
  • Collaborate and publish with machine learning researchers and data scientists on novel research and translate research into scalable and efficient software solutions.
  • Stay current with the latest advancements in machine learning infrastructure, distributed computing, and cloud technologies, and integrate them into our platform to drive innovation.
  • Mentor teammates, conduct code reviews, and uphold engineering best practices to ensure the delivery of high-quality software solutions.

What it takes to catch our eye:

  • Senior/Leadership level experience in ML infrastructure engineering, ideally at an innovative technology company.
  • Proven Impact: Show us your track record of delivering impactful solutions.
  • Innovative Thinker: Bring creativity and fresh ideas to the table.
  • Technical Proficiency: Solid foundation in software engineering and ML concepts.
  • Collaborative Mindset: Strong communication and teamwork skills are a must.
  • Continuous Learner: Stay updated with the latest advancements in the field of AI.
  • Our Team's Tech stack: Compute: AWS, EKS, Databricks - Data: Snowflake, S3, SQLMesh, Feast - Workflow Automation: Airflow - Experiment Tracking: Weights & Biases, MLflow - LLM Inference: Fireworks, in-house deployment on EKS

Come change how the world works.

At Upwork, you’ll shape talent solutions for how the world works today. We are a remote-first organization working together to create exciting remote work opportunities for a global community of professionals. While we have physical offices in San Francisco and Chicago, currently we also hire full-time employees in 19 states in the United States.

At the core of our vibrant culture are shared values that form the foundation of our organization. These values revolve around trust, risk-taking, customer focus, and excellence. Our overarching mission is to create economic opportunities so that people have better lives. We foster an environment where individuals are encouraged to bring their authentic selves to work, nurturing personal and professional growth through development opportunities, mentorship programs, and participation in Upwork Belonging Communities.

We take pride in providing exceptional benefits to our employees. These include comprehensive medical insurance coverage for both you and your family, unlimited paid time off, a 401(k) plan with matching contributions, 12 weeks of paid parental leave, and an Employee Stock Purchase Plan. To explore these benefits in detail, as well as gain insights into our company values, working principles, and the overall employee experience, we invite you to visit our Life at Upwork page.

Check out our Careers page to learn more about the employee experience.

Upwork is proudly committed to recruiting and retaining a diverse and inclusive workforce. As an Equal Opportunity Employer, we never discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical condition), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

The annual base salary range for this position is displayed below. The range displayed reflects the minimum and maximum salary for this position, and individual base pay will depend on your skills, qualifications, experience, and location. Additionally, this position is eligible for the annual bonus plan or sales incentive plan and eligibility to participate in our long term equity incentive program.

Annual Base Compensation
$216,500$390,750 USD

To learn more about how Upwork processes and protects your personal information as part of the application process, please review our Global Job Applicant Privacy Notice

Interested in this job?

Application deadline: Open until filled

Logo of Upwork

Upwork

A freelancing platform connecting businesses with independent professionals for remote work across various industries.

See more jobs
Date PostedJune 6th, 2025
Job TypeFull Time
LocationRemote
Salary$216,500 - $390,750
Exciting fully remote opportunity for a Principal ML Infra Engineer at Upwork. Offering $216,500 - $390,750 (full time). Explore more remote jobs on FlexHired!

Safe Remote Job Search Tips

Verify Employer Thoroughly

Research the company's identity thoroughly before applying. Check for a professional website with contacts, active social media, and LinkedIn profiles. Verify details across platforms and look for reviews on Glassdoor or Trustpilot to confirm legitimacy.

Never Pay to Get a Job

Legitimate employers never require payment for applications, training, background checks, or equipment. Always reject upfront payment requests or demands for bank details, even if they claim it's for purchasing necessary work gear on your behalf.

Safeguard Your Personal Information

Protect sensitive data like SSN, bank details, or ID copies. Share this only after accepting a formal, written job offer. Ensure it's submitted via a secure company system or portal, never through insecure channels like standard email attachments.

Scrutinize Communication & Interviews

Watch for communication red flags: poor grammar, generic emails (@gmail), vague details, or undue pressure. Be highly suspicious of interviews held only via text or chat apps; legitimate companies typically use video or phone calls.

Beware of Unrealistic Offers

If an offer's salary or benefits seem unrealistically high for the work involved, be cautious. Research standard pay for similar roles. Offers that appear 'too good to be true' are often scams designed to lure you into providing information or payment.

Insist on a Formal Contract

Always secure and review a formal, written job offer or employment contract before starting work or sharing final personal details. Ensure it clearly defines your role, compensation, key terms, and conditions to avoid misunderstandings or scams.

Related Jobs

Full Time
$203,000 - $318,000
Remote, USA
Full Time
$232,500 - $325,500
Remote - United States
Full Time
$232,500 - $325,500
Remote - United States
Full Time
$230,000 - $322,000
Remote - United States
Full Time
$223,600 - $313,000
Remote - United States

Subscribe Newsletter

Never miss a remote job opportunity. Subscribe to our newsletter today and receive exclusive job alerts, career advice, and industry insights delivered straight to your inbox.