Senior MLOps Engineer Job at DeepRec.ai, San Jose, CA

M1hsRW1KcHl2UVgwSXlvdGdwRW9Ycjg0a0E9PQ==
  • DeepRec.ai
  • San Jose, CA

Job Description

Senior MLOps Engineer

We are hiring for an MLOps Engineer for a fast-moving AI startup who are building a worldclass AI-powered video platform.

We are looking for a skilled and hands-on MLOps Engineer to join their growing team. You will play a critical role in deploying, scaling, and maintaining their machine learning infrastructure, supporting a range of tools that enable the controlled generation of high-quality animated videos.

Key Responsibilities

  • Design, deploy, and maintain scalable training and data-processing pipelines on distributed compute clusters (e.g., Slurm, Kubernetes, or cloud-native equivalents).
  • Optimize inference systems for latency and cost in a production setting.
  • Collaborate closely with ML researchers and engineers to productionize deep learning models.
  • Implement robust monitoring, logging, and alerting systems for model performance and infrastructure reliability.
  • Automate model testing, validation, and deployment processes across staging and production environments.
  • Ensure efficient usage of compute resources, including GPU clusters, and help identify bottlenecks or cost-saving opportunities.

Requirements

  • Proven experience in MLOps, ML infrastructure, or related roles.
  • Deep expertise in deploying and maintaining ML training pipelines on distributed systems.
  • Strong knowledge of inference optimization techniques, especially in reducing latency and cost at scale.
  • Proficiency with cloud platforms (AWS, GCP, Azure) and orchestration tools (Kubernetes, Docker).
  • Experience working with GPU scheduling, distributed training (e.g., PyTorch DDP), and model serving frameworks (e.g., Triton, TorchServe).
  • Familiarity with CI/CD for ML workflows.
  • Strong Python skills and experience with ML/DL frameworks like PyTorch or TensorFlow.

Bonus Points

  • Experience working in the creative media or animation industry.
  • Exposure to video processing, generative AI, or large-scale content production systems.
  • Experience collaborating with research teams or integrating research code into production pipelines.

Please apply for more information

Job Tags

Similar Jobs

Cummins Inc.

Skilled Technician - Level III - Controls Tech - 3rd shift Job at Cummins Inc.

Skilled Technician - Level III - Controls Tech - 3rd shift North Charleston, SC Career Path: Manufacturing Organization: Cummins Inc. Role Category: On-site Job Type: Shop ReqID: 2411210 DESCRIPTION We are looking for a talented Skilled Technician...

JAB Recruitment

Recruitment Consultant - Oil & Gas Job at JAB Recruitment

 ...Wednesday in office located in the Galleria, Thursday-Friday work from home. Must be a lawful resident authorized to work for any employer...  ..., national origin, age, citizenship status, marital status, medical condition, physical or mental disability or any other legally... 

Medix™

Clinical Research Nurse Job at Medix™

 ...Clinical Research Nurse Must Haves: Must have an active RN license in the state of...  ...responsible for supporting the integrity and quality of clinical research studies. This role...  ...and pursues opportunities for process improvements to enhance efficiency Assesses... 

Town Pump

CASINO BOOKKEEPER Job at Town Pump

CASINO BOOKKEEPER Location Havre, MT : This position is accountable for the following areas: Accounting decision consistent with Casino...  ...customer service and casino operations. *Must be available on weekends!* ESSENTIAL DUTIES AND RESPONSIBILITIES: Customer service... 

Pinnacle Rehab

Full Time Hospice Inpatient Unit Registered Nurse - Overnight 3x12's Job at Pinnacle Rehab

 ...We are looking for a Full-time (36 hours a week) Registered Nurse for an Inpatient Unit position in the Hospice setting. We are in need...  ...work any 3 days of the week but must be able to work at least 3 weekend night shifts per month (Friday or Saturday nights). The...