Job title: Senior MLOPs Engineer - RC
Job type: Permanent
Emp type: Full-time
Industry: Information Technology / IT
Functional Expertise: Technical < 技術系 >
Salary: Negotiable
Location: Tokyo
Job published: 2025-04-29
Job ID: 61279

Job Description

Hiring a Senior MLOps Engineer in Japan!

 

 

■ Senior MLOps Engineer

 

■ Company Overview

We Combine AI and conversation design expertise, provides brands the ability to easily create and integrate automated chat experiences, transforming  post, story, or ad into personalized interactive experiences.

 

■ Your Role and Responsibilities 

As a Senior MLOps Engineer, you will be performing deploying, optimizing, and monitoring LLMs in production environments. Your role will involve building and maintaining scalable pipelines, ensuring low-latency inference, and implementing best practices in monitoring and observability. You will also work with state-of-the-art tools like Hugging Face and MLFlow to fine-tune models and integrate them into AI solutions.

 

Key Responsibilities

Model Deployment & Management:
• Develop and maintain scalable pipelines for deploying LLMs, focusing on efficient, low-latency inference.
• Utilize tools like Hugging Face and MLFlow for seamless model integration and version control.

 

Monitoring & Observability:
• Implement comprehensive monitoring frameworks to track performance and reliability of models in production.
• Use advanced observability tools to proactively detect and address performance issues.

 

Infrastructure Optimization:
• Architect and optimize cloud and on-premise infrastructure to support large-scale LLM operations

 

Collaboration & Documentation:
• Partner with AI engineers and data scientists to align on project objectives and deployment strategies.

 

■ Experience and Qualifications

• 5+ years of experience in MLOps, DevOps, or related fields, with a focus on deploying and managing LLMs or other large-scale machine learning models.
• Proven experience with tools like Hugging Face, MLFlow, and containerization technologies (Docker, Kubernetes).
• Strong experience with cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform).
• Hands-on experience in reducing inference latency and optimizing AI infrastructure.

 

Technical Skills:
• Proficiency in Python, with experience in ML libraries such as TensorFlow, PyTorch, and related frameworks.
• Expertise in CI/CD pipelines, version control (Git), and orchestration tools.
• Familiarity with Generative AI, prompt engineering, and deploying models at scale.

 

■ Good Reasons to Join

• Full remote position within Japan
• Flexible working: Highly flexible
• Will work with diversified employees

 

■ Work Location

Japan

 

 

 

Details will be provided during the meeting.

 

File types (doc, docx, pdf, rtf, png, jpeg, jpg, bmp, jng, ppt, pptx, csv, gif) size up to 5MB
File types (doc, docx, pdf, rtf, png, jpeg, jpg, bmp, jng, ppt, pptx, csv, gif) size up to 5MB