Job title:	Senior MLOPs Engineer - RC
Job type:	Permanent
Emp type:	Full-time
Industry:	IT & Telecommunications / IT・通信
Functional Expertise:	Technical (IT) / 技術職（IT）
Salary:	Negotiable
Location:	Tokyo
Job published:	2025-04-29
Job ID:	61279

Job Description

Hiring a Senior MLOps Engineer in Japan!

■ Senior MLOps Engineer

■ Company Overview

We Combine AI and conversation design expertise, provides brands the ability to easily create and integrate automated chat experiences, transforming post, story, or ad into personalized interactive experiences.

■ Your Role and Responsibilities

As a Senior MLOps Engineer, you will be performing deploying, optimizing, and monitoring LLMs in production environments. Your role will involve building and maintaining scalable pipelines, ensuring low-latency inference, and implementing best practices in monitoring and observability. You will also work with state-of-the-art tools like Hugging Face and MLFlow to fine-tune models and integrate them into AI solutions.

Key Responsibilities

Model Deployment & Management:
• Develop and maintain scalable pipelines for deploying LLMs, focusing on efficient, low-latency inference.
• Utilize tools like Hugging Face and MLFlow for seamless model integration and version control.

Monitoring & Observability:
• Implement comprehensive monitoring frameworks to track performance and reliability of models in production.
• Use advanced observability tools to proactively detect and address performance issues.

Infrastructure Optimization:
• Architect and optimize cloud and on-premise infrastructure to support large-scale LLM operations

Collaboration & Documentation:
• Partner with AI engineers and data scientists to align on project objectives and deployment strategies.

■ Experience and Qualifications

• 5+ years of experience in MLOps, DevOps, or related fields, with a focus on deploying and managing LLMs or other large-scale machine learning models.
• Proven experience with tools like Hugging Face, MLFlow, and containerization technologies (Docker, Kubernetes).
• Strong experience with cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform).
• Hands-on experience in reducing inference latency and optimizing AI infrastructure.

Technical Skills:
• Proficiency in Python, with experience in ML libraries such as TensorFlow, PyTorch, and related frameworks.
• Expertise in CI/CD pipelines, version control (Git), and orchestration tools.
• Familiarity with Generative AI, prompt engineering, and deploying models at scale.

■ Good Reasons to Join

• Full remote position within Japan
• Flexible working: Highly flexible
• Will work with diversified employees

■ Work Location

Japan

Details will be provided during the meeting.

Job Description

Our use of cookies