案件名: Senior MLOPs Engineer - RC
案件種類: Permanent
雇用形態: Full-time
業界: Information Technology / IT
専門: Technical < 技術系 >
給与: 交渉可
所在地: Tokyo
掲載済み案件: 2025-04-29
案件ID: 61279

職務内容

Hiring a Senior MLOps Engineer in Japan!

 

 

■ Senior MLOps Engineer

 

■ Company Overview

We Combine AI and conversation design expertise, provides brands the ability to easily create and integrate automated chat experiences, transforming  post, story, or ad into personalized interactive experiences.

 

■ Your Role and Responsibilities 

As a Senior MLOps Engineer, you will be performing deploying, optimizing, and monitoring LLMs in production environments. Your role will involve building and maintaining scalable pipelines, ensuring low-latency inference, and implementing best practices in monitoring and observability. You will also work with state-of-the-art tools like Hugging Face and MLFlow to fine-tune models and integrate them into AI solutions.

 

Key Responsibilities

Model Deployment & Management:
• Develop and maintain scalable pipelines for deploying LLMs, focusing on efficient, low-latency inference.
• Utilize tools like Hugging Face and MLFlow for seamless model integration and version control.

 

Monitoring & Observability:
• Implement comprehensive monitoring frameworks to track performance and reliability of models in production.
• Use advanced observability tools to proactively detect and address performance issues.

 

Infrastructure Optimization:
• Architect and optimize cloud and on-premise infrastructure to support large-scale LLM operations

 

Collaboration & Documentation:
• Partner with AI engineers and data scientists to align on project objectives and deployment strategies.

 

■ Experience and Qualifications

• 5+ years of experience in MLOps, DevOps, or related fields, with a focus on deploying and managing LLMs or other large-scale machine learning models.
• Proven experience with tools like Hugging Face, MLFlow, and containerization technologies (Docker, Kubernetes).
• Strong experience with cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform).
• Hands-on experience in reducing inference latency and optimizing AI infrastructure.

 

Technical Skills:
• Proficiency in Python, with experience in ML libraries such as TensorFlow, PyTorch, and related frameworks.
• Expertise in CI/CD pipelines, version control (Git), and orchestration tools.
• Familiarity with Generative AI, prompt engineering, and deploying models at scale.

 

■ Good Reasons to Join

• Full remote position within Japan
• Flexible working: Highly flexible
• Will work with diversified employees

 

■ Work Location

Japan

 

 

 

Details will be provided during the meeting.