Job Description
Hiring a Senior MLOps Engineer in Japan!
■ Senior MLOps Engineer
■ Company Overview
We Combine AI and conversation design expertise, provides brands the ability to easily create and integrate automated chat experiences, transforming post, story, or ad into personalized interactive experiences.
■ Your Role and Responsibilities
As a Senior MLOps Engineer, you will be performing deploying, optimizing, and monitoring LLMs in production environments. Your role will involve building and maintaining scalable pipelines, ensuring low-latency inference, and implementing best practices in monitoring and observability. You will also work with state-of-the-art tools like Hugging Face and MLFlow to fine-tune models and integrate them into AI solutions.
Key Responsibilities
Model Deployment & Management:
• Develop and maintain scalable pipelines for deploying LLMs, focusing on efficient, low-latency inference.
• Utilize tools like Hugging Face and MLFlow for seamless model integration and version control.
Monitoring & Observability:
• Implement comprehensive monitoring frameworks to track performance and reliability of models in production.
• Use advanced observability tools to proactively detect and address performance issues.
Infrastructure Optimization:
• Architect and optimize cloud and on-premise infrastructure to support large-scale LLM operations
Collaboration & Documentation:
• Partner with AI engineers and data scientists to align on project objectives and deployment strategies.
■ Experience and Qualifications
• 5+ years of experience in MLOps, DevOps, or related fields, with a focus on deploying and managing LLMs or other large-scale machine learning models.
• Proven experience with tools like Hugging Face, MLFlow, and containerization technologies (Docker, Kubernetes).
• Strong experience with cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform).
• Hands-on experience in reducing inference latency and optimizing AI infrastructure.
Technical Skills:
• Proficiency in Python, with experience in ML libraries such as TensorFlow, PyTorch, and related frameworks.
• Expertise in CI/CD pipelines, version control (Git), and orchestration tools.
• Familiarity with Generative AI, prompt engineering, and deploying models at scale.
■ Good Reasons to Join
• Full remote position within Japan
• Flexible working: Highly flexible
• Will work with diversified employees
■ Work Location
Japan
Details will be provided during the meeting.