Job Description
Career Opportunity for a Data Scientist (Creative Vision) in Japan!
■ Data Scientist (Creative Vision)
■ Company Overview
Japan-based advanced AI company focused on building next-generation intelligent systems. Backed by a strong global technology group, it works at the intersection of cutting-edge research and real-world application, developing scalable AI solutions with long-term impact.
■ Your Role and Responsibilities
● Design and operate large-scale multi-modal data pipelines (ingestion, deduplication, filtering, versioning)
● Build data APIs and high-throughput loaders (streaming, caching, sampling)
● Develop and manage captioning and annotation workflows, including multilingual support
● Oversee annotators, gold sets, QA metrics, and quality dashboards
● Curate and verify datasets using CLIP/VLM-assisted captioning
● Perform data quality control (duplicate detection, clustering, policy filtering such as NSFW/PII)
● Balance datasets across domains and regions; evaluate dense captions and synthetic data
● Conduct data ablation studies and create internal research reports
● Collaborate with research and product teams; define reusable schemas and SLAs
■ Experience and Qualifications
● Experience with large-scale data infrastructure and multi-modal datasets
● Strong background in data processing, curation, and annotation workflows
● Knowledge of data quality evaluation, policy filtering, and safety considerations
● Research-oriented mindset with cross-functional collaboration experience
■ Additional Preferred Qualifications
● Familiarity with CLIP metrics, aesthetic/safety evaluation, and test set management
● Knowledge of data governance, licensing, deletion workflows, and NSFW tracing
■ Good Reasons to Join
● Full remote work possible within Japan
■ Work Location
Tokyo, Japan
Details will be provided during the meeting.