Team Introduction
TikTok DCC (Data Cycling Centre):
The team focuses on building data-driven solutions and enhancing content understanding for both internal stakeholders and external users, aiming to solve significant problems through a combination of human and direct content understanding
About the role:
A Data Understanding Specialist (Indonesian speaking) is a critical player at the junction of data collection, model development, and project success. Their main responsibility is to act as a bridge between machine learning engineers and labellers, ensuring the accurate application of structure to unstructured data for Indonesian content such as video, images, text, etc. supporting the building of world class machine learning solutions.
Responsibilities
Labelling rules clarification and training
- Define clear and unambiguous labelling rules for Indonesian-content in the Indonesian market to ensure high quality labelling output
- Simplify labelling rules to reduce contextual knowledge for labelling while providing training to Indonesian labellers if needed
- Ensure that labelling rules meet model training requirements for the Indonesian market
Quality improvement and assurance
- Monitor and review quality of labelled data for the Indonesian market
- Timely root-cause identification of quality issues for remediation
- Identify opportunities and areas of improvement to optimize delivery
Project management
- Deliver projects on time and on target flagging any key risks that arise
- Deep dive into issues that arise during project lifecycle with solutions that ensure delivery
- Manage Indonesian-speaking stakeholders to drive transparency across the data delivery pipeline