Search Relevance Rating, Online Data Rating
Online Data Rating and Validation for Multinational Technology Company
The Challenge
다국적 기술 기업인 이 고객사는 오디오, 이미지, 텍스트, 영상 데이터 전반에 걸쳐 검색 쿼리 및 결과를 검증하고 평가하기 위해 평가자들로 구성된 대규모 현지 인력을 찾고 있었습니다. 프로젝트의 범위가 전 세계적이었기 때문에, 고객사는 담당 팀에게 높은 유연성과 확장성을 바탕으로 하루 24시간 업무 대응이 가능하도록 요청했습니다. 또한 고객사는 담당 팀이 새로운 트렌드와 검색에 익숙하고 세부 사항에 대한 예리한 시각을 갖도록 세부적인 가이드라인과 품질 표준을 개략적으로 제시했습니다. 다양한 데이터 유형이 작업 범위에 포함되어 있었기 때문에, 이러한 대규모 평가 및 검증 이니셔티브는 전 세계적으로 150여 개의 프로젝트로 확대되었습니다.
• • • •The Solution• • • •
In the initial stages of the project, DataForce assembled a group of fewer than 50 raters with a growth of 20% daily, climbing to a total of 1,500 active and trained raters within six months. As the project grows with advanced specifications, DataForce continues to source an in-country workforce, providing a variety of project-specific training programs. Aside from onboarding, the team is in constant communication with the raters, providing a ticketing system for feedback and issues should they arise. Adhering to the guidelines set by our client, an intensive quality assurance (QA) process was established,aligned with our client’s expectation. The guidelines include several different layers of QA to assess the raters’ speed and fraud prevention measures (IP checks, resume authentication, etc.), all while ensuring bi-weekly payments and performance reporting for the community members.
Some examples of the projects include:
- Point of Interest: the confirmation of location-specific data (operating hours, address, etc.)-Search: the validation of search results, confirming both accurate and up-to-date results.
- Product Identification: the validation and comparison of various products such as clothing, accessories, and food, ensuring the search results match the link, product description, color, style, etc.
- Auto Complete: the validation of system prediction when writing a query involving an abbreviation or finishing a statement, confirming the system understands what it is being asked.
- Sentiment Analysis: reviewing and tagging text data with positive, negative, and neutral classification to train the AI system to the emotion behind the text.
- Podcast: the rater is provided a text-based synopsis of the audio file and they must review the text and listen to the podcast to confirm the copy matches what is said on the podcast.
- Audio Quality Rating: the evaluation of sound quality for text-to-speech models, confirming if the output sounds natural (high quality) or robotic (low quality).
- Offensive Content Moderation: mitigating unsuitable online content such as adult material, hate speech, and other detrimental motives through diligently identifying and assessing image, video, audio, and text data that should appear in various search results.
As this project advances, DataForce will continue to facilitate the enhancement of our client’s model by meticulously following the specific guidelines provided, all while assembling and training a network of geographically diverse raters across the world.

DataForce has a global community of over 1,000,000 members from around the globe and linguistic experts in over 250 languages. DataForce is its own platform but can also use client or third-party tools. This way, your data is always under control.