Text Annotation
Social Media Categorization for Supermajor Oil Company
The Challenge
Our client, one of the seven "supermajor" oil companies in the world, was looking for a scalable partner to review and classify nearly 2,000 tweets to diagnose the public perception of the company and utilize the insights to tailor internal and external communications. The project covered seven reputational drivers including products and services, innovation, workplace, integrity and transparency, citizenship, leadership, and performance. The main challenge was handling highly subjective content and displaying the findings to ensure our client could make actionable decisions based on the uncovered opinion-based insights.
• • • •The Solution• • • •
利用我们超过130万贡献者的全球社群数据资源,创博数据仅在两周时间内就迅速组建并培训出一支由11名具备相关技能和背景人员所组成的团队。 由专门的项目经理和协调员负责确保满足时间、培训、预算和所有其他细节要求;此外还有七名标注员和两名审核员负责完成分类工作。 我们通过自有标注平台将每条推文分配给两名不同的标注员,确保每条推文都能够得到多角度的分析。 在出现意见分歧的情况下,不同结果将分配给质量保证(QA)审核员。 QA审核员会对初始分类进行消歧,并结合推文内容与标注员的观点进行分析,依据客户提供的指南确定推文的类别归属。
Given the high subjectivity of the task, we made sure to be on target by categorizing and delivering a fraction of the dataset first, which included 250 tweets. After completing the pilot and discussing the outcome with the client, we further fine-tuned the guidelines and specified training instructions before going into the full production of 1,926 tweets. It is important to note that this review of the first 250 tweets provided a mutual opportunity to dissect the guidelines and reassess the categorization breakdown, providing an opportunity to alter the project early on so our client could obtain the results they desired in the end.
Our client not only received the final data annotated within eight weeks of project kickoff, but they also had the ability to view the results in a simple way, thanks to a customized dashboard we built catered to their preferences.
Following the final delivery, we received excellent feedback from our client noting the measurable shift in the performance of their model thanks to the labeled and classified dataset. We were proud to report a 100% acceptance score, indicating our client was fully satisfied with the quality of the dataset results.

DataForce has a global community of over 1,000,000 members from around the globe and linguistic experts in over 250 languages. DataForce is its own platform but can also use client or third-party tools. This way, your data is always under control.