Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 About Toloka  



1.1  Origin of the platform's name  





1.2  Types of tasks and scope of results  



1.2.1  Machine learning  





1.2.2  Audit and marketing research  









2 Users  





3 Requesters  





4 Toloka Research  





5 References  





6 External links  














Toloka






Русский
 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Toloka

Type of site

Crowdsourcing, Microwork
Available inEnglish, Russian, Spanish, French, Arabic etc.[1]
Founded2014; 10 years ago (2014)
Country of originRussia, Switzerland[2][3]
OwnerYandexInc
Founder(s)Olga Megorskaya
URLtoloka.ai

Toloka is a crowdsourcing platform and microtasking project launched by Yandex in 2014[2] to quickly markup large amounts of data, which are then used for machine learning and improving search algorithms.[4] The proposed tasks are usually simple and do not require any special training from the performer.[2] Most of the tasks are designed to improve algorithms that are used by modern technologies spanning self-driving vehicles, smart web searches, advanced voice assistants and e-commerce.[citation needed] Upon completion of each task the performer receives a reward based on the volume of images, videos, and unstructured text.[3] The service has two app versions – for Android and iOS.

About Toloka

[edit]

Origin of the platform's name

[edit]

Atoloka used to be a form of mutual assistance among villagers of Russia, Ukraine, Belarus, Estonia, Latvia, and Lithuania. It was organized in villages to perform urgent work requiring a large number of workers, such as harvesting, logging, building houses, etc. Sometimes a toloka was used for community works (building churches, schools, roads, etc.).[3]

Types of tasks and scope of results

[edit]

Data labeling helps to improve search quality and effectively tune result ranking algorithms of search engine.[3]

Machine learning

[edit]

To train machine learning algorithm requires labeling of large volumes with positive and negative examples of data. Toloka performers receive tasks to determine the presence or absence of objects defined by a computer in a content item.[3][5] In tasks of another type, a context of the dialogue is given and a scale is proposed by which it is necessary to assess whether a chatbot's answer in this context is appropriate, interesting, and so on.[6] Another group of tasks in Toloka is translation verification performed by collecting examples of translations from different performers.[citation needed]

Audit and marketing research

[edit]

Checking the quality of the online store, delivery service, writing reviews about products and services. Such audits allow to control the quality of the service and identify weaknesses, over which work will be carried out in the future to improve and eliminate the identified problems.[citation needed]

Users

[edit]

Toloka users, also known as performers or tolokers, are people who earn money by completing system testing and improvement tasks on the Toloka crowdsourcing platform.[citation needed] In 2018, more than a million people participated in Toloka projects. Most performers are young people under 35 (usually engineering students or mothers on maternity leave). Performers mainly see Toloka as an additional source of income, but many of them note that they like to do meaningful work and clean up the internet. As of March 2022, Toloka has 245,000 monthly active performers in 123 countries. Tolokers generates over 15 million labels per day.[1][7]

Requesters

[edit]

All tasks in Toloka are placed by requesters. The main uses of Toloka are data collection and processing for machine learning, speech technology, computer vision, smart search algorithms, and other projects, as well as content moderation, field tasks, optimization of internal business processes.[3]

Toloka Research

[edit]

In May 2019, the service's team started publishing datasets for non-commercial and academic purposes to support the scientific community and attract researchers to Toloka. Such datasets are addressed to researchers in different directions like linguistics, computer vision, testing of result aggregation models, and chatbot training.[8] Toloka research has been showcased at a range of conferences, including the Conference on Neural Information Processing Systems (NeurIPS),[9] the International Conference on Machine Learning (ICML)[10] and the International Conference on Very Large Data Bases (VLDB).[11]

References

[edit]
  1. ^ a b "It helps me learn and earn: Toloka reports results of a global survey of Tolokers in 2022". toloka.ai. 2022-03-23. Retrieved 2022-09-16.
  • ^ a b c "Toloka rolls out 20000 new jobs opportunities for Ghanaians". Ghana Education News. 2021-06-15. Retrieved 2022-09-17.
  • ^ a b c d e f Alex Woodie (2021-04-27). "Toloka Expands Data Labeling Service". Datanami. Retrieved 2022-09-17.
  • ^ Daria Baidakova (2021-09-29). "Data-Labeling Instructions: Gateway to Success in Crowdsourcing and Enduring Impact on AI". Data Science Central. Retrieved 2022-09-17.
  • ^ Frederik Bussler (2021-12-07). "Data labeling will fuel the AI revolution". VentureBeat. Retrieved 2022-09-17.
  • ^ Kumar Gandharv (2021-04-29). "Why Are Data Labelling Firms Eyeing Indian Market?". Analytics India Magazine. Retrieved 2022-09-17.
  • ^ "Olga Megorskaya/Toloka: Practical Lessons About Data Labeling". TheSequence. 2021-10-27. Retrieved 2022-09-16.
  • ^ "Toloka to present new dataset at prestigious Data-Centric AI workshop launched by Andrew Ng". The AI Journal. Retrieved 2022-09-17.
  • ^ "Toloka to present new dataset at prestigious Data-Centric AI workshop launched by Andrew Ng". FE News. 2021-11-18. Retrieved 2022-02-10.
  • ^ "Toloka". icml.cc. Retrieved 2022-02-10.
  • ^ "VLDB 2021 Challenge". crowdscience.ai. Retrieved 2022-02-10.
  • [edit]
    Retrieved from "https://en.wikipedia.org/w/index.php?title=Toloka&oldid=1228414927"

    Categories: 
    Yandex
    Crowdsourcing
    Human-based computation
    Social information processing
    Web services
    Hidden categories: 
    Articles with short description
    Short description is different from Wikidata
    All articles with unsourced statements
    Articles with unsourced statements from January 2024
    Articles with unsourced statements from June 2024
    Official website different in Wikidata and Wikipedia
     



    This page was last edited on 11 June 2024, at 03:21 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki