Wikipédia:Projetos/Rotulagem/Sobre

Wiki labels is both the name for a software suite and a WikiProject. In this WikiProject, we produce datasets of labeled wiki artifacts and the software suite is designed to make that work easier. The name an be interpreted a either a noun

We work together on Wikipedia to produce wiki labels for important data.

or a as a very (ala Wiki loves *)

In order to get the data we need, wiki labels edit quality.

Objetivos e escopo editar

 
Labels logo

Our goal in this project is to produce labeled datasets for pressing needs of the Wikipedia community. Labeled datasets have a variety of uses inclusing research (e.g. qualitative analyses of newcomer quality[1] and editor interactions[2]) and the the development of advance wiki tools (e.g. the models used by User:ClueBot NG and WP:STiki). Generally, gathering these types of datasets is difficult as it requires substantial investment of time and effort by a small group of people to "hand-code" a suitably large dataset.

We are concerned with (1) identifying opportunities to produce important labeled datasets, (2) distributing the work as broadly as possible and (3) making it easy and efficient to "hand-code" large datasets. See our list of campaigns for what we're up to recently. If you would like to help out, sign the member list. If you have an idea for a labeled dataset you'd like to produce, inquire on the talk page.

Como ajudar? editar

Há algumas formas de ajudar neste projeto:

Rotulagem
A essência deste projeto é a rotulagem de artefatos na Wikipédia. Para a maioria das campanhas de rotulagem, será necessário rotular um número bem grande de observações para que o conjunto de dados possa ter alguma utilidade. Assim, um dos objetivos deste projeto é distribuir de forma eficaz este tipo de tarefa. Se estiver interessado em contribuir, inclua o seu nome na lista de participantes
Programação
Atividades como a correção de bugs, implementação de novos recursos e o aprimoramento do desempenho do sistema. Pull requests são bem-vindos! Acesse o repositório.
Manutenção
O carregamento de campanhas, ações para lidar com problemas no sistema e o auxílio aos novos participantes interessados em tarefas de rotulagem. Se estiver interessado em ajudar nas tarefas de manutenção do Wiki labels, entre em contato com EpochFail ou Helder.

Projetos relacionados editar

Serviço de pontuação de edições editar

 
Revision scoring logo

Many of Wikipedia's most powerful tools rely on machine classification of edit quality. In this project, we'll construct a public queryable API of machine classified scores for revisions. It's our belief that by providing such a service, we would make it much easier to build new powerful wiki tools and extend current tools to new wikis. In order to build powerful machine classifiers, we must start with high quality labeled data. That's where Wiki labels comes in. See WP:Labels/Edit quality.

 
ORES logo

The primary way that wiki tool developers will take advantage of this project is via a restful web service and scoring system we call ORES (Objective revision evaluation service). ORES provides a web service that will generate scores for revisions on request. For example, http://ores.wmflabs.org/scores/enwiki?revids=34854258&models=reverted asks for the score of the "reverted" model for revision #34854258 in English Wikipedia.

Referências editar

  1. Halfaker, A., Geiger, R. S., Morgan, J. T., & Riedl, J. (2012). The rise and decline of an open collaboration system: How Wikipedia’s reaction to popularity is causing its decline. American Behavioral Scientist, 0002764212469365. summary full paper
  2. m:Grants:IEG/Editor_Interaction_Data_Extraction_and_Visualization