How we’re using machine learning to visually enrich Wikidata – Wikimedia Blog

ab1630's bookmarks 2018-03-15

Summary:

"Only 2.5 million of 45 million Wikidata items have an image attached. A new algorithm helps people find relevant and high-quality images to add to Wikidata items.

Wikidata is a multilingual project by design. The project allows contributors to add structured knowledge in every human language, and acts as a central repository of structured data for Wikipedia and its sister projects. As powerful tools to share knowledge without language barriers, images are very important within Wikidata.

Images can also help illustrate the content of an item in a language-agnostic way to external data consumers. However, a large proportion of Wikidata items lack images: for example, as of today, more than 3.6 million Wikidata items are about humans but only 17 percent of them have an image. More generally, only 2.5 million of 45 million Wikidata items have an image attached.

We recently started a research project to help people find relevant images to add to Wikidata items. The project uses algorithmic image analysis and the richness of linked open data to discover and recommend relevant, high-quality, free-licensed pictures for Wikidata items that don’t already have an image attached...."

Link:

https://blog.wikimedia.org/2018/03/14/machine-learning-visually-enriching-wikidata/

From feeds:

Open Access Tracking Project (OATP) » ab1630's bookmarks

Tags:

oa.new oa.images oa.wikidata oa.tools oa.code4oa

Date tagged:

03/15/2018, 17:12

Date published:

03/15/2018, 13:12