Projects:Sketchy recognition

From DiVersions
Revision as of 15:53, 31 August 2019 by Michael Murtaugh (talk | contribs)
Jump to navigation Jump to search

Collections: Koninklijke Musea voor Kunst en Geschiedenis/MIM

Working sketch

Description

Bread, Nose, Kangaroo or Teddy Bear?

A photograph from the collection of the Museum of Musical Instrument is processed by a contour detector algorithm. The algorithm draws the lines it found on the image sequentially. While it is tracing the contours, another algorithm, a sketch detector, tries to guess what is being drawn. Is it bread? A kangaroo? It is a teddy bear.

Sketchy Recognition (working title) is an attempt to provoke a dialogue with, and between, algorithms, visitors and museum collections.

Cast:

  • Musical instruments: MIM collection, Brussels.
  • Line detector: The Hough algorithm in the OpenCV toolbox, originally developed to analyse bubble chamber photographs.
  • Sketch recognizer: an algorithm based on the research of Eitz, Hays and Alexa (2012), and the code and models by Jean-Baptiste Alayrac.
  • Data: from the hands of the many volunteers who contributed to Google's Quick, Draw! Dataset.
  • Special sauce, bugs and fixes: Michael and Nicolas

Installation

  • (single-board, ie Raspberry Pi) Computer
  • Camera
  • Drawing surface
  • (mini) projector and/or adjacent display

Interactive installation where participants / visitors can draw forms which are interpreted live by a closed-circuit computer vision (CCCV) system. Based on a "ready-made" model trained on sketches from one of 250 pre-determined categories. The model is also used to make connections between the visitors drawing and items from the collection of the Musical Intrument Museum (MIM).

bio

Nicolas Malevé is beeldend kunstenaar, computer programmeur en data-activist. Op dit moment woont en werkt hij tussen Brussel en Londen aan een onderzoek naar hoe en waarom machinale normen over ‘kijken’ in Computer Vision algoritmes worden geïmplementeerd.

Michael Murtaugh doet onderzoek naar community databases, interactieve documentaire en tools voor nieuwe vormen van online lezen en schrijven. Hij is als docent betrokken bij het Experimental Publishing Traject van de Media Design Master aan het Piet Zwart Instituut in Rotterdam.

Sources

  • MIM/Carmentis Saskia Willaerts
  • Contours / Algorithm (what is the algorithm in use in opencv?)
  • Sketch recognition database / programmer of model -- Using Jean-Baptiste Alayrac's pre-trained model

(Re)sources

Humans have used sketching to depict our visual world since prehistoric times. Even today, sketching is possibly the only rendering technique readily available to all humans. This paper is the first large scale exploration of human sketches. We analyze the distribution of non-expert sketches of everyday objects such as 'teapot' or 'car'. We ask humans to sketch objects of a given category and gather 20,000 unique sketches evenly distributed over 250 object categories. With this dataset we perform a perceptual study and find that humans can correctly identify the object category of a sketch 73% of the time. We compare human performance against computational recognition methods. We develop a bag-of-features sketch representation and use multi-class support vector machines, trained on our sketch dataset, to classify sketches. The resulting recognition method is able to identify unknown sketches with 56% accuracy (chance is 0.4%). Based on the computational model, we demonstrate an interactive sketch recognition system. We release the complete crowd-sourced dataset of sketches to the community.

Code

Working sketches

Early sketch

some "best of" links:

Rough notes (not for publication ;)

cf Saskia's story of misnaming an instrument. (The end of which was that the African museum contacted wanted not that the instrument be returned, but that the name be updated to reflect the fact that the name incorrectly referred to a larger class of instruments, and not the particular instrument in question)

How explicit do we need to be with our intentionality. Danger: Flatten the potential? Maybe keep it simple / straightforward

Meta data as interstitial frames introducing the sequences of images + sketch predictions.

Algorithms reading algorithms...