Structuring and Embedding Image Captions: the V.I.F. Multi-modal System

Vasconcelos, Cristina N.; Sá, Asla M.; Sá, Marcio I.; Carvalho, Paulo Cezar P.

View/Open

025-032.pdf (569.6Kb)

Date

2012

Author

Vasconcelos, Cristina N.

Sá, Asla M.

Sá, Marcio I.

Carvalho, Paulo Cezar P.

Pay-Per-View via TIB Hannover:

Try if this item/paper is available.

Metadata

Show full item record

Abstract

Within the context of historical photographic annotated collections, we observe the frequent occurrence of some subsets of important characters, usually described in captions. For many years, image captions were annotated using natural language texts intended to be read by humans. Today, the information retrieval of structured information is appealing and the migration of natural language captions to structured information is desirable in a variety of photographic collections. In this paper, we describe the Very Important Faces (V.I.F.) system, which is designed to graphically document the occurrence of distinguished characters within photographic collections and store this information in a structured format useful for retrieval purposes. The V.I.F. system implements face detection in the image data and detects proper names in previously inserted captions if any are present. The user matches names to faces throughout the software interface in order to produce a photo annotation that is stored considering structured information principles. Once the matching is done, an efficient verification tool is proposed, which helps the expert to review the annotation, taking advantage of such multi-modal databases. The concept of annotation maturity level is also introduced.

BibTeX

@inproceedings {10.2312:VAST:VAST12:025-032,
booktitle = {VAST: International Symposium on Virtual Reality, Archaeology and Intelligent Cultural Heritage},
editor = {David Arnold and Jaime Kaminski and Franco Niccolucci and Andre Stork},
title = {{Structuring and Embedding Image Captions: the V.I.F. Multi-modal System}},
author = {Vasconcelos, Cristina N. and Sá, Asla M. and Sá, Marcio I. and Carvalho, Paulo Cezar P.},
year = {2012},
publisher = {The Eurographics Association},
ISSN = {1811-864X},
ISBN = {978-3-905674-39-2},
DOI = {10.2312/VAST/VAST12/025-032}
}

URI

http://dx.doi.org/10.2312/VAST/VAST12/025-032

Collections

VAST12: The 13th International Symposium on Virtual Reality, Archaeology and Intelligent Cultural Heritage