Analyse the Impact of Text Representation Methods for Iconclass Label Recommendation

  • Links:PDF
  • ICONCLASS is the de facto global standard for the subject classification of cultural heritage content. It consists of alphanumeric “notations” which document the subjects in images in a language independent way. Each one of these notations has a set of textual labels to describe iconographic subjects in natural language.

    In your thesis, you will have the chance to evaluate different representation methods for text, such as word embeddings, in the task of retrieving correct ICONCLASS codes from textual queries.

    You will gain valuable experience in NLP and Information Retrieval techniques as well as provide fruitful insights on how to improve the current ICONCLASS text search system.