Photo
Audio Visual Speech Technologies

Multimedia Mining via Linguistic Associations


This AR project started as a collaboration between the Audio-Visual Speech Technologies Group and the Pervasive Media Management group.

Our goal is to build a trainable system for detecting semantics in Multimedia content. Specifically,

  • Given
    • Concepts vocabulary: Events, Objects, Static scenes
    • Concept definition: Audio visual constituents and their spatial, temporal relationships
    • Concept illustration: Examples or user specification 
  • Build
    • Statistical models from low-level and semantic features to detect concepts 
    • Large, extensible inventory of low-level and semantic features
  • Detect
    • Temporal events, Static scenes, Objects in multimedia content
Related:

Members:

Audio-Visual Speech Technologies Pervasive Media Management
Bill Adams Ching-Yung Lin
Giridharan Iyengar Milind Naphade
Harriet Nock John R. Smith (Manager, PMM)
Chalapathy Neti (Manager, AVSTG)

Presentations and Publications: