| Project |
| Research Home >> |
Video Dense Information Grinding (VideoDIG)
|
|||
|
Video Dense Information Grinding (VideoDIG) -- Large-Scale Video Semantic Filtering System In this project, we propose a novel semantic routing/filtering mechanism to reduce the amount of transmission loads based on the semantic user profiles. In other words, the system shall only transmit those video shots or stories that are of interest to the end users. Figure 1 shows an example of concept filtering and semantic routing for large-scale video streaming system. In the system, we deploy concept filters hierarchically based on the semantic trees. For instance, if an end user is interested at the basket clips, then the processing elements would first filter out all shots that are not sport-event, and then classify video packets to baseball, basketball, hockey, soccer, tennis, etc. Using this semantic routing structure, processing loads for each nodes can be reduced and thus make the overall system scale for large streaming environments. We propose to use the complexity-accuracy curves to optimally choose operating points in this semantic routing scenario. We also propose a set of novel video features, that result in better performance, in terms of both speed and accuracy, than our previous generic video concept classifiers. We have built one hundred concept classification filters. Experiments on 154 hours of video streams validated the effectiveness of the proposed system. ![]() Demo -- Visual Concept Filters: We have tested these 100 concept classifiers on five data sets. IBM TRECVID team divided the NIST TRECVID 2003 Development set into four sets: 38 hours of ConceptTrain set for training, 24 hours of ConceptValidate, ConceptFusion1, and ConceptFusion2 sets for testing or multi-modality fusion. The NIST TRECVID 2003 Development set was manually annotated (Video Collaborative Annotation Forum). These 100 VideoDIG Visual Concept Filters were trained using the ConceptTrain set and tested on ConceptValidate, ConceptFusion1, ConceptFusion2 and the NIST TRECVID 2003 and 2004 Test sets. [with Quantative Avearge Precisions Evaluations] Classification Results on IBM Internal "Official" Test Set => ConceptFusion2 (12 hours) Classification Results on Other Test sets => ConceptFusion1 (6 hours), ConceptValidate (6 hours). [without Quantative Average Precision Evaluations] Classification Results on NIST TRECVID 2003 Test Set => ConceptTest (66 hours) Classification Results on NIST TRECVID 2004 Test Set => ConceptTest2 (64 hours, download "high-level" feature donation of these classifiers for TRECVID 2004.) [with Quantative Evaluation -- TREC2005 Development Sets (58.1 hours of video)] Classification Results on Sets => ConceptSelect05 (8 hours), ConceptFusion05 (10.2 hours), ConceptValidate05 (10 hours), ConceptEvaluate05 (28.2 hours) [without Quantative Average Precision Evaluations] Classification Results on NIST Official TRECVID 2005 Test Set => ConceptTest05 (84.7 hours) Publications: Ching-Yung Lin, Olivier Verscheure, and Lisa Amini, "Semantic Routing and Filtering fro Large-Scale Video Streams Monitoring", IEEE Intl. Conf. on Multimedia & Expo (ICME), Amsterdam, Netherlands, July 2005. Exploratory Stream Processing Systems Group, IBM T. J. Watson Research Center If you have any question, please contact: Ching-Yung Lin, Olivier Verscheure, or Lisa Amini. Last Updated: 9/17/2005 |
|||
| Privacy | Legal | Contact | IBM Home | Research Home | Project List | Research Sites | Page Contact | ||||