Table of Contents
[ The numbers following the paper titles are the page numbers in the Proceedings. ]
Volume I Monday
MA0 Multimedia Education I (R)
Time & Place: 9:45am - 12:05pm, Beekman Parlor
MA0.01 Lectern: A Digital Desk System for Video-Free Course Lecture Capturing and Playback 3
Tzi-cker Chiueh and Peitao Deng, State University of New York at Stony Brook, USA
MA0.02 Multimedia Linguistic Learning 7
E. Gentile, P. Plantamura, and V. L. Plantamura, Università degli Studi di Bari, Italy
MA0.03 Virtual Microphones for Multichannel Audio Applications 11
C. Kyriakakis and A. Mouchtaris, University of Southern California, USA
MA0.04 Multi-modal Interaction in the Age of Information Appliances 15
Stéphane H. Maes and T. V. Raman, IBM T.J. Watson Research Center, USA
MA0.05 An Educational-Oriented Framework for Building On-Line Courses using XML 19
Florin Bota, University of Cluj, Romania; Laura Farinetti, Politecnico di Torino, Italy; Anca Rarau, Technical University of Cluj, Romania
MA0.06 A Web-Based CBIR-Assisted Learning Tool for Radiology Education
Anytime and Anyplace 23
C. R. Shyu, A. C. Kak, C. E. Brodley, C. Pavlopoulou, and M. F. Chyan, Purdue University, USA; L. S. Broderick, University of Wisconsin Hospital, USA
MA0.07 Complete System for Distance Learning over IP 27
Javier Durán-de-Jesus, Juan José Villacorta-Calvo, and Alberto Izquierdo-Fuente,
Universidad de Valladolid, Spain
MA1 Collaborative Networking Applications (R)
Time & Place: 9:45am - 12:05pm, Sutton North
MA1.01 An Efficient Near-Video-on-Demand System for Broadband Residential ADSL Network 33
Chi-Chien Hsueh, Wen-Cheung Cheng, and Chi-Shi Liu, Chungwa Telecom, Taiwan
MA1.02 A Standalone Video Communication System for Wire and Wireless Communications 37
HonCheong Ng, HaiBin Huang, Xiao Lin, HockLye Toh, Susanto Rahardja, Hui Lan, and Xiang Chen, Nanyang Technological University, Singapore
MA1.03 A Cooperative Playback System for On-Demand Multimedia Sessions over Internet 41
G. Fortino and L. Nigro, Università della Calabria, Italy
MA1.04 Protocol for Collaborative Multimedia Presentations 45
Eenjun Hwang, Ajou University, Korea; B. Prabhakaran, National
University of Singapore, Singapore
MA1.05 A Universal Distribution Protocol for Video-on-Demand 49
Jehan-François Pâris, University of Houston, USA; Steven W. Carter and Darrell D. E. Long, University of California, USA
MA1.06 Efficient Network Management for Collaborative Services and Application Development 53
Kevin H. Liu, Telcordia Technologies Inc., USA; Vassilios Th. Tsaoussidis, SUNY at
Stony Brook, USA
MA1.07 Dynamic Object Markers in a Collaborative Environment for Video Content Discussion 57
Candemir Toklu, Thomas Fisher and Shih-Ping Liou, Siemens Corporate Research, USA
MA2 Web and E-Commerce (R)
Time & Place: 9:45am - 12:05pm, Sutton Center
MA2.01 Reduction of Blocking Artifacts for Low Bit-Rate Video Coding using
Regularized Dequantization 63
S. Moon-Ho Song, Gunho Lee, Sanghoon Sull, and Sung-Jea Ko, Korea University,
South Korea
MA2.02 Efficient Representation and Streaming of XML Content over the Internet Medium 67
Marc Girardot and Neel Sundaresan, IBM Almaden Research Center, USA
MA2.03 Talking Heads and Synthetic Speech: An Architecture for Supporting Electronic Commerce 71
Jörn Ostermann and David Millen, AT&T Labs - Research, USA
MA2.04 On Feasibility of MPEG-4 for Mulimedia Integration for E-Commerce 75
A. Puri, R. L. Schmidt, and Q. Huang, AT&T Labs - Research, USA
MA2.05 Performance Evaluation of an Interactive Web-Based Multimedia Document System with Streaming Media 80
H. Fahmi, M. Latif, and A. Ghafoor, Purdue University, USA; P. Liu and L. Hsu, Siemens Corporate Research, USA
MA2.06 Internet Course Delivery Making it Easier and More Effective 84
D. Anderson, L. Harvel, and M. Hayes, Georgia Institute of Technology, USA; Y. Ishiguro, NEC Corp., Japan; J. Jackson, Georgia Institute of Technology, USA; M. Pimentel, Universidade de São Paolo, Brazil
MA2.07 E-Commerce Direct Marketing using Augmented Reality 88
Xiang Zhang, Nassir Navab, and Shih-Ping Liou, Siemens Corporate Research, USA
MA3 Multimedia Codec I (R)
Time & Place: 9:45am - 12:05pm, Beekman Parlor
MA3.01 An Advanced Center Biased Three Step Search Algorithm for Motion Estimation 95
Humaira Nisar and Tae-Sun Choi, Kwangju Institute of Science and Technology, Korea
MA3.02 Multimedia Image Coding using Adaptive Integer-to-Integer Wavelet Transforms 99
Subhasis Saha and Rao Vemuri, University of California Davis, USA
MA3.03 Software Optimization of H.263 Video Encoder on Pentium Processor with
MMX Technology 103
Pohsiang Hsu and K. J. Ray Liu, University of Maryland at College Park, USA
MA3.04 Region of Interest Coding for Low Bit Rate Image Transmission 107
O. Déforges and J. Ronsin, INSA, France
MA3.05 Simplified EZW Image Coder with Residual Data Transmission 111
Tanzeem Muzaffar and Tae-Sun Choi, Kwangju Institute of Science and Technology, Korea
MA3.06 PBS: A Predictive Block Sampling Algorithm for Desktop Multimedia Video Applications 115
Joseph Pasquale, Tom Nguyen, and Jon Kay, University of California, San Diego, USA
MA3.07 Efficient Multi-layer Coding and Encryption of MPEG Video Streams 119
Ali Saman Tosun and Wu-chi Feng, The Ohio State University, USA
MA4 Image Retrieval I (R)
Time & Place: 9:45am - 12:05pm, Regent Parlor
MA4.01 Image Retrieval Using Blob Histograms 125
Richard J. Qian, Peter J. L. van Beek, and M. Ibrahim Sezan, Intel Labs, USA
MA4.02 Indexing and Retrieval Scheme of the Image Database based on Color and Spatial Relations 129
Timothy K. Shih, Ching-Sheng Wang, Anthony Y. Chang, and Chuan-Ho Kao,
Tamkang University, Taiwan
MA4.03 Detection of Human Faces using Skin Color and Eyes 133
Jeonghee Park, Jungwon Seo, Dongun An, and Seongjong Chung, Chonbuk National University, South Korea
MA4.04 JaNeT: A Framework for Flexible Web-Content Retrieval 137
F. Bergenti, A. Poggi, and M. Somacher, Università degli Studi di Parma, Italy
MA4.05 Image Retrieval by Shape: A Comparative Study 141
Maytham Safar, Cyrus Shahabi, and Xiaoming Sun, University of Southern California, USA
MA4.06 Compass: An Image Retrieval System for Distributed Databases 145
R. Brunelli and O. Mich, ITC-irst, Italy
MA4.07 The ICOR Framework: A Top-Down Approach to Media Indexing and Retrieval 149
Ivica Rimac, Stephan Fischer, and Ralf Steinmetz, Darmstadt University of
Technology, Germany
MA5 Multimedia Authoring / Virtual Reality (P)
Time & Place: 9:45am - 12:05pm, Sutton Corridor
MA5.01 Variorum: A Multimedia-Based Program Documentation System 155
Tzi-cker Chiueh, Wei Wu, and Lap-Chung Lam, State University of New York at
Stony Brook, USA
MA5.02 Content-Based Browsing and Editing of Unstructured Video 159
Giridharan Iyengar, IBM T.J. Watson Research Center, USA; Andrew B. Lippmann, MIT Media Laboratory, USA
MA5.03 Design of Video Caption Markup Language VCML and Development of VCML Player 163
Katsuyuki Watanabe, Naohide Fukada, and Masahide Sugiyama, The University of Aizu, Japan
MA5.04 Tightly Coupling Authoring and Evaluation in an Integrated Tool to Support Iterative
Design of Educational Hypermedia 167
Selma Holmquist, Universidad de Los Andes, Venezuela; N. Hari Narayanan, Auburn University, USA
MA5.05 ProtoGMI MusicBrush - An Exercise in General Multimedia Instrument Interface Design 171
Timothy Chen, Kazushi Nishimoto, and Kenji Mase, ATR Media Integration & Communications Research Laboratories, Japan
MA5.06 MPEG-Pro: An Authoring System for MPEG-4 with Temporal Constraints and Template Guided Editing 175
Souhila Boughoufalah, Jean-Claude Dufourd, and Frederic Bouilhaguet, ENST Paris, France
MA5.07 Networked Virtual Environments for the Web: The WebTalk-I &
WebTalk-II Architectures 179
Thimoty Barbieri, HyperMedia Open Center, Italy
MA5.08 VGuide: Design and Performance Evaluation of a Synchronous Collaborative
Virtual Reality Application 183
J. M. Arango and P. K. McKinley, Michigan State University, USA
MA5.09 Architecture and Mechanisms of a Web-Based Video Data Management System 187
Shermann Sze-Man Chan and Qing Li, City University of Hong Kong, Hong Kong
MA5.10 Distributed Virtual Reality Authoring Interfaces for the WWW 191
I. Varlamis and M. Vazirgiannis, Athens University of Economics & Business, Greece;
I. Lazaridis, University of California, Irvine, USA
MA5.11 On the Choice of Tactile Code 195
Danilo P. Mandic and Richard Harvey, University of East Anglia, United Kingdom; Djemal H. Kolonic, University of Banjaluka, Bosnia-Herzegovina
MA5.12 Unified Multiple Media Interface for Robot Teleoperation 199
Tamhant Jain, Sambit K. Dash, Nishant Agrawal, Susmit Sen, and Amitabha Mukerjee, I.I.T. Kanpur, India
MA5.13 System Aspects of Copy Management for Digital Video 203
Jean-Paul Linnartz, Joop Talstra, Ton Kalkar, and Maurice Maes, Philips Research,
The Netherlands
MA5.14 Interactive Artificial Life based on Behavior and Perception in a Virtual Environment 207
Hyun Seung Yang, Hyun-jin Park, and Yong-jin Cho, Korea Advanced Institute of Science and Technology, Korea
MP0 Multimedia Codec II (R)
Time & Place: 2:15pm - 3:35pm, Sutton South
MP0.01 An Efficient Low-Bit Rate Motion Compensation Technique Based on Quadtree 213
Hanan A. Mahmoud and Magdy Bayoumi, University of Louisiana, USA
MP0.02 Selective Requantization for Transcoding of MPEG Compressed Video 217
Hani Sorial and William E. Lynch, Concordia University, Canada; André Vincent, Communications Research Centre, Canada
MP0.03 Fast Dihedral Symmetry Operations on Digital Images in the Compressed Domain 221
Viresh Ratnakar, Bhaskaran Vasudev, and Victor Ivashin, Epson Research and
Development, Inc., USA
MP0.04 Compressed Domain MPEG-2 Video Editing 225
Kai Wang and John W. Woods, Rensselaer Polytechnic Institute, USA
MP1 Multimedia Authoring (R)
Time & Place: 2:15pm - 3:35pm, Sutton North
MP1.01 Systematic Approach for Creating Animated Character Interfaces 231
Izumi Kohno, Shujun Yoshizaka, and Shin'ichi Uwakubo, NEC Corporation, Japan
MP1.02 Extending Databases to Support Image Editing 235
Greg Speegle, Allen M. Gao, and Shaowen Hu, Baylor University, USA; Le Gruenwald, Univeristy of Oklahoma, USA
MP1.03 Automatic Techniques for Insertion of Three-Dimensional Objects into a Video Sequence 239
Satyan R. Coorg, IBM T.J. Watson Research Center, USA
MP2 Audio Processing in Multimedia I (R)
Time & Place: 2:15pm - 3:35pm, Sutton Center
MP2.01 Selective Signal Cancellation for Multiple-Listener Audio Applications:
An Information Theory Approach 245
MP2.02 Towards Efficient and Scalable Speech Compression Schemes for Robust Speech
Recognition Applications 249
N. Srinivasamurthy and A. Ortega, USC, USA; Q. Zhu and A. Alwan, UCLA, USA
MP2.03 A Multiple Input Single Output Model for Rendering Virtual Sound Sources in Real Time 253
Panayiotis G. Georgiou and Chris Kyriakakis, University of Southern California, USA
MP2.04 A New Communication Paradigm: Action-to-Speech 257
Masanobu Abe and Tsubasa Shinozaki, NTT Cyber Space Labs, Japan
MP3 Streaming Video (R)
Time & Place: 2:15pm - 3:35pm, Sutton South
MP3.01 Optimal Streaming of Layer-Encoded Multimedia Presentations 263
David A. Turner and Keith W. Ross, Institut Eurécom, France
MP3.02 Segment Reencoding of Buffer Constrained Variable Bit Rate Video Streams 267
Larry Lu, Krishna Ratakonda, Rajesh Rajagopalan, Jack Kouloheris, and Cesar Gonzales, IBM T.J. Watson Research Center, USA
MP3.03 Streaming Video with Optimized Reconstruction-Based DCT 271
Xiao Su and Benjamin W. Wah, University of Illinois at Urbana-Champaign, USA
MP3.04 Transmission of Streaming Video over an EGPRS Wireless Network 275
Kapil Chawla, Zhimei Jiang, Xiaoxin Qiu, and Amy Reibman, AT&T Labs Research, USA
MP4 User Interface I (R)
Time & Place: 2:15pm - 3:35pm, Regent Parlor
MP4.01 Hand Gesture Recognition of a Mobile Device User 281
Vesa-Matti Mäntylä, Technical Research Centre of Finland, Finland; Jani Mäntyjärvi and Tapio Seppänen, University of Oulu, Finland; Esa Tuulari, Technical Research Centre of Finland, Finland
MP4.02 Gesture-Enhanced Information Retrieval and Presentation in a Distributed
Learning Environment 285
Shi-Kuo Chang, University of Pittsburgh, USA; Tsuhan Chen, Carnegie Mellon University, USA; C. S. Li, IBM Watson Research Center, USA
MP4.03 Object-Based Visual Effects by using Multi-focus Images and Its Real-Time Implementation 289
Kiyoharu Aizawa, Akira Kubota, and Conny Riani Gunadi, University of Tokyo, Japan
MP4.04 Multimodal Output for a Conversational Telephony System 293
M. Mast, C. Günther, S. Kunzmann, and T. Roß, IBM Germany, Germany
MP5 Video / Image Retrieval II (P)
Time & Place: 2:15pm - 3:35pm, Sutton Corridor Poster
MP5.01 Incorporate Discriminant Analysis with EM Algorithm in Image Retrieval 299
Qi Tian, Ying Wu, and Thomas S. Huang, University of Illinois at Urbana-Champaign, USA
MP5.02 An Architecture for Content-Based Retrieval of Remote Sensing Images 303
Luis M. del Val Cura, Neucimar Jeronimo Leite, and Claudia Bauzer Medeiros, Campinas State University, Brazil
MP5.03 A Modified Zernike Moment Shape Descriptor Invariant to Translation, Rotation and
Scale for Similarity-Based Image Retrieval 307
Hae-Kwang Kim, Sejong University, Korea; Jong-Deuk Kim, Dong-Gyu Sim, and Dae-Il Oh, Hyundai Electronics Industries Co., Korea
MP5.04 Color Based Retrieval and Recognition 311
Nicu Sebe and Michael S. Lew, Leiden Institute of Advanced Computer Science,
The Netherlands
MP5.05 Spatial Match Representation and Retrieval for Supporting Ranking in
Iconic Image Databases 315
Jae-Woo Chang, Yeon-Jung Kim, and Ki-Sung Jin, Chonbuk National University, South Korea
MP5.06 Perceptual Color on Internet Browser 319
Seiichiro Hangai, Takayuki Hamamoto, and Masato Kawamoto, Science University of
Tokyo, Japan
MP5.07 Augmented Album: Situation-Dependent System for a Personal Digital Video/Image Collection 323
K. Priyantha Hewagamage and Masahito Hirakawa, Hiroshima University, Japan
MP5.08 On Design of Adaptive Internet Streaming Applications: An Architectural Perspective 327
Reza Rejaie, AT&T Labs - Research, USA
MP5.09 Non-linear Relevance Feedback: Improving the Performance of
Content-Based Retrieval Systems 331
Nikolaos D. Doulamis, Anastasios D. Doulamis, and Stefanos D. Kollias, National Technical University of Athens, Greece
MP5.10 Using Multiple Examples for Content-Based Image Retrieval 335
J. Assfalg, A. Del Bimbo, and P. Pala, University of Florence, Italy
MP5.11 Adaptive Synthesis in Progressive Retrieval of Audio-Visual Data 339
John R. Smith and Chung-Sheng Li, IBM T.J. Watson Research Center, USA
MP5.12 A Novel Shape Matching Method using Biological-Sequence Dynamic Alignment 343
Shaojie Zhang and Kai-Kuang Ma, Nanyang Technological University, Singapore
MP5.13 A General Inference Network Based Architecture for Multimedia Information Retrieval 347
Campbell Wilson, Bala Srinivasan, and Maria Indrawan, Monash University, Australia
MP5.14 Using Category-Based Collaborative Filtering in the Active WebMuseum 351
Arnd Kohrs and Bernard Merialdo, Institute EURECOM, France
MP5.15 A Composite Histogram for Image Retrieval 355
Dong Kwon Park, Yoon Seok Jeon, and Chee Sun Won, Dongguk University, Korea; Soo-Jun Park, ETRI, Korea; Seong-Joon Yoo, SearchCast, Korea
MP5.16 Indexing for Linear Model-Based Information Retrieval 359
Yuan-Chi Chang and Chung-Sheng Li, IBM T. J. Watson Research Center, USA
MP5.17 Multi-clip Query Optimization in Video Databases 363
Ahmed Mostefaoui and Lionel Brunie, Insa de Lyon, France; Harald Kosch, and László Böszörményi, University Klagenfurt, Austria
MP5.18 Partial Image Matching by Measures from Connected Color Regions 367
TaeYong Kim, Chung-Ang University, Korea; Joon H. Han, Pohang University of Science and Technology, Korea
MP6 Data Hiding I (R)
Time & Place: 3:50pm - 5:30pm, Beekman Parlor
MP6.01 An Evaluation Method for Watermarking Techniques 373
Wei Zhihui and Xiao Liang, Nanjing University of Science and Technology, P.R. China
MP6.02 Perceptually Transparent Attachment of Content-Based Data to Audio-Visual Documents 377
Frank Kurth, University of Bonn, Germany
MP6.03 Video Access Control Via Multi-level Data Hiding 381
Min Wu, Princeton University, USA; Hong Heather Yu, Panasonic Information & Networking Technological Laboratories, USA
MP6.04 Event-Coupled Hidden Markov Models 385
T. T. Kristjansson and B. J. Frey, University of Waterloo, Canada; Thomas S. Huang, University of Illinois at Urbana-Champaign, USA
MP6.05 Blind Digital Watermarking for Images and Videos and Performance Analysis 389
Qiang Cheng and Thomas S. Huang, University of Illinois at Urbana-Champaign, USA
MP6.06 Data Hiding in Digital Binary Image 393
Min Wu, Princeton Univ., USA; Edward Tang, Johns Hopkins Univ., USA; Bede Liu, Princeton Univ., USA
MP6.07 Transparent and Robust Audio Data Hiding in Cepstrum Domain 397
Xin Li, Princeton University, USA; Hong Heather Yu, Panasonic Information and Networking Technologies Lab, USA
MP7 Face Video Analysis & Synthesis I (R)
Time & Place: 3:50pm - 5:30pm, Sutton North
MP7.01 Human Facial Expression Recognition based on Learning Subspace Method 403
Xilen Chen, Harbin Institute of Technology, China; Sam Kwong, City University of Hong Kong, Hong Kong; Yan Lu, Harbin Institute of Technology, China
MP7.02 Facial Model Generation Through Mono Image Sequence 407
Chung J. Kuo, National Chung Cheng University, Taiwan; Tsang G. Lin, Industrial Technology Research Institute, Taiwan
MP7.03 Accurate Extraction of Human Face Area using Subspace Method and Genetic Algorithm 411
Makoto Murakami, Masahide Yoneyama, and Katsuhiko Shirai, Waseda University, Japan
MP7.04 Virtual Talk: A Model-Based Virtual Phone using a Layered Audio-Visual Integration 415
Yao-Jen Chang, Chih-Chung Chen, Jen-Chung Chou, and Yung-Chang Chen, National Tsing Hua University, Taiwan
MP7.05 A Performance Based Parametric Model for Facial Animation 419
Anna Wojdel and Leon J. M. Rothkrantz, Delft University of Technology, The Netherlands
MP7.06 Emotional Expressions in Audiovisual Human Computer Interaction 423
Lawrence S. Chen and Thomas S. Huang, Univ. of Illinois at Urbana-Champaign, USA
MP7.07 Speech-to-Face Movement Synthesis based on HMMS 427
Kiyotsugu Kakihara, Satoshi Nakamura, and Kiyohiro Shikano, Nara Institute of Science & Technology, Japan
MP8 Speech /Audio over Network (R)
Time & Place: 3:50pm - 5:30pm, Sutton South
MP8.01 Improving the Performance of ITU-T G.729A for VoIP 433
C. Montminy and T. Aboulnasr, University of Ottawa, Canada
MP8.02 Effects of Vocoder Distortion on Network Echo Cancellation 437
Y. Huang and R. A. Goubran, Carleton University, Canada
MP8.03 Toward Naturalness in Narrow-Band Speech Compression 440
S. Ghaemmaghami, Sharif University of Technology, Iran
MP8.04 Multiple Description Speech Coding for Robust Communication over
Lossy Packet Networks 444
Wenqing Jiang and Antonio Ortega, University of Southern California, USA
MP8.05 Prosody Model in a Mandarin Text-to-Speech System based on a Hierarchical Approach 448
Neng-Huang Pan, Wen-Tsai Jen, Shyr-Shen Yu, Ming-Shing Yu, Shyh-Yang Hwang, and Ming-Jer Wu, National Chung-Hsing University, Taiwan
MP8.06 Automatic Audio Segmentation using a Measure of Audio Novelty 452
Jonathan Foote, FX Palo Alto Laboratory, Inc., USA
MP9 Video Retrieval (R)
Time & Place: 3:50pm - 5:30pm, Regent Parlor
MP9.01 Application-Specific File Prefetching for Multimedia Programs 459
Tulika Mitra, Chuan-Kai Yang, and Tzi-cker Chiueh, State University of New York at
Stony Brook, USA
MP9.02 Modeling and Querying Videos by Content Trajectories 463
Zaher Aghbari, Kunihiko Kaneko, and Akifumi Makinouchi, Kyushu University, Japan
MP9.03 A Probabilistic-Based Mechanism for Video Database Management Systems 467
Mei-Ling Shyu, University of Miami, USA; Shu-Ching Chen, Florida International University, USA; R. L. Kashyap, Purdue University, USA
MP9.04 NMPTE: Network Multimedia Programming Training Environment 471
Yuh-Huei Shyu and Peng-Wen Chen, Tamkang University, Taiwan
MP9.05 A Probabilistic Framework for Semantic Indexing and Retrieval in Video 475
Milind R. Naphade and Thomas S. Huang, University of Illinois at Urbana-Champaign, USA
MP9.06 Retrieval by Content of Commercials based on Dynamics of Color Flows 479
A. Del Bimbo, P. Pala, and L. Tanganelli, University of Florence, Italy
MP9.07 Content Based Annotation and Retrieval of News Videos 483
M. Bertini, A. Del Bimbo, and P. Pala, University of Florence, Italy
MP10 Authentication / Security / Traffic / Error Control (P)
Time & Place: 3:50pm - 5:30pm, Sutton Corridor Poster
MP10.01 On-Line Signature Verification based on Altitude and Direction of Pen Movement 489
Seiichiro Hangai, Shinji Yamanaka, and Takayuki Hamamoto, Science University of Tokyo, Japan
MP10.02 Information Access using Speech, Speaker and Face Recognition 493
M. Viswanathan, H. S. M. Beigi, and A. Tritschler, IBM T.J. Watson Research Center, USA; F. Maali, Signal Recognition Corporation, USA
MP10.03 Image Enhancement Towards Soft Image Authentication 497
Liehua Xie and Gonzalo R. Arce, University of Delaware, USA; Arianne Lewis and E. Bert Basch, GTE Laboratories Incorporated, USA
MP10.04 Multi-level Reliability in Multimedia Collaboration over Heterogeneous Networks 501
Liang Cheng and Ivan Marsic, Rutgers University, USA
MP10.05 Multirate Trellis Coded Modulation in Multimedia Communications 505
Pingyi Fan, Tsinghua University, China; Xiang-Gen Xia, University of Delaware, USA
MP10.06 A Compression-Efficient Forward Error Control Mechanism for Image Transmission over
ATM Networks 509
Rogelio Hasimoto-Beltran, University of Delaware, USA; Sohail A. Sheikh, Widener University, USA; Ashfaq A. Khokhar, University of Delaware, USA
MP10.07 Error Concealment for Image Transmission by Multiscale Markov
Random Field Modeling 513
Yong Zhang and Kai-Kuang Ma, Nanyang Technological University, Singapore
MP10.08 Adaptive QoS for Mobile Multimedia Services over Wireless Networks 517
Alejandra Mercado and K. J. Ray Liu, University of Maryland, USA
MP10.09 Design Space Exploration for Orividing QoS Within the Harmony Framework 521
A. M. Lele and S. K. Nandy, Indian Institute of Science, India; D. H. J. Epema, Delft University of Technology, The Netherlands
MP10.10 Descriptive-Procedural Configuration of Communication Bindings 525
Thorsten Kramp and Rainer Koster, University of Kaiserslautern, Germany
MP10.11 Multimedia Security Gateway Protocol to Achieve Anonymity in Delivering Multimedia
Data using Watermarking 529
Sangeeta Narang, Indraprastha University, India; P. S. Grover, Delhi University, India; Saroj Kaushik, IIT, India
MP10.12 A Petri-Net Based Multilevel Security Specification Model for Multimedia Documents 533
J. Joshi and A. Ghafoor, Purdue University, USA
MP10.13 Remotely Keyed Encryption with Java Cards: A Secure and Efficient Method to Encrypt Multimedia Streams 537
Rüdiger Weis, cryptolabs Amsterdam, The Netherlands; Wolfgang Effelsberg and Stefan Lucks, Universität Mannheim, Germany
MP10.14 Individual Single Source Authentication on the MBone 541
F. Bergadano and D. Cavagnino, University of Turin, Italy; B. Crispo, SRI International, United Kingdom
MP10.15 Scheduling and Routing of Real-Time Multimedia Traffic in Packet-Switched Networks 545
Sudhir M. Rao and Albert M. K. Cheng, University of Houston, USA
MP10.16 Experimental Evaluation of Design Tradeoff in Specialized Virtual Machine for Multimedia Traffic in Active Networks 549
Sheng-Yih Wang and Bharat Bhargava, Purdue University, USA
MP10.17 A Simple Model for MPEG Video Traffic 553
Hai Liu, Nirwan Ansari, and Yun Q. Shi, New Jersey Institute of Technology, USA
MP10.18 TBLB Algorithm for Servicing Real-Time Multimedia Traffic Streams 557
William K. Wong and Victor C. M. Leung, The University of British Columbia, Canada
MPS11 Special Session: Perceptual Interface (R)
(Dominic Massaro)
Time & Place: 3:50pm - 5:30pm, Sutton Center
MPS11.01 Perceptual Interfaces in Human Computer Interaction 563
Dominic W. Massaro, University of California, USA
MPS11.02 Neural Mechanisms for Integrating Information from Multiple Senses 567
Barry E. Stein, Paul J. Laurienti, Terrence R. Stanford, and Mark T. Wallace, Wake Forest University School of Medicine, USA
MPS11.03 Multimodal Speech Synthesis 571
J. Schroeter, J. Ostermann, H. P. Graf, M. Beutnagel, E. Cosatto, A. Syrdal, A. Conkie, and
Y. Stylianou, AT&T Labs Research, USA
MPS11.04 Approaches to Visual Speech Processing based on the MPEG-4 Face Animation Standard 575
Eric Petajan, face2face animation, inc., USA
MPS11.05 Learning from Multimodal Observations 579
Deb Roy, MIT Media Laboratory, USA
Author Index follows page 582
Volume II Tuesday
TA0 Multimedia Education II (R)
Time & Place: 9:45am - 12:05pm, Beekman Parlor
TA0.01 Development of CAI System Employing Synthesized Speech and CG Animated Agent 585
Tsubasa Shinozaki and Masanobu Abe, NTT Cyber Space Labs, Japan
TA0.02 Cyber Atelier: A Creative Learning Environment Assisting Non-professional
Multimedia Productions 589
Shoji Tanaka, Jun Kurumizawa, Keiko Nakao, and Yuichi Iwadate, ATR Media Integration & Communications Research Labs, Japan
TA0.03 The Multimedia Online Collaboration Architecture: Tools to Enable Distance Learning 593
J. Peden, W. Burleson, C. Leonardo, University of Massachusetts at Amherst, USA
TA0.04 Beehive - An Internet Integrated Collaborative Learning Environment 597
C. Zeve, J. V. de Lima, E. Polonia, H. Sloczinski, and J. Nitzke, Federal University of Rio Grande do Sul, Brazil
TA0.05 A Live Intranet Distance Learning System using MPEG-4 over RTP/RTSP 601
Peter Westerink, Lisa Amini, Sundar Veliah and Will Belknap, IBM T.J. Watson Research Center, USA
TA0.06 Prior Knowledge and Redundant Multimedia 605
Frank Vetere and Steve Howard, University of Melbourne, Australia
TA1 Face Video Analysis & Synthesis II (R)
Time & Place: 9:45am - 12:05pm, Sutton North
TA1.01 A Study on Subjective Evaluation of Facial Polygon Model 611
Daisuke Kase, Takayuki Hamamoto, and Seiichiro Hangai, Science University of Tokyo, Japan
TA1.02 Exploring the Time Course of Facial Expressions with a Fuzzy System 615
F. Piat and N. Tsapatsoulis, National Technical University of Athens, Greece
TA1.03 Audio-Visual Unit Selection for the Synthesis of Photo-Realistic Talking-Heads 619
Eric Cosatto, AT&T Labs - Research, USA; Gerasimos Potamianos, IBM T.J. Watson Research Center, USA; Hans Peter Graf, AT&T Labs - Research, USA
TA1.04 Automated Lip Synchronized Speech Driven Facial Animation 623
Zeki Melek and Lale Akarun, Bogaziçi University, Turkey
TA1.05 Talking Faces 627
Jun-yong Noh and Ulich Neumann, University of Southern California, USA
TA1.06 Realistic Video Avatar 631
Wing Ho Leung, Carnegie Mellon University, USA; Belle L. Tseng, Zon-Yin Shae, and Ferdinand Hendriks, IBM T.J. Watson Research Center, USA; Tsuhan Chen, Carnegie Mellon University, USA
TA1.07 Robustness Against Instability of Sensory Judgment in a Human Interface to Draw a Facial Image using a Psychometrical Space Model 635
F. Sugimoto and M. Yoneyama, Toyo University, Japan
TA2 Feature Extraction for Image and Video (R)
Time & Place: 9:45am - 12:05pm, Sutton Center
TA2.01 Towards Automatic Extraction of Expressive Elements from Motion Pictures: Tempo 641
Brett Adams, Curtin University of Technology, Australia; Chitra Dorai, IBM T. J. Watson Research Center, USA; Svetha Venkatesh, Curtin University of Technology, Australia
TA2.02 Semi-automatic Semantic Video Object Extraction by Active Contour Model 645
Zhitao Lu and W. A. Pearlman, Rensselaer Polytechnic Institute, USA
TA2.03 A Method for Color Content Matching of Images 649
Aleksandra Mojsilovic, IBM TJ Watson Research Center, USA; Jianying Hu, Lucent Technologies Bell Labs, USA
TA2.04 Unifying Conversational Multimedia Interfaces for Accessing Network Services Across Communication Devices 653
G. Di Fabbrizio, S. Narayanan, P. Ruscitti, C. Kamm, B. Buntschuh, J. Hubbell, J. Wright, and J. Hamaker, AT&T Labs - Research, USA
TA2.05 A Panorama-Based Technique for Annotation Overlay and Its Real-Time Implementation 657
Masakatsu Kourogi, Takeshi Kurata, Katsuhiko Sakaue, and Yoichi Muraoka, Waseda University, Japan
TA2.06 Locale-Based Object Search under Illumination Change using Chromaticity Voting and
Elastic Correlation 661
Ze-Nian Li, Zinovi Tauber and Mark S. Drew, Simon Fraser University, Canada
TA2.07 Content-Based Image Retrieval for Not-Well-Framed Images using
Mutiresolutional Eigen-Features 665
Young Bok Joo and Jesse Jin, The University of New South Wales, Australia
TAS3 Special Session: Content-Based Retrieval (R)
(Alberto Del Bimbo)
Time & Place: 9:45am - 12:05pm, Sutton South
TAS3.01 Expressive Semantics for Automatic Annotation and Retrieval of Video Streams 671
A. Del Bimbo, University of Florence, Italy
TAS3.02 Invariance in Content-Based Image Retrieval 675
Arnold Smeulders, Theo Gevers, Jan-Mark Geusebroek, and Marcel Worring, University of Amsterdam, The Netherlands
TAS3.03 From Features to Semantics: Some Preliminary Results 679
Rong Zhao and W. I. Grosky Wayne State University, USA
TAS3.04 Semantic Modalities in Content-Based Retrieval 683
Simone Santini, University of California, San Diego, USA
TAS3.05 Structural and Semantic Analysis of Video 687
Shi-Fu Chang and Hari Sundaram, Columbia University, USA
TA4 Image Retrieval II (R)
Time & Place: 9:45am - 12:05pm, Regent Parlor
TA4.01 iPURE: Perceptual and User-Friendly Retrieval of Images 693
Gaurav Aggarwal, Pradeep Dubey, Sugata Ghosal, Ashutosh Kulshreshtha, and Abhinanda Sarkar, Indian Institute of Technology, USA
TA4.02 Supporting Multi-Example Image Queries in Image Databases 697
Lei Zhu and Aidong Zhang, State University of New York at Buffalo
TA4.03 A Model for Multimodal Information Retrieval 701
Rohini K. Srihari, Aibing Rao, Benjamin Han, Srikanth Munirathnam, and Xiaoyun Wu, State University of New York at Buffalo, USA
TA4.04 Automatic Query Generation for Content-Based Image Retrieval 705
Christian Breiteneder, University of Vienna, Austria; Horst Eidenberger, Austrian Libraries Network, Austria
TA4.05 Similar Shape Retrieval in MARS 709
Kaushik Chakrabarti, Michael Ortega-Binderberger, Kriengkrai Porkaew, and Sharad Mehrotra, University of California, USA
TA4.06 Relevance Graph-Based Image Retrieval 713
Sanghoon Sull, Jeongtaek Oh, Sangwook Oh, and S. Moon-Ho Song, Korea University, Korea; Sang W. Lee, University of Michigan, USA
TA4.07 Image Retrieval with Sketches and Compositions 717
Raj Kumar Rajendran and Shih-Fu Chang, Columbia University, USA
TA5 Web Search / Retrieval & Applications (P)
Time & Place: 9:45am - 12:05pm, Sutton Corridor Poster
TA5.01 GEMS: A New Model of Parallelism in PC-Workstations for Multimedia Applications 723
Eric Debes and Fulvio Moschetti, Swiss Federal Institute of Technology, Switzerland
TA5.02 Effective Caching of Web Objects using Zipf's Law 727
D. N. Serpanos, University of Crete, Greece; G. Karakostas and W. H. Wolf,
Princeton University, USA
TA5.03 2-D Interleaving for Enhancing the Robustness of Watermark Signals Embedded in
Still Images 731
George F. Elmasry and Yun Q. Shi, New Jersey Institute of Technology, USA
TA5.04 A System of Integrating Videos and Maps for the Identification of Building Object 735
Haomin Jin, Xu Xu, Yaginuma Yoshitomo, and Masao Sakauchi, The University of Tokyo, Japan
TA5.05 Active Router Approach for Selective Packet Discard of Streamed MPEG Video under
Low Bandwidth Conditions 739
G. Ravindra, N. Balakrishnan, and K. R. Ramakrishnan, Indian Institute of Science, India
TA5.06 Cooperating Intelligent Mobile Agents Mechanism for Distributed
Multimedia Synchronization 743
Ying-Hong Wang and Hung-Zu Lin, TamKang University, Taiwan
TA5.07 WWW and Telecommunication Collaboration Service for Mandarin Automatic Personal Phonebook Inquire Dialogue System 747
Min-Jen Tsai, National Chiao Tung University, Taiwan; Tien-Hwa Ho,
Chinese Navy Academy, Taiwan
TA5.08 On Voice Quality of IP Voice over GPRS 751
A. Lakaniemi, J. Parantainen, Nokia Research Center, Finland
TA5.09 Framework Design and Implementation of Web-Based Tutorials in Spoken Language Engineering 755
Rüdiger Hoffmann and Matthias Wolff, Dresden University of Technology, Germany
TA5.10 Telop-on-Demand: Video Structuring and Retrieval based on Text Recognition 759
H. Kuwano, Y. Taniguchi, H. Arai, M. Mori, S. Kurakake, and H. Kojima, NTT Cyber Solutions Laboratories, Japan
TA5.11 A Robust Video Watermarking Method 763
Dug-Ryung Kim, Ansan College, Korea; Sung-Han Park, Hanyang University, Korea
TA6 Multimedia Feature Representations (P)
Time & Place: 9:45am - 12:05pm, Sutton Corridor Poster
TA6.01 Graphical Representation for Generating Musical Sequences 769
Atsushi Hiroike, Hitachi Ltd., Japan
TA6.02 A New Approach for High Level Video Structuring 773
Yong-Moo Kwon, Chang-Jun Song, and Ig-Jae Kim, Korea Institute of
Science & Technology, Korea
TA6.03 Solo: An MPEG-7 Optimum Search Tool 777
Jose A. Lay and Ling Guan, University of Sydney, Australia
TA6.04 Multimedia Documents Description by Ordered Hierarchies: The ToCAI Description Scheme 781
N. Adami, A. Bugatti, R. Leonardi, P. Migliorati and Lorenzo A. Rossi,
University of Brescia, Italy
TA6.05 Using Shape Feature Matching to Track Moving Objects in Image Sequences 785
D. Hsu, J. Leu, S. Chen, W. Chang, and W. Fang, University of Southern California, USA
TA6.06 Interactive Gesture Interface for Intelligent Wheelchairs 789
Yoshinori Kuno, Teruhisa Murashima, Nobutaka Shimada, and Yoshiaki Shirai, Osaka University, Japan
TA6.07 Description Schemes for Retrieval Applications Targeted to the Audiovisual Market 793
José M. Martínez, Julián Cabrera, Jesús Bescós, José M. Menéndez, and Guillermo Cisneros, Universidad Politécnica de Madrid, Spain
TA6.08 A Robust Algorithm for Text Extraction in Color Video 797
Edward K. Wong and Minya Chen, Polytechnic University, USA
TA6.09 Low-Level Motion Activity Features for Semantic Characterization of Video 801
Kadir A. Peker, A. Aydin Alatan, Ali N. Akansu, New Jersey Institute of Technology, USA
TP0 Audio Processing in Multimedia II (R)
Time & Place: 1:15pm - 3:35pm, Beekman Parlor
TP0.01 A 3D Sound using the Adaptive Head Model and Measured Pinna Data 807
Cheng-Ta Chang and Oscal T. -C. Chen, National Chung Cheng University, Taiwan
TP0.02 Source Segmentation for Structured Audio 811
Kathy Melih and Ruben Gonzalez, Griffith University, Gold Coast, Australia
TP0.03 A Progressive Approach for Perceptual Audio Coding 815
Ye Shen, Hongmei Ai, and C. -C. Jay Kuo, University of Southern California, USA
TP0.04 Evaluation of a Melody Transcription System 819
Rodger J. McNab, Digilib Systems, Ltd., New Zealand; Lloyd A. Smith, New Mexico Highlands University, USA
TP0.05 First Measurements of a Large-Aperture Microphone Array System for
Remote Audio Acquisition 823
Harvey F. Silverman, William R. Patterson III, and Joshua M. Sachar, Brown University, USA
TP0.06 Subband Audio Coding using a Perceptually Hybrid Vector-Scalar Quantization 827
Yu Rongshan, Nanyang Technological University, Singapore
TP0.07 Short-Time Kurtosis of Speech Signals with Application to Co-channel Speech Separation 831
Phillip L. De Leon, New Mexico State University, USA
TP1 QoS (Traffic Management / Protocol) I (R)
Time & Place: 1:15pm - 3:35pm, Sutton North
TP1.01 Trader's Quality of Service Specifications and Effects on System Performance for
Video-on-Demand 837
Edward Babulak, University of Ottawa, Canada
TP1.02 On TCP-Friendly Video Transfer with Consideration on Application-Level QoS 843
Naoki Wakamiya, Masayuki Murata and Hideo Miyahara, Osaka University, Japan
TP1.03 Impact of Protocol Stacks on Quality of Perception 847
G. Ghinea, Brunel University, United Kingdom; J. P. Thomas, Pace University, USA
TP1.04 Resource-Aware Configuration of Ubiquitous Multimedia Services 851
Dongyan Xu, Duangdao Wichadakul, and Klara Nahrstedt, University of Illinois at
Urbana-Champaign, USA
TP1.05 A Gateway-Assisted Approach Toward QoS Adaptations 855
William Kalter, Baochun Li, Won Jeon, Klara Nahrstedt, and Jun-Hyuk Seo, University of Illinois at Urbana-Champaign, USA
TP1.06 A Resource Broker Model with Integrated Reservation Scheme 859
Kihun Kim and Klara Nahrstedt, University of Illinois at Urbana-Champaign, USA
TP1.07 Mining User Behavior for Resource Prediction in Interactive Electronic Malls 863
Silvia Hollfelder, GMD - IPSI, Germany; Vincent Oria and M. Tamer Özsu, University of Alberta, Canada
TP2 Audio Retrieval (R)
Time & Place: 1:15pm - 3:35pm, Sutton Center
TP2.01 A Study on N-Gram Indexing of Musical Features 869
Chi Lap Yip and Ben Kao, The University of Hong Kong, Hong Kong
TP2.02 Query by Music Segments: An Efficient Approach for Song Retrieval 873
Arbee L. P. Chen, Maggie Chang, Jesse Chen, Jia-Lien, Hsu, Chih-How Hsu, and Spot Y. S. Hua, National Tsing Hua University, Taiwan
TP2.03 Content-Based Indexing and Retrieval-by-Example in Audio 877
Zhu Liu, Polytechnic University, USA; Qian Huang, AT&T Labs - Research, USA
TP2.04 Indexing Telephone Conversations by Speakers 881
Ivan Magrin-Chagnolleau, IRISA/INRIA Rennes, France
TP2.05 Content-Based Indexing and Retrieval of Audio Data using Wavelets 885
Guohui Li and Ashfaq A. Khokhar, University of Delaware, USA
TP2.06 Distance Metrics and Indexing Strategies for a Digital Library of Popular Music 889
C. Francu and C. G. Nevill-Manning, Rutgers University, USA
TP3 Multimedia Representations (R)
Time & Place: 1:15pm - 3:35pm, Sutton South
TP3.01 Efficient Representation and Comparison of Multimedia Content using DAG-Composition 895
I-Jong Lin, Princeton University, USA; Ajay Divakaran and Anthony Vetro, Mitsubishi Electric, USA; Sun-Yuan Kung, Princeton University, USA
TP3.02 Feature Representations for Image Retrieval: Beyond the Color Histogram 899
Nuno Vasconcelos and Andrew Lippman, MIT Media, USA
TP3.03 Discourse Structure Analysis for News Video 903
Yasuhiko Watanabe and Yoshihiro Okada, Ryukoku University, Japan; Sadao Kurohashi, Kyoto University, Japan; Eiichi Iwanari, Ryukoku University, Japan
TP3.04 Visual Segment Tree Creation for MPEG-7 Description Schemes 907
Philippe Salembier, Joan Llach, and Luis Garrido, Universitat Politècnica de Catalunya, Spain
TP3.05 Multi Layer Video Object Database based on Interactive Annotation and its Application 911
Tomoyuki Yatabe, Hiroshi Kawasaki, Hiroshi Mo, and Masao Sakauchi, University of
Tokyo, Japan
TP3.06 Conceptual Modeling of Audio-Visual Content 915
John R. Smith and Ana B. Benitez, IBM T. J. Watson Research Center, USA
TP4 User Interface II (R)
Time & Place: 1:15pm - 3:35pm, Regent Parlor
TP4.01 Different Modalities in Assembly Support System User Interface 921
Lauri Repokari, Marko Nieminen, Milla Hailikari, Jyrki Kasvi, Matti Vartiainen, Anneli Pulkkis and Ilpo Kari, Helsinki University of Technology, Finland
TP4.02 EMG-Based Human-Machine Interface System 925
Osamah A. Alsayegh, Kuwait Institute for Scientific Research, Kuwait
TP4.03 A Test Bed for Prototyping Human/Computer Interfaces used in
Mission Critical Environments 929
Michael D. Orosz and Walter J. Karplus, University of California at Los Angeles, USA; J. D. Balakrishnam, Purdue University, USA
TP4.04 A Framework for Creating Customized Multi-modal Interfaces for XML Documents 933
Sami Rollins, University of California at Santa Barbara, USA; Neel Sundaresan, IBM Almaden Research Center, USA
TP4.05 Automated Language Acquisition in Multimodal Environment 937
Daniel Nagy, Attila Medl, and James L. Flanagan, Rutgers University, USA
TP4.06 Temperament-Based Information Filtering: A Human Factors Approach to
Information Recommendation 941
Cha-Hwa Lin and Dennis McLeod, University of Southern California, USA
TP4.07 A Method to Make a Three-Dimensional Model of an Individual Face from a Front
View of a Facial Image Only 945
Y. Nakamura, Tokyo Denki University, Japan; F. Sugimoto and M. Yoneyama, Toyo University, Japan; S. Nakamura, Tokyo Denki University, Japan
TP5 Wireless Video / Communication (P)
Time & Place: 1:15pm - 3:35pm, Sutton Corridor Poster
TP5.01 High Quality Wideband Audio over DECT 951
Soledad Torres-Guijarro, Universidad Europea de Madrid, Spain; F. Javier Casajús-Quirós, Lino García-Morales and Ramón Sanchez-Perez, Universidad Politecnica de Madrid, Spain
TP5.02 A Novel Video Communication System Utilizing Adaptive and Integrated System Design for Mobile Wireless ATM 955
Jozsef Vass, Eyeball.com Network, Inc., Canada; Xinhua Zhuang, University of Missouri-Columbia, USA
TP5.03 A Demonstrator for Real-Time Multimedia Sessions over Third Generation
Wireless Networks 959
S. Gruhl, Bell Laboratories, Lucent Technologies, Germany; A. Echihabi and T. Rachidi, Alakhawayn University, Morocco; M. Link and M. Söllner, Bell Laboratories,
Lucent Technologies, Germany
TP5.04 Bandwidth Management Providing Guaranteed Call Dropping Rates for
Multimedia Mobile Networks 963
Jianping Jiang and Ten-Hwang Lai, The Ohio State University, USA
TP5.05 Pocket Pavilion: A Synchronous Collaborative Browsing Application for
Wireless Handheld Computers 967
Philip K. McKinley and Ji Li, Michigan State University, USA
TP5.06 Low Bit-Rate Compression based on LAR Method for Videoconference via Internet 971
O. Déforges, L. Bédat, and J. Ronsin, INSA, France
TP5.07 Post-processing for Real-Time Quality Enhancement of MPEG-Coded Video Sequences 975
L. Atzori, University of Cagliari, Italy; F. G. B. De Natale, University of Trento, Italy; Fabrizio Granelli, University of Genoa, Italy
TP5.08 A New Search Algorithm for Block Motion Estimation 979
Jau-Ling Chen and Pei-Yin Chen, Southern Taiwan University of Technology, Taiwan
TP5.09 A Very Low Bit-Rate Video Coding Algorithm by Focusing on Moving Regions 983
Kwok-Wai Wong, Kin-Man Lam, and Wan-Chi Siu, The Hong Kong Polytechnic University, Hong Kong
TP5.10 Sending an Image to a Large Number of Nodes in Short Time using TCP 987
Takayuki Hirahara, Takashi Yamanoue, Hiroyuki Anzai, and Itsujirou Arita, Kyushu Kyouritsu University, Japan
TP5.11 Reducing Bandwidth Requirement for Delivering Video over Wide Area
Networks with Proxy Server 991
Wei-hsiu Ma and David H. C. Du, University of Minnesota, USA
TP5.12 A Comprehensive Analysis of Energy Savings in Dynamic Supply Voltage Scaling
Systems using Data Dependent Voltage Level Selection 995
Lama H. Chandrasena and Michael J. Liebelt, The University of Adelaide, Australia
TP5.13 Adapting Network Video to Multi-time Scale Bandwidth Fluctuations 999
Yuan-Chi Chang and Chung-Sheng Li, IBM T.J. Watson Research Center, USA; David G. Messerschmitt, University of California at Berkeley, USA
TP5.14 Progressive Image Transmission over OFDM Systems using Multiple Antennas 1003
Jie Song and K. J. Ray Liu, University of Maryland at College Park, USA
TP5.15 Performance Evaluation in Multimedia CDMA Wireless Transmission 1007
M. R. Hueda, C. E. Rodríguez and C. A. Marqués, Universidad Nacional de Córdoba, Argentina
TP6 Data Hiding II (R)
Time & Place: 3:50pm - 5:30pm, Beekman Parlor
TP6.01 Audio Watermarking: Features, Applications, and Algorithms 1013
Michael Arnold, Fraunhofer-Institute for Computer Graphics, Germany
TP6.02 An Improved All-Pass Watermarking for Speech and Audio 1017
Tolga Çiloglu and S. Utku Karaaslan, Middle East Tech. University, Turkey
TP6.03 The Design and Implementation of a Streaming Application for MPEG Videos 1021
Aylin Kantarci and Turhan Tunali, Ege University, Turkey
TP6.04 Progressive Image Watermarking 1025
Trista Pei-chun Chen and Tsuhan Chen, Carnegie Mellon University, USA
TP6.05 A DCT Domain Visible Watermarking Technique for Images 1029
Saraju P. Mohanty, University of South Florida, USA; K. R. Ramakrishnan, Indian Institute of Science, India; Mohan S. Kankanhalli, National University of Singapore, Singapore
TP6.06 Enhancing Robustness of Digital Watermarking Against Geometric Attack based on
Fractal Transform 1033
Zhicheng Ni and Eric Sung, Nanyang Technological University, USA; Yun Q. Shi,
New Jersey Institute of Technology, USA
TP6.07 3-D Interleaving for Enhancing the Robustness of Watermark Signals Embedded in
Video Sequences 1037
George F. Elmasry and Yun Q. Shi, New Jersey Institute of Technology, USA
TP7 QoS (Traffic Management / Protocol) II (R)
Time & Place: 3:50pm - 5:30pm, Sutton North
TP7.01 MIDI Encoding Method based on Variable Frame-Length Analysis and Its
Evaluation of Coding Precision 1043
Toshio Modegi, Dai Nippon Printing Co., Ltd. Japan
TP7.02 A Novel ATM Traffic Scheduler for Real-Time Multimedia Data Transport with Improved Packet Level QoS 1047
Fu-Ming Tsou, National Taiwan University, Taiwan; Hong-Bin Chiou, Chunghwa Telecommunications Co., Ltd., Taiwan; Zsehong Tsai, National Taiwan University, Taiwan
TP7.03 Adaptive Reservation: A New Framework for Multimedia Adaptation 1051
Xin Wang and Henning Schulzrinne, Columbia University, USA
TP7.04 Network-Adaptive Rate Control With TCP-Friendly Protocol for Multiple Video Objects 1055
Qian Zhang, Wenwu Zhu, and Ya-Qin Zhang, Microsoft Research, China
TP7.05 Dynamic QoS and Routing Support for Real-Time Multimedia Applications in the Next Generation Internet 1059
Oliver T. W. Yu, University of Illinois at Chicago, USA
TP7.06 MQ: An Integrated Mechanism for Multimedia Multicasting 1063
De-Nian Yang, Wanjiun Liao, and Yen-Ting Lin, National Taiwan University, USA
TP7.07 Transport of MPEG-4 over IP/RTP 1067
A. Basso, AT&T Labs Research, USA; S. Varakliotis, University College London,
United Kingdom
TP8 Lip Synchronization / Speechreading (R)
Time & Place: 3:50pm - 5:30pm, Sutton Center
TP8.01 Speaker Independent Audio-Visual Speech Recognition 1073
You Zhang, Stephen Levinson, and Thomas Huang, University of Illinois at Urbana-Champaign, USA
TP8.02 Lip Synchronization using Linear Predictive Analysis 1077
Sumedha Kshirsagar, Nadia Magnenat-Thalmann, University of Geneva, Switzerland
TP8.03 Automatic Selection of Visemes for Image-Based Visual Speech Synthesis 1081
Jie Yang, Jing Xiao, and Max Ritter, Carnegie Mellon University, USA
TP8.04 A Hierarchical Segmentation Algorithm for Face Analysis Application for Lipreading 1085
M. Liévin and F. Luthon, Grenoble National Polytechnic Institute, France
TP8.05 Translingual Visual Speech Synthesis 1089
Tanveer A. Faruquie and Chalapathy Neti, IBM T.J. Watson Research Center, USA; Nitendra Rajput, L. Venkata Subramaniam, Ashish Verma, IBM India Research Lab, India
TP8.06 A New Approach to Integrate Audio and Visual Features of Speech 1093
Hao Pan, Ahi-Pei Liang, and Thomas Huang, University of Illinois at Urbana-Champaign, USA
TP8.07 A Cascade Image Transform for Speaker Independent Automatic Speech Reading 1097
G. Potamianos, IBM T.J Watson Research Center, USA; A. Verma, IBM India Research Lab, India; C. Neti, G. Iyengar, and S. Basu, IBM T.J Watson Research Center, USA
TPS9 Special Session: MPEG-4 Natural Hybrid Coding (R)
(Euee Jang)
Time & Place: 3:50pm - 5:30pm, Sutton South
TPS9.01 Efficient Modeling of Virtual Humans in MPEG-4 1103
Tolga K. Capin, Swiss Federal Institute of Technology, Switzerland; Eric Petajan, face2face animation, inc., USA; Joern Ostermann, AT&T Labs Research, USA
TPS9.02 Very Low Bitrate Coding of Virtual Human Animation in MPEG-4 1107
Tolga K. Capin, Swiss Federal Institute of Technology, Switzerland; Eric Petajan, face2face animation, inc., USA; Joern Ostermann, AT&T Labs Research, USA
TPS9.03 Photo-Realistic 3D Model Coding in MPEG-4 1111
Nicolas Aspert and Touradj Ebarahimi, Swiss Federal Institute of Technology, Switzerland
TPS9.04 Animation Framework for MPEG-4 Systems 1115
Mikaël Bourges-Sévenier, iVast Inc., USA; Euee S. Jang and James D. K. Kim,
Samsung AIT, Korea
TPS9.05 3D Animation Coding: Its History and Framework 1119
Euee S. Jang, Samsung AIT, Korea
TP10 Image / Video Segmentation / Summary I (R)
Time & Place: 3:50pm - 5:30pm, Regent Parlor
TP10.01 Automatic Image Event Segmentation and Quality Screening for Albuming Applications 1125
Alexander C. Loui, Eastman Kodak Company, USA; Andreas E. Savakis, Rochester Institute of Technology, USA
TP10.02 Labeling Update of Segmented Images using Conceptual Graphs and Dempster-Shafer
Theory of Evidence 1129
Philippe Mulhem, IPAL-CNRS, KRD, Singapore; Dezhong Hong and Jian Kang Wu,
KRDL, Singapore
TP10.03 A Knowledge Engineering Approach for Image Classification based on
Probabilistic Reasoning Systems 1133
Seungyup Paek and Shih-Fu Chang, Columbia University, USA
TP10.04 Structuring Personal Experiences -- Analyzing Views from a Head-Mounted Camera 1137
Yuichi Nakamura, Jun'ya Ohde, and Yuichi Ohta, University of Tsukuba, Japan
TP10.05 Real-Time Scene Change Detection on Compressed Multimedia Bitstream based on
Statistical Sequential Analysis 1141
Dan Lelescu and Dan Schonfeld, University of Illinois at Chicago, USA
TP10.06 Video Scene Segmentation using Video and Audio Features 1145
Hari Sundaram and Shih-Fu Chang, Columbia University, USA
TP10.07 Rotation Invariant Face Detection using a Model-Based Clustering Algorithm 1149
Byeong Hwan Jeon and Sang Uk Lee, Seoul National University, Korea; Kyung Mu Lee, Hongik University, Korea
TP11 Video / Image Retrieval I (P)
Time & Place: 3:50pm - 5:30pm, Sutton Corridor Poster
TP11.01 A Content Based Internet Search Engine for Analysis and Archival of MPEG-1
Compressed Newsfeeds 1155
Odej Kao and Gerhard R. Joubert, Technical University of Clausthal, Germany
TP11.02 A Video Searching System using MSP Exchanging Data and Its Evaluation of
Matching Methods 1159
Mei Kodama, Hiroshima University, Japan; Tomoji Ikeda, SATAKE Corporation, Japan
TP11.03 Video Composition and Retrieval 1163
V. Singla, Y. C. Park, S. Panchanathan, F. Golshani, Arizona State University, USA
TP11.04 An Efficient Technique for Summarizing Videos using Visual Contents 1167
JungHwan Oh and Kien A. Hua, University of Central Florida, USA
TP11.05 Efficient Camera Motion Characterization for MPEG Video Indexing 1171
Jae-Gon Kim, Hyun Sung Chang, and Jinwoong Kim, Electronics and Telecommunications Research Institute, Korea; Hyung-Myung Kim, Advanced Institute of Science and Technology, Korea
TP11.06 Real Time Storage and Simultaneous Retrieval for Surveillance and Patrol Video 1175
Fujio Tsutsumi, Central Research Institute of Electric Power Industry, Japan
TP11.07 An Architecture of the Distributed Multimedia Information Retrieval Network with
Query Routing Systems 1179
Yukiko Kawasaki and Hideki Sunahara, Nara Institute of Science and Technology, Japan
TP11.08 Local Web Advertisement Through Dynamic Active Proxy 1183
Jing Deng and Chi-Hung Chi, National University of Singapore, Singapore
TP11.09 Improving Visual Recognition using Color Normalization in Digital Video Applications 1187
Juan M. Sánchez and Xavier Binefa, Universitat Autònoma de Barcelona, Spain
TP11.10 Segmentation and Tracking of Video Objects for a Content-Based Video Indexing Context 1191
Magali Mazière and Françoise Chassaing, France Telecom CNET/DHI/HDM, France; Luis Garrido and Philippe Salembier, Universitat Politècnica de Catalunya, Spain
TP11.11 Index-Based Fast Search Algorithm of Image Database on Internet 1195
Chia H. Yeh and Chung J. Kuo, National Chung Cheng University, Taiwan
TP11.12 Update Relevant Image Weights for Content-Based Image Retrieval using
Support Vector Machines 1199
Qi Tian, Pengyu Hong, and Thomas S. Huang, University of Illinois at Urbana-Champaign, USA
TP11.13 A Content-Based Scheme for CT Lung Image Retrieval 1203
Chii Tung Liu, Pol Lin Tai, Arlene Y. -J. Chen, Chen-Hsing Peng, and Jia Shung Wang, National Tsing Hua University, Taiwan
TP11.14 Relevance Feedback for Content-Based Retrieval using the Choquet Integral 1207
YoungSik Choi and Daewon Kim, MTRL, Korea Telecom, Korea; Raghu Krishnapuram, Colorado School of Mines, USA
TP11.15 Dimension Reduction of Texture Features for Image Retrieval using Hybrid
Associative Neural Networks 1211
Jose Antonio Catalan and Jesse S. Jin, University of New South Wales, Australia
TP11.16 Benchmarking Access Structures for the Similarity Retrieval of High-Dimensional
Multimedia Data 1215
Nathan G. Colossi, State University of Campinas, Brazil; Mario A. Nascimento, University of Alberta, Canada
Author Index follows page 1218
Volume III Wednesday
WA0 Multimedia Home Appliance / Universal Access (R)
Time & Place: 9:45am - 12:05pm, Beekman Parlor
WA0.01 Clustering Source/Channel Rate Allocations for Receiver-Driven Multicast under a Limited Number of Streams 1221
Philip A. Chou, Microsoft Corporation, USA; Kannan Ramchandran,
University of California, USA
WA0.02 From Proven Office Technologies to the Intelligent Multimedia Home 1225
Christian Gran and Angela Scheller, GMD FOKUS, Germany
WA0.03 Back to the TV: Information Visualization Interfaces based on TV-Program Metaphors 1229
Katsumi Tanaka, Akiyo Nadamoto, Machiko Kusahara, Taeko Hattori, Hiroyuki Kondo, and Kazutoshi Sumiya, Kobe University, Japan
WA0.04 Issues in Data Embedding and Synchronization for Digital Television 1233
J. Brunheroto, R. Chernock, P. Dettori, X. Dong, J. Paraszczak, F. Schaffa, and D. Seidman, IBM Research, USA
WA0.05 A Live Video Imaging Method for Capturing Presentation Information in
Distance Learning 1237
Yoshinari Kameda, Kentaro Ishizuka, and Michihiko Minoh, Kyoto University, Japan
WA0.06 On the Application of Continuous Media Filters over Wireless Networks 1241
Margaritis Margaritidis and George C. Polyzos, University of California, San Diego, USA
WA0.07 Video Containers: A System for the On-Demand Storage, Delivery, and Management of Television Programs 1245
S. R. Subramanya, University of Missouri-Rolla, USA
WAS1 Special Session: Wireless Multimedia (R)
(David Goodman)
Time & Place: 9:45am - 12:05pm, Sutton North
WAS1.01 Wireless Communication of Vital Signs using the Georgia Tech Wearable Motherboard 1253
Babak Firoozbakhsh, Nikil Jayant, Sungmee Park, and Sundaresan Jayaraman, Georgia Institute of Technology, USA
WAS1.02 Filtering Wavelet based Video Streams for Wireless Inter-working 1257
A. Kassler, A. Neubeck, P. Schulthess, University of Ulm, Germany
WAS1.03 Issues of Mobile Ad-Hoc WANs 1261
Silvia Giordano, Maher Hamdi, Jean-Pierre Hubaux, Jean-Yves Le Boudec, and Ljubica Blazevic, Ecole Polytechnique Federale de Lausanne, Switzerland
WAS1.04 Admission and Flow Control for Multimedia CDMA 1265
Cristina Comaniciu, Narayan Mandayam, Rutgers University, USA; David Famolari and Prathima Agrawal, Telcordia Technologies, USA
WAS1.05 A Channel Predictor for Wireless Packet Networks 1269
Javier Gomez and Andrew T. Campbell, Columbia University, USA
WAS2 Special Session: Multimedia and Security (R)
(Jana Dittman and Martin Steinbach)
Time & Place: 9:45am - 12:05pm, Sutton Center
WAS2.01 Approaches to Multimedia and Security 1275
Klara Nahrstedt, University of Illinois at UrbanaChampaign, USA; Jana Dittman, GMD IPSI, Germany; Petra Wohlmacher, University of Klagenfurt, Australia
WAS2.02 Staganalysis of LSB Encoding in Color Images 1279
Jiri Fridrich, Rui Du, and Meng Long, SUNY Binghamton, USA
WAS2.03 Watermarking Through Color Image Bands Decorrelation 1283
A. Piva, F. Bartolini, L. Boccardi, V. Cappellini, and A. DeRosa, Università di Firenze, Italy; M. Barni, Università di Siena, Italy
WAS2.04 Water-Filling for Watermarking? 1287
Deepa Kundur, University of Toronto, Canada
WAS2.05 Geometric Distortion Correction Through Image Normalization 1291
Masoud Alghoniemy and Ahmed H. Tewfik, University of Minnesota, USA
WA3 Video over Network (R)
Time & Place: 9:45am - 12:05pm, Sutton South
WA3.01 Image Integrity and Correction using Parities of Error Control Coding 1297
Jaejin Lee and Chee Sun Won, Dongguk University, Korea
WA3.02 Joint Source/FEC Rate Selection for Optimal MPEG-2 Video Delivery 1301
Pascal Frossard, Swiss Federal Institute of Technology, Switzerland; Olivier Verscheure, IBM T.J. Watson Research Center, USA
WA3.03 Activity-Adaptive Modeling of Dynamic Multimedia Traffic 1305
Deepak Turaga and Tsuhan Chen, Carnegie Mellon University, USA
WA3.04 Modeling of the Coding Gain of Joint Coding for Multi-program Video Transmission 1309
A. Vincent, P. Corriveau, P. Blanchfield, and R. Renaud, Communications
Research Centre, Canada
WA3.05 Evaluation of Adaptive Filtering of MPEG System Streams in IP Networks 1313
Michael Hemy, Peter Steenkiste, and Thomas Gross, Carnegie Mellon University, USA
WA3.06 Non Linear Traffic Modeling of VBR MPEG-2 Video Sources 1318
Anastasios D. Doulamis, Nikolaos D. Doulamis and Stefanos D. Kollias, National Technical University of Athens, Greece
WA3.07 An Architecture based on IETF Protocols for the Transport of MPEG-4 Content
over the Internet 1322
Roberto Castagno and Serkan Kiranyaz, Nokia Mobile Phones, Finland; Florin Lohan and Irek Defee, Tampere University of Technology, Finland
WA4 Image / Video Segmentation / Summary II (R)
Time & Place: 9:45am - 12:05pm, Regent Parlor
WA4.01 A Genetic Algorithm for Video Segmentation and Summarization 1329
Patrick Chiu, Andreas Girgensohn, Wolf Polak, Eleanor Rieffel, and Lynn Wilcox, FX Palo Alto Laboratory, USA
WA4.02 Video Abstract: A Hybrid Approach to Generate Semantically Meaningful
Video Summaries 1333
Candemir Toklu and Shih-Ping Liou, Siemens Corporate Research, USA; Madirakshi Das, University of Massachusetts, USA
WA4.03 Generating Semantic Visual Templates for Video Databases 1337
William Chen and Shih-Fu Chang, Columbia University, USA
WA4.04 Design and Performance Study of Scalable Video Storage in a
Disk-Array-Based Video Server 1341
Zheng-Ru Lin and Ming-Syan Chen, National Taiwan University, Taiwan
WA4.05 TV Program Classification based on Face and Text Processing 1345
Gang Wei, Wayne State University, USA; Lalitha Agnihotri and Nevenka Dimitrova, Philips Research, USA
WA4.06 Dissolve Transition Detection using B-Splines Interpolation 1349
Jeho Nam and Ahmed H. Tewfik, University of Minnesota, USA
WA4.07 A Bayesian segmentation of Stereopairs 1353
G. A. Triantafyllidis, Aristotle University of Thessaloniki, Greece; D. Tzovaras and M. G. Strintzis, Informatics and Telematics Institute, Greece
WA5 Multimedia System / Hardware (P)
Time & Place: 9:45am - 12:05pm, Sutton Corridor Poster
WA5.01 Flexible Multimedia System for Multimedia Communication Services 1359
Koji Hashimoto and Yoshitaka Shibata, Iwate Prefectural University, Japan; Norio Shiratori, Tohoku University, Japan
WA5.02 Hardware/Software Co-design for Real-Time Physical Modeling 1363
B. Bishop, T. P. Kelliher, and M. J. Irwin, The Pennsylvania State University, USA
WA5.03 Supporting Audience and Player Interaction during Interactive Media Performances 1367
Nikitas M. Sgouros, University of Piraeus, Greece
WA5.04 Design and Implementation of a Programmable Stack Filter with FPGAs 1371
M. Hu, O. Vainio, and D. Gevorkian, Tampere University of Technology, Finland
WA5.05 A VLIW Architecture Simulator Innovative Approach for HW-SW Co-design 1375
Ivano Barbieri, Massimo Bariani, and Marco Raggio, University of Genova, Italy
WA5.06 Efficient Hardware-Software Co-design for the G.723.1 Algorithm Targeted at
VoIP Applications 1379
Shridhar Mubaraq Mishra, Infineon Technologies Asia Pacific Pte. Ltd., Singapore; Arjun Balaram, Nortel Networks, Canada
WA5.07 MMX-Like Architecture Extension to Support the Rotation Operation 1383
J. Villalba, J. Hormigo, M. A. González, and E. L. Zapata, University of Málaga, Spain
WA5.08 A Finite Field Processor Employing Dual Parallel Datapath for High-Speed/Low-Power
RS-ECC Applications 1387
Hyung-Joon Kwon, Young-Beom Jang, and Bangwon Lee, Samsung Electronics, Korea
WA5.09 A Multimedia Terminal Architecture for Dynamically Configurable Protocol Stacks 1391
Filip Vandermeulen, University of Ghent, Belgium; Frank Steegmans, Alcatel Corporate Research Center, Belgium; Brecht Vermeulen, University of Ghent, Belgium; Steven Vermeulen, Alcatel Corporate Research Center, Belgium
WA5.10 Programmable and Low Power VLSI Architecture for Full Search Motion Estimation in Multimedia Communications 1395
Luca Fanucci, National Research Council, Italy; Lorenzo Bertini and Sergio Saponara, University of Pisa, Italy
WA5.11 A VLSI Implementation Structure for Wavelet Decomposition Filter 1399
Wu Shunjun, Wang Chao, and Shang Yong, Xidian University, P.R. China
WA5.12 An Extensible Set-Top-Box Architecture for Interactive and Broadcast Services Offering Sophisticated User Guidance 1403
Frank Lonczewski and Rudolf Jaeger, BetaResearch, Germany
WA5.13 Interoperable Content Protection for Digital TV 1407
B. J. van Rijnsoever and J. P. Linnartz, Philips Research, The Netherlands
WA5.14 A Digital Television Service Architecture 1411
P. Vuorimaa, Helsinki University of Technology, Finland
WA5.15 Combined Watermarking for Image Authentication and Protection 1415
Chun-Shien Lu, Hong-Yuan Mark Liao, and Chwen-Jye Sze, Academia Sinica, Taiwan
WA5.16 FlyCam: Practical Panoramic Video and Automatic Camera Control 1419
Jonathan Foote and Don Kimber, FX Palo Alto Laboratory, Inc., USA
WP0 Multimedia System / Hardware (R)
Time & Place: 2:15pm - 3:35pm, Beekman Parlor
WP0.01 A Hardware Implementation for Approximate Text Search in Multimedia Applications 1425
H. -M. Blüthgen, P. Osterloh, H. Blume, and T. G. Noll, Aachen Institute of
Technology, Germany
WP0.02 Improved Data Layouts for Fault-Tolerant Multimedia Systems 1429
Martha L. Escobar-Molano and Lanfeng Hao, University of South Florida, USA; David A. Barrett, Asgard Systems, USA
WP0.03 LucentVision: Converting Real World Events into Multimedia Experiences 1433
Gopal Pingali, Yves Jean, Agata Opalach, and Ingrid Carlbom, Bell Laboratories, Lucent Technologies, USA
WP0.04 Image-Based Rendering via the Standard Graphics Pipeline 1437
Miles E. Hansard and Bernard F. Buxton, University College London, United Kingdom
WP1 Wireless Multimedia (R)
Time & Place: 2:15pm - 3:35pm, Sutton North
WP1.01 On the Capabilities of Error Concealment in MPEG-2 Communications over
Wireless ATM 1443
Francisco Delicado, Pedro Cuenca, and Antonio Garrido, Universidad de Castilla-La Mancha, Spain; Luis Orozco-Barbosa and Francisco Quiles, University of Ottawa, Canada
WP1.02 Source-Channel Matching Space-Time Diversity for Multimedia Communications 1447
H. Zheng, Bell-Labs, Lucent Technologies, USA; K. J. R. Liu, University of Maryland, USA
WP1.03 Foveation-Based Error Resilience for Video Transmission over Mobile Networks 1451
Sanghoon Lee, Lucent Technologies, USA; Chris Podilchuk, The University of Texas at Austin; Alan C. Bovik, Bell Labs, USA
WP1.04 Joint Downlink Beamforming, Power Control, and Data Rate Allocation for DS-CDMA
Mobile Radio with Multimedia Services 1455
Ying-Chang Liang and Francois P. S. Chin, Centre for Wireless Communications, Singapore; K. J. Ray Liu, University of Maryland, USA
WP2 Video on Demand (R)
Time & Place: 2:15pm - 3:35pm, Sutton Center
WP2.01 Transmitting Variable-Bit-Rate Videos on Clustered VoD Systems 1461
Chow-Sing Lin, Min-You Wu and Wei Shu, UCF, USA
WP2.02 Design and Implementation of VoD Server by using Clustered File System 1465
Chang-Soon Park, ETRI, Korea; Mann-Ho Lee, Chungnam National University, Korea; Young-Sung Son, ETRI, Korea; Oh-Young Kwon, Korea University of Technology and Education, Korea
WP2.03 Broadcast News Parsing using Visual Cues: A Robust Face Detection Approach 1469
Yannis Avrithis, Nicolas Tsapatsoulis and Stefanos Kollias, National Technical University of Athens, Greece
WP2.04 Fast-Forward Functions on Parallel Video Servers 1473
Zhiyong Ding and Chow-Sing Lin, University of Central Florida, USA; Min-You Wu, University of New Mexico, USA
WP3 Error Control (R)
Time & Place: 2:15pm - 3:35pm, Sutton South
WP3.01 A Study of Keyframe Reference Picture Selection Method for Error Resilient Multiple
Video Objects Distribution 1479
Hideaki Kimata, Yoshiyuki Yashima, and Naoki Kobayashi, NTT Cyber Space
Laboratories, Japan
WP3.02 DCT Coefficient-Based Error Detection Technique for Compressed Video Stream 1483
K. Bhattacharyya, H. S. Jamadagni, Indian Institute of Science, India
WP3.03 Scalable MPEG-4 Video Coding with Graceful Packet-Loss Resilience over
Bandwidth-Varying Networks 1487
M. van der Schaar and H. Radha, Philips Research USA, USA; C. Dufour, Philips Research LEP, France
WP4 Security / Authentication (R)
Time & Place: 2:15pm - 3:35pm, Regent Parlor
WP4.01 Multimedia Enhanced General-Purpose Processors 1493
Stephan Wong, Sorin Cotofana, and Stamatis Vassiliadis, Delft University of Technology,
The Netherlands
WP4.02 VeriNet Web Speaker Verification for the World Wide Web 1497
Kevin Farrell and William Mistretta, T-NETIX, Inc., USA
WP4.03 SMMM - A Secure MultiMedia Mail System 1501
Marcel Stanley A. de Moura, DI, PUC-Rio, Brazil; Guido Lemos de Souza Filho and Thaís Vasconcelos Batista, DIMAp UFRN, Brazil; Luiz Fernando G. Soares, DI, PUC-Rio, Brazil
WP5 Segmentation, Summarization & Indexing (P)
Time & Place: 2:15pm - 3:35pm, Sutton Corridor Poster
WP5.01 Video Segmentation with the Assistance of Audio Content Analysis 1507
Hao Jiang, Microsoft Research, China; Tong Lin, Peking University, China; Hong-Jiang Zhang, Microsoft Research, China
WP5.02 On the Segmentation of Text in Videos 1511
Axel Wernicke and Rainer Lienhart, Intel Corporation, USA
WP5.03 Unsupervised Color Image Segmentation for Content Based Application 1515
Chung Hui Kuo and Ahmed H. Tewfik, University of Minnesota, USA
WP5.04 Towards Abstracting Sports Video by Highlights 1519
Noboru Babaguchi, Osaka University, Japan
WP5.05 Dynamic Video Abstract Generation using an Object DBMS 1523
H. Martin and R. Lozano, Laboratoire Université Joseph Fourier, France
WP5.06 Video Segmentation using Spatial and Temporal Statistical Analysis Method 1527
Zhibin Lei, Wu Chou, Jialin Zhong, and Chin-Hui Lee, Lucent Technologies, USA
WP5.07 Spatiotemporal Segmentation of Moving Video Objects over MPEG Compressed Domain 1531
How-Lung Eng and Kai-Kuang Ma, Nanyang Technological University, Singapore
WP5.08 Automated Threshold Selection for the Detection of Dissolves in MPEG Videos 1535
G. Boccignone, M. De Santo, and G. Percannella, Università di Salerno, Italy
WP5.09 Visualization Methods for Personal Photo Collections: Browsing and
Searching in the PhotoFinder 1539
Hyunmo Kang and Ben Shneiderman, University of Maryland at College Park, USA
WP5.10 A Feature Point Based Scheme for Unsupervised Video Object Segmentation in
Stereoscopic Video Sequences 1543
Klimis S. Ntalianis, Nikolaos D. Doulamis, Anastasios D. Doulamis, and Stefanos D. Kollias, National Technical University of Athens, Greece
WP5.11 Visual and Audio Segmentation for Video Streams 1547
Takeshi Muramoto and Masahide Sugiyama, The University of Aizu, Japan
WP5.12 Joint Video Scene Segmentation and Classification based on Hidden Markov Model 1551
Jincheng Huang, Zhu Liu, and Yao Wang, Polytechnic University, USA
WP5.13 Video Object Segmentation and Tracking for Content-Based Video Coding 1555
J. Y. Zhou, National University of Singapore, Singapore; E. P. Ong, Institute of Microelectronics, Singapore; C. C. Ko, National University of Singapore, Singapore
WP5.14 Generating Optimal Video Summaries 1559
Yihong Gong and Xin Liu, NEC, USA
WP5.15 Tracking of Multiple Faces for Human-Computer Interfaces and Virtual Environments 1563
Fu Jie Huang and Tsuhan Chen, Carnegie Mellon University, USA
WP5.16 Browsing Images Based on Social and Content Similarity 1567
Junichi Tatemura, University of Tokyo, Japan
WP5.17 Toward a Retrieval of HTML Documents using a Semantic Approach 1571
Fernando Ferri, Istituto di Studi sulla Ricerca e sulla Documentazione Scientifica CNR, Italy; Cristina Ghiselli, Istituto per le Tecnologie Informatiche Multimediali CNR, Italy; Patrizia Grifoni, Istituto di Studi sulla Ricerca e sulla Documentazione Scientifica CNR, Italy; Marco Padula, Istituto per le Tecnologie Informatiche Multimediali CNR, Italy
WP6 Video Conferencing (R)
Time & Place: 3:50pm - 5:30pm, Beekman Parlor
WP6.01 Motion Detection and Segmentation using Image Mosaics 1577
Kiran S. Bhat, Mahesh Saptharishi, and Pradeep K. Khosla, Carnegie Mellon University, USA
WP6.02 Electronic Pan-Tilt-Zoom: A Solution for Intelligent Room Systems 1581
Mircea Nicolescu and Gerard Medioni, University of Southern California, USA
WP6.03 Robust Automatic Video-Conferencing with Multiple Cameras and Microphones 1585
Ce Wang, Scott Griebel, and Michael Brandstein, Harvard University, USA
WP6.04 Look Who's Talking: Speaker Detection using Video and Audio Correlation 1589
Ross Cutler and Larry Davis, University of Maryland, College Park, USA
WP6.05 Towards a Multimodal Meeting Record 1593
Ralph Gross, Michael Bett, Hua Yu, Xiaojin Zhu, Yue Pan, Jie Yang, and Alex Waibel, Carnegie Mellon University, USA
WP6.06 Smart Videoconferencing 1597
Dmitry Zotkin, Ramani Duraiswami, Vasanth Philomin, and Larry S. Davis, University of Maryland, College Park, USA
WP6.07 Rate-Distortion Optimization for Arbitrarily-Shaped Object Coding 1601
Guobin Shen, Bing Zeng, and Ming L. Liou, The Hong Kong University of Science and Technology, Hong Kong
WP7 QoS (Traffic Management / Protocol) III (R)
Time & Place: 3:50pm - 5:30pm, Sutton North
WP7.01 Explicit Rate Congestion Control of MPEG-2 Coded Video Traffic in ATM Networks 1607
Gajendra Sisodia, Ling Guan, Subrata De and Mehran Dowlatshahi, University of Sydney, Australia
WP7.02 Bandwidth Adaptive Smoothing for Multimedia Delivery 1611
Jae-Wook Kim and Rhan Ha, Hongik University, Korea; Hojung Cha, Kwangwoon University, Korea
WP7.03 An Adaptable Network Architecture for Multimedia Traffic Management and Control 1615
Sheng-Yih Wang and Bharat Bhargava, Purdue University, USA
WP7.04 LDA+: A TCP-Friendly Adaptation Scheme for Multimedia Communication 1619
Dorgham Sisalem and Adam Wolisz, GMD-Fokus, Germany
WP7.05 On the Quality of Service and Pricing in a Multiservice Network 1623
Tiina Keikkinen, University of Lund, Sweden
WP7.06 M3POC: A Multimedia Multicast Transport Protocol for Cooperative Applications 1627
T. Gayraud, P. Berthou, P. Owezarski, and M. Diaz, LAAS - CNRS, France
WP7.07 Distributed QoS Routing for Multimedia Traffic 1631
Venkatesh Sarangan, Donna Ghosh, and Raj Acharya, State University of New York at Buffalo, USA
WP8 Virtual / Augmented Reality (R)
Time & Place: 3:50pm - 5:30pm, Sutton Center
WP8.01 Camera Tracking for Augmented Reality Media 1637
Bolan Jiang, Suya You, and Ulrich Neumann, University of Southern California, USA
WP8.02 Mixing Realities in Shared Space: An Augmented Reality Interface for
Collaborative Computing 1641
Mark Billinghurst, University of Washington, USA; Ivan Poupyrev, ATR International,
Japan; Hirokazu Kato, Hiroshima City University, Japan; Richard May, University of Washington, USA
WP8.03 Networked Intelligent Collaborative Environment (NetICE) 1645
Wing Ho Leung, Khalid Goudeaux, Sooksan Panichpapiboon, Sy-Bor Wang, and Tsuhan Chen, Carnegie Mellon University, USA
WP8.04 Compression with Mosaic Prediction for Image-Based Rendering Applications 1649
Wing Ho Leung and Tsuhan Chen, Carnegie Mellon University, USA
WP8.05 Automatic 3D City Construction System using Omni Camera 1653
Hiroshi Kawasaki, Katsushi Ikeuchi, Masao Sakauchi, University of Tokyo, Japan
WP8.06 Virtual Me: A Virtual Communication Method that Enables Simultaneous Multiple
Existence as an Avatar and/or Agents 1657
Jun Ohya, Ryohei Nakatsu, Shinjiro Kawato, and Tatsumi Sakaguchi, ATR Media Integration & Communications Research Laboratories, Japan
WP8.07 Real Time 3D Navigation in a Static Virtualized Scene from a Limited Set of 2D Data 1661
Katia Fintzel and Jean-Luc Dugelay, Institut EURECOM, France
WP9 Synchronization (R)
Time & Place: 3:50pm - 5:30pm, Sutton South
WP9.01 An Adaptive Tutoring Machine based on Web Learning Assessment 1667
Timothy K. Shih, University of Aizu, Japan; Shi-Kuo Chang, University of Pittsburgh, USA; Ching-Sheng Wang, Tamkang University, Taiwan; Jianhua Ma and Runhe Huang , University of Aizu, Japan
WP9.02 An Approach to Checking Consistency in Multimedia Synchronization Constraints 1671
Huadong Ma, Beijing University of Posts & Telecommunications, China; Kang G. Shin, University of Michigan, USA
WP9.03 About the Semantic Verification of SMIL Documents 1675
P. N. M. Sampaio, C. A. S. Santos, and J. -P. Courtiat, LAAS CNRS, France
WP9.04 Common Time Reference for Interactive Multimedia Applications 1679
Mario Baldi and Yoram Ofek, Synchrodyne, Networks, Inc., USA
WP9.05 Extension of SMIL with QoS Control and its Implementation 1683
Yoshiki Terashima, Osaka University, Japan; Keiichi Yasumoto, Shiga University, Japan; Teruo Higashino, Osaka University, Japan; Kota Abe and Toshio Matsuura, Osaka City University, Japan; Kenichi Taniguchi, Osaka University, Japan
WP9.06 Skew Detection and Compensation for Internet Audio Applications 1687
Orion Hodson, Colin Perkins, and Vicky Hardman, University College London, United Kingdom
WP9.07 High-Level Multimedia Synchronisation Algorithm on Broadband Network 1691
Seng Bing Go, Yacine Atif and Qingping Lin, Nanyang Technological University, Singapore
WP10 Indexing (R)
Time & Place: 3:50pm - 5:30pm, Regent Parlor
WP10.01 Live Multimedia Adaptation Through Wireless Hybrid Networks 1697
Antti Koivisto, Pekka Pietikäinen, and Jaakko Sauvola, University of Oulu, Finland;
David Doermann, University of Maryland, USA
WP10.02 Performance Analysis of AB-Tree 1701
Sakti Pramanik, Jinhua Li and Jiandong Ruan, Michigan State University, USA
WP10.03 In Common Sense Rethinking Web Search Results 1705
E. Amitay, Macquarie University, Australia
WP10.04 Feature Based Indexing for Media Tracking 1709
Arun Hampapur and Ruud Bolle, IBM TJ Watson Research Center, USA
WP10.05 ClusterTree: Integration of Cluster Representation and Nearest Neighbor
Search for Image Databases 1713
Dantong Yu and Aidong Zhang, State University of New York at Buffalo, USA
WP10.06 Web-Based Searching and Browsing of Multimedia Data 1717
Wayne Niblack, Stanley Yue, Reiner Kraft, Arnon Amir, and Neel Sundaresan, Almaden Research Center, USA
WP10.07 A Look-Ahead Strategy for Graph Matching in Retrieval by Spatial Arrangement 1721
S. Berretti, A. Del Bimbo, and E. Vicario, Università di Firenze, Italy
WP11 Multimedia Codec III (P)
Time & Place: 3:50pm - 5:30pm, Sutton Corridor Poster
WP11.01 Real-Time Remote File System for Multimedia Application 1727
Shinzo Doi, Atsuhiro Tsuji, Yukiko Itoh, Kouji Kubota, and Tsutomu Tanaka, Matsushita Electric Industrial Co., Ltd., Japan
WP11.02 Fast Mesh Simplification for Progressive Transmission 1731
Wenlong Dong, Jiankun Li and C. -C. Jay Kuo, CheerMedia Corporation, USA
WP11.03 A Predictive H.263 Bit-Rate Control Scheme based on Scene Information 1735
Pohsiang Hsu and K. J. Ray Liu, University of Maryland at College Park, USA
WP11.04 Placement of Multi-rate Smoothed VBR Video Objects to MZR Disks 1739
Sooyong Kang and Heon Y. Yeom, Seoul National University, Korea
WP11.05 How to Measure Arithmetic Complexity of Compression Algorithms: A Simple Solution 1743
Julien Reichel and Marcus J. Nadenau, Swiss Federal Institute of Technology, Switzerland
WP11.06 Multiband Approach to Digital Audio FX 1747
Pablo Fernandez-Cid, Universidad Europea de Madrid, Spain; Javier Casajús-Quirós, Universidad Politécnica de Madrid, Spain
WP11.07 A Characteristics-Based Bandwidth Reduction Technique for Pre-recorded Videos 1751
Wallapak Tavanapong and Srikanth Krishnamohan, Iowa State University, USA
WP11.08 A Cost Function with Position Penalty for Motion Estimation in MPEG-2 Video Coding 1755
Hangu Yeo, Cesar A. Gonzales, Jack Kouloheris, and Wai-Man Lam, IBM T.J. Watson Research Center, USA
WP11.09 Image Denoising using Wiener Filtering and Wavelet Thresholding 1759
X. Huang and G. A. Woolsey, University of New England, Australia
WP11.10 Partial Update of Active Textures for Efficient Expression Synthesis in
Model-Based Coding 1763
Lijun Yin and Anup Basu, University of Alberta, Canada
WP11.11 Transmission of MPEG-4 Video over the Internet 1767
Steven Gringeri, Sami Iren, and Roman Egorov, GTE Laboratories Incorporated, USA
WP11.12 On Building an Internet Gateway for Internet Telephony 1771
Cheng-Yue Chang and Ming-Syan Chen, National Taiwan University, Taiwan
WP11.13 Lossless Compression for µ-Law (A-Law) and IMA ADPCM on the Basis of a
Fast RLS Algorithm 1775
Dawei Huang, Queensland University of Technology, Australia
Author Index follows page 1778