Table of Contents

[ The numbers following the paper titles are the page numbers in the Proceedings. ]


Volume I – Monday

MA0 Multimedia Education I (R)

Time & Place: 9:45am - 12:05pm, Beekman Parlor

MA0.01 Lectern: A Digital Desk System for Video-Free Course – Lecture Capturing and Playback 3

Tzi-cker Chiueh and Peitao Deng, State University of New York at Stony Brook, USA

MA0.02 Multimedia Linguistic Learning 7

E. Gentile, P. Plantamura, and V. L. Plantamura, Università degli Studi di Bari, Italy

MA0.03 Virtual Microphones for Multichannel Audio Applications 11

C. Kyriakakis and A. Mouchtaris, University of Southern California, USA

MA0.04 Multi-modal Interaction in the Age of Information Appliances 15

Stéphane H. Maes and T. V. Raman, IBM T.J. Watson Research Center, USA

MA0.05 An Educational-Oriented Framework for Building On-Line Courses using XML 19

Florin Bota, University of Cluj, Romania; Laura Farinetti, Politecnico di Torino, Italy; Anca Rarau, Technical University of Cluj, Romania

MA0.06 A Web-Based CBIR-Assisted Learning Tool for Radiology Education –
Anytime and Anyplace
23

C. R. Shyu, A. C. Kak, C. E. Brodley, C. Pavlopoulou, and M. F. Chyan, Purdue University, USA; L. S. Broderick, University of Wisconsin Hospital, USA

MA0.07 Complete System for Distance Learning over IP 27

Javier Durán-de-Jesus, Juan José Villacorta-Calvo, and Alberto Izquierdo-Fuente,
Universidad de Valladolid, Spain

MA1 Collaborative Networking Applications (R)

Time & Place: 9:45am - 12:05pm, Sutton North

MA1.01 An Efficient Near-Video-on-Demand System for Broadband Residential ADSL Network 33

Chi-Chien Hsueh, Wen-Cheung Cheng, and Chi-Shi Liu, Chungwa Telecom, Taiwan

MA1.02 A Standalone Video Communication System for Wire and Wireless Communications 37

HonCheong Ng, HaiBin Huang, Xiao Lin, HockLye Toh, Susanto Rahardja, Hui Lan, and Xiang Chen, Nanyang Technological University, Singapore

MA1.03 A Cooperative Playback System for On-Demand Multimedia Sessions over Internet 41

G. Fortino and L. Nigro, Università della Calabria, Italy

MA1.04 Protocol for Collaborative Multimedia Presentations 45

Eenjun Hwang, Ajou University, Korea; B. Prabhakaran, National
University of Singapore, Singapore

MA1.05 A Universal Distribution Protocol for Video-on-Demand 49

Jehan-François Pâris, University of Houston, USA; Steven W. Carter and Darrell D. E. Long, University of California, USA

MA1.06 Efficient Network Management for Collaborative Services and Application Development 53

Kevin H. Liu, Telcordia Technologies Inc., USA; Vassilios Th. Tsaoussidis, SUNY at
Stony Brook, USA

MA1.07 Dynamic Object Markers in a Collaborative Environment for Video Content Discussion 57

Candemir Toklu, Thomas Fisher and Shih-Ping Liou, Siemens Corporate Research, USA

MA2 Web and E-Commerce (R)

Time & Place: 9:45am - 12:05pm, Sutton Center

MA2.01 Reduction of Blocking Artifacts for Low Bit-Rate Video Coding using
Regularized Dequantization
63

S. Moon-Ho Song, Gunho Lee, Sanghoon Sull, and Sung-Jea Ko, Korea University,
South Korea

MA2.02 Efficient Representation and Streaming of XML Content over the Internet Medium 67

Marc Girardot and Neel Sundaresan, IBM Almaden Research Center, USA

MA2.03 Talking Heads and Synthetic Speech: An Architecture for Supporting Electronic Commerce 71

Jörn Ostermann and David Millen, AT&T Labs - Research, USA

MA2.04 On Feasibility of MPEG-4 for Mulimedia Integration for E-Commerce 75

A. Puri, R. L. Schmidt, and Q. Huang, AT&T Labs - Research, USA

MA2.05 Performance Evaluation of an Interactive Web-Based Multimedia Document System with Streaming Media 80

H. Fahmi, M. Latif, and A. Ghafoor, Purdue University, USA; P. Liu and L. Hsu, Siemens Corporate Research, USA

MA2.06 Internet Course Delivery – Making it Easier and More Effective 84

D. Anderson, L. Harvel, and M. Hayes, Georgia Institute of Technology, USA; Y. Ishiguro, NEC Corp., Japan; J. Jackson, Georgia Institute of Technology, USA; M. Pimentel, Universidade de São Paolo, Brazil

MA2.07 E-Commerce Direct Marketing using Augmented Reality 88

Xiang Zhang, Nassir Navab, and Shih-Ping Liou, Siemens Corporate Research, USA

MA3 Multimedia Codec I (R)

Time & Place: 9:45am - 12:05pm, Beekman Parlor

MA3.01 An Advanced Center Biased Three Step Search Algorithm for Motion Estimation 95

Humaira Nisar and Tae-Sun Choi, Kwangju Institute of Science and Technology, Korea

MA3.02 Multimedia Image Coding using Adaptive Integer-to-Integer Wavelet Transforms 99

Subhasis Saha and Rao Vemuri, University of California Davis, USA

MA3.03 Software Optimization of H.263 Video Encoder on Pentium Processor with
MMX Technology
103

Pohsiang Hsu and K. J. Ray Liu, University of Maryland at College Park, USA

MA3.04 Region of Interest Coding for Low Bit Rate Image Transmission 107

O. Déforges and J. Ronsin, INSA, France

MA3.05 Simplified EZW Image Coder with Residual Data Transmission 111

Tanzeem Muzaffar and Tae-Sun Choi, Kwangju Institute of Science and Technology, Korea

MA3.06 PBS: A Predictive Block Sampling Algorithm for Desktop Multimedia Video Applications 115

Joseph Pasquale, Tom Nguyen, and Jon Kay, University of California, San Diego, USA

MA3.07 Efficient Multi-layer Coding and Encryption of MPEG Video Streams 119

Ali Saman Tosun and Wu-chi Feng, The Ohio State University, USA

MA4 Image Retrieval I (R)

Time & Place: 9:45am - 12:05pm, Regent Parlor

MA4.01 Image Retrieval Using Blob Histograms 125

Richard J. Qian, Peter J. L. van Beek, and M. Ibrahim Sezan, Intel Labs, USA

MA4.02 Indexing and Retrieval Scheme of the Image Database based on Color and Spatial Relations 129

Timothy K. Shih, Ching-Sheng Wang, Anthony Y. Chang, and Chuan-Ho Kao,
Tamkang University, Taiwan

MA4.03 Detection of Human Faces using Skin Color and Eyes 133

Jeonghee Park, Jungwon Seo, Dongun An, and Seongjong Chung, Chonbuk National University, South Korea

MA4.04 JaNeT: A Framework for Flexible Web-Content Retrieval 137

F. Bergenti, A. Poggi, and M. Somacher, Università degli Studi di Parma, Italy

MA4.05 Image Retrieval by Shape: A Comparative Study 141

Maytham Safar, Cyrus Shahabi, and Xiaoming Sun, University of Southern California, USA

MA4.06 Compass: An Image Retrieval System for Distributed Databases 145

R. Brunelli and O. Mich, ITC-irst, Italy

MA4.07 The ICOR Framework: A Top-Down Approach to Media Indexing and Retrieval 149

Ivica Rimac, Stephan Fischer, and Ralf Steinmetz, Darmstadt University of
Technology, Germany

MA5 Multimedia Authoring / Virtual Reality (P)

Time & Place: 9:45am - 12:05pm, Sutton Corridor

MA5.01 Variorum: A Multimedia-Based Program Documentation System 155

Tzi-cker Chiueh, Wei Wu, and Lap-Chung Lam, State University of New York at
Stony Brook, USA

MA5.02 Content-Based Browsing and Editing of Unstructured Video 159

Giridharan Iyengar, IBM T.J. Watson Research Center, USA; Andrew B. Lippmann, MIT Media Laboratory, USA

MA5.03 Design of Video Caption Markup Language VCML and Development of VCML Player 163

Katsuyuki Watanabe, Naohide Fukada, and Masahide Sugiyama, The University of Aizu, Japan

MA5.04 Tightly Coupling Authoring and Evaluation in an Integrated Tool to Support Iterative
Design of Educational Hypermedia
167

Selma Holmquist, Universidad de Los Andes, Venezuela; N. Hari Narayanan, Auburn University, USA

MA5.05 ProtoGMI MusicBrush - An Exercise in General Multimedia Instrument Interface Design 171

Timothy Chen, Kazushi Nishimoto, and Kenji Mase, ATR Media Integration & Communications Research Laboratories, Japan

MA5.06 MPEG-Pro: An Authoring System for MPEG-4 with Temporal Constraints and Template Guided Editing 175

Souhila Boughoufalah, Jean-Claude Dufourd, and Frederic Bouilhaguet, ENST Paris, France

MA5.07 Networked Virtual Environments for the Web: The WebTalk-I &
WebTalk-II Architectures
179

Thimoty Barbieri, HyperMedia Open Center, Italy

MA5.08 VGuide: Design and Performance Evaluation of a Synchronous Collaborative
Virtual Reality Application
183

J. M. Arango and P. K. McKinley, Michigan State University, USA

MA5.09 Architecture and Mechanisms of a Web-Based Video Data Management System 187

Shermann Sze-Man Chan and Qing Li, City University of Hong Kong, Hong Kong

MA5.10 Distributed Virtual Reality Authoring Interfaces for the WWW 191

I. Varlamis and M. Vazirgiannis, Athens University of Economics & Business, Greece;
I. Lazaridis, University of California, Irvine, USA

MA5.11 On the Choice of Tactile Code 195

Danilo P. Mandic and Richard Harvey, University of East Anglia, United Kingdom; Djemal H. Kolonic, University of Banjaluka, Bosnia-Herzegovina

MA5.12 Unified Multiple Media Interface for Robot Teleoperation 199

Tamhant Jain, Sambit K. Dash, Nishant Agrawal, Susmit Sen, and Amitabha Mukerjee, I.I.T. Kanpur, India

MA5.13 System Aspects of Copy Management for Digital Video 203

Jean-Paul Linnartz, Joop Talstra, Ton Kalkar, and Maurice Maes, Philips Research,
The Netherlands

MA5.14 Interactive Artificial Life based on Behavior and Perception in a Virtual Environment 207

Hyun Seung Yang, Hyun-jin Park, and Yong-jin Cho, Korea Advanced Institute of Science and Technology, Korea

MP0 Multimedia Codec II (R)

Time & Place: 2:15pm - 3:35pm, Sutton South

MP0.01 An Efficient Low-Bit Rate Motion Compensation Technique Based on Quadtree 213

Hanan A. Mahmoud and Magdy Bayoumi, University of Louisiana, USA

MP0.02 Selective Requantization for Transcoding of MPEG Compressed Video 217

Hani Sorial and William E. Lynch, Concordia University, Canada; André Vincent, Communications Research Centre, Canada

MP0.03 Fast Dihedral Symmetry Operations on Digital Images in the Compressed Domain 221

Viresh Ratnakar, Bhaskaran Vasudev, and Victor Ivashin, Epson Research and
Development, Inc., USA

MP0.04 Compressed Domain MPEG-2 Video Editing 225

Kai Wang and John W. Woods, Rensselaer Polytechnic Institute, USA

MP1 Multimedia Authoring (R)

Time & Place: 2:15pm - 3:35pm, Sutton North

MP1.01 Systematic Approach for Creating Animated Character Interfaces 231

Izumi Kohno, Shujun Yoshizaka, and Shin'ichi Uwakubo, NEC Corporation, Japan

MP1.02 Extending Databases to Support Image Editing 235

Greg Speegle, Allen M. Gao, and Shaowen Hu, Baylor University, USA; Le Gruenwald, Univeristy of Oklahoma, USA

MP1.03 Automatic Techniques for Insertion of Three-Dimensional Objects into a Video Sequence 239

Satyan R. Coorg, IBM T.J. Watson Research Center, USA

MP2 Audio Processing in Multimedia I (R)

Time & Place: 2:15pm - 3:35pm, Sutton Center

MP2.01 Selective Signal Cancellation for Multiple-Listener Audio Applications:
An Information Theory Approach
245

MP2.02 Towards Efficient and Scalable Speech Compression Schemes for Robust Speech
Recognition Applications
249

N. Srinivasamurthy and A. Ortega, USC, USA; Q. Zhu and A. Alwan, UCLA, USA

MP2.03 A Multiple Input Single Output Model for Rendering Virtual Sound Sources in Real Time 253

Panayiotis G. Georgiou and Chris Kyriakakis, University of Southern California, USA

MP2.04 A New Communication Paradigm: Action-to-Speech 257

Masanobu Abe and Tsubasa Shinozaki, NTT Cyber Space Labs, Japan

MP3 Streaming Video (R)

Time & Place: 2:15pm - 3:35pm, Sutton South

MP3.01 Optimal Streaming of Layer-Encoded Multimedia Presentations 263

David A. Turner and Keith W. Ross, Institut Eurécom, France

MP3.02 Segment Reencoding of Buffer Constrained Variable Bit Rate Video Streams 267

Larry Lu, Krishna Ratakonda, Rajesh Rajagopalan, Jack Kouloheris, and Cesar Gonzales, IBM T.J. Watson Research Center, USA

MP3.03 Streaming Video with Optimized Reconstruction-Based DCT 271

Xiao Su and Benjamin W. Wah, University of Illinois at Urbana-Champaign, USA

MP3.04 Transmission of Streaming Video over an EGPRS Wireless Network 275

Kapil Chawla, Zhimei Jiang, Xiaoxin Qiu, and Amy Reibman, AT&T Labs Research, USA

MP4 User Interface I (R)

Time & Place: 2:15pm - 3:35pm, Regent Parlor

MP4.01 Hand Gesture Recognition of a Mobile Device User 281

Vesa-Matti Mäntylä, Technical Research Centre of Finland, Finland; Jani Mäntyjärvi and Tapio Seppänen, University of Oulu, Finland; Esa Tuulari, Technical Research Centre of Finland, Finland

MP4.02 Gesture-Enhanced Information Retrieval and Presentation in a Distributed
Learning Environment
285

Shi-Kuo Chang, University of Pittsburgh, USA; Tsuhan Chen, Carnegie Mellon University, USA; C. S. Li, IBM Watson Research Center, USA

MP4.03 Object-Based Visual Effects by using Multi-focus Images and Its Real-Time Implementation 289

Kiyoharu Aizawa, Akira Kubota, and Conny Riani Gunadi, University of Tokyo, Japan

MP4.04 Multimodal Output for a Conversational Telephony System 293

M. Mast, C. Günther, S. Kunzmann, and T. Roß, IBM Germany, Germany

MP5 Video / Image Retrieval II (P)

Time & Place: 2:15pm - 3:35pm, Sutton Corridor Poster

MP5.01 Incorporate Discriminant Analysis with EM Algorithm in Image Retrieval 299

Qi Tian, Ying Wu, and Thomas S. Huang, University of Illinois at Urbana-Champaign, USA

MP5.02 An Architecture for Content-Based Retrieval of Remote Sensing Images 303

Luis M. del Val Cura, Neucimar Jeronimo Leite, and Claudia Bauzer Medeiros, Campinas State University, Brazil

MP5.03 A Modified Zernike Moment Shape Descriptor Invariant to Translation, Rotation and
Scale for Similarity-Based Image Retrieval
307

Hae-Kwang Kim, Sejong University, Korea; Jong-Deuk Kim, Dong-Gyu Sim, and Dae-Il Oh, Hyundai Electronics Industries Co., Korea

MP5.04 Color Based Retrieval and Recognition 311

Nicu Sebe and Michael S. Lew, Leiden Institute of Advanced Computer Science,
The Netherlands

MP5.05 Spatial Match Representation and Retrieval for Supporting Ranking in
Iconic Image Databases
315

Jae-Woo Chang, Yeon-Jung Kim, and Ki-Sung Jin, Chonbuk National University, South Korea

MP5.06 Perceptual Color on Internet Browser 319

Seiichiro Hangai, Takayuki Hamamoto, and Masato Kawamoto, Science University of
Tokyo, Japan

MP5.07 Augmented Album: Situation-Dependent System for a Personal Digital Video/Image Collection 323

K. Priyantha Hewagamage and Masahito Hirakawa, Hiroshima University, Japan

MP5.08 On Design of Adaptive Internet Streaming Applications: An Architectural Perspective 327

Reza Rejaie, AT&T Labs - Research, USA

MP5.09 Non-linear Relevance Feedback: Improving the Performance of
Content-Based Retrieval Systems
331

Nikolaos D. Doulamis, Anastasios D. Doulamis, and Stefanos D. Kollias, National Technical University of Athens, Greece

MP5.10 Using Multiple Examples for Content-Based Image Retrieval 335

J. Assfalg, A. Del Bimbo, and P. Pala, University of Florence, Italy

MP5.11 Adaptive Synthesis in Progressive Retrieval of Audio-Visual Data 339

John R. Smith and Chung-Sheng Li, IBM T.J. Watson Research Center, USA

MP5.12 A Novel Shape Matching Method using Biological-Sequence Dynamic Alignment 343

Shaojie Zhang and Kai-Kuang Ma, Nanyang Technological University, Singapore

MP5.13 A General Inference Network Based Architecture for Multimedia Information Retrieval 347

Campbell Wilson, Bala Srinivasan, and Maria Indrawan, Monash University, Australia

MP5.14 Using Category-Based Collaborative Filtering in the Active WebMuseum 351

Arnd Kohrs and Bernard Merialdo, Institute EURECOM, France

MP5.15 A Composite Histogram for Image Retrieval 355

Dong Kwon Park, Yoon Seok Jeon, and Chee Sun Won, Dongguk University, Korea; Soo-Jun Park, ETRI, Korea; Seong-Joon Yoo, SearchCast, Korea

MP5.16 Indexing for Linear Model-Based Information Retrieval 359

Yuan-Chi Chang and Chung-Sheng Li, IBM T. J. Watson Research Center, USA

MP5.17 Multi-clip Query Optimization in Video Databases 363

Ahmed Mostefaoui and Lionel Brunie, Insa de Lyon, France; Harald Kosch, and László Böszörményi, University Klagenfurt, Austria

MP5.18 Partial Image Matching by Measures from Connected Color Regions 367

TaeYong Kim, Chung-Ang University, Korea; Joon H. Han, Pohang University of Science and Technology, Korea

MP6 Data Hiding I (R)

Time & Place: 3:50pm - 5:30pm, Beekman Parlor

MP6.01 An Evaluation Method for Watermarking Techniques 373

Wei Zhihui and Xiao Liang, Nanjing University of Science and Technology, P.R. China

MP6.02 Perceptually Transparent Attachment of Content-Based Data to Audio-Visual Documents 377

Frank Kurth, University of Bonn, Germany

MP6.03 Video Access Control Via Multi-level Data Hiding 381

Min Wu, Princeton University, USA; Hong Heather Yu, Panasonic Information & Networking Technological Laboratories, USA

MP6.04 Event-Coupled Hidden Markov Models 385

T. T. Kristjansson and B. J. Frey, University of Waterloo, Canada; Thomas S. Huang, University of Illinois at Urbana-Champaign, USA

MP6.05 Blind Digital Watermarking for Images and Videos and Performance Analysis 389

Qiang Cheng and Thomas S. Huang, University of Illinois at Urbana-Champaign, USA

MP6.06 Data Hiding in Digital Binary Image 393

Min Wu, Princeton Univ., USA; Edward Tang, Johns Hopkins Univ., USA; Bede Liu, Princeton Univ., USA

MP6.07 Transparent and Robust Audio Data Hiding in Cepstrum Domain 397

Xin Li, Princeton University, USA; Hong Heather Yu, Panasonic Information and Networking Technologies Lab, USA

MP7 Face Video Analysis & Synthesis I (R)

Time & Place: 3:50pm - 5:30pm, Sutton North

MP7.01 Human Facial Expression Recognition based on Learning Subspace Method 403

Xilen Chen, Harbin Institute of Technology, China; Sam Kwong, City University of Hong Kong, Hong Kong; Yan Lu, Harbin Institute of Technology, China

MP7.02 Facial Model Generation Through Mono Image Sequence 407

Chung J. Kuo, National Chung Cheng University, Taiwan; Tsang G. Lin, Industrial Technology Research Institute, Taiwan

MP7.03 Accurate Extraction of Human Face Area using Subspace Method and Genetic Algorithm 411

Makoto Murakami, Masahide Yoneyama, and Katsuhiko Shirai, Waseda University, Japan

MP7.04 Virtual Talk: A Model-Based Virtual Phone using a Layered Audio-Visual Integration 415

Yao-Jen Chang, Chih-Chung Chen, Jen-Chung Chou, and Yung-Chang Chen, National Tsing Hua University, Taiwan

MP7.05 A Performance Based Parametric Model for Facial Animation 419

Anna Wojdel and Leon J. M. Rothkrantz, Delft University of Technology, The Netherlands

MP7.06 Emotional Expressions in Audiovisual Human Computer Interaction 423

Lawrence S. Chen and Thomas S. Huang, Univ. of Illinois at Urbana-Champaign, USA

MP7.07 Speech-to-Face Movement Synthesis based on HMMS 427

Kiyotsugu Kakihara, Satoshi Nakamura, and Kiyohiro Shikano, Nara Institute of Science & Technology, Japan

MP8 Speech /Audio over Network (R)

Time & Place: 3:50pm - 5:30pm, Sutton South

MP8.01 Improving the Performance of ITU-T G.729A for VoIP 433

C. Montminy and T. Aboulnasr, University of Ottawa, Canada

MP8.02 Effects of Vocoder Distortion on Network Echo Cancellation 437

Y. Huang and R. A. Goubran, Carleton University, Canada

MP8.03 Toward Naturalness in Narrow-Band Speech Compression 440

S. Ghaemmaghami, Sharif University of Technology, Iran

MP8.04 Multiple Description Speech Coding for Robust Communication over
Lossy Packet Networks
444

Wenqing Jiang and Antonio Ortega, University of Southern California, USA

MP8.05 Prosody Model in a Mandarin Text-to-Speech System based on a Hierarchical Approach 448

Neng-Huang Pan, Wen-Tsai Jen, Shyr-Shen Yu, Ming-Shing Yu, Shyh-Yang Hwang, and Ming-Jer Wu, National Chung-Hsing University, Taiwan

MP8.06 Automatic Audio Segmentation using a Measure of Audio Novelty 452

Jonathan Foote, FX Palo Alto Laboratory, Inc., USA

MP9 Video Retrieval (R)

Time & Place: 3:50pm - 5:30pm, Regent Parlor

MP9.01 Application-Specific File Prefetching for Multimedia Programs 459

Tulika Mitra, Chuan-Kai Yang, and Tzi-cker Chiueh, State University of New York at
Stony Brook, USA

MP9.02 Modeling and Querying Videos by Content Trajectories 463

Zaher Aghbari, Kunihiko Kaneko, and Akifumi Makinouchi, Kyushu University, Japan

MP9.03 A Probabilistic-Based Mechanism for Video Database Management Systems 467

Mei-Ling Shyu, University of Miami, USA; Shu-Ching Chen, Florida International University, USA; R. L. Kashyap, Purdue University, USA

MP9.04 NMPTE: Network Multimedia Programming Training Environment 471

Yuh-Huei Shyu and Peng-Wen Chen, Tamkang University, Taiwan

MP9.05 A Probabilistic Framework for Semantic Indexing and Retrieval in Video 475

Milind R. Naphade and Thomas S. Huang, University of Illinois at Urbana-Champaign, USA

MP9.06 Retrieval by Content of Commercials based on Dynamics of Color Flows 479

A. Del Bimbo, P. Pala, and L. Tanganelli, University of Florence, Italy

MP9.07 Content Based Annotation and Retrieval of News Videos 483

M. Bertini, A. Del Bimbo, and P. Pala, University of Florence, Italy

MP10 Authentication / Security / Traffic / Error Control (P)

Time & Place: 3:50pm - 5:30pm, Sutton Corridor Poster

MP10.01 On-Line Signature Verification based on Altitude and Direction of Pen Movement 489

Seiichiro Hangai, Shinji Yamanaka, and Takayuki Hamamoto, Science University of Tokyo, Japan

MP10.02 Information Access using Speech, Speaker and Face Recognition 493

M. Viswanathan, H. S. M. Beigi, and A. Tritschler, IBM T.J. Watson Research Center, USA; F. Maali, Signal Recognition Corporation, USA

MP10.03 Image Enhancement Towards Soft Image Authentication 497

Liehua Xie and Gonzalo R. Arce, University of Delaware, USA; Arianne Lewis and E. Bert Basch, GTE Laboratories Incorporated, USA

MP10.04 Multi-level Reliability in Multimedia Collaboration over Heterogeneous Networks 501

Liang Cheng and Ivan Marsic, Rutgers University, USA

MP10.05 Multirate Trellis Coded Modulation in Multimedia Communications 505

Pingyi Fan, Tsinghua University, China; Xiang-Gen Xia, University of Delaware, USA

MP10.06 A Compression-Efficient Forward Error Control Mechanism for Image Transmission over
ATM Networks 509

Rogelio Hasimoto-Beltran, University of Delaware, USA; Sohail A. Sheikh, Widener University, USA; Ashfaq A. Khokhar, University of Delaware, USA

MP10.07 Error Concealment for Image Transmission by Multiscale Markov
Random Field Modeling
513

Yong Zhang and Kai-Kuang Ma, Nanyang Technological University, Singapore

MP10.08 Adaptive QoS for Mobile Multimedia Services over Wireless Networks 517

Alejandra Mercado and K. J. Ray Liu, University of Maryland, USA

MP10.09 Design Space Exploration for Orividing QoS Within the Harmony Framework 521

A. M. Lele and S. K. Nandy, Indian Institute of Science, India; D. H. J. Epema, Delft University of Technology, The Netherlands

MP10.10 Descriptive-Procedural Configuration of Communication Bindings 525

Thorsten Kramp and Rainer Koster, University of Kaiserslautern, Germany

MP10.11 Multimedia Security Gateway Protocol to Achieve Anonymity in Delivering Multimedia
Data using Watermarking
529

Sangeeta Narang, Indraprastha University, India; P. S. Grover, Delhi University, India; Saroj Kaushik, IIT, India

MP10.12 A Petri-Net Based Multilevel Security Specification Model for Multimedia Documents 533

J. Joshi and A. Ghafoor, Purdue University, USA

MP10.13 Remotely Keyed Encryption with Java Cards: A Secure and Efficient Method to Encrypt Multimedia Streams 537

Rüdiger Weis, cryptolabs Amsterdam, The Netherlands; Wolfgang Effelsberg and Stefan Lucks, Universität Mannheim, Germany

MP10.14 Individual Single Source Authentication on the MBone 541

F. Bergadano and D. Cavagnino, University of Turin, Italy; B. Crispo, SRI International, United Kingdom

MP10.15 Scheduling and Routing of Real-Time Multimedia Traffic in Packet-Switched Networks 545

Sudhir M. Rao and Albert M. K. Cheng, University of Houston, USA

MP10.16 Experimental Evaluation of Design Tradeoff in Specialized Virtual Machine for Multimedia Traffic in Active Networks 549

Sheng-Yih Wang and Bharat Bhargava, Purdue University, USA

MP10.17 A Simple Model for MPEG Video Traffic 553

Hai Liu, Nirwan Ansari, and Yun Q. Shi, New Jersey Institute of Technology, USA

MP10.18 TBLB Algorithm for Servicing Real-Time Multimedia Traffic Streams 557

William K. Wong and Victor C. M. Leung, The University of British Columbia, Canada

MPS11 Special Session: Perceptual Interface (R)
(Dominic Massaro)

Time & Place: 3:50pm - 5:30pm, Sutton Center

MPS11.01 Perceptual Interfaces in Human Computer Interaction 563

Dominic W. Massaro, University of California, USA

MPS11.02 Neural Mechanisms for Integrating Information from Multiple Senses 567

Barry E. Stein, Paul J. Laurienti, Terrence R. Stanford, and Mark T. Wallace, Wake Forest University School of Medicine, USA

MPS11.03 Multimodal Speech Synthesis 571

J. Schroeter, J. Ostermann, H. P. Graf, M. Beutnagel, E. Cosatto, A. Syrdal, A. Conkie, and
Y. Stylianou, AT&T Labs – Research, USA

MPS11.04 Approaches to Visual Speech Processing based on the MPEG-4 Face Animation Standard 575

Eric Petajan, face2face animation, inc., USA

MPS11.05 Learning from Multimodal Observations 579

Deb Roy, MIT Media Laboratory, USA

Author Index follows page 582

Volume II – Tuesday

TA0 Multimedia Education II (R)

Time & Place: 9:45am - 12:05pm, Beekman Parlor

TA0.01 Development of CAI System Employing Synthesized Speech and CG Animated Agent 585

Tsubasa Shinozaki and Masanobu Abe, NTT Cyber Space Labs, Japan

TA0.02 Cyber Atelier: A Creative Learning Environment Assisting Non-professional
Multimedia Productions
589

Shoji Tanaka, Jun Kurumizawa, Keiko Nakao, and Yuichi Iwadate, ATR Media Integration & Communications Research Labs, Japan

TA0.03 The Multimedia Online Collaboration Architecture: Tools to Enable Distance Learning 593

J. Peden, W. Burleson, C. Leonardo, University of Massachusetts at Amherst, USA

TA0.04 Beehive - An Internet Integrated Collaborative Learning Environment 597

C. Zeve, J. V. de Lima, E. Polonia, H. Sloczinski, and J. Nitzke, Federal University of Rio Grande do Sul, Brazil

TA0.05 A Live Intranet Distance Learning System using MPEG-4 over RTP/RTSP 601

Peter Westerink, Lisa Amini, Sundar Veliah and Will Belknap, IBM T.J. Watson Research Center, USA

TA0.06 Prior Knowledge and Redundant Multimedia 605

Frank Vetere and Steve Howard, University of Melbourne, Australia

TA1 Face Video Analysis & Synthesis II (R)

Time & Place: 9:45am - 12:05pm, Sutton North

TA1.01 A Study on Subjective Evaluation of Facial Polygon Model 611

Daisuke Kase, Takayuki Hamamoto, and Seiichiro Hangai, Science University of Tokyo, Japan

TA1.02 Exploring the Time Course of Facial Expressions with a Fuzzy System 615

F. Piat and N. Tsapatsoulis, National Technical University of Athens, Greece

TA1.03 Audio-Visual Unit Selection for the Synthesis of Photo-Realistic Talking-Heads 619

Eric Cosatto, AT&T Labs - Research, USA; Gerasimos Potamianos, IBM T.J. Watson Research Center, USA; Hans Peter Graf, AT&T Labs - Research, USA

TA1.04 Automated Lip Synchronized Speech Driven Facial Animation 623

Zeki Melek and Lale Akarun, Bogaziçi University, Turkey

TA1.05 Talking Faces 627

Jun-yong Noh and Ulich Neumann, University of Southern California, USA

TA1.06 Realistic Video Avatar 631

Wing Ho Leung, Carnegie Mellon University, USA; Belle L. Tseng, Zon-Yin Shae, and Ferdinand Hendriks, IBM T.J. Watson Research Center, USA; Tsuhan Chen, Carnegie Mellon University, USA

TA1.07 Robustness Against Instability of Sensory Judgment in a Human Interface to Draw a Facial Image using a Psychometrical Space Model 635

F. Sugimoto and M. Yoneyama, Toyo University, Japan

TA2 Feature Extraction for Image and Video (R)

Time & Place: 9:45am - 12:05pm, Sutton Center

TA2.01 Towards Automatic Extraction of Expressive Elements from Motion Pictures: Tempo 641

Brett Adams, Curtin University of Technology, Australia; Chitra Dorai, IBM T. J. Watson Research Center, USA; Svetha Venkatesh, Curtin University of Technology, Australia

TA2.02 Semi-automatic Semantic Video Object Extraction by Active Contour Model 645

Zhitao Lu and W. A. Pearlman, Rensselaer Polytechnic Institute, USA

TA2.03 A Method for Color Content Matching of Images 649

Aleksandra Mojsilovic, IBM TJ Watson Research Center, USA; Jianying Hu, Lucent Technologies Bell Labs, USA

TA2.04 Unifying Conversational Multimedia Interfaces for Accessing Network Services Across Communication Devices 653

G. Di Fabbrizio, S. Narayanan, P. Ruscitti, C. Kamm, B. Buntschuh, J. Hubbell, J. Wright, and J. Hamaker, AT&T Labs - Research, USA

TA2.05 A Panorama-Based Technique for Annotation Overlay and Its Real-Time Implementation 657

Masakatsu Kourogi, Takeshi Kurata, Katsuhiko Sakaue, and Yoichi Muraoka, Waseda University, Japan

TA2.06 Locale-Based Object Search under Illumination Change using Chromaticity Voting and
Elastic Correlation
661

Ze-Nian Li, Zinovi Tauber and Mark S. Drew, Simon Fraser University, Canada

TA2.07 Content-Based Image Retrieval for Not-Well-Framed Images using
Mutiresolutional Eigen-Features
665

Young Bok Joo and Jesse Jin, The University of New South Wales, Australia

TAS3 Special Session: Content-Based Retrieval (R)

(Alberto Del Bimbo)

Time & Place: 9:45am - 12:05pm, Sutton South

TAS3.01 Expressive Semantics for Automatic Annotation and Retrieval of Video Streams 671

A. Del Bimbo, University of Florence, Italy

TAS3.02 Invariance in Content-Based Image Retrieval 675

Arnold Smeulders, Theo Gevers, Jan-Mark Geusebroek, and Marcel Worring, University of Amsterdam, The Netherlands

TAS3.03 From Features to Semantics: Some Preliminary Results 679

Rong Zhao and W. I. Grosky Wayne State University, USA

TAS3.04 Semantic Modalities in Content-Based Retrieval 683

Simone Santini, University of California, San Diego, USA

TAS3.05 Structural and Semantic Analysis of Video 687

Shi-Fu Chang and Hari Sundaram, Columbia University, USA

TA4 Image Retrieval II (R)

Time & Place: 9:45am - 12:05pm, Regent Parlor

TA4.01 iPURE: Perceptual and User-Friendly Retrieval of Images 693

Gaurav Aggarwal, Pradeep Dubey, Sugata Ghosal, Ashutosh Kulshreshtha, and Abhinanda Sarkar, Indian Institute of Technology, USA

TA4.02 Supporting Multi-Example Image Queries in Image Databases 697

Lei Zhu and Aidong Zhang, State University of New York at Buffalo

TA4.03 A Model for Multimodal Information Retrieval 701

Rohini K. Srihari, Aibing Rao, Benjamin Han, Srikanth Munirathnam, and Xiaoyun Wu, State University of New York at Buffalo, USA

TA4.04 Automatic Query Generation for Content-Based Image Retrieval 705

Christian Breiteneder, University of Vienna, Austria; Horst Eidenberger, Austrian Libraries Network, Austria

TA4.05 Similar Shape Retrieval in MARS 709

Kaushik Chakrabarti, Michael Ortega-Binderberger, Kriengkrai Porkaew, and Sharad Mehrotra, University of California, USA

TA4.06 Relevance Graph-Based Image Retrieval 713

Sanghoon Sull, Jeongtaek Oh, Sangwook Oh, and S. Moon-Ho Song, Korea University, Korea; Sang W. Lee, University of Michigan, USA

TA4.07 Image Retrieval with Sketches and Compositions 717

Raj Kumar Rajendran and Shih-Fu Chang, Columbia University, USA

TA5 Web Search / Retrieval & Applications (P)

Time & Place: 9:45am - 12:05pm, Sutton Corridor Poster

TA5.01 GEMS: A New Model of Parallelism in PC-Workstations for Multimedia Applications 723

Eric Debes and Fulvio Moschetti, Swiss Federal Institute of Technology, Switzerland

TA5.02 Effective Caching of Web Objects using Zipf's Law 727

D. N. Serpanos, University of Crete, Greece; G. Karakostas and W. H. Wolf,
Princeton University, USA

TA5.03 2-D Interleaving for Enhancing the Robustness of Watermark Signals Embedded in
Still Images
731

George F. Elmasry and Yun Q. Shi, New Jersey Institute of Technology, USA

TA5.04 A System of Integrating Videos and Maps for the Identification of Building Object 735

Haomin Jin, Xu Xu, Yaginuma Yoshitomo, and Masao Sakauchi, The University of Tokyo, Japan

TA5.05 Active Router Approach for Selective Packet Discard of Streamed MPEG Video under
Low Bandwidth Conditions
739

G. Ravindra, N. Balakrishnan, and K. R. Ramakrishnan, Indian Institute of Science, India

TA5.06 Cooperating Intelligent Mobile Agents Mechanism for Distributed
Multimedia Synchronization
743

Ying-Hong Wang and Hung-Zu Lin, TamKang University, Taiwan

TA5.07 WWW and Telecommunication Collaboration Service for Mandarin Automatic Personal Phonebook Inquire Dialogue System 747

Min-Jen Tsai, National Chiao Tung University, Taiwan; Tien-Hwa Ho,
Chinese Navy Academy, Taiwan

TA5.08 On Voice Quality of IP Voice over GPRS 751

A. Lakaniemi, J. Parantainen, Nokia Research Center, Finland

TA5.09 Framework Design and Implementation of Web-Based Tutorials in Spoken Language Engineering 755

Rüdiger Hoffmann and Matthias Wolff, Dresden University of Technology, Germany

TA5.10 Telop-on-Demand: Video Structuring and Retrieval based on Text Recognition 759

H. Kuwano, Y. Taniguchi, H. Arai, M. Mori, S. Kurakake, and H. Kojima, NTT Cyber Solutions Laboratories, Japan

TA5.11 A Robust Video Watermarking Method 763

Dug-Ryung Kim, Ansan College, Korea; Sung-Han Park, Hanyang University, Korea

TA6 Multimedia Feature Representations (P)

Time & Place: 9:45am - 12:05pm, Sutton Corridor Poster

TA6.01 Graphical Representation for Generating Musical Sequences 769

Atsushi Hiroike, Hitachi Ltd., Japan

TA6.02 A New Approach for High Level Video Structuring 773

Yong-Moo Kwon, Chang-Jun Song, and Ig-Jae Kim, Korea Institute of
Science & Technology, Korea

TA6.03 Solo: An MPEG-7 Optimum Search Tool 777

Jose A. Lay and Ling Guan, University of Sydney, Australia

TA6.04 Multimedia Documents Description by Ordered Hierarchies: The ToCAI Description Scheme 781

N. Adami, A. Bugatti, R. Leonardi, P. Migliorati and Lorenzo A. Rossi,
University of Brescia, Italy

TA6.05 Using Shape Feature Matching to Track Moving Objects in Image Sequences 785

D. Hsu, J. Leu, S. Chen, W. Chang, and W. Fang, University of Southern California, USA

TA6.06 Interactive Gesture Interface for Intelligent Wheelchairs 789

Yoshinori Kuno, Teruhisa Murashima, Nobutaka Shimada, and Yoshiaki Shirai, Osaka University, Japan

TA6.07 Description Schemes for Retrieval Applications Targeted to the Audiovisual Market 793

José M. Martínez, Julián Cabrera, Jesús Bescós, José M. Menéndez, and Guillermo Cisneros, Universidad Politécnica de Madrid, Spain

TA6.08 A Robust Algorithm for Text Extraction in Color Video 797

Edward K. Wong and Minya Chen, Polytechnic University, USA

TA6.09 Low-Level Motion Activity Features for Semantic Characterization of Video 801

Kadir A. Peker, A. Aydin Alatan, Ali N. Akansu, New Jersey Institute of Technology, USA

TP0 Audio Processing in Multimedia II (R)

Time & Place: 1:15pm - 3:35pm, Beekman Parlor

TP0.01 A 3D Sound using the Adaptive Head Model and Measured Pinna Data 807

Cheng-Ta Chang and Oscal T. -C. Chen, National Chung Cheng University, Taiwan

TP0.02 Source Segmentation for Structured Audio 811

Kathy Melih and Ruben Gonzalez, Griffith University, Gold Coast, Australia

TP0.03 A Progressive Approach for Perceptual Audio Coding 815

Ye Shen, Hongmei Ai, and C. -C. Jay Kuo, University of Southern California, USA

TP0.04 Evaluation of a Melody Transcription System 819

Rodger J. McNab, Digilib Systems, Ltd., New Zealand; Lloyd A. Smith, New Mexico Highlands University, USA

TP0.05 First Measurements of a Large-Aperture Microphone Array System for
Remote Audio Acquisition
823

Harvey F. Silverman, William R. Patterson III, and Joshua M. Sachar, Brown University, USA

TP0.06 Subband Audio Coding using a Perceptually Hybrid Vector-Scalar Quantization 827

Yu Rongshan, Nanyang Technological University, Singapore

TP0.07 Short-Time Kurtosis of Speech Signals with Application to Co-channel Speech Separation 831

Phillip L. De Leon, New Mexico State University, USA

TP1 QoS (Traffic Management / Protocol) I (R)

Time & Place: 1:15pm - 3:35pm, Sutton North

TP1.01 Trader's Quality of Service Specifications and Effects on System Performance for
Video-on-Demand
837

Edward Babulak, University of Ottawa, Canada

TP1.02 On TCP-Friendly Video Transfer with Consideration on Application-Level QoS 843

Naoki Wakamiya, Masayuki Murata and Hideo Miyahara, Osaka University, Japan

TP1.03 Impact of Protocol Stacks on Quality of Perception 847

G. Ghinea, Brunel University, United Kingdom; J. P. Thomas, Pace University, USA

TP1.04 Resource-Aware Configuration of Ubiquitous Multimedia Services 851

Dongyan Xu, Duangdao Wichadakul, and Klara Nahrstedt, University of Illinois at
Urbana-Champaign, USA

TP1.05 A Gateway-Assisted Approach Toward QoS Adaptations 855

William Kalter, Baochun Li, Won Jeon, Klara Nahrstedt, and Jun-Hyuk Seo, University of Illinois at Urbana-Champaign, USA

TP1.06 A Resource Broker Model with Integrated Reservation Scheme 859

Kihun Kim and Klara Nahrstedt, University of Illinois at Urbana-Champaign, USA

TP1.07 Mining User Behavior for Resource Prediction in Interactive Electronic Malls 863

Silvia Hollfelder, GMD - IPSI, Germany; Vincent Oria and M. Tamer Özsu, University of Alberta, Canada

TP2 Audio Retrieval (R)

Time & Place: 1:15pm - 3:35pm, Sutton Center

TP2.01 A Study on N-Gram Indexing of Musical Features 869

Chi Lap Yip and Ben Kao, The University of Hong Kong, Hong Kong

TP2.02 Query by Music Segments: An Efficient Approach for Song Retrieval 873

Arbee L. P. Chen, Maggie Chang, Jesse Chen, Jia-Lien, Hsu, Chih-How Hsu, and Spot Y. S. Hua, National Tsing Hua University, Taiwan

TP2.03 Content-Based Indexing and Retrieval-by-Example in Audio 877

Zhu Liu, Polytechnic University, USA; Qian Huang, AT&T Labs - Research, USA

TP2.04 Indexing Telephone Conversations by Speakers 881

Ivan Magrin-Chagnolleau, IRISA/INRIA Rennes, France

TP2.05 Content-Based Indexing and Retrieval of Audio Data using Wavelets 885

Guohui Li and Ashfaq A. Khokhar, University of Delaware, USA

TP2.06 Distance Metrics and Indexing Strategies for a Digital Library of Popular Music 889

C. Francu and C. G. Nevill-Manning, Rutgers University, USA

TP3 Multimedia Representations (R)

Time & Place: 1:15pm - 3:35pm, Sutton South

TP3.01 Efficient Representation and Comparison of Multimedia Content using DAG-Composition 895

I-Jong Lin, Princeton University, USA; Ajay Divakaran and Anthony Vetro, Mitsubishi Electric, USA; Sun-Yuan Kung, Princeton University, USA

TP3.02 Feature Representations for Image Retrieval: Beyond the Color Histogram 899

Nuno Vasconcelos and Andrew Lippman, MIT Media, USA

TP3.03 Discourse Structure Analysis for News Video 903

Yasuhiko Watanabe and Yoshihiro Okada, Ryukoku University, Japan; Sadao Kurohashi, Kyoto University, Japan; Eiichi Iwanari, Ryukoku University, Japan

TP3.04 Visual Segment Tree Creation for MPEG-7 Description Schemes 907

Philippe Salembier, Joan Llach, and Luis Garrido, Universitat Politècnica de Catalunya, Spain

TP3.05 Multi Layer Video Object Database based on Interactive Annotation and its Application 911

Tomoyuki Yatabe, Hiroshi Kawasaki, Hiroshi Mo, and Masao Sakauchi, University of
Tokyo, Japan

TP3.06 Conceptual Modeling of Audio-Visual Content 915

John R. Smith and Ana B. Benitez, IBM T. J. Watson Research Center, USA

TP4 User Interface II (R)

Time & Place: 1:15pm - 3:35pm, Regent Parlor

TP4.01 Different Modalities in Assembly Support System User Interface 921

Lauri Repokari, Marko Nieminen, Milla Hailikari, Jyrki Kasvi, Matti Vartiainen, Anneli Pulkkis and Ilpo Kari, Helsinki University of Technology, Finland

TP4.02 EMG-Based Human-Machine Interface System 925

Osamah A. Alsayegh, Kuwait Institute for Scientific Research, Kuwait

TP4.03 A Test Bed for Prototyping Human/Computer Interfaces used in
Mission Critical Environments
929

Michael D. Orosz and Walter J. Karplus, University of California at Los Angeles, USA; J. D. Balakrishnam, Purdue University, USA

TP4.04 A Framework for Creating Customized Multi-modal Interfaces for XML Documents 933

Sami Rollins, University of California at Santa Barbara, USA; Neel Sundaresan, IBM Almaden Research Center, USA

TP4.05 Automated Language Acquisition in Multimodal Environment 937

Daniel Nagy, Attila Medl, and James L. Flanagan, Rutgers University, USA

TP4.06 Temperament-Based Information Filtering: A Human Factors Approach to
Information Recommendation
941

Cha-Hwa Lin and Dennis McLeod, University of Southern California, USA

TP4.07 A Method to Make a Three-Dimensional Model of an Individual Face from a Front
View of a Facial Image Only
945

Y. Nakamura, Tokyo Denki University, Japan; F. Sugimoto and M. Yoneyama, Toyo University, Japan; S. Nakamura, Tokyo Denki University, Japan

TP5 Wireless Video / Communication (P)

Time & Place: 1:15pm - 3:35pm, Sutton Corridor Poster

TP5.01 High Quality Wideband Audio over DECT 951

Soledad Torres-Guijarro, Universidad Europea de Madrid, Spain; F. Javier Casajús-Quirós, Lino García-Morales and Ramón Sanchez-Perez, Universidad Politecnica de Madrid, Spain

TP5.02 A Novel Video Communication System Utilizing Adaptive and Integrated System Design for Mobile Wireless ATM 955

Jozsef Vass, Eyeball.com Network, Inc., Canada; Xinhua Zhuang, University of Missouri-Columbia, USA

TP5.03 A Demonstrator for Real-Time Multimedia Sessions over Third Generation
Wireless Networks
959

S. Gruhl, Bell Laboratories, Lucent Technologies, Germany; A. Echihabi and T. Rachidi, Alakhawayn University, Morocco; M. Link and M. Söllner, Bell Laboratories,
Lucent Technologies, Germany

TP5.04 Bandwidth Management Providing Guaranteed Call Dropping Rates for
Multimedia Mobile Networks
963

Jianping Jiang and Ten-Hwang Lai, The Ohio State University, USA

TP5.05 Pocket Pavilion: A Synchronous Collaborative Browsing Application for
Wireless Handheld Computers
967

Philip K. McKinley and Ji Li, Michigan State University, USA

TP5.06 Low Bit-Rate Compression based on LAR Method for Videoconference via Internet 971

O. Déforges, L. Bédat, and J. Ronsin, INSA, France

TP5.07 Post-processing for Real-Time Quality Enhancement of MPEG-Coded Video Sequences 975

L. Atzori, University of Cagliari, Italy; F. G. B. De Natale, University of Trento, Italy; Fabrizio Granelli, University of Genoa, Italy

TP5.08 A New Search Algorithm for Block Motion Estimation 979

Jau-Ling Chen and Pei-Yin Chen, Southern Taiwan University of Technology, Taiwan

TP5.09 A Very Low Bit-Rate Video Coding Algorithm by Focusing on Moving Regions 983

Kwok-Wai Wong, Kin-Man Lam, and Wan-Chi Siu, The Hong Kong Polytechnic University, Hong Kong

TP5.10 Sending an Image to a Large Number of Nodes in Short Time using TCP 987

Takayuki Hirahara, Takashi Yamanoue, Hiroyuki Anzai, and Itsujirou Arita, Kyushu Kyouritsu University, Japan

TP5.11 Reducing Bandwidth Requirement for Delivering Video over Wide Area
Networks with Proxy Server
991

Wei-hsiu Ma and David H. C. Du, University of Minnesota, USA

TP5.12 A Comprehensive Analysis of Energy Savings in Dynamic Supply Voltage Scaling
Systems using Data Dependent Voltage Level Selection
995

Lama H. Chandrasena and Michael J. Liebelt, The University of Adelaide, Australia

TP5.13 Adapting Network Video to Multi-time Scale Bandwidth Fluctuations 999

Yuan-Chi Chang and Chung-Sheng Li, IBM T.J. Watson Research Center, USA; David G. Messerschmitt, University of California at Berkeley, USA

TP5.14 Progressive Image Transmission over OFDM Systems using Multiple Antennas 1003

Jie Song and K. J. Ray Liu, University of Maryland at College Park, USA

TP5.15 Performance Evaluation in Multimedia CDMA Wireless Transmission 1007

M. R. Hueda, C. E. Rodríguez and C. A. Marqués, Universidad Nacional de Córdoba, Argentina

TP6 Data Hiding II (R)

Time & Place: 3:50pm - 5:30pm, Beekman Parlor

TP6.01 Audio Watermarking: Features, Applications, and Algorithms 1013

Michael Arnold, Fraunhofer-Institute for Computer Graphics, Germany

TP6.02 An Improved All-Pass Watermarking for Speech and Audio 1017

Tolga Çiloglu and S. Utku Karaaslan, Middle East Tech. University, Turkey

TP6.03 The Design and Implementation of a Streaming Application for MPEG Videos 1021

Aylin Kantarci and Turhan Tunali, Ege University, Turkey

TP6.04 Progressive Image Watermarking 1025

Trista Pei-chun Chen and Tsuhan Chen, Carnegie Mellon University, USA

TP6.05 A DCT Domain Visible Watermarking Technique for Images 1029

Saraju P. Mohanty, University of South Florida, USA; K. R. Ramakrishnan, Indian Institute of Science, India; Mohan S. Kankanhalli, National University of Singapore, Singapore

TP6.06 Enhancing Robustness of Digital Watermarking Against Geometric Attack based on
Fractal Transform
1033

Zhicheng Ni and Eric Sung, Nanyang Technological University, USA; Yun Q. Shi,
New Jersey Institute of Technology, USA

TP6.07 3-D Interleaving for Enhancing the Robustness of Watermark Signals Embedded in
Video Sequences
1037

George F. Elmasry and Yun Q. Shi, New Jersey Institute of Technology, USA

TP7 QoS (Traffic Management / Protocol) II (R)

Time & Place: 3:50pm - 5:30pm, Sutton North

TP7.01 MIDI Encoding Method based on Variable Frame-Length Analysis and Its
Evaluation of Coding Precision
1043

Toshio Modegi, Dai Nippon Printing Co., Ltd. Japan

TP7.02 A Novel ATM Traffic Scheduler for Real-Time Multimedia Data Transport with Improved Packet Level QoS 1047

Fu-Ming Tsou, National Taiwan University, Taiwan; Hong-Bin Chiou, Chunghwa Telecommunications Co., Ltd., Taiwan; Zsehong Tsai, National Taiwan University, Taiwan

TP7.03 Adaptive Reservation: A New Framework for Multimedia Adaptation 1051

Xin Wang and Henning Schulzrinne, Columbia University, USA

TP7.04 Network-Adaptive Rate Control With TCP-Friendly Protocol for Multiple Video Objects 1055

Qian Zhang, Wenwu Zhu, and Ya-Qin Zhang, Microsoft Research, China

TP7.05 Dynamic QoS and Routing Support for Real-Time Multimedia Applications in the Next Generation Internet 1059

Oliver T. W. Yu, University of Illinois at Chicago, USA

TP7.06 MQ: An Integrated Mechanism for Multimedia Multicasting 1063

De-Nian Yang, Wanjiun Liao, and Yen-Ting Lin, National Taiwan University, USA

TP7.07 Transport of MPEG-4 over IP/RTP 1067

A. Basso, AT&T Labs – Research, USA; S. Varakliotis, University College London,
United Kingdom

TP8 Lip Synchronization / Speechreading (R)

Time & Place: 3:50pm - 5:30pm, Sutton Center

TP8.01 Speaker Independent Audio-Visual Speech Recognition 1073

You Zhang, Stephen Levinson, and Thomas Huang, University of Illinois at Urbana-Champaign, USA

TP8.02 Lip Synchronization using Linear Predictive Analysis 1077

Sumedha Kshirsagar, Nadia Magnenat-Thalmann, University of Geneva, Switzerland

TP8.03 Automatic Selection of Visemes for Image-Based Visual Speech Synthesis 1081

Jie Yang, Jing Xiao, and Max Ritter, Carnegie Mellon University, USA

TP8.04 A Hierarchical Segmentation Algorithm for Face Analysis Application for Lipreading 1085

M. Liévin and F. Luthon, Grenoble National Polytechnic Institute, France

TP8.05 Translingual Visual Speech Synthesis 1089

Tanveer A. Faruquie and Chalapathy Neti, IBM T.J. Watson Research Center, USA; Nitendra Rajput, L. Venkata Subramaniam, Ashish Verma, IBM India Research Lab, India

TP8.06 A New Approach to Integrate Audio and Visual Features of Speech 1093

Hao Pan, Ahi-Pei Liang, and Thomas Huang, University of Illinois at Urbana-Champaign, USA

TP8.07 A Cascade Image Transform for Speaker Independent Automatic Speech Reading 1097

G. Potamianos, IBM T.J Watson Research Center, USA; A. Verma, IBM India Research Lab, India; C. Neti, G. Iyengar, and S. Basu, IBM T.J Watson Research Center, USA

TPS9 Special Session: MPEG-4 Natural Hybrid Coding (R)

(Euee Jang)

Time & Place: 3:50pm - 5:30pm, Sutton South

TPS9.01 Efficient Modeling of Virtual Humans in MPEG-4 1103

Tolga K. Capin, Swiss Federal Institute of Technology, Switzerland; Eric Petajan, face2face animation, inc., USA; Joern Ostermann, AT&T Labs – Research, USA

TPS9.02 Very Low Bitrate Coding of Virtual Human Animation in MPEG-4 1107

Tolga K. Capin, Swiss Federal Institute of Technology, Switzerland; Eric Petajan, face2face animation, inc., USA; Joern Ostermann, AT&T Labs – Research, USA

TPS9.03 Photo-Realistic 3D Model Coding in MPEG-4 1111

Nicolas Aspert and Touradj Ebarahimi, Swiss Federal Institute of Technology, Switzerland

TPS9.04 Animation Framework for MPEG-4 Systems 1115

Mikaël Bourges-Sévenier, iVast Inc., USA; Euee S. Jang and James D. K. Kim,
Samsung AIT, Korea

TPS9.05 3D Animation Coding: Its History and Framework 1119

Euee S. Jang, Samsung AIT, Korea

TP10 Image / Video Segmentation / Summary I (R)

Time & Place: 3:50pm - 5:30pm, Regent Parlor

TP10.01 Automatic Image Event Segmentation and Quality Screening for Albuming Applications 1125

Alexander C. Loui, Eastman Kodak Company, USA; Andreas E. Savakis, Rochester Institute of Technology, USA

TP10.02 Labeling Update of Segmented Images using Conceptual Graphs and Dempster-Shafer
Theory of Evidence
1129

Philippe Mulhem, IPAL-CNRS, KRD, Singapore; Dezhong Hong and Jian Kang Wu,
KRDL, Singapore

TP10.03 A Knowledge Engineering Approach for Image Classification based on
Probabilistic Reasoning Systems
1133

Seungyup Paek and Shih-Fu Chang, Columbia University, USA

TP10.04 Structuring Personal Experiences -- Analyzing Views from a Head-Mounted Camera 1137

Yuichi Nakamura, Jun'ya Ohde, and Yuichi Ohta, University of Tsukuba, Japan

TP10.05 Real-Time Scene Change Detection on Compressed Multimedia Bitstream based on
Statistical Sequential Analysis
1141

Dan Lelescu and Dan Schonfeld, University of Illinois at Chicago, USA

TP10.06 Video Scene Segmentation using Video and Audio Features 1145

Hari Sundaram and Shih-Fu Chang, Columbia University, USA

TP10.07 Rotation Invariant Face Detection using a Model-Based Clustering Algorithm 1149

Byeong Hwan Jeon and Sang Uk Lee, Seoul National University, Korea; Kyung Mu Lee, Hongik University, Korea

TP11 Video / Image Retrieval I (P)

Time & Place: 3:50pm - 5:30pm, Sutton Corridor Poster

TP11.01 A Content Based Internet Search Engine for Analysis and Archival of MPEG-1
Compressed Newsfeeds
1155

Odej Kao and Gerhard R. Joubert, Technical University of Clausthal, Germany

TP11.02 A Video Searching System using MSP Exchanging Data and Its Evaluation of
Matching Methods
1159

Mei Kodama, Hiroshima University, Japan; Tomoji Ikeda, SATAKE Corporation, Japan

TP11.03 Video Composition and Retrieval 1163

V. Singla, Y. C. Park, S. Panchanathan, F. Golshani, Arizona State University, USA

TP11.04 An Efficient Technique for Summarizing Videos using Visual Contents 1167

JungHwan Oh and Kien A. Hua, University of Central Florida, USA

TP11.05 Efficient Camera Motion Characterization for MPEG Video Indexing 1171

Jae-Gon Kim, Hyun Sung Chang, and Jinwoong Kim, Electronics and Telecommunications Research Institute, Korea; Hyung-Myung Kim, Advanced Institute of Science and Technology, Korea

TP11.06 Real Time Storage and Simultaneous Retrieval for Surveillance and Patrol Video 1175

Fujio Tsutsumi, Central Research Institute of Electric Power Industry, Japan

TP11.07 An Architecture of the Distributed Multimedia Information Retrieval Network with
Query Routing Systems
1179

Yukiko Kawasaki and Hideki Sunahara, Nara Institute of Science and Technology, Japan

TP11.08 Local Web Advertisement Through Dynamic Active Proxy 1183

Jing Deng and Chi-Hung Chi, National University of Singapore, Singapore

TP11.09 Improving Visual Recognition using Color Normalization in Digital Video Applications 1187

Juan M. Sánchez and Xavier Binefa, Universitat Autònoma de Barcelona, Spain

TP11.10 Segmentation and Tracking of Video Objects for a Content-Based Video Indexing Context 1191

Magali Mazière and Françoise Chassaing, France Telecom CNET/DHI/HDM, France; Luis Garrido and Philippe Salembier, Universitat Politècnica de Catalunya, Spain

TP11.11 Index-Based Fast Search Algorithm of Image Database on Internet 1195

Chia H. Yeh and Chung J. Kuo, National Chung Cheng University, Taiwan

TP11.12 Update Relevant Image Weights for Content-Based Image Retrieval using
Support Vector Machines
1199

Qi Tian, Pengyu Hong, and Thomas S. Huang, University of Illinois at Urbana-Champaign, USA

TP11.13 A Content-Based Scheme for CT Lung Image Retrieval 1203

Chii Tung Liu, Pol Lin Tai, Arlene Y. -J. Chen, Chen-Hsing Peng, and Jia Shung Wang, National Tsing Hua University, Taiwan

TP11.14 Relevance Feedback for Content-Based Retrieval using the Choquet Integral 1207

YoungSik Choi and Daewon Kim, MTRL, Korea Telecom, Korea; Raghu Krishnapuram, Colorado School of Mines, USA

TP11.15 Dimension Reduction of Texture Features for Image Retrieval using Hybrid
Associative Neural Networks
1211

Jose Antonio Catalan and Jesse S. Jin, University of New South Wales, Australia

TP11.16 Benchmarking Access Structures for the Similarity Retrieval of High-Dimensional
Multimedia Data
1215

Nathan G. Colossi, State University of Campinas, Brazil; Mario A. Nascimento, University of Alberta, Canada

Author Index follows page 1218

Volume III – Wednesday

WA0 Multimedia Home Appliance / Universal Access (R)

Time & Place: 9:45am - 12:05pm, Beekman Parlor

WA0.01 Clustering Source/Channel Rate Allocations for Receiver-Driven Multicast under a Limited Number of Streams 1221

Philip A. Chou, Microsoft Corporation, USA; Kannan Ramchandran,
University of California, USA

WA0.02 From Proven Office Technologies to the Intelligent Multimedia Home 1225

Christian Gran and Angela Scheller, GMD FOKUS, Germany

WA0.03 Back to the TV: Information Visualization Interfaces based on TV-Program Metaphors 1229

Katsumi Tanaka, Akiyo Nadamoto, Machiko Kusahara, Taeko Hattori, Hiroyuki Kondo, and Kazutoshi Sumiya, Kobe University, Japan

WA0.04 Issues in Data Embedding and Synchronization for Digital Television 1233

J. Brunheroto, R. Chernock, P. Dettori, X. Dong, J. Paraszczak, F. Schaffa, and D. Seidman, IBM Research, USA

WA0.05 A Live Video Imaging Method for Capturing Presentation Information in
Distance Learning
1237

Yoshinari Kameda, Kentaro Ishizuka, and Michihiko Minoh, Kyoto University, Japan

WA0.06 On the Application of Continuous Media Filters over Wireless Networks 1241

Margaritis Margaritidis and George C. Polyzos, University of California, San Diego, USA

WA0.07 Video Containers: A System for the On-Demand Storage, Delivery, and Management of Television Programs 1245

S. R. Subramanya, University of Missouri-Rolla, USA

WAS1 Special Session: Wireless Multimedia (R)

(David Goodman)

Time & Place: 9:45am - 12:05pm, Sutton North

WAS1.01 Wireless Communication of Vital Signs using the Georgia Tech Wearable Motherboard 1253

Babak Firoozbakhsh, Nikil Jayant, Sungmee Park, and Sundaresan Jayaraman, Georgia Institute of Technology, USA

WAS1.02 Filtering Wavelet based Video Streams for Wireless Inter-working 1257

A. Kassler, A. Neubeck, P. Schulthess, University of Ulm, Germany

WAS1.03 Issues of Mobile Ad-Hoc WANs 1261

Silvia Giordano, Maher Hamdi, Jean-Pierre Hubaux, Jean-Yves Le Boudec, and Ljubica Blazevic, Ecole Polytechnique Federale de Lausanne, Switzerland

WAS1.04 Admission and Flow Control for Multimedia CDMA 1265

Cristina Comaniciu, Narayan Mandayam, Rutgers University, USA; David Famolari and Prathima Agrawal, Telcordia Technologies, USA

WAS1.05 A Channel Predictor for Wireless Packet Networks 1269

Javier Gomez and Andrew T. Campbell, Columbia University, USA

WAS2 Special Session: Multimedia and Security (R)

(Jana Dittman and Martin Steinbach)

Time & Place: 9:45am - 12:05pm, Sutton Center

WAS2.01 Approaches to Multimedia and Security 1275

Klara Nahrstedt, University of Illinois at Urbana­Champaign, USA; Jana Dittman, GMD – IPSI, Germany; Petra Wohlmacher, University of Klagenfurt, Australia

WAS2.02 Staganalysis of LSB Encoding in Color Images 1279

Jiri Fridrich, Rui Du, and Meng Long, SUNY Binghamton, USA

WAS2.03 Watermarking Through Color Image Bands Decorrelation 1283

A. Piva, F. Bartolini, L. Boccardi, V. Cappellini, and A. DeRosa, Università di Firenze, Italy; M. Barni, Università di Siena, Italy

WAS2.04 Water-Filling for Watermarking? 1287

Deepa Kundur, University of Toronto, Canada

WAS2.05 Geometric Distortion Correction Through Image Normalization 1291

Masoud Alghoniemy and Ahmed H. Tewfik, University of Minnesota, USA

WA3 Video over Network (R)

Time & Place: 9:45am - 12:05pm, Sutton South

WA3.01 Image Integrity and Correction using Parities of Error Control Coding 1297

Jaejin Lee and Chee Sun Won, Dongguk University, Korea

WA3.02 Joint Source/FEC Rate Selection for Optimal MPEG-2 Video Delivery 1301

Pascal Frossard, Swiss Federal Institute of Technology, Switzerland; Olivier Verscheure, IBM T.J. Watson Research Center, USA

WA3.03 Activity-Adaptive Modeling of Dynamic Multimedia Traffic 1305

Deepak Turaga and Tsuhan Chen, Carnegie Mellon University, USA

WA3.04 Modeling of the Coding Gain of Joint Coding for Multi-program Video Transmission 1309

A. Vincent, P. Corriveau, P. Blanchfield, and R. Renaud, Communications
Research Centre, Canada

WA3.05 Evaluation of Adaptive Filtering of MPEG System Streams in IP Networks 1313

Michael Hemy, Peter Steenkiste, and Thomas Gross, Carnegie Mellon University, USA

WA3.06 Non Linear Traffic Modeling of VBR MPEG-2 Video Sources 1318

Anastasios D. Doulamis, Nikolaos D. Doulamis and Stefanos D. Kollias, National Technical University of Athens, Greece

WA3.07 An Architecture based on IETF Protocols for the Transport of MPEG-4 Content
over the Internet
1322

Roberto Castagno and Serkan Kiranyaz, Nokia Mobile Phones, Finland; Florin Lohan and Irek Defee, Tampere University of Technology, Finland

WA4 Image / Video Segmentation / Summary II (R)

Time & Place: 9:45am - 12:05pm, Regent Parlor

WA4.01 A Genetic Algorithm for Video Segmentation and Summarization 1329

Patrick Chiu, Andreas Girgensohn, Wolf Polak, Eleanor Rieffel, and Lynn Wilcox, FX Palo Alto Laboratory, USA

WA4.02 Video Abstract: A Hybrid Approach to Generate Semantically Meaningful
Video Summaries
1333

Candemir Toklu and Shih-Ping Liou, Siemens Corporate Research, USA; Madirakshi Das, University of Massachusetts, USA

WA4.03 Generating Semantic Visual Templates for Video Databases 1337

William Chen and Shih-Fu Chang, Columbia University, USA

WA4.04 Design and Performance Study of Scalable Video Storage in a
Disk-Array-Based Video Server
1341

Zheng-Ru Lin and Ming-Syan Chen, National Taiwan University, Taiwan

WA4.05 TV Program Classification based on Face and Text Processing 1345

Gang Wei, Wayne State University, USA; Lalitha Agnihotri and Nevenka Dimitrova, Philips Research, USA

WA4.06 Dissolve Transition Detection using B-Splines Interpolation 1349

Jeho Nam and Ahmed H. Tewfik, University of Minnesota, USA

WA4.07 A Bayesian segmentation of Stereopairs 1353

G. A. Triantafyllidis, Aristotle University of Thessaloniki, Greece; D. Tzovaras and M. G. Strintzis, Informatics and Telematics Institute, Greece

WA5 Multimedia System / Hardware (P)

Time & Place: 9:45am - 12:05pm, Sutton Corridor Poster

WA5.01 Flexible Multimedia System for Multimedia Communication Services 1359

Koji Hashimoto and Yoshitaka Shibata, Iwate Prefectural University, Japan; Norio Shiratori, Tohoku University, Japan

WA5.02 Hardware/Software Co-design for Real-Time Physical Modeling 1363

B. Bishop, T. P. Kelliher, and M. J. Irwin, The Pennsylvania State University, USA

WA5.03 Supporting Audience and Player Interaction during Interactive Media Performances 1367

Nikitas M. Sgouros, University of Piraeus, Greece

WA5.04 Design and Implementation of a Programmable Stack Filter with FPGAs 1371

M. Hu, O. Vainio, and D. Gevorkian, Tampere University of Technology, Finland

WA5.05 A VLIW Architecture Simulator Innovative Approach for HW-SW Co-design 1375

Ivano Barbieri, Massimo Bariani, and Marco Raggio, University of Genova, Italy

WA5.06 Efficient Hardware-Software Co-design for the G.723.1 Algorithm Targeted at
VoIP Applications
1379

Shridhar Mubaraq Mishra, Infineon Technologies Asia Pacific Pte. Ltd., Singapore; Arjun Balaram, Nortel Networks, Canada

WA5.07 MMX-Like Architecture Extension to Support the Rotation Operation 1383

J. Villalba, J. Hormigo, M. A. González, and E. L. Zapata, University of Málaga, Spain

WA5.08 A Finite Field Processor Employing Dual Parallel Datapath for High-Speed/Low-Power
RS-ECC Applications
1387

Hyung-Joon Kwon, Young-Beom Jang, and Bangwon Lee, Samsung Electronics, Korea

WA5.09 A Multimedia Terminal Architecture for Dynamically Configurable Protocol Stacks 1391

Filip Vandermeulen, University of Ghent, Belgium; Frank Steegmans, Alcatel Corporate Research Center, Belgium; Brecht Vermeulen, University of Ghent, Belgium; Steven Vermeulen, Alcatel Corporate Research Center, Belgium

WA5.10 Programmable and Low Power VLSI Architecture for Full Search Motion Estimation in Multimedia Communications 1395

Luca Fanucci, National Research Council, Italy; Lorenzo Bertini and Sergio Saponara, University of Pisa, Italy

WA5.11 A VLSI Implementation Structure for Wavelet Decomposition Filter 1399

Wu Shunjun, Wang Chao, and Shang Yong, Xidian University, P.R. China

WA5.12 An Extensible Set-Top-Box Architecture for Interactive and Broadcast Services Offering Sophisticated User Guidance 1403

Frank Lonczewski and Rudolf Jaeger, BetaResearch, Germany

WA5.13 Interoperable Content Protection for Digital TV 1407

B. J. van Rijnsoever and J. P. Linnartz, Philips Research, The Netherlands

WA5.14 A Digital Television Service Architecture 1411

P. Vuorimaa, Helsinki University of Technology, Finland

WA5.15 Combined Watermarking for Image Authentication and Protection 1415

Chun-Shien Lu, Hong-Yuan Mark Liao, and Chwen-Jye Sze, Academia Sinica, Taiwan

WA5.16 FlyCam: Practical Panoramic Video and Automatic Camera Control 1419

Jonathan Foote and Don Kimber, FX Palo Alto Laboratory, Inc., USA

WP0 Multimedia System / Hardware (R)

Time & Place: 2:15pm - 3:35pm, Beekman Parlor

WP0.01 A Hardware Implementation for Approximate Text Search in Multimedia Applications 1425

H. -M. Blüthgen, P. Osterloh, H. Blume, and T. G. Noll, Aachen Institute of
Technology, Germany

WP0.02 Improved Data Layouts for Fault-Tolerant Multimedia Systems 1429

Martha L. Escobar-Molano and Lanfeng Hao, University of South Florida, USA; David A. Barrett, Asgard Systems, USA

WP0.03 LucentVision: Converting Real World Events into Multimedia Experiences 1433

Gopal Pingali, Yves Jean, Agata Opalach, and Ingrid Carlbom, Bell Laboratories, Lucent Technologies, USA

WP0.04 Image-Based Rendering via the Standard Graphics Pipeline 1437

Miles E. Hansard and Bernard F. Buxton, University College London, United Kingdom

WP1 Wireless Multimedia (R)

Time & Place: 2:15pm - 3:35pm, Sutton North

WP1.01 On the Capabilities of Error Concealment in MPEG-2 Communications over
Wireless ATM
1443

Francisco Delicado, Pedro Cuenca, and Antonio Garrido, Universidad de Castilla-La Mancha, Spain; Luis Orozco-Barbosa and Francisco Quiles, University of Ottawa, Canada

WP1.02 Source-Channel Matching Space-Time Diversity for Multimedia Communications 1447

H. Zheng, Bell-Labs, Lucent Technologies, USA; K. J. R. Liu, University of Maryland, USA

WP1.03 Foveation-Based Error Resilience for Video Transmission over Mobile Networks 1451

Sanghoon Lee, Lucent Technologies, USA; Chris Podilchuk, The University of Texas at Austin; Alan C. Bovik, Bell Labs, USA

WP1.04 Joint Downlink Beamforming, Power Control, and Data Rate Allocation for DS-CDMA
Mobile Radio with Multimedia Services
1455

Ying-Chang Liang and Francois P. S. Chin, Centre for Wireless Communications, Singapore; K. J. Ray Liu, University of Maryland, USA

WP2 Video on Demand (R)

Time & Place: 2:15pm - 3:35pm, Sutton Center

WP2.01 Transmitting Variable-Bit-Rate Videos on Clustered VoD Systems 1461

Chow-Sing Lin, Min-You Wu and Wei Shu, UCF, USA

WP2.02 Design and Implementation of VoD Server by using Clustered File System 1465

Chang-Soon Park, ETRI, Korea; Mann-Ho Lee, Chungnam National University, Korea; Young-Sung Son, ETRI, Korea; Oh-Young Kwon, Korea University of Technology and Education, Korea

WP2.03 Broadcast News Parsing using Visual Cues: A Robust Face Detection Approach 1469

Yannis Avrithis, Nicolas Tsapatsoulis and Stefanos Kollias, National Technical University of Athens, Greece

WP2.04 Fast-Forward Functions on Parallel Video Servers 1473

Zhiyong Ding and Chow-Sing Lin, University of Central Florida, USA; Min-You Wu, University of New Mexico, USA

WP3 Error Control (R)

Time & Place: 2:15pm - 3:35pm, Sutton South

WP3.01 A Study of Keyframe Reference Picture Selection Method for Error Resilient Multiple
Video Objects Distribution
1479

Hideaki Kimata, Yoshiyuki Yashima, and Naoki Kobayashi, NTT Cyber Space
Laboratories, Japan

WP3.02 DCT Coefficient-Based Error Detection Technique for Compressed Video Stream 1483

K. Bhattacharyya, H. S. Jamadagni, Indian Institute of Science, India

WP3.03 Scalable MPEG-4 Video Coding with Graceful Packet-Loss Resilience over
Bandwidth-Varying Networks
1487

M. van der Schaar and H. Radha, Philips Research USA, USA; C. Dufour, Philips Research LEP, France

WP4 Security / Authentication (R)

Time & Place: 2:15pm - 3:35pm, Regent Parlor

WP4.01 Multimedia Enhanced General-Purpose Processors 1493

Stephan Wong, Sorin Cotofana, and Stamatis Vassiliadis, Delft University of Technology,
The Netherlands

WP4.02 VeriNet Web – Speaker Verification for the World Wide Web 1497

Kevin Farrell and William Mistretta, T-NETIX, Inc., USA

WP4.03 SMMM - A Secure MultiMedia Mail System 1501

Marcel Stanley A. de Moura, DI, PUC-Rio, Brazil; Guido Lemos de Souza Filho and Thaís Vasconcelos Batista, DIMAp – UFRN, Brazil; Luiz Fernando G. Soares, DI, PUC-Rio, Brazil

WP5 Segmentation, Summarization & Indexing (P)

Time & Place: 2:15pm - 3:35pm, Sutton Corridor Poster

WP5.01 Video Segmentation with the Assistance of Audio Content Analysis 1507

Hao Jiang, Microsoft Research, China; Tong Lin, Peking University, China; Hong-Jiang Zhang, Microsoft Research, China

WP5.02 On the Segmentation of Text in Videos 1511

Axel Wernicke and Rainer Lienhart, Intel Corporation, USA

WP5.03 Unsupervised Color Image Segmentation for Content Based Application 1515

Chung Hui Kuo and Ahmed H. Tewfik, University of Minnesota, USA

WP5.04 Towards Abstracting Sports Video by Highlights 1519

Noboru Babaguchi, Osaka University, Japan

WP5.05 Dynamic Video Abstract Generation using an Object DBMS 1523

H. Martin and R. Lozano, Laboratoire Université Joseph Fourier, France

WP5.06 Video Segmentation using Spatial and Temporal Statistical Analysis Method 1527

Zhibin Lei, Wu Chou, Jialin Zhong, and Chin-Hui Lee, Lucent Technologies, USA

WP5.07 Spatiotemporal Segmentation of Moving Video Objects over MPEG Compressed Domain 1531

How-Lung Eng and Kai-Kuang Ma, Nanyang Technological University, Singapore

WP5.08 Automated Threshold Selection for the Detection of Dissolves in MPEG Videos 1535

G. Boccignone, M. De Santo, and G. Percannella, Università di Salerno, Italy

WP5.09 Visualization Methods for Personal Photo Collections: Browsing and
Searching in the PhotoFinder
1539

Hyunmo Kang and Ben Shneiderman, University of Maryland at College Park, USA

WP5.10 A Feature Point Based Scheme for Unsupervised Video Object Segmentation in
Stereoscopic Video Sequences
1543

Klimis S. Ntalianis, Nikolaos D. Doulamis, Anastasios D. Doulamis, and Stefanos D. Kollias, National Technical University of Athens, Greece

WP5.11 Visual and Audio Segmentation for Video Streams 1547

Takeshi Muramoto and Masahide Sugiyama, The University of Aizu, Japan

WP5.12 Joint Video Scene Segmentation and Classification based on Hidden Markov Model 1551

Jincheng Huang, Zhu Liu, and Yao Wang, Polytechnic University, USA

WP5.13 Video Object Segmentation and Tracking for Content-Based Video Coding 1555

J. Y. Zhou, National University of Singapore, Singapore; E. P. Ong, Institute of Microelectronics, Singapore; C. C. Ko, National University of Singapore, Singapore

WP5.14 Generating Optimal Video Summaries 1559

Yihong Gong and Xin Liu, NEC, USA

WP5.15 Tracking of Multiple Faces for Human-Computer Interfaces and Virtual Environments 1563

Fu Jie Huang and Tsuhan Chen, Carnegie Mellon University, USA

WP5.16 Browsing Images Based on Social and Content Similarity 1567

Junichi Tatemura, University of Tokyo, Japan

WP5.17 Toward a Retrieval of HTML Documents using a Semantic Approach 1571

Fernando Ferri, Istituto di Studi sulla Ricerca e sulla Documentazione Scientifica – CNR, Italy; Cristina Ghiselli, Istituto per le Tecnologie Informatiche Multimediali – CNR, Italy; Patrizia Grifoni, Istituto di Studi sulla Ricerca e sulla Documentazione Scientifica – CNR, Italy; Marco Padula, Istituto per le Tecnologie Informatiche Multimediali – CNR, Italy

WP6 Video Conferencing (R)

Time & Place: 3:50pm - 5:30pm, Beekman Parlor

WP6.01 Motion Detection and Segmentation using Image Mosaics 1577

Kiran S. Bhat, Mahesh Saptharishi, and Pradeep K. Khosla, Carnegie Mellon University, USA

WP6.02 Electronic Pan-Tilt-Zoom: A Solution for Intelligent Room Systems 1581

Mircea Nicolescu and Gerard Medioni, University of Southern California, USA

WP6.03 Robust Automatic Video-Conferencing with Multiple Cameras and Microphones 1585

Ce Wang, Scott Griebel, and Michael Brandstein, Harvard University, USA

WP6.04 Look Who's Talking: Speaker Detection using Video and Audio Correlation 1589

Ross Cutler and Larry Davis, University of Maryland, College Park, USA

WP6.05 Towards a Multimodal Meeting Record 1593

Ralph Gross, Michael Bett, Hua Yu, Xiaojin Zhu, Yue Pan, Jie Yang, and Alex Waibel, Carnegie Mellon University, USA

WP6.06 Smart Videoconferencing 1597

Dmitry Zotkin, Ramani Duraiswami, Vasanth Philomin, and Larry S. Davis, University of Maryland, College Park, USA

WP6.07 Rate-Distortion Optimization for Arbitrarily-Shaped Object Coding 1601

Guobin Shen, Bing Zeng, and Ming L. Liou, The Hong Kong University of Science and Technology, Hong Kong

WP7 QoS (Traffic Management / Protocol) III (R)

Time & Place: 3:50pm - 5:30pm, Sutton North

WP7.01 Explicit Rate Congestion Control of MPEG-2 Coded Video Traffic in ATM Networks 1607

Gajendra Sisodia, Ling Guan, Subrata De and Mehran Dowlatshahi, University of Sydney, Australia

WP7.02 Bandwidth Adaptive Smoothing for Multimedia Delivery 1611

Jae-Wook Kim and Rhan Ha, Hongik University, Korea; Hojung Cha, Kwangwoon University, Korea

WP7.03 An Adaptable Network Architecture for Multimedia Traffic Management and Control 1615

Sheng-Yih Wang and Bharat Bhargava, Purdue University, USA

WP7.04 LDA+: A TCP-Friendly Adaptation Scheme for Multimedia Communication 1619

Dorgham Sisalem and Adam Wolisz, GMD-Fokus, Germany

WP7.05 On the Quality of Service and Pricing in a Multiservice Network 1623

Tiina Keikkinen, University of Lund, Sweden

WP7.06 M3POC: A Multimedia Multicast Transport Protocol for Cooperative Applications 1627

T. Gayraud, P. Berthou, P. Owezarski, and M. Diaz, LAAS - CNRS, France

WP7.07 Distributed QoS Routing for Multimedia Traffic 1631

Venkatesh Sarangan, Donna Ghosh, and Raj Acharya, State University of New York at Buffalo, USA

WP8 Virtual / Augmented Reality (R)

Time & Place: 3:50pm - 5:30pm, Sutton Center

WP8.01 Camera Tracking for Augmented Reality Media 1637

Bolan Jiang, Suya You, and Ulrich Neumann, University of Southern California, USA

WP8.02 Mixing Realities in Shared Space: An Augmented Reality Interface for
Collaborative Computing
1641

Mark Billinghurst, University of Washington, USA; Ivan Poupyrev, ATR International,
Japan
; Hirokazu Kato, Hiroshima City University, Japan; Richard May, University of Washington, USA

WP8.03 Networked Intelligent Collaborative Environment (NetICE) 1645

Wing Ho Leung, Khalid Goudeaux, Sooksan Panichpapiboon, Sy-Bor Wang, and Tsuhan Chen, Carnegie Mellon University, USA

WP8.04 Compression with Mosaic Prediction for Image-Based Rendering Applications 1649

Wing Ho Leung and Tsuhan Chen, Carnegie Mellon University, USA

WP8.05 Automatic 3D City Construction System using Omni Camera 1653

Hiroshi Kawasaki, Katsushi Ikeuchi, Masao Sakauchi, University of Tokyo, Japan

WP8.06 Virtual Me: A Virtual Communication Method that Enables Simultaneous Multiple
Existence as an Avatar and/or Agents
1657

Jun Ohya, Ryohei Nakatsu, Shinjiro Kawato, and Tatsumi Sakaguchi, ATR Media Integration & Communications Research Laboratories, Japan

WP8.07 Real Time 3D Navigation in a Static Virtualized Scene from a Limited Set of 2D Data 1661

Katia Fintzel and Jean-Luc Dugelay, Institut EURECOM, France

WP9 Synchronization (R)

Time & Place: 3:50pm - 5:30pm, Sutton South

WP9.01 An Adaptive Tutoring Machine based on Web Learning Assessment 1667

Timothy K. Shih, University of Aizu, Japan; Shi-Kuo Chang, University of Pittsburgh, USA; Ching-Sheng Wang, Tamkang University, Taiwan; Jianhua Ma and Runhe Huang , University of Aizu, Japan

WP9.02 An Approach to Checking Consistency in Multimedia Synchronization Constraints 1671

Huadong Ma, Beijing University of Posts & Telecommunications, China; Kang G. Shin, University of Michigan, USA

WP9.03 About the Semantic Verification of SMIL Documents 1675

P. N. M. Sampaio, C. A. S. Santos, and J. -P. Courtiat, LAAS – CNRS, France

WP9.04 Common Time Reference for Interactive Multimedia Applications 1679

Mario Baldi and Yoram Ofek, Synchrodyne, Networks, Inc., USA

WP9.05 Extension of SMIL with QoS Control and its Implementation 1683

Yoshiki Terashima, Osaka University, Japan; Keiichi Yasumoto, Shiga University, Japan; Teruo Higashino, Osaka University, Japan; Kota Abe and Toshio Matsuura, Osaka City University, Japan; Kenichi Taniguchi, Osaka University, Japan

WP9.06 Skew Detection and Compensation for Internet Audio Applications 1687

Orion Hodson, Colin Perkins, and Vicky Hardman, University College London, United Kingdom

WP9.07 High-Level Multimedia Synchronisation Algorithm on Broadband Network 1691

Seng Bing Go, Yacine Atif and Qingping Lin, Nanyang Technological University, Singapore

WP10 Indexing (R)

Time & Place: 3:50pm - 5:30pm, Regent Parlor

WP10.01 Live Multimedia Adaptation Through Wireless Hybrid Networks 1697

Antti Koivisto, Pekka Pietikäinen, and Jaakko Sauvola, University of Oulu, Finland;
David Doermann, University of Maryland, USA

WP10.02 Performance Analysis of AB-Tree 1701

Sakti Pramanik, Jinhua Li and Jiandong Ruan, Michigan State University, USA

WP10.03 In Common Sense – Rethinking Web Search Results 1705

E. Amitay, Macquarie University, Australia

WP10.04 Feature Based Indexing for Media Tracking 1709

Arun Hampapur and Ruud Bolle, IBM TJ Watson Research Center, USA

WP10.05 ClusterTree: Integration of Cluster Representation and Nearest Neighbor
Search for Image Databases
1713

Dantong Yu and Aidong Zhang, State University of New York at Buffalo, USA

WP10.06 Web-Based Searching and Browsing of Multimedia Data 1717

Wayne Niblack, Stanley Yue, Reiner Kraft, Arnon Amir, and Neel Sundaresan, Almaden Research Center, USA

WP10.07 A Look-Ahead Strategy for Graph Matching in Retrieval by Spatial Arrangement 1721

S. Berretti, A. Del Bimbo, and E. Vicario, Università di Firenze, Italy

WP11 Multimedia Codec III (P)

Time & Place: 3:50pm - 5:30pm, Sutton Corridor Poster

WP11.01 Real-Time Remote File System for Multimedia Application 1727

Shinzo Doi, Atsuhiro Tsuji, Yukiko Itoh, Kouji Kubota, and Tsutomu Tanaka, Matsushita Electric Industrial Co., Ltd., Japan

WP11.02 Fast Mesh Simplification for Progressive Transmission 1731

Wenlong Dong, Jiankun Li and C. -C. Jay Kuo, CheerMedia Corporation, USA

WP11.03 A Predictive H.263 Bit-Rate Control Scheme based on Scene Information 1735

Pohsiang Hsu and K. J. Ray Liu, University of Maryland at College Park, USA

WP11.04 Placement of Multi-rate Smoothed VBR Video Objects to MZR Disks 1739

Sooyong Kang and Heon Y. Yeom, Seoul National University, Korea

WP11.05 How to Measure Arithmetic Complexity of Compression Algorithms: A Simple Solution 1743

Julien Reichel and Marcus J. Nadenau, Swiss Federal Institute of Technology, Switzerland

WP11.06 Multiband Approach to Digital Audio FX 1747

Pablo Fernandez-Cid, Universidad Europea de Madrid, Spain; Javier Casajús-Quirós, Universidad Politécnica de Madrid, Spain

WP11.07 A Characteristics-Based Bandwidth Reduction Technique for Pre-recorded Videos 1751

Wallapak Tavanapong and Srikanth Krishnamohan, Iowa State University, USA

WP11.08 A Cost Function with Position Penalty for Motion Estimation in MPEG-2 Video Coding 1755

Hangu Yeo, Cesar A. Gonzales, Jack Kouloheris, and Wai-Man Lam, IBM T.J. Watson Research Center, USA

WP11.09 Image Denoising using Wiener Filtering and Wavelet Thresholding 1759

X. Huang and G. A. Woolsey, University of New England, Australia

WP11.10 Partial Update of Active Textures for Efficient Expression Synthesis in
Model-Based Coding
1763

Lijun Yin and Anup Basu, University of Alberta, Canada

WP11.11 Transmission of MPEG-4 Video over the Internet 1767

Steven Gringeri, Sami Iren, and Roman Egorov, GTE Laboratories Incorporated, USA

WP11.12 On Building an Internet Gateway for Internet Telephony 1771

Cheng-Yue Chang and Ming-Syan Chen, National Taiwan University, Taiwan

WP11.13 Lossless Compression for µ-Law (A-Law) and IMA ADPCM on the Basis of a
Fast RLS Algorithm
1775

Dawei Huang, Queensland University of Technology, Australia

Author Index follows page 1778