Computer Vision and Multimedia

IBM Research is a leading player in the quest to give AI systems sight. We’re enabling Watson, IBM's AI platform, to interpret visual content as easily as it does text.

Computer Vision and Multimedia

IBM Research is a leading player in the quest to give AI systems sight. We’re enabling Watson, IBM's AI platform, to interpret visual content as easily as it does text.

Featured Projects

Image Captioning
 

Oral presentation at CVPR 2017, top entry in MS-COCO Captioning Challenge 2017

Dialog-based Interactive
Image Retrieval

A new learning framework of multi-modal conversational agents

A Low Power, Fully Event-Based Gesture Recognition System

Powered by the first neuromorphic chip

Identifying skin cancer with computer vision

First machine learning system to demonstrate expert-level accuracy on large public dermoscopy dataset

Automatic Curation
of Sports Highlights

Officially used in the Masters, Wimbledon and US Open tournaments

Harnessing A.I. for Augmenting Creativity: Movie Trailer Creation

Best Paper Award, Brave New Ideas at ACM Multimedia 2017

Featured Papers

CVPR 2018 accepted publications

Learn about IBM Research AI at CVPR 2018 →

Benchmarks and Workshops

Learn more about IBM’s work in computer vision - including projects in all of these research areas →