Skip to main content
ESIP Home
Project
ESIP Product
Cluster
White Paper
Contact
iis header page

next up previous
 Next:Summary Up: Model-Based Mining of Previous:Combining results from searching
 

Model Revision

 

This section examines one model revision algorithm [5] to illustrate the process of revising the feature space using feedback from the user and the ground truth results. In this algorithm, a linear transform of the features is modified by the user's feedback. A gradient (steepest descent) method is employed to ensure rapid convergence, which makes the approach suitable for  an interactive query environment.

We assume that an image database consists of a set of N feature vectors. Each feature vector has n dimensions. The feature vectors potentially represent a combination of color, texture and shape information.

The user starts a query by selecting a particular query image, region, or object. The corresponding feature vector is extracted from the example, and the K best matches are retrieved using a Euclidean metric. The K results whose feature vectors are closest to the target feature vectors are then returned to the user for visual inspection or further processing.

The user specifies a refinement by selecting tex2html_wrap_inline937 of the K matches that are most similar to the desired match and reissuing the query. Based upon this feedback, the linear or nonlinear transform matrix is modified to better approximate the user's evaluation of similarity. A second set of matches is found and returned to the user. The user selects the tex2html_wrap_inline941 best matches and again reissues the query. This process is repeated until either the result set converges, or the user stops the process.

If the set of the feature vectors selected by the user up to step (i-1) is denoted as tex2html_wrap_inline945 , then

equation359 

where tex2html_wrap_inline947 is the set of feature vectors selected during step i.

The vectors that are NOT selected up to step i-1 is tex2html_wrap_inline953 , then

equation363 

where tex2html_wrap_inline955 is the set of feature vectors rejected during step i.

Algorithm The algorithm [5] for iterative refinement using nonlinear multidimensional scaling is as follows:

  1. Performing similarity search on a feature vector, v, retrieving the K most similar results in the feature space. Set i=1.
  2. Initialize tex2html_wrap_inline963 and tex2html_wrap_inline965 to those vectors which are considered to be similar. Also initialize tex2html_wrap_inline967 and tex2html_wrap_inline969 to those vectors that are not considered to be similar. If the number of vectors is less than a prescribed threshold, set tex2html_wrap_inline971 , where tex2html_wrap_inline973 is a fixed increment. Return to step 1.
  3. Perform the rescaling, described in the previous subsection, based on two classes of vectors: tex2html_wrap_inline747 and tex2html_wrap_inline763 where tex2html_wrap_inline747 include all the vectors that are considered similar, while tex2html_wrap_inline763 include all the vectors that are considered to be no similar (or irrelevant). Consequently, the class label for tex2html_wrap_inline747 is 1, while the class label for tex2html_wrap_inline763 is 2.
  4. Perform similarity search in the scaled feature space. The results are assigned either to set tex2html_wrap_inline947 containing similar (or relevant) results or to set tex2html_wrap_inline955 containing dissimilar (or irrelevant) results.
  5. Update tex2html_wrap_inline747 and tex2html_wrap_inline763 as follows
  6. equation372
  7. equation375
  8. if the difference between tex2html_wrap_inline747 and tex2html_wrap_inline945 is less than a prescribed threshold, a equilibrium has been reached and exit.
  9. set i=i+1, and return to step 3.


next up previous
 Next:Summary Up: Model-Based Mining of Previous:Combining results from searching
ESIP Home Project ESIP Product Cluster White Paper Contact

| Project home| Technical agenda| Publications| Contact|

[ Research home page | IBM home page | Order | Search | Contact IBM | Legal ]