IBM Logo Explore IBM Research
      Home   |   Products & services   |   Support & downloads   
 Select a country
 IBM Home
IBM Research
Think Research
Pen Technologies
About IBM Research
Resources
Search Research
Feedback
 
 

 

    
Pen Technologies
Glossary
To clarify some of the jargon on our web pages, we present here some of the terminology of handwriting recognition.

Writer-dependent Trained to recognize the handwriting of a particular writer. cf. writer-independent

Writer-independent Recognizing the handwriting of many different writers. cf. writer-dependent

Vocabulary, lexicon, word-list, dictionary A list of words known to the recognizer. This can be changed by the user for specific applications, or even for particular fields in a data-entry tool. Typically we have 23,000 words in the cursive note-taking application, 100,000 in the ThinkWrite product, and fewer in a form-filling application.

Training To improve the accuracy on a particular writer's handwriting, we collect some samples of handwriting from the writer, and automatically adjust the parameters of the recognition engine to arrive at a writer-dependent recognizer.

Adaptation Adaptation is training on a small amount of data to quickly increase the accuracy when a new writer starts to use the system. cf. Training

On-line On-line handwriting recognition is recognition of handwriting written with an electronic stylus onto a tablet or pen computer. From the digitizing device, the computer receives a stream of coordinates describing the path of the stylus tip. cf. Off-line.

Off-line Off-line handwriting recognition is recognition of handwriting that has already been written on a piece of paper, using a pen or pencil. The computer scans the paper with an optical scanner and recognizes the handwriting from the image, with no information about the timing or ordering of the strokes. cf. On-line.

Discrete Discrete handwriting is where the pen is lifted off the paper between each character. It is printing as opposed to cursive writing. IBM's ThinkWrite handwriting recognition product recognizes discrete handwriting, with occasional run-on.

Cursive Cursive handwriting is `joined-up' writing where consecutive letters are joined together. Recognition of cursive writing is harder than discrete writing because it is very difficult to work out where one letter ends and the next begins.

Unconstrained Unconstrained means that no restriction is placed on the style of handwriting that the recognizer will accept. In particular we use unonstrained to refer to a recognizer that will recognize cursive and discrete handwriting.

Electronic ink The lines on the screen to mark the path of an electronic pen. Words written in ink are on-line handwriting and can be recognized and transcribed into text.

Hidden Markov model This is the technique used by the cursive recognition engine. Statistics of training samples are used by the model to recognize unseen words.

    About IBM   |   Privacy   |   Legal   |   Contact