Profile

Masafumi Nishimura received his B.E. and M.E. degrees in Biophysical Engineering from Osaka University in 1981 and 1983, and his Dr. of Engineering degree from Toyohashi University of Technology in 1998. In 1983 he joined the IBM Japan Science Institute and worked in speech recognition area. Since 2003, he is a Senior Technical Staff Member of IBM. He is currently the STSM-Manager of Research of the Speech Technology Group at the IBM Research - Tokyo, Japan. His research interests include speech signal processing, robust speech recognition, statistical language modeling, expressive speech synthesis, emotion detection and speech analytics. Dr. Nishimura became Editor of the Institute of Electronics, Information and Communication Engineers Transactions in 2010. He received the SIG Research Award from the Information Processing Society of Japan in 1998, and the Prize for Outstanding Technological Development in Acoustics from the Acoustical Society of Japan in 1999. He is a senior member of the IEEE and the IEICE, and a member of the IPSJ and the ASJ.
Contact
Publications
Journal Papers
-
[1] "Monosyllable Recognition by Using Intermediate Cumulative Distance and
Normalized Distance Similarity,"
Masafumi Nishimura, Yasuhiro Matsuda,
Transactions of IPSJ, Vol. 27, No.1, pp.41-48, 1986. -
[2] "Speaker Adaptation Method for Fenonic Markov Model-based Speech
Recognition,"
Masafumi Nishimura,
IEICE Transactions on Information and Systems, D-II, Vol. J73-D-II, No.10, pp.1630-1638, 1990. -
[3] "Speaker adaptation method for fenonic Markov model-based speech recognition,"
Masafumi Nishimura,
Systems and Computers in Japan, Vol.22, No.13, pp.47-58, 1991. -
[4] "Large-vocabulary Speech Recognition on a General-purpose Speech Processing
Card,"
Akihiro Kuroda, Masafumi Nishimura,
Transactions of IPSJ, Vol. 35, No.8, pp.1549-1554, 1994. -
[5] "Word clustering for class-based language models,"
Shinsuke Mori, Masafumi Nishimura, Nobuyasu Itoh, Transactions of IPSJ, Vol. 38, No.11, pp.2200-2208, 1997. -
[6] "A word-based Japanese dictation system,"
Masafumi Nishimura, Nobuyasu Itoh,
IEICE Transactions on Information and Systems, D-II, Vol. J81-D-II, No.1, pp.1-8, 1998.1. -
[7] "A Word-based Japanese Language Model,"
Nobuyasu Itoh, Masafumi Nishimura, Shiho Ogino, Kazutaka Yamasaki,
Journal of Natural Language Processing, Vol.6, No.1, pp.9-28, Jan., 1999. -
[8] "Wavelet analysis for text-to-speech synthesis, "
Mei Kobayashi, Masaharu Sakamoto, Takeshi Saito, Yasuhide Hashimoto,
Masafumi Nishimura, Kazuhiro Suzuki,
IEEE Circuits & Systems, Vol. 45, No. 8, Aug. 1998, pp. 1125-1129. -
[9] "Word-based approach to large-vocabulary continuous speech recognition for
Japanese,"
Masafumi Nishimura, Nobuyasu Itoh, Kazutaka Yamasaki,
Transactions of IPSJ, Vol.40, No.4, pp.1395-1403, 1999-4. -
[10] "Large vocabulary spontaneous-speech recognition using a corpus of lectures,"
Masafumi Nishimura, Nobuyasu Itoh,
IEICE Transactions on Information and Systems, D-2, Vol.J83-D2, pp.2473-2480, 2000, 11. -
[11] "Large vocabulary spontaneous-speech recognition using a corpus of lectures,"
M.Nishimura, N.Itoh,
Electronics and Communications in Japan, Vol.86, No.9 , 2003.Sep. -
[12] "Speech enhancement by Profile Fitting method,"
O.Ichikawa, T.Takiguchi, M.Nishimura,
IEICE Transactions on Information and Systems, Vol.E86-D No.3, pp.514-521, 2003. -
[13] "Improved HMM Separation for Distant-Talking Speech Recognition,"
T.Takiguchi, M.Nishimura,
IEICE Trnsactions on Information and Systems, Vol.E87-D, No.5, pp.1127-1137, 2004. -
[14] "Sound source localization using a pinna-based Profile Fitting method,"
O.Ichikawa, T.Takiguchi, M.Nishimura,
IEICE Transactions on Information and Systems, Vol.E87-D No.5, pp.1138-1145, 2004. -
[15] "Simultaneous adaptation of echo cancellation and spectral subtraction for
in-car speech recognition,"
O.Ichikawa, M.Nishimura,
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, Vol.E88-A No.7, pp.1732-1738, 2005. -
[16] “An N-gram-based Approach to Phoneme and Accent Estimation for TTS,”
Tohru Nagano, Shinsuke Mori, Masafumi Nishimura,
Transactions of IPSJ, Vol.47, No.6, 2006. -
[17] "Acoustic Model Adaptation Using First-Order Linear Prediction for
Reverberant Speech”,
T. Takiguchi, M. Nishimura, and Y. Ariki,
IEICE Transactions on Information and Systems, Vol. E89-D, No. 3, pp. 908-914, 2006. -
[18] "Automatic Prosody Labeling using Multiple Models for Japanese,"
R.Tachibana, T.Nagano, G.Kurata, M.Nishimura, N.Babaguchi,
IEICE Transactions on Information and Systems, Vol. E90-D, No.11, pp. 1805-1812, 2007. -
[19] "Unsupervised Adaptation of a Speech Recognition System Using a
Lecture-Related Corpus,"
Gakuto Kurata, Shinsuke Mori, Masafumi Nishimura,
IEICE Transactions on information and systems, Vol. J90-D, No.9, pp.2530-2540, 2007. -
[20] "Unsupervised Construction of Speech Recognition Lexicon from Speech and
Text,"
Gakuto Kurata, Shinsuke Mori, Nobuyasu Itoh, Masafumi Nishimura,
Transactions of IPSJ, Vol.49, No.8, pp.2900-2909, 2008. -
[21] "Local peak enhancement for in-car speech recognition in noisy environment,"
O.Ichikawa, T.Fukuda, M.Nishimura,
IEICE Transactions on Information and Systems, Vol.E91D No.3, pp.635-639, 2008. -
[22] "DOA Estimation with Local-Peak-Weighted CSP,"
Ichikawa, O., Fukuda, T., Nishimura, M.,
Trans. EURASIP, Volume 2010, Article ID 358729, 9 pages, 2010, May. -
[23] “Long-term spectro-temporal and static harmonic features for voice activity
detection,”
Fukuda, T., Ichikawa, O., Nishimura, M.,
IEEE journal of selected topics in signal processing, Vol., 4, Issue 5, 2010. -
[24] “Dynamic Features in the Linear-Logarithmic Hybrid Domain for Automatic
Speech Recognition in a Reverberant Environment,"
Ichikawa, O., Fukuda, T., Nishimura, M.,
IEEE journal of selected topics in signal processing, Vol., 4, Isuue 5, 2010. -
[25] "Speech Input Method in Automobiles Reflecting Analysis on How Users Speak,"
Kurata. G., Ichikawa, O.,Nishimura, M.,
IEICE transactions on information and systems, D-II Vol, J93-D, No.10, pp.2107-2117, 2010, 10. -
[26] "Corpus-based Text-to-Speech Front-end for Japanese,''
T. Nagano, R. Tachibana, M. Nishimura,
IEICE transactions on information and systems, D-II Vol. J93-D, No.10, pp.2096-2106, 2010, 10. -
[27] "Acoustically Discriminative Language Model Training with Pseudo-hypothesis,"
Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, Ariya Rastrow, Nobuyasu Itoh, Masafumi Nishimura,
Speech Communication, Vol.54, Issue 2, pp.219-228, February 2012. -
[28] "Leveraging Word Confusion Networks for Named Entity Modeling and Detection from Conversational Telephone Speech,"
Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura, Abhinav Sethy, Bhuvana Ramabhadran,
Speech Communication, Vol.54, Issue 3, pp.491-502, March 2012.
International Conference Papers
-
[1] "A Method for recognizing Japanese monosyllables by using intermediate cumulative distance,"
Yasuhiro Matsuda, Shu Tezuka, Mitsuhiko Kanoh, Masafumi Nishimura and Toyohisa Kaneko,
IEEE ICASSP'84, 9.3, 1984. -
[2] "Isolated word recognition using hidden Markov models,"
Kazuhide Sugawara, Masafumi Nishimura Koichi Toshioka Masaaki Okochi and Toyohisa Kaneko,
IEEE ICASSP'85, 1.1, 1985. -
[3] "Isolated word recognition using HMM with duration distribution,"
Masafumi Nishimura and Masakai Okochi,
ICA-12, A1-8, 1986. -
[4] "Speaker adaptation for a hidden Markov model,"
Kazuhide Sugawara, Masafumi Nishimura and Akihiro Kuroda,
IEEE ICASSP'86, 49.11, 1986. -
[5] "HMM-based speech recognition using multi-dimensional multi-labeling,"
Masafumi Nishimura and Koichi Toshioka,
IEEE ICASSP'87, 27.11, 1987. -
[6] "Speaker adaptation method for HMM-based speech recognition,"
Masafumi Nishimura and Kazuhide Sugawara,
IEEE ICASSP'88, S5.7, 1988. -
[7] "HMM-based speech recognition using dynamic spectral feature,"
Masafumi Nishimura,
IEEE ICASSP'89, S6.12, 1989. -
[8] "Word Clustering for a Word Bi-gram Model,"
Shinsuke Mori, Masafumi Nishimura, Nobuyasu Itoh
ICSLP 1998 -
[9] "Recognizing overlapping speech by using HMM composition,"
T.Takiguchi, M.Nishimura,
The seventh Western Pacific Regional Acoustics Conference, 2000. -
[10] "A method for sytle adaptation to spontaneous speech by using a semi-linear interpolation technique,"
N.Itoh, M.Nishimura,
Proc of 6th ICSLP, Oct, 2000. -
[11] "Integration of HMM composition and a microphone array for overlapping speech recognition,"
T.Takiguchi, M.Nishimura,
Workshop on Hands-free speech communication, pp.127-130, 2001. -
[12] "A Stochastic Parser Based on a Structural Word Prediction Model,"
Shinsuke MORI, Masafumi NISHIMURA, Nobuyasu ITOH, Shiho OGINO, Hideo WATANABE
Proc. of Coling 2000, pp. 558-564, 2000. -
[13] "Improvement of a Structured Language Model: Arbori-context Tree,"
Shinsuke MORI, Masafumi NISHIMURA, Nobuyasu ITOH
Proc. of EuroSpeech 2001, pp. 713-716, 2001. -
[14] "An automatic sentence boundary detector based on a structured language model,"
S.Mori, M.Nishimura and N.Itoh,
Proc. of ICSLP 2002., pp.921-924, Sep. 2002. -
[15] "Sound source localization using a pinna-based Profile Fitting method,"
O.Ichikawa, T.Takiguchi, M.Nishimura,
International Workshop on Acoustic Echo and Noise Control(IWAENC), pp.263-266, 2003. -
[16] "Reverberant Speech Recognition using First-Order Linear Prediction,"
T.Takiguchi, M.Nishimura,
Proc. of International Congress on Acoustics, pp.2829-2830. 2003. -
[17] "Language Model Adaptation Using Word Clustering,"
Shinsuke MORI, Masafumi NISHIMURA, Nobuyasu ITOH
Proc. of EuroSpeech 2003, pp.425-428, 2003. - [18] "Acoustic Model Adaptation using First Order Prediction for Reverberant Speech," T.Takiguchi, M.Nishimura, Proc. IEEE International Conf. on Acoustics, Speech and Signal Processing, pp.869-872. 2004.
-
[19] "A Stochastic Approach to Phoneme and Accent Estimation,"
Tohru NAGANO, Shinsuke MORI, Masafumi NISHIMURA
EuroSpeech 2005 -
[20] "Simultaneous adaptation of echo cancellation and spectral subtraction for in-car speech recognition,"
Osamu Ichikawa and Masafumi Nishimura,
Proc. of European Conference on Speech Communication and Technology (EuroSpeech / InterSpeech) 2005, pp.2293-2296, 2005. -
[21] "Unsupervised Adaptation of a Stochastic Language Model Using a Japanese Raw Corpus,"
Gakuto KURATA, Shinsuke MORI, Masafumi NISHIMURA
ICASSP 2006.6. -
[22] "Unsupervised Lexicon Acquisition from Speech and Text, "
G.KURATA, S.MORI, N.ITOH, M.NISHIMURA,
Proc. of ICASSP 2007, Vol.4, pp.421-424, 2007. -
[23] "Preliminary Experiments toward Automatic Generation of New TTS Voices from Recorded Speech Alone,"
R.Tachibana, T.Nagano, G.Kurata, M.Nishimura, N.Babaguchi,
Proc. of INTERSPEECH, 2007. -
[24] "Short- and Long-term Dynamic Features for Robust Speech Recognition,"
T.Fukuda, O.Ichikawa, M.Nishimura,
Proc of Interspeech 2008, pp.2262-2265, 2008. -
[25] "Phone-duration-dependent Long-term Dynamic Features for Stochastic Model-based Voice Activity Detection,"
T.Fukuda, O.Ichikawa, M.Nishimura,
Proc of Interspeech 2008, pp.1293-1296, 2008. -
[26] "Improving Phoneme and Accent Estimation by Leveraging a Dictionary for a Stochatic TTS Front-end,"
T.Nagano, R.Tachibana, N. Itoh, and M.Nishimura,
Proc.,IEEE ICASSP 2008, pp.4689-4692, 2008. -
[27]“Local Peak Enhancement Combined with Noise Reduction Algorithms for Robust Automatic Speech Recognition in Automobiles,”
O.Ichikawa, T.Fukuda, M.Nishimura,
IEEE ICASSP 2008, pp.4865-4868, 2008. -
[28]“Acoustically Discriminative Training for Language Models”,
Gakuto KURATA, Nobuyasu ITOH, Masafumi NISHIMURA,
Proc. Of ICASSP 2009, Apri. 2009 -
[29] "Japanese Pitch Conversion for Voice Morphing Based on Differential Modeling,"
Ryuki Tachibana, Zhiwei Shuang, Masafumi Nishimura,
InterSpeech 2009, Sep. 2009. -
[30]“Dynamic Features in the Linear Domain for Robust Automatic Speech Recognition in a Reverberant Environment”,
Osamu Ichikawa, Takashi Fukuda, Masafumi Nishimura,
Interspeech 2009, Sep. 2009 -
[31]“Improved voice activity detection using static harmonic features,”
Fukuda, T., Ichikawa, O., Nishimura, M.,
International conference on acoustic, speech, and signal processing (ICASSP), pp. 4482-4485, 2010, March. -
[32]“Speech Synthesis by Modeling Harmonics Structure with Multiple Function”,
Nakashika, T., Tachibana, R., Nishimura, M., Takiguchi, T., Ariki, Y,
INTERSPEECH 2010, pp.295-948, Sep., 2010. -
[33] "Named Entity Recognition from Conversational Telephone Speech Leveraging Word Confusion Networks for Training and Recognition,”
Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura, Abhinav Sethy, Bhuvana Ramabhadran,
Proc. of ICASSP 2011, pp.5576-5579, May, 2011. -
[34] "Training of Error-Corrective Model for ASR without Using Audio Data,"
Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura,
Proc. of ICASSP 2011, pp.5572-5575, May, 2011. -
[35] "Acoutic Model Training with Detecting Transcription Errors in the Training Data,"
Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura,
INTERSPEECH 2011, Aug., 2011. -
[36] "Combining feature space discriminative training with long-term spectro-temporal features for noise-robust speech recognition,"
Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura,
INTERSPEECH 2011, Aug., 2011. -
[37] "Agglomerative Hierarchical Clustering of Emotions in Speech Based on Subjective Relative Similarity,"
Ryoichi Takashima, Tohru Nagano, Ryuki Tachibana, Masafumi Nishimura,
INTERSPEECH 2011, Aug., 2011. -
[38] "Breath-detection-based Telephony Speech Phrasing,"
Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura,
INTERSPEECH 2011, Aug., 2011. -
[39] "Continuous Digits Recognition Leveraging Invariant Structure,"
Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu,
INTERSPEECH 2011, Aug., 2011. -
[40] "Model-based noise reduction leveraging frequency-wise confidence metric for in-car speech recognition,"
Osamu Ichikawa, Steven Rennie, Takashi Fukuda, Masafumi Nishimura,
SP-P16, ICASSP 2012, March 2012. -
[41] "Disicriminative Reranking for LVCSR Leveraging Invariant Structure,"
Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu,
INTERSPEECH 2012, Sep., 2012. (To Appear)
IBM Research Report
-
[1] “A word-based Japanese dictation system,”
Masafumi Nishimura, Nobuyasu Itoh,
IBM Research Report, RT0219, 1997.9. -
[2] “A word-based Japanese Language Model,”
Nobuyasu Itoh, Masafumi Nishimura, Shiho Ogino, Kazutaka Yamasaki,
IBM Research Report, RT0288, 1998-12. -
[3] “Synthesizing Speech with Emphasis by Learning Prosody Change,”
Ryuki Tachibana, Masafumi Nishimura,
IBM Research Report, RT0608, 2005-4. -
[4] “Automatic Accent Labelling Using the Prosodic Structure of the Language,”
Ryuki Tachibana, Tohru Nagano, Gakuto Kurata, Masafumi Nishimura,
IBM Research Report, RT5273, 2006-11. -
[5] “New Speech Interface by Free Form Command,”
Gakuto Kurata, Osamu Ichikawa, Masafumi Nishimura,
IBM Research Report, RT5274, 2006-12. -
[6] “AFE: ASR Front-end for Speech Enhancement,”
Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura,
IBM Research Report, RT5281, 2007-7. -
[7] “Optimum F0 Adjustment for Concatenative TTS,”
Ryuki Tachibana, Masafumi Nishimura,
IBM Research Report, RT5287, 2008-1. -
[8] “Discriminative Reranking with Pseudo-ASR,”
Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura,
IBM Research Report, RT5302, 2009-4. -
[9] “Pitch Conversion for Unit Selection TTS Using Combination of Direct and
Differential Features,”
Ryuki Tachibana, Zhiwei Shuang, Masafumi Nishimura,
IBM Research Report, RT0881, 2009-9. -
[10] “Corpus-based Text-to-Speech Front-end for Japanese,”
Tohru Nagano, Ryuki Tachibana, Masafumi Nishimura,
IBM Research Report, RT0916, 2010-8. -
[11] “POI Retrieval from Free Keywords,”
Nobuyasu Itoh, Osamu Ichikawa, Masafumi Nishimura,
IBM Research Report, RT0919, 2010-10. -
[12] “Emotion Detection in Call-Center Conversation,”
Gakuto KURATA, Nobuyasu Itoh, Masafumi Nishimura,
IBM Research Report, RT0928. 2011-2.
Patents
Patents Issued
- [1] "SPEECH ROCOGNITION SYSTEM," 1997-06-13, Pat. no. 2662120, Japan
- [2] "A METHOD FOR CONTROLING DICTATION-STYLE MODEL," 2006-03-17, Pat. no. 3782943, Japan
- [3] "A METHOD FOR PREDICTING DISFLUENCY WORDS BY N-GRAM MODEL," 2005-12-07, Pat. no. ZL00135969.X, China
- [4] "A METHOD FOR PREDICTING DISFLUENCY WORDS BY N-GRAM MODEL," 2003-05-09, Pat. no. 3426176, Japan
- [5] “Adaptation of Acoustic Prototype Vectors in a Speech Recognition System,” 1991-09-03, Pat. No. 5046099, United States
- [6] "A PITCH SYNCHRONOUS OVERLAP-ADD METHOD BASED ON GLOTTAL CLOSURE INSTANTS," 2000-07-28, Pat. no. 3093113, Japan
- [7] "HMM BASED SPECH RECOGNITION METHOD USING STATIC AND DYNAMIC FEATURES," 1994-12-26, Pat. no. 1892342, Japan
- [8] "METHOD OF SPEECH MODELLING AND A SPEECH RECOGNIZER," 1999-04-14, Pat. no. 69324428.3, Germany
- [9] "METHOD OF SPEECH MODELLING AND A SPEECH RECOGNIZER," 1999-04-14, Pat. no. 590925, France
- [10] "METHOD OF SPEECH MODELLING AND A SPEECH RECOGNIZER," 1999-04-14, Pat. no. 590925, United Kingdom
- [11] “METHOD, APPARATUS, COMPUTER SYSTEM AND STORAGE MEDIUM FOR SPEECH RECOGNITION,” 2005-07-12, Pat. No. 6917910, United States
- [12] "SPEAKER ADAPTATION FOR HMM BASED SPEECH RECOGNITION," 1992-08-11, Pat. no. 1689273, Japan
- [13] "SPEAKER ADAPTATION METHOD FOR VQ CODE BOOK," 1995-02-24, Pat. no. 1906392, Japan
- [14] “SPEECH RECOGNITION APPARATUS AND METHOD UTILIZING A LANGUAGE MODEL PREPARED FOR EXPRESSIONS UNIQUE SPONTANEOUS SPEECH,” 2006-01-10, Pat. no. 6985863, United States
- [15] “SPEECH RECOGNITION BY CONCATENATING FENONIC ALLOPHONE HIDDEN MARKOV MODELS IN PARALLEL AMONG SUBWORDS,” 1996-03-26, Pat. No. 5502791, United States
- [16] "SPEECH RECOGNITION METHOD," 1989-06-27, Pat.no. 1256562, Canada
- [17] "SPEECH RECOGNITION METHOD," 1991-09-18, Pat.no. 3773039808, Germany
- [18] "SPEECH RECOGNITION METHOD," 1991-09-18, Pat. no. 243009, France
- [19] "SPEECH RECOGNITION METHOD," 1991-09-18, Pat. no. 243009, United Kingdom
- [20] "SPEECH RECOGNITION METHOD," 1991-09-18, Pat. no. 243009, Italy
- [21] "SPEECH RECOGNITION METHOD," 1992-08-11, Pat. no. 1689246, Japan
- [22] "SPEECH RECOGNITION METHOD," 1996-04-09, Pat. no. 2044703, Japan
- [23] “SPEECH RECOGNITION METHOD,” 1989-05-09, Pat.no.4829577, United States
- [24] "SPEECH RECOGNITION SYSTEM USING MARKOV MODELS," 1992-11-25, Pat. no. 3876207208, Germany
- [25] "SPEECH RECOGNITION SYSTEM USING MARKOV MODELS," 1992-11-25, Pat. no. 312209, France
- [26] "SPEECH RECOGNITION SYSTEM USING MARKOV MODELS," 1992-11-25, Pat. no. 312209, United Kingdom
- [27] “SPEECH RECOGNITION SYSTEM USING MARKOV MODELS HAVING INDEPENDENT LABEL OUPUT SETS,” 1991-07-09, Pat.no.5031217, United States
- [28] "SPEECH ROCOGNITION," 1998-04-01, Pat. no. 69224953.2, Germany
- [29] "SPEECH ROCOGNITION," 1998-04-01, Pat. no. 535909, France
- [30] "SPEECH ROCOGNITION," 1998-04-01, Pat. no. 0535909, United Kingdom
- [31] “SPEECH ROCOGNITION SYSTEM HAVING AN INTEFACE TO A HOST COMPUTER BUS FOR DIRECT ACCESS TO THE HOST MEMORY,” 1994-10-04, Pat. No.5353377, United States
- [32] “SPEECH SYNTHESIS USING GLOTTAL CLOSURE INSTANTS DETERMINED FROM ADAPTIVELY-THRESHOLDED WAVELET TRANSFORMS,” 1997-09-23, Pat.no.5671330, United States
- [33] “SYSTEM INSERTION APPARATUS AND METHOD,” 2004-08-17, Pat.no.6778958, United States
- [34] "VOICE RECOGNITION APPARATUS," 1995-07-25, Pat. no. 1336458, Canada
- [35] "WORD-BASED JAPANESE DICTATION SYSTEM," 2000-10-20, Pat. no. 3121530, Japan
- [36] “SPEECH RECOGNITION METHOD,” 1991-09-17, Pat.no. 5050215, United States
- [37] "SPEECH RECOGNITION METHOD USING A TRAINABLE HMM-NETWORK," 1996-04-25, Pat. no. 2048523, Japan
- [38] "SPEECH RECOGNITION SYSTEM," 1994-07-20, Pat. no. 69010722.6, Germany
- [39] "SPEECH RECOGNITION SYSTEM," 1994-07-20, Pat. no. 388067, France
- [40] "SPEECH RECOGNITION SYSTEM," 1994-07-20, Pat. no. 388067, United Kingdom
- [41] "SYSTEM, PROGRAM, AND CONTROL METHOD FOR SPEECH SYNTHESIS," 2009-01-23, Pat. no. 4247564, Japan
- [42] "REVERBERANT SPEECH RECOGNITION BASED ON MODEL COMPENSATION APPROACH," 2006-08-04, Pat. no. 3836815, Japan
- [43] "APPARATUS, METHOD, AND PROGRAM FOR SUPPORTING SPEECH INTERFACE DESIGN," 2008-07-18, Pat. no. 4156639, Japan
- [44] "A METHOD TO DESIGN THE SHAPE OF OUTER-EAR SUITABLE FOR SOUND SOURCE LOCALIZATION," 2007-08-17, Pat. no. 3999689, Japan
- [45] "CONTROLS FOR AUTOMATIC-PUNCTUATING FUNCTION," 2001-09-14, Pat. no. 3232289, Japan
- [46] ”SPEECH RECOGNITION SYSTEM AND PROGRAM THEREOF,” 2008-07-22, Pat.no.7403896, United States
- [47] "SPEECH RECOGNITION BY FRAME-WISE SELECTION OF THE MODEL UNDER THE RAPID CHANGE OF NOISE," 2007-12-28, Pat. no. 4061094, Japan
- [48] "SPEECH RECORDING METHOD FOR COURT REPORT," 2008-02-22, Pat. no. 4082611, Japan
- [49] “SYSTEMS AND METHODS FOR NATURAL SPOKEN LANGUAGE WORD PREDICTION AND SPEECH RECOGNITION,” 2008-04-15, Pat.no. 7359852, United States
- [50] "STRUCTURAL LANGUAGE MODELING BASED ON DEPENDENCY," 2008-04-04, Pat. no. 4105841, Japan
- [51] "LOW-COST METHOD FOR DETERMINING FILTER COEFFICIENT IN DEREVERBERATION," 2008-04-11, Pat. no. 4107613, Japan
- [52] "SYSTEM FOR SUPPORTING TEXT-TO-SPEECH," 2008-05-30, Pat. no. 4129989, Japan
- [53] "MICROPHONE-ARRAY BASED NOISE SUPPRESSION METHOD," 2008-10-03, Pat. no. 4195267, Japan
- [54] "CONTEXT TREE FOR TREE-STRUCTURED HISTORY," 2008-11-14, Pat. no. 4215418, Japan
- [55] “SPEECH RECOGNITION APPARATUS, SPEECH RECOGNITION APPARATUS AND PROGRAM THEREOF,” 2009-1-13, Pat.no. 7478041, United States.
- [56] “WORD PREDICTING METHOD, VOICE RECOGNITION METHOD, AND VOICE RECOGNITION APPARATUS AND PROGRAM USING THE SAME METHODS,” 2009-01-20, Pat. No. 7480612, United States
- [57] “SIGNAL ENHANCEMENT VIA NOISE REDUCTION FOR SPEECH RECOGNITION,” 2009-05-12, Pat. No. 7533015, United States
- [58] “SIGNAL ENHANCEMENT VIA NOISE REDUCTION FOR SPEECH RECOGNITION,” 2011-02-22, Pat. No. 7895038, United States
- [59] "SPEECH RECOGNITION SYSTEM AND METHOD," 2011-08-26, Pat. No. 4808764, Japan
- [60] "RECORDING SYSTEM WITH IMPROVED SUPPRESSION OF INTERFERING TALKER," 2012-01-20, Pat. No. 4906908, Japan
- [61] "METHOD AND SYSTEM FOR POSITION DETECTION OF A SOUND SOURCE," 2012-04-24, Pat. No. 8165317, United States
- [62] "SYSTEM, METHOD, AND PROGRAM PRODUCT FOR PROCESSING SPEECH RATIO DIFFERENCE DATA VARIATIONS IN A CONVERSATION BETWEEN TWO PERSONS," 2012-04-24, Pat. No. 8165874, United States
IBM Tchinical Disclosure Bulletin
- [1] “Isolated word recognition method,” Masafumi Nishimura and Masaaki Okochi, IBM technical disclosure bulletin, Vol.29, No.4, Sep.1986.
- [2] “Speech recognition method using multiple fenemic baseforms of HMM,” Masafumi Nishimura and Koichi Toshioka, IBM technical disclosure bulletin, Vol.30, No.6, Nov.1987.
- [3] “Speech recognition method,” Masafumi Nishimura, IBM technical disclosure bulletin, Vol.34, No.5, Oct.1991.
- [4] “Speech recognition method using multi-labeling,” Masafumi Nishimura and Koichi Toshioka, IBM technical disclosure bulletin, Vol.29, No.10, Mar.1987.
- [5] “Speech recognition method,” Masafumi Nishimura, IBM technical disclosure bulletin, Vol.33, No.2, Jul.1990.
- [6] “Using information entropy to select leaning words,” Masafumi Nishimura, IBM technical disclosure bulletin, Vol.34, No.10B, Mar.1992.
- [7] “Improved endpoint detector for Japanese speech recognition,” Masafumi Nishimura, IBM technical disclosure bulletin, Vol.34, No.9, Feb.1992.
- [8] “Method of endpoint detection,” Yasuhide Hashimoto and Masafumi Nishimura, IBM technical disclosure bulletin, Vol.34, No.9, Feb.1992.
- [9] “Method for compressing a fast match table,” Masafumi Nishimura, IBM technical disclosure bulletin, Vol.34, No.1, Jun.1991.
- [10] “Speech recognition method using templates spoken by multiple speakers,” Yasuhide Hashimoto, Masafumi Nishimura and Masaharu Sakamoto, IBM technical disclosure bulletin, Vol.37, No.12, Dec.1994.
- [11] “Real-time word recognition method,” Masafumi Nishimura and Masaharu Sakamoto, IBM technical disclosure bulletin, Vol.38, No.8, Aug.1995.
- [12] “Method for segmenting texts into words,” Nobuyasu Itoh and Masafumi Nishimura, IBM technical disclosure bulletin, Vol.39, No.11, Nov.1996.
Other Publications
Chapters in Books
-
[1] "Wavelet Analysis of Speech Signals",
M.Kobayashi, M.Sakamoto, T.Saitoh, M.Nishimura
Approximation Theory VIII, Vol.2:Wavelets and Multilevel Approximation, pp209-215, Academic Press, NY, 1995 -
[2] "Wavelet Analysis for a Text-to-speech System,"
M.Kobayashi, M.Sakamoto, T.Saitoh, M.Nishimua,
Wavelets and their applications, pp.75-100, SIAM, Philadelphia, PA, 1998. -
[3] "IBM's Japanese Dictation System,"
Masafumi Nishimura
Spoken Language Systems, Chapter2, pp.47-58, Ohmusha/IOS Press, 2005. ISBN 4-274-90637-X.
Editorials
-
[1] "Trends on Japanese dictation systems,"
M.Nishimura,
The journal of the acoustical society of Japan, Vol.54, No.3,Mar,1998. -
[2] "Statistical spoken language processing,"
M.Nishimura, S.Mori,
The Journal of IEICE, Vol.5, 1999. -
[3] "Speech enabled word-processor, past, present, and tomorrow,"
M.Nishimura, N.Itoh,
Journal of IPSJ, Vol. 40, No. 2, 1999. -
[4] "Activity of IPSJ Trial Standard for Spoken Language Interface,"
T. Nitta,H.Matsuura,T.Nishimoto,M.Nishimura,
IPSJ(Information Society of Japan) Magazine, Vol.47,No. 7,pp.762-767, 2006.7. -
[5] "Current advances and possibility of innovation in speech interface technology,"
M.Nishimura, G.Kurata,
IPSJ(Information Society of Japan) Magazine, Vol.51, No.11, pp.1434-1439, 2010.11.
Technical Magazines
-
[1] "Japanese dictation sofware-IBM ViaVoice98,"
M.Nishimura, N.Itoh,
Electronics (Ohm-sha), June 1999. -
[2] "Trends on Japanese speech recognition",
M.Nishimura,
Bit (Kyoritsu-shuppan), Vol. 30,No.7, June 1998. -
[3] "Japanese speech recognition,"
M.Nishimura, N.Itoh,
Interface (CQ-shuppan), 1998-8, June 1998. -
[4] "Text-to-speech synthesis system with an easy and effective interface for tuning”,
Tachibana, R., Nishimura, M.,
IBM ProVision, No. 66, pp.67-73, 2010. -
[5]“Speech Phrasing Based on Breath Detection for Telephone Conversations in Call Center,”
Fukuda, T., Nishimura, M.,
IBM ProVision, No. 68, pp.80-87, 2011.
Other Activities
Teaching (Courses Taught)
-
[1] Osaka University,
Introduction to speech technology (Special lecture) , 1998.5 -
[2] Tokyo Metropolitan University,
Introduction to speech technology (Special lecture), 1998.6 -
[3] Ryukoku University,
Introduction to speech technology (Special lecture), 1999.11 -
[4] Shizuoka University,
Introduction to speech technology (Special lecture), 1999.11 -
[5] Tottori University,
Pattern recognition and speech technology (Part-time lecturer), 2000.4-9 -
[6] Aizu University,
Introduction to speech technology (Special lecture), 2000.3 -
[7] Nagoya University,
Introduction to speech technology (Par-time lecturer), 2000 -
[8] The University of Tokyo,
Current advances and possibility of innovation in speech technology (Special lecture), 2001.6,2002.6 -
[9] Osaka University,
Introduction to speech technologies, (Part-time lecturer), 2000, 2002, 2003, 2004
Editorial Board Member
- [1] Guest Editor of IEICE Transaction, I&S, Special Issue, 2007- 2008.
- [2] Member of Scientific Committee, IEEE workshop on Spoken Language Technology (SLT2008), 2008
- [3] Guest Editor of IEICE Transaction, I&S, Special Issue, 2009-May, 2010.
- [4] Associate Editor of IEICE Transaction, Information and Systems, May, 2004-May, 2010.
- [5] Editor of IEICE Transaction, Information and Systems, May, 2010- May, 2012.
- [6] Advisor to Editorial Board of IEICE Transaction, May, 2012 -
Invited Talks
-
[1] “Study about Japanese broadcast news transcription,”
IPSJ SIG-Spoken Language Processing, SLP99-25-6, pp.31-32, 1999.2. -
[2] "Japanese dictation system - past, present and future,"
IPSJ SIG-Spoken Language Processing, SLP99-29-2, 1999.12. -
[3] “Current and future trends in speech recognition market,”
IPSJ SIG-Spoken Language Processing, SLP55, No.12, 2005.2. -
[4] “Speech Technology – past, current and future,"
Toyohashi University of Technology, April, 2010. -
[5] “Voice interface for car navigation system – dream and reality,"
The Japanese Society for Artificial Intelligence (JSAI), 60th SIG-SLUD, October, 2010. -
[6] “Introduction of IBM Speech Technology Team in Tokyo,”
IPSJ SIG-Spoken Language Processing, SLP2011-085, 2011.2.
