IBMSkip to main content
  Home     Products & services     Support & downloads     My account  
  Select a country 
Journals Home 
 Systems Journal 
 ·  Current Issue 
 ·  Recent Issues 
 ·  Papers in Progress 
 ·  Search/Index 
 ·  Orders 
 ·  Description 
 ·  Author's Guide 
Journal of Research
and Development
 Staff 
 Contact Us 
  Related links:  
    IBM UIMA Project  
    IBM TAKMI  
    IBM TRL Text Mining  
    IBM BioTeKS Project  
    IBM alphaWorks:
  Semantics
 
IBM Systems Journal 
Volume 43, Number 3, 2004
Unstructured Information Management
 Table of contents: arrowHTML arrowPDF   This article: arrowHTML arrowPDF arrowCopyright info
  

A text-mining system for knowledge discovery from biomedical documents - References

by N. Uramoto, H. Matsuzawa, T. Nagano, A. Murakami, H. Takeuchi, and K. Takeda

Cited references

  1. S. Arlington, S. Barnett, S. Hughes, and J. Palo, Pharma 2010: The Threshold of Innovation, IBM Corporation, http://www-1.ibm.com/services/us/index.wss/xs/imc/a1001099.
  2. M. Hearst, “Untangling Text Data Mining,” Proceedings of the 37th Annual Meeting of the Association for Computer Linguistics (ACL'99), College Park, MD, ACL, East Stroudsburg, PA (1999), pp. 3–10.
  3. Gene Ontology Consortium, http://www.geneontology.org.
  4. D. Swanson, “Medical Literature as a Potential Source of New Knowledge,” Bulletin of the Medical Library Association 78, No. 1, 29–37 (1990).
  5. V. Brusic and J. Zeleznikow, “Knowledge Discovery and Data Mining in Biological Databases,” Knowledge Engineering Review 14, No. 3, 257–277 (1999).
  6. D. Swanson and N. Smalheiser, “An Interactive System for Finding Complementary Literatures: A Stimulus to Scientific Discovery,” Artificial Intelligence 91, No. 2, 183–203 (1997).
  7. R. Agrawal and R. Srikant, “Fast Algorithms for Mining Association Rules in Large Databases,” Proceedings of the 20th International Conference on Very Large Databases (VLDB), Santiago, Chile, ACM, New York (1994), pp. 487–499.
  8. MEDLINE, http://www.nlm.nih.gov/databases/databases_medline.html.
  9. P. Kankar, S. Adak, A. Sarkar, K. Murari, and G. Sharma, “MedMesh Summarizer: Text Mining for Gene Clusters,” Proceedings of the 2nd SIAM International Conference on Data Mining, Arlington, VA (2002), pp. 548–565.
  10. Medical Subject Headings (MeSH), http://www.nlm.nih.gov/mesh/meshhome.html.
  11. L. Tanabe, U. Scherf, L. Smith, J. Lee, L. Hunter, and J. Weinstein, “MedMiner: An Internet Tool for Filtering and Organizing Gene Expression and Pharmacological Information,” BioTechniques 27,1210–1217 (1999).
  12. PubMed, http://www.ncbi.nlm.nih.gov/entrez.
  13. T. Nasukawa and T. Nagano, “Text Analysis and Knowledge Mining System,” IBM Systems Journal 40, No. 4, 967–984 (2001).
  14. K. Fukuda, T. Tsunoda, A. Tamura, and T. Takagi, “Toward Information Extraction: Identifying Protein Names from Biological Papers,” Proceedings of the Pacific Symposium on Biocomputing '98 (PSB '98), Kapalua, HI, (1998), pp. 707–718.
  15. M. Krauthammer, A. Rzhetsky, P. Morozov, and C. Friedman, “Using BLAST for Identifying Gene and Protein Names in Journal Articles,” Gene 259, No. 1–2, 245–252 (2000).
  16. N. Collier, C. Nobota, and J. Tsujii, “Extracting the Names of Genes and Gene Products with a Hidden Markov Model,” Proceedings of the 18th International Conference on Computer Linguistics (COLING 2000), Saarbrücken, Germany, ACL, East Stroudsburg, PA (2000), pp. 201–207.
  17. V. Hatzivassiloglou, P. Duboue, and A. Rzhetsky, “Disambiguating Proteins, Genes, and RNA in Text: A Machine Learning Approach,” Bioinformatics 17, Suppl. 1, S97–106 (2001).
  18. K. Yamamoto, T. Kudo, A. Konagaya, and Y. Matsumoto, “Protein Name Tagging for Biomedical Annotation in Text,” Proceedings of the ACL 2003 Workshop on Natural Language Processing in Biomedicine (BioNLP), Sapporo, Japan, ACL, East Stroudsburg, PA (2003), pp. 65–72.
  19. Y. Tsuruoka and J. Tsujii, “Boosting Precision and Recall of Dictionary-Based Protein Name Recognition,” Proceedings of the ACL 2003 Workshop on Natural Language Processing in Biomedicine (BioNLP), Sapporo, Japan, ACL, East Stroudsburg, PA (2003), pp. 41–48.
  20. C. Blaschke, M. Andrade, C. Ouzounis, and A. Valencia, “Automatic Extraction of Biological Information from Scientific Text: Protein-Protein Interactions,” Proceedings of the AAAI Conference on Intelligent Systems in Microbiology (ISMB '99), Heidelberg, Germany, AAAI, Menlo Park, CA (1999), pp. 60–77.
  21. T. Sekimizu, H. Park, and J. Tsujii, “Identifying the Interaction between Genes and Gene Products Based on Frequently Seen Verbs in MEDLINE Abstracts,” Genome Informatics, 62–71 (1998).
  22. T. Rindflesch J. Rajah, and L. Hunter, “Extracting Molecular Binding Relationships from Biomedical Text,” Proceedings of the 6th Applied Natural Language Processing Conference (ANLP-NAACL 2000), Seattle, WA, ACL, East Stroudsburg, PA (2000), pp. 188–195.
  23. A. Yakushiji, Y. Tateisi, Y. Miyao, and J. Tsujii, “Event Extraction from Biomedical Papers Using a Full Parser,” Proceedings of the Pacific Symposium on Biocomputing '01 (PSB'01), Kohala Coast, HI (2001), pp. 408–419.
  24. C. Friedman, P. Kra, H. Yu, M. Krauthammer, and A. Rzhetsky, “GENIES: A Natural-Language Processing System for the Extraction of Molecular Pathways from Journal Articles,” Bioinformatics 17, Suppl. 1, S74–82 (2001).
  25. J. Park, H. Kim, and J. Kim, “Bidirectional Incremental Parsing for Automatic Pathway Identification with Combinatory Categorical Grammar,”Proceedings of the Pacific Symposium on Biocomputing '01 (PSB'01), Kohala Coast, HI (2001), pp. 396–407.
  26. J. Hosaka, J. Koh, and A. Konagaya, “Effect of Utilizing Terminology on Extraction of Protein-Protein Interaction Information from Biomedical Literature,” Proceedings of the 10th Conference of the European Chapter of the Association for Computer Linguistics (EACL'03), Budapest, Hungary, ACL, East Stroudsburg, PA (2003), pp. 107–110.
  27. S. Novichkova, S. Egorov, and N. Daraselia, “MedScan, a Natural Language Processing Engine for MEDLINE Abstracts,” Bioinformatics 19, No. 13, 1699–1706 (2003).
  28. J. Thomas, D. Milward, C. Ouzounis, and S. Pulman, “Automatic Extraction of Protein Interactions from Scientific Abstracts,” Proceedings of the Pacific Symposium on Biocomputing '00 (PSB'00), Honolulu, HI (2000), pp. 538–549.
  29. K. Humphreys, G. Demetriou, and R. Gaizauskas, “Automatically Extracting Enzyme Interaction and Protein Structure Information from Biological Science Journal Articles,” Proceedings of the Symposium on Artificial Intelligence in Bioinformatics of the 2000 Convention of the Society for the Study of Artificial Intelligence and Simulation of Behavior (AISB'00), Birmingham, UK, AISB, Brighton, UK (2000).
  30. K. Humphreys, G. Demetriou, and R. Gaizauskas, “Two Applications of Information Extraction to Biological Science Journal Articles: Enzyme Interactions and Protein Structures,” Proceedings of the Pacific Symposium on Biocomputing '00 (PSB'00), Honolulu, HI (2000), pp. 502–513.
  31. G. Leroy and H. Chen, “Filling Preposition-Based Templates to Capture Information from Medical Abstracts,” Proceedings of the Pacific Symposium on Biocomputing '02 (PSB'02), Lihue, HI (2000), pp. 350–361.
  32. M. Craven and J. Kumlien, “Constructing Biological Knowledge Bases by Extracting Information from Text Sources,” Proceedings of the AAAI Conference on Intelligent Systems in Microbiology (ISMB '99), Heidelberg, Germany, AAAI, Menlo Park, CA (1999), pp. 77–86.
  33. E. Marcotte, I. Xenarios, and D. Eisenberg, “Mining Literature for Protein-Protein Interactions”Bioinformatics 17, No. 4, 359–363 (2001).
  34. See note 17.
  35. B. Stapley and G. Benoit, “Biobibliometrics: Information Retrieval and Visualization from Co-occurrences of Gene Names in MEDLINE Abstracts,” Proceedings of the Pacific Symposium on Biocomputing '00 (PSB'00), Honolulu, HI (2000), pp. 529–540.
  36. National Center for Biotechnology Information, http://www.ncbi.nlm.nih.gov/.
  37. United States National Library of Medicine, http://www.nlm.nih.gov/.
  38. MeSH Browser, http://www.nlm.nih.gov/mesh/MBrowser.html.
  39. E. Charniak, Statistical Language Learning, MIT Press, Cambridge, MA (1993).
  40. Unified Medical Language System, http://www.nlm.nih.gov/research/umls/umlsmain.html.
  41. C. Pollard and I. Sag, Head-Driven Phrase Structure Grammar, University of Chicago Press and CSLI Publications, Chicago, IL (1994).
  42. R. Berwick, Robert, “Computational Complexity and Lexical Functional Grammar,” Proceedings of the 19th Annual Meeting of the Association for Computational Linguistics, Stanford, CA, ACL, East Stroudsburg, PA (1981), pp. 7–12.
  43. H. Akaike, “A New Look at the Statistical Model Identification,” IEEE Transactions on Automatic Control 19,716–723 (1974).
  44. LocusLink, http://www.ncbi.nih.gov/LocusLink/.
  45. H. Tran, C. Kim, S. Faham, M. Siddall, and J. Bowie, “Native Interface of the SAM Domain Polymer of TEL,” BMC Structural Biology 2, No. 1, 5–11 (2002).
  46. H. Sattler, R. Lensch, V. Rohde, E. Zimmer, E. Meese, H. Bonkhoff, M. Retz, T. Zwergel, A. Bex, M. Stoeckle, and B. Wullich, “Novel Amplification Unit at Chromosome 3q25-q27 in Human Prostate Cancer,” Prostate 45, No. 3, 207–15 (2000).
  47. I. Foster, C. Kesselman, and S. Tuecke, “The Anatomy of the Grid: Enabling Scalable Virtual Organizations,” International Journal of Supercomputer Applications 15, No. 3, 200–222 (2001), http://www.globus.org/research/papers/anatomy.pdf.
  48. The Grid 2: Blueprint for a New Computing Infrastructure (2nd Edition), I. Foster and C. Kesselman, Editors, Morgan Kaufmann Publishing, Inc., San Francisco, CA (2003).
  49. D. Ferrucci and A. Lally, “Building an Example Application with the Unstructured Information Management Architecture,” IBM Systems Journal43, No. 3, 455-475 (2004, this issue).
  50. R. Mack, S. Mukherjea, A. Soffer, N. Uramoto, E. Brown, A. Coden, J. Cooper, A. Inokuchi, B. Iyer, K. Kummamuru, Y. Maas, H. Matsuzawa, and L. Subramaniam, “Text Analytics for Life Science using the Unstructured Information Management Architecture,” IBM Systems Journal43, No. 3, 490-515 (2004, this issue).