|
Sign In to gain access to subscriptions and/or personal tools.
|
Patent surrogate extraction and evaluation in the context of patent mapping
Yuen-Hsien Tseng
National Taiwan Normal University, Taipei, Taiwan, samtseng{at}ntnu.edu.tw
Yeong-Ming Wang
LungHwa University of Science and Technology, Guishan, Taoyuan County, Taiwan
Yu-I Lin
Taipei Municipal University of Education, Taipei, Taiwan
Chi-Jen Lin
WebGenie Information LTD., Taipei, Taiwan
Dai-Wei Juang
WebGenie Information LTD., Taipei, Taiwan
Patent documents contain important research results. They are often collectively analyzed and organized in a visual way to support decision making. However, they are lengthy and rich in technical terminology, and thus require a lot of human effort for analysis. Automatic tools for assisting patent engineers or decision makers in patent analysis are in great demand. This paper describes a summarization method for patent surrogate extraction, intended to efficiently and effectively support patent mapping, which is an important subtask of patent analysis. Six patent maps were used to evaluate its relative usefulness. The experimental results confirm that the machine generated summaries do preserve more important content words than some other patent sections or even than the full patent texts when only a few terms are to be considered for classification and mapping. The implication is that if one were to determine a patent's category based on only a few terms at a quick pace, one could begin by reading the section summaries generated automatically.
Key Words: Text mining summarization feature extraction patent classification patent clustering
References
- A.F. Breitzman and M.E. Mogee, The many applications of patent analysis , Journal of Information Science, 28(3) (2002) 187—206 .[Abstract]
- S. Jung, Importance of Using Patent Information (2003). Available at http://www.wipo.int/sme/en/activities/meetings/china_most_03/wipo_ip_bis_ge_03_13.1.pdf (accessed 29 May 2006).
- R.S. Campbell, Patent trends as a technological forecasting tool , World Patent Information 5(3) (1983) 137—43 .[CrossRef]
- S.-J. Liu, Patent map — a route to a strategic intelligence of industrial competitiveness . In: Proceedings of the first Asia-Pacific Conference on Patent Maps (Taipei, 2003) 2—13.
- Y.-M. Bay, Development and applications of patent map in korean high-tech industry . In: Proceedings of the first Asia-Pacific Conference on Patent Maps (Taipei, 2003) 13—23.
- B. Chen, Introduction to patent map (Unpublished lecture notes for the training of patent mapping and patent analysis, National Science Council, Taipei , 1999, in Chinese).
- F.-D. Mai, F. Hwang, K.-M. Chien, Y.-M. Wang and C.-Y. Chen, Patent Map and Patent Analysis for Carbon Nanotube ( Science and Technology Information Center, National Science Council, Taipei , 2002).
- United States Patent and Trademark Office, http://www.uspto.gov/ (accessed 29 May 2006).
- M.A. Hearst, Untangling text data-mining . In: Proceedings the 37th Annual Meeting of the Association for Computational Linguistics (1999) 3—10.
- P. Losiewicz, D.W. Oard and R.N. Kostoff, Textual data mining to support science and technology management, Journal of Intelligent Information Systems 15(2) (2000) 99—119 .
- U. Fayyad, G. Piatetsky-Shapiro, P. Smyth and R. Uthurasamy, Advances in Knowledge Discovery and Data Mining ( AAAI Press / The MIT Press , 1996).
- C.J. Fall, A. Torcsvari, K. Benzineb and G. Karetka, Automated categorization in the international patent classification , ACM SIGIR Forum 37(1) (2003) 10—25 .[CrossRef]
- I. Mani, Automatic Summarization ( John Benjamins , 2001).
- H.P. Edmundson, New methods in automatic extracting , Journal of the ACM 16(2) (1969) 264—85 .[CrossRef][Web of Science]
- Document Understanding Conferences, http://www.-nlpir.nist.gov/projects/duc/ (accessed 29 May 2006)
- T. Hirao, M. Okumura, T. Fukushima and H. Nanba, Text summarization challenge 3 . In: Proceedings of the Fourth NTCIR Workshop on Evaluation of Information Retrieval, Automatic Text Summarization and Question Answering (Tokyo, 2004).
- W.T. Chuang and J. Yang, Extracting sentence segments for text summarization: a machine learning approach . In: Proceedings of the 23rd International ACM SIGIR Conference on Research and Development in Information Retrieval (Athens, 2000) 152—9.
- A. Shinmori, M. Okumura, Y. Marukawa and M. Iwayama, Patent claim processing for readability: structure analysis and term explanation . In: Proceedings of the ACL Workshop on Patent Corpus Processing, (Sapporo, 2003) 56—65.
- S. Sheremetyeva, Natural language analysis of patent claims . In: Proceedings of the ACL Workshop on Patent Corpus Processing (Sapporo, 2003) 66—73.
- Y.-H. Tseng, Automatic thesaurus generation for Chinese documents , Journal of the American Society for Information Science and Technology 53(13) (2002) 1130—8 .[CrossRef][Web of Science]
- Y.-H. Tseng, Content-based retrieval for music collections . In: Proceedings of the 22nd International ACM SIGIR Conference on Research and Development in Information Retrieval (Berkeley, 1999) 176—82.
- Y. Yang and J. Pedersen, A comparative study on feature selection in text categorization . In: Proceedings of the International Conference on Machine Learning (1997) 412—20.
- H.T. Ng, W.B. Goh and K.L. Low, Feature selection, perception learning, and a usability case study for text categorization . In: Proceedings of the 20th International Conference on Research and Development in Information Retrieval ( 1997) 67—73.
- K.-A. Cheng, R.-H. Maa, T.-C. Lin, Y.-F. Huang and H.-I. Liu, Patent Maps and Analyses: Quantum-Dot Optoelectronic Applications ( Science and Technology Information Center, National Science Council, Taipei , 2003).
- K.-H. Kuo, C.-H. Kuo, J.-H. Yin, T.-C. Lin and B. Chiang, Patent Maps and Analyses: Biomolecular Motors ( Science and Technology Information Center, National Science Council, Taipei , 2003).
- K.-M. Chien, D.-S. Wu, C.-C. Hung, Y.-M. Wang and Y.-P. Lan, Patent Maps and Analyses: Titanium Dioxide ( Science and Technology Information Center, National Science Council, Taipei , 2003).
- R. Bekkerman, R. El-Yaniv, Y. Winter and N. Tishby, On feature distributional clustering for text categorization . In: Proceedings of the 24th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval (New Orleans, 2001) 146—53.
- J.B. Kruskal, Multidimensional scaling and other methods for discovering structure. In: K. Enslein, A. Ralston and H.S. Wilf (eds), Statistical Methods for Digital Computers ( Wiley, New York , 1977 ).
- Y.-H. Tseng, C.-J. Lin, H.-H. Chen and Y.-I. Lin, Toward generic title generation for clustered documents . In: Proceedings of the Asia Information Retrieval Symposium (Singapore, 2006) 145—57.
- Y.-H. Tseng, C.-J. Lin and Y.-I. Lin, Text mining techniques for patent analysis Information Processing and Management (2007) in press.
This version was published on December
1, 2007
Journal of Information Science, Vol. 33, No. 6,
718-736 (2007)
DOI: 10.1177/0165551507077406

CiteULike Complore Connotea Del.icio.us Digg Reddit Technorati Twitter What's this?
|
|