ÇѱÛÁ¦¸ñ(Korean Title) |
¼øÂ÷ÆÐÅÏ¿¡ ±â¹ÝÇÑ XML ¹®¼ Ŭ·¯½ºÅ͸µ |
¿µ¹®Á¦¸ñ(English Title) |
XML Document Clustering Based on Sequential Pattern |
ÀúÀÚ(Author) |
ȲÁ¤Èñ
·ù±ÙÈ£
|
¿ø¹®¼ö·Ïó(Citation) |
VOL 10-D NO. 07 PP. 1093 ~ 1102 (2003. 12) |
Çѱ۳»¿ë (Korean Abstract) |
ÀÎÅͳÝÀÇ »ç¿ë Áõ°¡·Î Á¤º¸ÀÇ ¾çÀº ±âÇϱ޼öÀûÀ¸·Î Áõ°¡ÇÏ°í ÀÖÀ¸¸ç À¥ µ¥ÀÌÅÍÀÇ Ç¥ÁØÀÎ XMLÀÇ µ¥ÀÌÅÍ Ç¥ÇöÀÇ À¯¿¬¼ºÀ¸·Î ÀÎÇØ EDMS(Electronic Document Management System), ebXML(e-business eXtensible Markup Language) µî À¥ ±â¹ÝÀÇ ÀüÀÚ¹®¼¸¦ ÀÌ¿ëÇÏ´Â ½Ã½ºÅÛµéÀº XML¸¦ ¹®¼ ±³È¯ ¹æ½Ä ¹× Ç¥ÁØ ¹®¼ Çü½ÄÀ¸·Î µµÀÔÇÏ°í ÀÖ´Â ½ÇÁ¤ÀÌ´Ù. ±×·¯¹Ç·Î Á¡Â÷ È®»êµÇ¾î °¡°í ÀÖ´Â XML ¹®¼¿¡ ´ëÇÑ È¿À²ÀûÀÎ ¹®¼ÀÇ °ü¸®¿Í °Ë»öÀ» À§ÇÑ ¿¬±¸°¡ ÇÊ¿äÇÏ´Ù. ÀÌ ³í¹®¿¡¼´Â ´ÙÁß ¹®¼°£ÀÇ ±¸Á¶Àû À¯»ç¼ºÀ» ºÐ·ùÇϱâ À§ÇÏ¿© ¿¤¸®¸ÕÆ®ÀÇ ¼ø¼Àû Àǹ̸¦ °®´Â XML ¹®¼¸¦ ´ë»óÀ¸·Î ¼øÂ÷ÆÐÅÏÀ» ÀÌ¿ëÇÏ¿© ¹®¼ÀÇ Æ¯¼ºÀ» ¹Ý¿µÇÏ´Â ´ëÇ¥±¸Á¶¸¦ ÃßÃâÇÏ°í ÃßÃâµÈ ±¸Á¶¸¦ ±â¹ÝÀ¸·Î À¯»ç ±¸Á¶ ¹®¼¸¦ Ŭ·¯½ºÅ͸µÇÏ´Â ¹æ¹ýÀ» Á¦½ÃÇÑ´Ù. ÀÌ ³í¹®ÀÇ Á¦¾È ¾Ë°í¸®ÁòÀº Ŭ·¯½ºÅÍÀÇ ÀÀÁýµµ¿Í Ŭ·¯½ºÅÍ°£ÀÇ À¯»çµµ¸¦ ÇÔ²² °í·ÁÇÏ´Â ºñ¿ë°è»ê ¹æ½ÄÀ» ÀÌ¿ëÇϹǷνá Ŭ·¯½ºÅ͸µÀÇ Á¤È®µµ¸¦ ³ôÀÏ ¼ö ÀÖ´Â È¿°ú¸¦ ¾òÀ» ¼ö ÀÖ´Ù. |
¿µ¹®³»¿ë (English Abstract) |
As the use of internet is growing, the amount of information is increasing rapidly and XML that is a standard of the web data has the property of flexibility of data representation. Therefore electronic document systems based on web, such as EDMS (Electronic Document Management System), ebXML (e-business eXtensible Markup Language), have been adopting XML as the method for exchange and standard of documents. So research on the method which can manage and search structural XML documents in an effective way is required. In this paper we propose the clustering method based on structural similarity among the many XML documents, using typical structures extracted from each document by sequential pattern mining in pre-clustering process. The proposed algorithm improves the accuracy of clustering by computing cost considering cluster cohesion and inter-cluster similarity. |
Å°¿öµå(Keyword) |
¹®¼ Ŭ·¯½ºÅ͸µ
Document Clustering
XML ¹®¼
XML Document
¼øÂ÷ÆÐÅÏ
Sequential Patern
±¸Á¶ À¯»ç¼º
Structural Similarity
|
ÆÄÀÏ÷ºÎ |
PDF ´Ù¿î·Îµå
|