WEKO3
アイテム
{"_buckets": {"deposit": "2941aac8-2d2b-47a2-a27d-00cccd2f5a75"}, "_deposit": {"id": "48957", "owners": [], "pid": {"revision_id": 0, "type": "depid", "value": "48957"}, "status": "published"}, "_oai": {"id": "oai:repository.dl.itc.u-tokyo.ac.jp:00048957", "sets": ["15", "96"]}, "item_2_biblio_info_7": {"attribute_name": "書誌情報", "attribute_value_mlt": [{"bibliographicIssueDates": {"bibliographicIssueDate": "2001-09-01", "bibliographicIssueDateType": "Issued"}, "bibliographicIssueNumber": "9", "bibliographicPageEnd": "2318", "bibliographicPageStart": "2311", "bibliographicVolumeNumber": "E84-A", "bibliographic_titles": [{"bibliographic_title": "IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences"}]}]}, "item_2_description_5": {"attribute_name": "抄録", "attribute_value_mlt": [{"subitem_description": "When we search from a huge amount of documents, we often specify several keywords and use conjunctive queries to narrow the result of the search. Though the searched documents contain all keywords, positions of the keywords are usually not considered. As a result, the search result contains some meaningless documents. It is therefore effective to rank documents according to proximity of keywords in the documents. This ranking is regarded as a kind of text data mining. In this paper, we propose two algorithms for finding documents in which all given keywords appear in neighboring places. One is based on plane-sweep algorithm and the other is based on divide-and-conquer approach. Both algorithms run in O(n log n) time where n is the number of occurrences of given keywords. We run the algorithms on a large collection of html files and verify its effectiveness.", "subitem_description_type": "Abstract"}]}, "item_2_description_6": {"attribute_name": "内容記述", "attribute_value_mlt": [{"subitem_description": "PAPER", "subitem_description_type": "Other"}]}, "item_2_publisher_20": {"attribute_name": "出版者", "attribute_value_mlt": [{"subitem_publisher": "Institute of Electronics, Information and Communication Engineers"}]}, "item_2_relation_25": {"attribute_name": "関係URI", "attribute_value_mlt": [{"subitem_relation_type_id": {"subitem_relation_type_id_text": "https://search.ieice.org/", "subitem_relation_type_select": "URI"}}]}, "item_2_rights_12": {"attribute_name": "権利", "attribute_value_mlt": [{"subitem_rights": "copyright©2001 IEICE"}]}, "item_2_select_14": {"attribute_name": "著者版フラグ", "attribute_value_mlt": [{"subitem_select_item": "publisher"}]}, "item_2_text_4": {"attribute_name": "著者所属", "attribute_value_mlt": [{"subitem_text_value": "Graduate School of Information Sciences, Tohoku University"}, {"subitem_text_value": "Department of Information Science, University of Tokyo"}]}, "item_creator": {"attribute_name": "著者", "attribute_type": "creator", "attribute_value_mlt": [{"creatorNames": [{"creatorName": "SADAKANE, Kunihiko"}], "nameIdentifiers": [{"nameIdentifier": "145477", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "IMAI, Hiroshi"}], "nameIdentifiers": [{"nameIdentifier": "145478", "nameIdentifierScheme": "WEKO"}]}]}, "item_files": {"attribute_name": "ファイル情報", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_date", "date": [{"dateType": "Available", "dateValue": "2017-12-15"}], "displaytype": "detail", "download_preview_message": "", "file_order": 0, "filename": "Fast Algorithms for k-Word Proximity Search 2001.pdf", "filesize": [{"value": "268.8 kB"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_free", "mimetype": "application/pdf", "size": 268800.0, "url": {"label": "Fast Algorithms for k-Word Proximity Search 2001.pdf", "url": "https://repository.dl.itc.u-tokyo.ac.jp/record/48957/files/Fast Algorithms for k-Word Proximity Search 2001.pdf"}, "version_id": "64fa44ed-caa9-49a9-ab7c-6a9dd85a0f5e"}]}, "item_language": {"attribute_name": "言語", "attribute_value_mlt": [{"subitem_language": "eng"}]}, "item_resource_type": {"attribute_name": "資源タイプ", "attribute_value_mlt": [{"resourcetype": "journal article", "resourceuri": "http://purl.org/coar/resource_type/c_6501"}]}, "item_title": "Fast Algorithms for k-Word Proximity Search", "item_titles": {"attribute_name": "タイトル", "attribute_value_mlt": [{"subitem_title": "Fast Algorithms for k-Word Proximity Search"}]}, "item_type_id": "2", "owner": "1", "path": ["15", "96"], "permalink_uri": "http://hdl.handle.net/2261/00074082", "pubdate": {"attribute_name": "公開日", "attribute_value": "2017-12-14"}, "publish_date": "2017-12-14", "publish_status": "0", "recid": "48957", "relation": {}, "relation_version_is_last": true, "title": ["Fast Algorithms for k-Word Proximity Search"], "weko_shared_id": null}
Fast Algorithms for k-Word Proximity Search
http://hdl.handle.net/2261/00074082
http://hdl.handle.net/2261/00074082390d402a-cdca-4b36-aeae-188fcf49279b
名前 / ファイル | ライセンス | アクション |
---|---|---|
Fast Algorithms for k-Word Proximity Search 2001.pdf (268.8 kB)
|
|
Item type | 学術雑誌論文 / Journal Article(1) | |||||
---|---|---|---|---|---|---|
公開日 | 2017-12-14 | |||||
タイトル | ||||||
タイトル | Fast Algorithms for k-Word Proximity Search | |||||
言語 | ||||||
言語 | eng | |||||
資源タイプ | ||||||
資源 | http://purl.org/coar/resource_type/c_6501 | |||||
タイプ | journal article | |||||
著者 |
SADAKANE, Kunihiko
× SADAKANE, Kunihiko× IMAI, Hiroshi |
|||||
著者所属 | ||||||
著者所属 | Graduate School of Information Sciences, Tohoku University | |||||
著者所属 | ||||||
著者所属 | Department of Information Science, University of Tokyo | |||||
抄録 | ||||||
内容記述タイプ | Abstract | |||||
内容記述 | When we search from a huge amount of documents, we often specify several keywords and use conjunctive queries to narrow the result of the search. Though the searched documents contain all keywords, positions of the keywords are usually not considered. As a result, the search result contains some meaningless documents. It is therefore effective to rank documents according to proximity of keywords in the documents. This ranking is regarded as a kind of text data mining. In this paper, we propose two algorithms for finding documents in which all given keywords appear in neighboring places. One is based on plane-sweep algorithm and the other is based on divide-and-conquer approach. Both algorithms run in O(n log n) time where n is the number of occurrences of given keywords. We run the algorithms on a large collection of html files and verify its effectiveness. | |||||
内容記述 | ||||||
内容記述タイプ | Other | |||||
内容記述 | PAPER | |||||
書誌情報 |
IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences 巻 E84-A, 号 9, p. 2311-2318, 発行日 2001-09-01 |
|||||
権利 | ||||||
権利情報 | copyright©2001 IEICE | |||||
著者版フラグ | ||||||
値 | publisher | |||||
出版者 | ||||||
出版者 | Institute of Electronics, Information and Communication Engineers | |||||
関係URI | ||||||
識別子タイプ | URI | |||||
関連識別子 | https://search.ieice.org/ |