专利名称:A Generic Architecture for Indexing
Document Groups in an Inverted Text Index
发明人:Andrei Z. Broder,Marcus Felipe
Fontoura,Michael Herscovici,RonnyLempel,John Ai McPherson,AndreasNeumann,Runping Qi,Eugene Jon Shekita
申请号:US10905604申请日:20050112
公开号:US20060155739A1公开日:20060713
专利附图:
摘要:A method for indexing a plurality of documents, that includes a plurality ofduplicate documents, first identifies one or more duplicate groups of documents fromamong the plurality of documents. Then, one index of content for the duplicate group iscreated instead of indexing the content from every document within the duplicate group.However, in contrast to the content index, an index of metadata for each of the
documents in the duplicate group is created. Thus the content of each duplicate group isindexed only once, while a search engine using such indexing techniques retains thecapability to answer queries as if the duplicated content was indexed for each documentof the group.
申请人:Andrei Z. Broder,Marcus Felipe Fontoura,Michael Herscovici,RonnyLempel,John Ai McPherson,Andreas Neumann,Runping Qi,Eugene Jon Shekita
地址:630 West 246th St. Apt. #927 Bronx NY 10471 US,205 Charter Oaks Circle LosGatos CA 95032 US,14 Got Levin Street Haifa 32922 IL,1 Moshe Sneh Street Haifa 34987IL,6586 Graystone Meadow Circle San Jose CA 95120 US,Goerlitzer Str. 9 Muelheim an derRuhr 45470 DE,7588 Barnhart Place Cupertino CA 95014 US,6599 Winterset Way San JoseCA 95120 US
国籍:US,US,IL,IL,US,DE,US,US
更多信息请下载全文后查看
因篇幅问题不能全部显示,请点此查看更多更全内容