[go: up one dir, main page]

WO2017064561A3 - Selection of initial document collection for visual interactive search - Google Patents

Selection of initial document collection for visual interactive search Download PDF

Info

Publication number
WO2017064561A3
WO2017064561A3 PCT/IB2016/001590 IB2016001590W WO2017064561A3 WO 2017064561 A3 WO2017064561 A3 WO 2017064561A3 IB 2016001590 W IB2016001590 W IB 2016001590W WO 2017064561 A3 WO2017064561 A3 WO 2017064561A3
Authority
WO
WIPO (PCT)
Prior art keywords
documents
initial
user
selection
space
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/IB2016/001590
Other languages
French (fr)
Other versions
WO2017064561A2 (en
Inventor
Diego Guy M. LEGRAND
Philip M. Long
Nigel Duffy
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sentient Technologies Barbados Ltd
Original Assignee
Sentient Technologies Barbados Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sentient Technologies Barbados Ltd filed Critical Sentient Technologies Barbados Ltd
Publication of WO2017064561A2 publication Critical patent/WO2017064561A2/en
Publication of WO2017064561A3 publication Critical patent/WO2017064561A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)

Abstract

Roughly described, a system for user identification of a desired document. A database identifies a catalog of documents in an embedding space, in which the distance between documents corresponds to a measure of their dissimilarity. The system presents an initial collection of the documents toward the user from an initial candidate space which is part of the embedding space, then in response to iterative user input, refines the candidate space and subsequent collections of documents presented toward the user. The initial collection is determined using a weighted cost-based iterative addition to the initial collection of documents from the initial candidate space, trading off between two sub-objectives of representativeness and diversity.
PCT/IB2016/001590 2015-10-15 2016-10-17 Selection of initial document collection for visual interactive search Ceased WO2017064561A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201562242258P 2015-10-15 2015-10-15
US62/242,258 2015-10-15

Publications (2)

Publication Number Publication Date
WO2017064561A2 WO2017064561A2 (en) 2017-04-20
WO2017064561A3 true WO2017064561A3 (en) 2017-07-06

Family

ID=58517053

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2016/001590 Ceased WO2017064561A2 (en) 2015-10-15 2016-10-17 Selection of initial document collection for visual interactive search

Country Status (1)

Country Link
WO (1) WO2017064561A2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020201866A1 (en) 2019-03-29 2020-10-08 株式会社半導体エネルギー研究所 Image search system and image search method

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6098034A (en) * 1996-03-18 2000-08-01 Expert Ease Development, Ltd. Method for standardizing phrasing in a document
US6286018B1 (en) * 1998-03-18 2001-09-04 Xerox Corporation Method and apparatus for finding a set of documents relevant to a focus set using citation analysis and spreading activation techniques
US6353825B1 (en) * 1999-07-30 2002-03-05 Verizon Laboratories Inc. Method and device for classification using iterative information retrieval techniques
US20020164078A1 (en) * 2001-03-23 2002-11-07 Fujitsu Limited Information retrieving system and method
US20050165600A1 (en) * 2004-01-27 2005-07-28 Kas Kasravi System and method for comparative analysis of textual documents
US20080243842A1 (en) * 2007-03-28 2008-10-02 Xerox Corporation Optimizing the performance of duplicate identification by content
US7814107B1 (en) * 2007-05-25 2010-10-12 Amazon Technologies, Inc. Generating similarity scores for matching non-identical data strings
US20130212090A1 (en) * 2012-02-09 2013-08-15 Stroz Friedberg, LLC Similar document detection and electronic discovery
US8972394B1 (en) * 2009-07-20 2015-03-03 Google Inc. Generating a related set of documents for an initial set of documents

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6098034A (en) * 1996-03-18 2000-08-01 Expert Ease Development, Ltd. Method for standardizing phrasing in a document
US6286018B1 (en) * 1998-03-18 2001-09-04 Xerox Corporation Method and apparatus for finding a set of documents relevant to a focus set using citation analysis and spreading activation techniques
US6353825B1 (en) * 1999-07-30 2002-03-05 Verizon Laboratories Inc. Method and device for classification using iterative information retrieval techniques
US20020164078A1 (en) * 2001-03-23 2002-11-07 Fujitsu Limited Information retrieving system and method
US20050165600A1 (en) * 2004-01-27 2005-07-28 Kas Kasravi System and method for comparative analysis of textual documents
US20080243842A1 (en) * 2007-03-28 2008-10-02 Xerox Corporation Optimizing the performance of duplicate identification by content
US7814107B1 (en) * 2007-05-25 2010-10-12 Amazon Technologies, Inc. Generating similarity scores for matching non-identical data strings
US8972394B1 (en) * 2009-07-20 2015-03-03 Google Inc. Generating a related set of documents for an initial set of documents
US20130212090A1 (en) * 2012-02-09 2013-08-15 Stroz Friedberg, LLC Similar document detection and electronic discovery

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
STASKO ET AL.: "Jigsaw: supporting investigative analysis through interactive visualization.", 2008, XP031221446, Retrieved from the Internet <URL:http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.1014.9375&rep=rep1&type=pdf> [retrieved on 20170419] *

Also Published As

Publication number Publication date
WO2017064561A2 (en) 2017-04-20

Similar Documents

Publication Publication Date Title
Nguyen et al. Human detection from images and videos: A survey
GB2544660A (en) Visual interactive search
JP2017503273A5 (en)
Afsar et al. Automatic visual detection of human behavior: A review from 2000 to 2014
HK1222726A1 (en) Intelligent automated assistant
WO2012142553A3 (en) Identifying query formulation suggestions for low-match queries
Li et al. Categorisation of visualisation methods to support the design of Human-Computer Interaction Systems
WO2008060919A3 (en) Image recognition system for use in analysing images of objects and applications thereof
WO2015138497A3 (en) Systems and methods for rapid data analysis
WO2017098332A3 (en) Method and system for inputting information
WO2010110880A3 (en) Shape based picture search
WO2018014109A8 (en) System and method for analyzing and searching for features associated with objects
WO2009099798A3 (en) System and method for utilizing tiles in a search results page
WO2014111944A8 (en) Systems and methods for identifying explosives
WO2016202214A3 (en) Method and device for displaying keyword
JP2015153013A5 (en)
WO2009099947A3 (en) Methods and apparatus to generate smart text
JO3514B1 (en) System and method for accessing images with a captured query image
EP4300501A3 (en) Methods of sequencing data read realignment
WO2014185651A3 (en) Method for providing integrated management service for creative products in cultural arts reflecting needs of requester
WO2017064561A3 (en) Selection of initial document collection for visual interactive search
Mondéjar-Guerra et al. Keypoint descriptor fusion with Dempster–Shafer theory
WO2014159111A3 (en) Clustering of ads with organic map content
WO2016085527A8 (en) Method and system for storage retrieval
WO2009013818A1 (en) Character recognition processing method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16855013

Country of ref document: EP

Kind code of ref document: A2

ENP Entry into the national phase

Ref document number: 2017545913

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16855013

Country of ref document: EP

Kind code of ref document: A2