[go: up one dir, main page]

HK1197945B - Media object metadata association and ranking - Google Patents

Media object metadata association and ranking Download PDF

Info

Publication number
HK1197945B
HK1197945B HK14111500.4A HK14111500A HK1197945B HK 1197945 B HK1197945 B HK 1197945B HK 14111500 A HK14111500 A HK 14111500A HK 1197945 B HK1197945 B HK 1197945B
Authority
HK
Hong Kong
Prior art keywords
metadata
image
user
ranking
input
Prior art date
Application number
HK14111500.4A
Other languages
Chinese (zh)
Other versions
HK1197945A (en
Inventor
丹尼尔.斯图尔特.布特尔菲尔德
埃里克.科斯特洛
凯特里纳.法克
卡鲁姆.詹姆斯.汉德尔森-比格
施吉.穆哈朝夫
乔舒亚.埃里.斯沙彻尔
Original Assignee
Jollify Management Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jollify Management Limited filed Critical Jollify Management Limited
Publication of HK1197945A publication Critical patent/HK1197945A/en
Publication of HK1197945B publication Critical patent/HK1197945B/en

Links

Description

媒体对象元数据关联和排名Media object metadata association and ranking

分案申请说明Divisional Application Instructions

本申请是申请日为2006年4月20日、题为“媒体对象元数据关联和排名”的中国发明专利申请No.200680022266.X(PCT国际申请PCT/US2006/015263)的分案申请。This application is a divisional application of Chinese invention patent application No. 200680022266.X (PCT international application PCT/US2006/015263), filed on April 20, 2006, and entitled “Media Object Metadata Association and Ranking”.

相关申请交叉引用Cross-reference to related applications

本申请要求2005年4月21日提交的题为“GENERATION AND USE OF METADATA FORMEDIA OBJECTS”的美国申请No.60/674,109的优先权,该申请整体通过引用结合于此。This application claims priority to U.S. Application No. 60/674,109, filed April 21, 2005, entitled "GENERATION AND USE OF METADATA FORMEDIA OBJECTS," which is incorporated herein by reference in its entirety.

本申请与同时提交的题为“INTERESTINGNESS RANKING OF MEDIA OBJECTS”的美国申请No._____相关,该申请整体通过引用结合于此。This application is related to concurrently filed U.S. Application No. _____, entitled “INTERESTINGNESS RANKING OF MEDIA OBJECTS,” which is incorporated herein by reference in its entirety.

技术领域Technical Field

本发明一般地涉及媒体对象的组织和显示,具体地涉及元数据与媒体对象(例如,图像)的关联及其排名。The present invention relates generally to the organization and display of media objects, and more particularly to the association of metadata with media objects (eg, images) and their ranking.

背景技术Background Art

现有的网站允许用户对诸如书籍或电影之类的媒体进行注释和品论。但是,传统的网站在下述方面受到限制:用户直接提供或者通过用户的行动间接提供的关于媒体对象的信息的类型,以及使用该信息来生成搜索结果。Existing websites allow users to annotate and review media such as books or movies. However, traditional websites are limited in the type of information that users can provide directly or indirectly through user actions about media objects, and in the use of that information to generate search results.

搜索结果依赖于项目的排名(ranking)来确定要呈现给搜索者的最相关的项目。这些排名可以是基于例如这样的标准的:特定项目被用户“点击”或者查看的次数。希望使多种关于媒体对象的用户导出的信息可用,并且开发基于这种信息的用于媒体对象的更相关的排名。Search results rely on rankings of items to determine the most relevant items to present to the searcher. These rankings can be based on criteria such as the number of times a particular item has been "clicked" or viewed by users. It would be desirable to make available a variety of user-derived information about media objects and to develop more relevant rankings for media objects based on this information.

发明内容Summary of the Invention

本发明的实施例通过提供一个或多个媒体对象以显示给一个或多个用户,并且接受来自用户的关于这一个或多个媒体对象的输入,从而使源数据能够与媒体对象关联。媒体对象可以包括图像(例如,静止或运动图像)或者音频媒体对象(例如,音轨)。输入可以包括至少两种不同类型的元数据,这些类型包括标签、评价、注释、描述和添加到最爱(“设为最爱”)或者播放列表(“设到播放列表”)。标签元数据可以包括位置元数据。这里实现该实施例的逻辑可以位于服务器处,并且每个用户可以被与相应的客户端计算机相关联。Embodiments of the present invention enable source data to be associated with media objects by providing one or more media objects for display to one or more users and accepting input from the users regarding the one or more media objects. The media objects may include images (e.g., still or moving images) or audio media objects (e.g., audio tracks). The input may include at least two different types of metadata, including tags, ratings, annotations, descriptions, and adding to favorites ("Set to Favorites") or playlists ("Set to Playlist"). Tag metadata may include location metadata. The logic for implementing this embodiment may be located at a server, and each user may be associated with a corresponding client computer.

本发明的实施例也可以利用至少一个元数据集合(metadatum)执行对媒体对象的搜索,并且对从该搜索返回的一幅或者多幅图像排名。该排名可以至少部分基于与该媒体对象相关的用户行动,包括用户输入的与该媒体对象相关的元数据的量、已向该媒体对象指派了元数据的用户的数目、和/或与该媒体对象相关的访问模式。访问模式可以至少部分基于对该媒体对象的点入(click through)或者查看的次数。排名也可以至少部分基于媒体对象的发布者和发起搜索的用户之间的关系和/或与该媒体对象相关的时间流逝的。本发明的实施例可以使排名至少部分基于一个或多个标签对媒体对象的适当性(relevance),其中适当性自身是基于来自一个或多个用户的适当性输入的。Embodiments of the present invention may also perform a search for media objects using at least one metadata set and rank one or more images returned from the search. The ranking may be based at least in part on user actions associated with the media object, including the amount of metadata associated with the media object input by the user, the number of users who have assigned metadata to the media object, and/or access patterns associated with the media object. The access patterns may be based at least in part on the number of click-throughs or views of the media object. The ranking may also be based at least in part on the relationship between the publisher of the media object and the user who initiated the search and/or the passage of time associated with the media object. Embodiments of the present invention may base the ranking at least in part on the relevance of one or more tags to the media object, where the relevance itself is based on relevance input from one or more users.

本发明的实施例可以包括统计逻辑,用于至少部分基于第一元数据集合和第二元数据集合共同被指派给相同媒体对象的频度确定度量。统计逻辑可以基于与多个媒体对象相关联的多个元数据的度量确定相关元数据的集群,并且对每个集群内的相关联的多个媒体对象进行排名。统计逻辑还可以如果所确定的度量超过阈值相关度值,则将第一元数据和第二元数据作为相关元数据提供来显示。本发明的实施例可以至少部分基于从用户输入的适当性确定第一元数据和第二元数据对图像的适当性,在这种情形中,统计逻辑可以至少部分基于所确定的适当性改变所述相关度度量。Embodiments of the present invention may include statistical logic for determining a metric based, at least in part, on how often a first set of metadata and a second set of metadata are commonly assigned to the same media object. The statistical logic may determine clusters of related metadata based on the metric for a plurality of metadata associated with a plurality of media objects, and rank the associated plurality of media objects within each cluster. The statistical logic may also provide the first metadata and the second metadata for display as related metadata if the determined metric exceeds a threshold relevance value. Embodiments of the present invention may determine the appropriateness of the first metadata and the second metadata for an image based, at least in part, on an appropriateness input from a user, in which case the statistical logic may change the relevance metric based, at least in part, on the determined appropriateness.

本发明的实施例可以确定在预定时间段中至少一个特定元数据集合被指派给多个媒体对象的频度,并且/或者确定该至少一个特定元数据集合被指派给的媒体对象的数目。Embodiments of the present invention may determine the frequency with which at least one specific metadata set is assigned to a plurality of media objects in a predetermined time period and/or determine the number of media objects to which the at least one specific metadata set is assigned.

本发明的实施例可以提供由广告服务器使用的信息,该广告服务器用于将广告与图像或者其他媒体对象相关联。响应于可以导致对媒体对象的访问的搜索或者其他用户动作,媒体对象可以被提供来向用户显示。原数据逻辑可以接受来自多个用户的关于媒体对象的元数据。统计逻辑可以至少部分基于至少一个元数据被指派给媒体对象的频度确定相关度度量。可以使元数据集合和度量对广告服务器可用。Embodiments of the present invention may provide information used by an ad server to associate advertisements with images or other media objects. In response to a search or other user action that results in access to the media object, the media object may be provided for display to a user. Metadata logic may accept metadata about the media object from multiple users. Statistics logic may determine a relevance metric based, at least in part, on the frequency with which at least one metadata item is assigned to the media object. The metadata set and metric may be made available to the ad server.

广告服务器可以至少部分基于元数据并且可选地基于相关度度量,将广告与媒体对象相关联。在本发明的实施例中,统计逻辑可以至少部分基于第一元数据集合和第二元数据集合被共同指派给媒体对象的频度确定度量。可以使第一和第二元数据对广告服务器可用。The ad server may associate an advertisement with the media object based at least in part on the metadata and, optionally, on a relevance metric. In an embodiment of the present invention, the statistical logic may determine the metric based at least in part on how often the first set of metadata and the second set of metadata are jointly assigned to the media object. The first and second metadata may be made available to the ad server.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1示出了根据本发明实施例的客户端-服务器系统。FIG1 shows a client-server system according to an embodiment of the present invention.

图2是示出了根据本发明实施例的标签元数据到媒体对象的入口的屏幕快照。FIG. 2 is a screenshot illustrating entry of tag metadata into a media object according to an embodiment of the present invention.

图3示出了根据本发明实施例添加注释元数据。FIG. 3 illustrates adding annotation metadata according to an embodiment of the present invention.

图4示出了根据本发明实施例设置许可。FIG. 4 illustrates setting permissions according to an embodiment of the present invention.

具体实施方式DETAILED DESCRIPTION

给出下面的说明使本领域普通技术人员能够实现和使用本发明。提供对特定设备、技术和应用的描述仅作为示例。本领域普通技术人员将清楚对这里所述的示例的各种修改,并且这里限定的一般的原理可以被应用于其他示例和应用,而部脱离本发明的精神和范围。因此,本发明不是要被限于这里描述并示出的示例,而是要符合与权利要求书一致的范围。The following description is given to enable those skilled in the art to implement and use the present invention. The description of specific devices, techniques and applications is provided as an example only. It will be clear to those skilled in the art that various modifications to the examples described herein, and the general principles defined herein can be applied to other examples and applications without departing from the spirit and scope of the present invention. Therefore, the present invention is not intended to be limited to the examples described and shown herein, but rather to conform to the scope consistent with the claims.

图1示出了根据本发明实施例的客户端-服务器系统。根据本发明实施例的媒体服务器可以包括http web服务器逻辑、脚本引擎(例如,PHP脚本引擎)、数据库和汇聚引擎。媒体服务器可以通过网络(例如因特网)与多个客户端通信。FIG1 illustrates a client-server system according to an embodiment of the present invention. A media server according to an embodiment of the present invention may include HTTP web server logic, a scripting engine (e.g., a PHP scripting engine), a database, and a convergence engine. The media server may communicate with multiple clients over a network (e.g., the Internet).

脚本引擎可以包括认证逻辑、上传逻辑、元数据处理逻辑和许可处理逻辑。认证逻辑对登入到媒体服务器网站的用户进行认证。上传逻辑可以用来将符合任何媒体格式的数据从客户端上传到服务器,所述媒体格式例如是静止照片(例如,JPEG、TIFF)、视频(例如,MPEG、AVI)、或者音频(例如,MP3、OGG)。汇聚引擎可以包括统计引擎和搜索引擎。用于与本发明的服务器一起使用的客户端可以包括典型的web浏览器应用。在www.flickr.com处可以观察到本发明的许多功能,其整体通过引用被结合于此。The scripting engine may include authentication logic, upload logic, metadata processing logic, and license processing logic. The authentication logic authenticates users logging into the media server website. The upload logic may be used to upload data from the client to the server in any media format, such as still photos (e.g., JPEG, TIFF), video (e.g., MPEG, AVI), or audio (e.g., MP3, OGG). The aggregation engine may include a statistics engine and a search engine. Clients for use with the server of the present invention may include typical web browser applications. Many of the features of the present invention can be observed at www.flickr.com, which is incorporated herein by reference in its entirety.

根据本发明的实施例,在客户端处的用户使用浏览器来访问媒体服务器,并且请求对媒体对象的上传。在响应中,上传逻辑将来自客户端的媒体对象存储到数据库中。为了方便,我们将频繁使用图像作为系统操纵的媒体对象的示例,但是本领域技术人员将认识到,在适当的情况下本发明在经过适当的修改并且使用其他功能后,也适用于其他媒体对象(例如,查看媒体对象可以应用于查看静止图像或者运动图像,或者听音频媒体对象,例如音轨)。According to an embodiment of the present invention, a user at a client uses a browser to access a media server and request an upload of a media object. In response, the upload logic stores the media object from the client in a database. For convenience, we will frequently use images as an example of media objects manipulated by the system, but those skilled in the art will recognize that the present invention is also applicable to other media objects, with appropriate modifications and using other functions, where appropriate (for example, viewing a media object can be applied to viewing still images or moving images, or listening to an audio media object, such as a soundtrack).

元数据处理逻辑许可用户输入元数据来描述每幅图像。(如这里所述,“元数据”可以指一个元数据集合或者多个元数据。)参见图2,元数据可以采取用于每幅图像的一个或多个标签的形式,例如,针对飞行中的海鸥的图像的作为一个空间限定的列表“cloudsseagull birds sky”输入的四个不同标签。其他类型的元数据包括标题(例如,“Last gull(目前)”)、对图像的描述、注释和评论。注释是直接在被注释的图像的一部分上显示的描述性注解。注释可以被隐藏不被查看,直到用户使光标经过被注释的部分。参见图3,例如,用户可以在海鸥的翅膀附近添加注释,例如“注意,日光穿过了翅膀”。评论可以被输入到文本输入框中,该文本输入框与用于在消息板上输入评论的类似。针对一个媒体对象,可以作出并且显示来自任何被许可用户的多条评论。The metadata processing logic allows users to enter metadata to describe each image. (As described herein, "metadata" can refer to a metadata set or multiple metadata.) Referring to FIG2 , the metadata can take the form of one or more tags for each image, for example, four different tags entered as a spatially bounded list "cloudsseagullbirdssky" for an image of seagulls in flight. Other types of metadata include titles (e.g., "Last gull (for now)"), descriptions of the image, annotations, and comments. Annotations are descriptive notes displayed directly on a portion of the annotated image. Annotations can be hidden from view until the user moves the cursor over the annotated portion. Referring to FIG3 , for example, a user can add a note near the seagull's wings, such as "Notice the sunlight shining through the wings." Comments can be entered into a text input box similar to that used for entering comments on a message board. Multiple comments from any authorized user can be made and displayed for a media object.

参见图4,许可逻辑使用户能够设置关于允许谁查看每幅图像的许可。例如,用户可以设置许可,用于允许仅用户自身、受限的人群(例如,家庭成员和/或朋友)、或者公众(例如,整个用户基础)访问。另外,许可逻辑允许其他人提供关于每幅图像的元数据。例如,用户可以允许朋友和/或家庭成员、任何其他用户、来自该用户的联系人列表的某些人添加标签、评论(例如,“我喜欢海鸥盘旋的方式”)或者注释,或者不允许任何人添加标签、评论或者注释。Referring to FIG4 , the permission logic enables the user to set permissions regarding who is allowed to view each image. For example, the user can set permissions to allow access only to the user, a limited group of people (e.g., family members and/or friends), or the public (e.g., the entire user base). Additionally, the permission logic allows others to provide metadata about each image. For example, the user can allow friends and/or family members, any other users, or certain people from the user's contact list to add tags, comments (e.g., "I love the way the seagulls are circling"), or annotations, or not allow anyone to add tags, comments, or annotations.

数据库使得可以以多种方式对媒体对象进行组织。例如,可以按时间组织用户的媒体对象,这允许用户按日期搜索媒体对象。这种组织允许沿时间线或者按照日历格式(例如,选自每天的图像被显示在相应的日期条目处)在用户的显示器上呈现媒体对象(例如,照片图像、用于代表视频文件的静止图像、或者代表音频文件的图标)。可以按照用户的显示格式选择,根据媒体对象被创建的日期、或上传日期来显示媒体对象。此外,脚本引擎允许以幻灯片格式显示媒体对象。The database makes it possible to organize media objects in a variety of ways. For example, a user's media objects can be organized by time, which allows the user to search for media objects by date. This organization allows media objects (e.g., photo images, still images representing video files, or icons representing audio files) to be presented on the user's display along a timeline or according to a calendar format (e.g., images selected from each day are displayed at the corresponding date entry). Media objects can be displayed according to the date the media object was created, or the date it was uploaded, according to the user's display format selection. In addition, a script engine allows media objects to be displayed in a slideshow format.

对于照片,创建日期可以是基于设备提供的元数据的,例如来自拍摄该照片的相机的元数据,包括关于光圈、快门速度和其他设置的元数据。可以从例如许多数码相机上传的文件内存储的EXIF(可交换图像文件)头部,或者由许多照片管理和组织软件包上传的文件内存储的IPTC(国际报业电信委员会)头部提取这种元数据。照片图像的时间组织可以被称作“照片流(photostream)”。For photos, the creation date can be based on metadata provided by the device, such as metadata from the camera that took the photo, including information about aperture, shutter speed, and other settings. This metadata can be extracted from, for example, the EXIF (Exchangeable Image File) headers stored in files uploaded by many digital cameras, or the IPTC (International Press Telecommunications Council) headers stored in files uploaded by many photo management and organization software packages. The temporal organization of photo images can be referred to as a "photostream."

数据库还允许用户将用户上传的媒体对象组织成由用户提供的集合标识符和描述来标识和描述的集合。每个图像集合例如类似于一个影集。每个媒体对象可以属于多个集合。集合标识符和描述自身也是元数据。The database also allows users to organize uploaded media objects into collections, identified and described by user-provided collection identifiers and descriptions. Each image collection is similar to a photo album, for example. Each media object can belong to multiple collections. The collection identifiers and descriptions themselves are also metadata.

与包括仅来自一个用户的媒体对象的集合相对,来自多个用户的媒体对象可以利用数据库被汇集成“群组”。每个群组由建立该群组的用户提供的群组标识符来标识。将所有群组分组到一起代表在容纳媒体对象的媒体服务器上可访问的特定类型(例如,图像)的所有媒体对象。群组创建者可以设置用于访问和添加媒体对象到群组的各种许可级别。许可级别可以包括例如对整个用户基础是公开的,或者对朋友/家庭成员或者用户定义的社交网络是私密的。被许可访问一个群组的用户也被许可添加标签、评论和/或注释。与显示和组织个体用户的媒体对象类似,数据库使得能够以按照日期安排的时间线或者日历格式来组织和显示群组媒体对象,也可以以幻灯片方式组织和显示媒体对象。In contrast to collections that include media objects from only one user, media objects from multiple users can be aggregated into "groups" using a database. Each group is identified by a group identifier provided by the user who created the group. Grouping all groups together represents all media objects of a particular type (e.g., images) accessible on the media server that hosts the media objects. The group creator can set various permission levels for accessing and adding media objects to the group. Permission levels can include, for example, public to the entire user base or private to friends/family members or a user-defined social network. Users who are granted access to a group are also permitted to add tags, comments, and/or annotations. Similar to displaying and organizing an individual user's media objects, the database enables group media objects to be organized and displayed in a timeline or calendar format arranged by date, or in a slideshow format.

媒体服务器可以包括RSS提要(feed)发生器,其允许用户预订媒体对象的“提要”,例如属于特定分组的媒体对象,或者由特定标签标识的媒体对象,这些媒体对象例如是按照发布它们的日期排序的。(“分组”可以指任何集合,例如,媒体对象的所有群组、多个用户的媒体对象的单个群组、个体用户的所有媒体对象,或者个体的媒体对象的集合(即,子集))在用户的客户端计算机处的RSS阅读器可以被配置为使得仅最近访问的媒体对象之后发布的新媒体对象(例如,更新)被呈现给该阅读器。类似地,阅读器可以被配置为使得仅与特定媒体对象相关的最常见评论串可以利用该功能来显示。A media server may include an RSS feed generator that allows a user to subscribe to a "feed" of media objects, such as media objects belonging to a particular group or identified by a particular tag, sorted, for example, by the date they were published. (A "group" may refer to any collection, such as all groups of media objects, a single group of media objects for multiple users, all media objects for an individual user, or a collection (i.e., a subset) of an individual's media objects.) An RSS reader on a user's client computer may be configured so that only new media objects (e.g., updates) published after the most recently accessed media object are presented to the reader. Similarly, the reader may be configured so that only the most popular comment threads related to a particular media object are displayed using this functionality.

统计引擎基于汇聚的元数据生成统计数据和其他度量。在一个实施例中,统计引擎确定在预定时间段内在一个媒体对象的分组中元数据(例如,标签)的流行度(popularity)。例如,统计引擎可以确定在最近24小时中,向系统中的所有群组中、单个群组中、或者媒体对象的集合中的一个或多个媒体对象指派了特定标签的不同用户的数目。汇聚引擎可以确定(并且显示)标签的直方图,并且可以通过确定频度超过最小阈值频度或者属于预定数目个最流行的标签的那些标签,从而确定最频繁地被指派的标签(在任意时间点或者在预定的时间段中)。The statistics engine generates statistics and other metrics based on the aggregated metadata. In one embodiment, the statistics engine determines the popularity of metadata (e.g., tags) within a group of media objects over a predetermined time period. For example, the statistics engine can determine the number of unique users who have assigned a particular tag to one or more media objects in all groups, a single group, or a collection of media objects in the system over the last 24 hours. The aggregation engine can determine (and display) a histogram of the tags and can determine the most frequently assigned tags (at any point in time or over a predetermined time period) by determining those tags whose frequency exceeds a minimum threshold frequency or falls within a predetermined number of the most popular tags.

在本发明的一个实施例中,预定数目的元数据(例如,标签)或者元数据内的项目(terms)(例如,评论内的项目)可以具有由用于显示它们的字体的大小指示的它们的频度。例如,所有群组内的100个最流行的标签可以按照字母顺序被布置在用户的显示器上,并且由增大的字体大小指示增大的流行度。In one embodiment of the present invention, a predetermined number of metadata (e.g., tags) or terms within metadata (e.g., terms within comments) may have their frequency indicated by the size of the font used to display them. For example, the 100 most popular tags within all groups may be arranged in alphabetical order on the user's display, with increasing popularity indicated by increasing font size.

在另一个实施例中,统计引擎可以确定元数据的“相关度(relatedness)”,即,特定元数据集合(例如,标签)(或者元数据集合内(例如,评论内)的项目)被指派给一个媒体对象的频度和至少一个其他特定元数据集合(或者元数据集合内的项目)的共同出现测度。在一个实施例中,共同出现测度可以确定相同类型的元数据共同出现的频度。例如,在添加有标签“Italy”的所有100幅图像中,这些图像中的50幅也可以被添加有标签“Rome”、25个被添加有标签“Venice”、10个被添加有标签“Florence”、2个被添加有标签“Sienna”。共同出现索引分别是:50个“Italy-Rome”、25个“Italy-Venice”、10个“Italy-Florence”和2个“Italy-Sienna”。总言之,包括作为标签的子集的位置,标签MD可以包括位置。In another embodiment, the statistics engine can determine the "relatedness" of metadata, i.e., the frequency with which a particular set of metadata (e.g., a tag) (or items within a metadata set (e.g., a comment)) is assigned to a media object and a co-occurrence measure with at least one other particular set of metadata (or items within a metadata set). In one embodiment, the co-occurrence measure can determine the frequency with which metadata of the same type co-occurs. For example, of all 100 images tagged with "Italy," 50 of these images can also be tagged with "Rome," 25 with "Venice," 10 with "Florence," and 2 with "Sienna." The co-occurrence indices are: 50 "Italy-Rome," 25 "Italy-Venice," 10 "Italy-Florence," and 2 "Italy-Sienna," respectively. In summary, a tag MD can include locations, including locations as a subset of tags.

在另一个实施例中,可以使相关度度量是特定于用户的,使得其是将至少一个媒体对象指派给特定元数据集合(例如,标签)(或者元数据集合内的项目(例如,评论))的用户的数目的频度和至少一个其他特定元数据集合(或者元数据集合内的项目)的共同出现测度。例如,在用词语“Italy”给至少一幅图像加了标签的所有100个用户中,50个用户可能还用“Rome”给已加有标签“Italy”的图像加了标签、25个还用“Venice”加了标签、10个用“Florence”加了标签、并且2个用“Sienna”加了标签。共同出现索引分别是:50个“Italy-Rome”、25个“Italy-Venice”、10个“Italy-Florence”和2个“Italy-Sienna”。In another embodiment, the relevance metric can be user-specific, such that it is a frequency measure of the number of users who assigned at least one media object to a particular metadata set (e.g., a tag) (or an item within a metadata set (e.g., a comment)) and a co-occurrence measure with at least one other particular metadata set (or item within a metadata set). For example, of all 100 users who tagged at least one image with the word "Italy," 50 users might have also tagged images tagged "Italy" with "Rome," 25 with "Venice," 10 with "Florence," and 2 with "Sienna." The co-occurrence indices are: 50 "Italy-Rome," 25 "Italy-Venice," 10 "Italy-Florence," and 2 "Italy-Sienna," respectively.

具有最高共同出现索引的预定数目个元数据(例如,标签)或者具有超过预定阈值的共同出现索引的那些元数据可以作为“相关的”元数据(例如,标签)显示给用户,同时不满足预定条件的至少一个元数据集合(例如,标签)被显示在“See alos”下面。预定阈值可以作为索引的最大可能值的百分比而被计算出。所有这种显示的元数据都可以充当到被指派了指定的元数据的所有媒体对象的超链接。相关度测度可以被应用到所有“公开”媒体对象(即,对系统上的任何人都可用的那些),或者应用到较小的分组(例如,群组或集合内的那些)。A predetermined number of metadata (e.g., tags) with the highest co-occurrence index, or those with a co-occurrence index exceeding a predetermined threshold, can be displayed to the user as "related" metadata (e.g., tags), while at least one set of metadata (e.g., tags) that does not meet the predetermined criteria is displayed under "See also." The predetermined threshold can be calculated as a percentage of the maximum possible value of the index. All such displayed metadata can serve as hyperlinks to all media objects assigned the specified metadata. The relevance measure can be applied to all "public" media objects (i.e., those available to anyone on the system) or to smaller groups (e.g., those within a group or collection).

作为相关度计算的一部分,统计引擎可以采用本领域已知的统计集群分析来确定元数据(例如,标签)之间的统计近似度,并且根据相应的集群将元数据和关联的媒体对象分组。例如,在用词语“Vancouver”加标签的10,000幅图像中,阈值近似度水平内的一个统计集群可以包括也用“Canada”和“British Columbia”加标签的图像。阈值近似度水平内的另一个统计集群可以相反用“Washington”和“space needle”与“Vancouver”一起加标签。集群分析允许统计引擎将“Vancouver”与“Vancouver-Canada”集群和“Vancouver-Washington”集群二者相关联。媒体服务器可以向用户提供两组相关的标签以显示,这两组相关标签指示例如它们属于与不同的主题领域相对应的不同的集群。As part of the relevance calculation, the statistical engine can employ statistical cluster analysis, known in the art, to determine the statistical similarity between metadata (e.g., tags) and group the metadata and associated media objects according to corresponding clusters. For example, among 10,000 images tagged with the word "Vancouver," one statistical cluster within a threshold similarity level may include images also tagged with "Canada" and "British Columbia." Another statistical cluster within the threshold similarity level may instead be tagged with "Washington" and "space needle" along with "Vancouver." The cluster analysis allows the statistical engine to associate "Vancouver" with both the "Vancouver-Canada" cluster and the "Vancouver-Washington" cluster. The media server can present the two sets of related tags to the user for display, indicating, for example, that they belong to different clusters corresponding to different subject areas.

本发明的一个实施例许可用户确定标签对媒体对象的适当性(relevance),尤其是对由其他用户发布的媒体对象的适当性。适当性设置图标或者其他输入图形可以被显示在每个标签旁。例如,图标可以包括“+”和“-”按钮,用于分别指示用户相信该标签对于所显示的媒体对象是适当的或者不适当的。统计引擎可以针对每个媒体对象收集适当性条目,以确定该对象的适当性度量。例如,该度量可以仅是输入的“+”的数目除以每个媒体对象的适当性条目的总数。统计引擎将每个投票与投票用户相关联,来防止“塞满投票箱”,即,统计引擎避免计数单个用户关于标签对媒体对象的适当性的多次投票。One embodiment of the present invention allows a user to determine the relevance of a tag to a media object, particularly to media objects posted by other users. An appropriateness setting icon or other input graphic may be displayed next to each tag. For example, the icon may include "+" and "-" buttons to indicate, respectively, that the user believes the tag is appropriate or inappropriate for the displayed media object. A statistical engine may collect the appropriateness entries for each media object to determine a measure of the object's appropriateness. For example, the measure may simply be the number of "+"s entered divided by the total number of appropriateness entries for each media object. The statistical engine associates each vote with the voting user to prevent "stuffing the ballot box," i.e., the statistical engine avoids counting multiple votes by a single user regarding the appropriateness of a tag to a media object.

统计引擎可以将适当性值考虑到集群分析中来影响相关度度量。例如,具有低适当性值的标签将被作为与和相同媒体对象相关联的其他标签不相关(即,被加权为具有较长的统计距离)。The statistical engine can factor the appropriateness value into the cluster analysis to influence the relevance metric. For example, a tag with a low appropriateness value will be treated as irrelevant to other tags associated with the same media object (i.e., weighted as having a longer statistical distance).

根据本发明的实施例,元数据处理逻辑118可以为每个媒体对象计算“兴趣”度量。兴趣可以是与媒体对象相关的用户动作的函数,包括例如用户输入的和/或用户编辑的元数据的量和/或媒体对象的访问模式。作为替换,或者除了那些因素之外,兴趣可以是时间、系统设置和/或用户与元数据的发布者的关系的函数。According to embodiments of the present invention, metadata processing logic 118 may calculate an "interest" metric for each media object. Interest may be a function of user actions associated with the media object, including, for example, the amount of user-entered and/or user-edited metadata and/or access patterns for the media object. Alternatively, or in addition to those factors, interest may be a function of time, system settings, and/or the user's relationship with the publisher of the metadata.

上述每个因素可以受到系统设计者所设置的最大值钳制,这是对每个因素加权的一种方式。作为替换,或者另外,在任何钳制之前,每个因素可以利用乘以该因素的加权系数而被更直接地加权。在任意情形中,因素(加权或者未加权的)可以被一起求和来创建兴趣分数(即,排名)。加权和钳制当然可以在更精细的级别上应用到对任意这些因素作出贡献的参数(下面将描述)。Each of the above factors can be clamped to a maximum value set by the system designer, which is one way to weight each factor. Alternatively, or in addition, each factor can be weighted more directly by multiplying the factor by a weighting coefficient before any clamping. In any case, the factors (weighted or unweighted) can be summed together to create an interest score (i.e., a ranking). Weighting and clamping can of course be applied at a more granular level to the parameters that contribute to any of these factors (described below).

可以针对任何分组的任何媒体对象计算兴趣分数,例如,来自包含该媒体对象的所有群组的分组,来自包含该媒体对象的一个群组的分组,来自与该媒体对象的发布者相关联的网站的区域的分组、或者来自例如包含正被评分的媒体对象的该用户的媒体对象的集合中的分组。An interest score may be calculated for any media object of any grouping, e.g., a grouping from all groups that contain the media object, a grouping from one group that contains the media object, a grouping from areas of a website associated with the publisher of the media object, or a grouping from, for example, a set of media objects of the user that contains the media object being scored.

用户输入的元数据的量可以包括例如这样的参数,例如,指派给媒体对象的标签、评论和/或注释的数目,和/或已将该媒体对象添加到它们的喜好/书签的用户的数目。(将音频媒体对象添加到用户的喜好可以包括将该媒体对象添加到用户的播放列表。)作为替换,或者除了那些参数之外,用户输入的元数据的量可以是用户相关的,并且包括例如已向该媒体对象添加了标签、评论和/或注释,和/或已将该媒体对象添加到它们的喜好/书签的用户的数量。The amount of user-entered metadata may include, for example, parameters such as the number of tags, comments, and/or annotations assigned to the media object, and/or the number of users who have added the media object to their favorites/bookmarks. (Adding an audio media object to a user's favorites may include adding the media object to the user's playlist.) Alternatively, or in addition to those parameters, the amount of user-entered metadata may be user-dependent and include, for example, the number of users who have added tags, comments, and/or annotations to the media object, and/or the number of users who have added the media object to their favorites/bookmarks.

作为替换,或者除了那些参数之外,元数据处理逻辑118可以将兴趣分数考虑到媒体对象的访问模式中,例如,媒体对象的查看(或者回放)和/或点入的次数,和/或已查看(或者回放)和/或点入了该媒体对象或者与该媒体对象相关的标签的用户的数目。不管兴趣算法是否将用户的动作作为“点入”,或者,相反,对媒体对象的“查看”或者“观看”可以取决于用户访问该媒体对象的路线,即,访问模式。例如,搜索指派了特定标签的图像可以返回多个略缩图图像。算法可以将用户点击那些略缩图中的特定一个作为“点入”。Alternatively, or in addition to those parameters, the metadata processing logic 118 can take into account the access pattern of the media object, such as the number of times the media object has been viewed (or played back) and/or clicked on, and/or the number of users who have viewed (or played back) and/or clicked on the media object or a tag associated with the media object. Whether the interest algorithm considers the user's action as a "click-through," or, conversely, the "viewing" or "viewing" of the media object can depend on the route the user took to access the media object, i.e., the access pattern. For example, a search for images assigned a particular tag can return multiple thumbnail images. The algorithm can consider the user clicking on a particular one of those thumbnails as a "click-through."

相反,例如,用电子邮件从一个用户发送到一个用户的图像可以认为被该用户“查看”。在另一个示例中,当用户访问图像群组池时,用户的浏览器可以将这些图像作为略缩图呈现。用户可以点击一个略缩图来“查看”该图像。因此,可见,点击略缩图这一相同的动作可以被作为“查看”或者“点入”对待,这取决于用户到达该图像所采取的路径,即,访问模式。基于心理学观察、市场研究或者其他因素,系统设计者可能希望将某些访问模式作为指示了比其他访问模式高的用户兴趣程度对待,并且在计算兴趣分数时向这种访问模式指派较高的权重。作为也许更有说服力的一个示例,如果用户基于支付$10.00达到并点击一个略缩图来访问图像,则系统设计者更可能向这种访问路径指派比免费访问图像更高的加权系数。相反,某些流量的源、搜索项、标签查询显示略缩图的其他前兆可能被确定为与较高兴趣不一致的动机相关联,从而系统设计者可能向这种访问路径指派相对较低的加权系数。Conversely, for example, an image sent by email from one user to another user can be considered "viewed" by that user. In another example, when a user accesses a pool of image groups, the user's browser may present these images as thumbnails. The user can click on a thumbnail to "view" the image. Thus, the same action of clicking on a thumbnail can be treated as either a "view" or a "click-through," depending on the path the user took to reach the image, i.e., the access mode. Based on psychological observations, market research, or other factors, system designers may wish to treat certain access modes as indicating a higher level of user interest than other access modes and assign a higher weight to such access modes when calculating interest scores. As a perhaps more compelling example, if a user accesses an image by paying $10.00 to reach and click on a thumbnail, the system designer is more likely to assign a higher weighting factor to this access path than to an image accessed for free. Conversely, certain traffic sources, search terms, tag queries, or other precursors to displaying thumbnails may be determined to be associated with motivations inconsistent with higher interest, and thus the system designer may assign a relatively lower weighting factor to such access paths.

另外,元数据处理逻辑118可以将媒体对象的发布者与用户(例如,输入搜索查询的用户)之间的关系考虑到分数中。用户可以例如是被允许访问发布者的媒体对象的私有群组的成员(例如,朋友和家庭成员,兴趣群组或者社交网络),或者在发布者的联系人列表中列出的用户。假定这种用户和发布者之间的兴趣相似性相对于其他用户可能较高,则这种关系可以被加权,并且被加到兴趣分数中来增大该分数。Additionally, metadata processing logic 118 may factor relationships between the publisher of a media object and the user (e.g., the user who entered the search query) into the score. The user may, for example, be a member of a private group (e.g., friends and family members, an interest group, or a social network) that is allowed to access the publisher's media objects, or a user listed in the publisher's contact list. Given that the similarity of interests between such a user and the publisher is likely high relative to other users, such a relationship may be weighted and added to the interest score to increase it.

上面的功能是“个性化”兴趣分数的示例。一般而言,该分数可以是基于媒体对象的兴趣分数的请求者的身份的。(如这里所使用的,对“请求者”或者对请求媒体对象的兴趣分数或者排名的人的指代是指,明确请求该分数或者通过任何访问模式采取行动(例如,输入搜索查询)的人,该访问模式导致呈现媒体对象,并且由元数据处理逻辑118计算兴趣分数,而不管该分数自身是否被提供给请求者。)具体而言,该分数可以是基于媒体对象的发布者和请求该分数的用户之间的关系的。The above functionality is an example of a "personalized" interest score. In general, the score can be based on the identity of the requester of the media object's interest score. (As used herein, reference to a "requester," or someone requesting a media object's interest score or ranking, refers to someone who explicitly requests the score or takes action through any access mode (e.g., entering a search query) that results in the media object being presented and the interest score being calculated by metadata processing logic 118, regardless of whether the score itself is provided to the requester.) Specifically, the score can be based on the relationship between the publisher of the media object and the user requesting the score.

在另一个实施例中,与用户相关联的媒体对象的个性化分数可以是基于由该用户和分数请求者指派了相同类型的元数据(例如,标签或喜好)的媒体对象的数目的。可以通过例如由用户指派元数据或者由用户公开从而可以使媒体对象与用户相关联。例如,假设第一用户和第二用户分别在他们的在线相册100和200中存储照片图像。第二用户可以搜索与特定标签相关联的图像。搜索引擎111可以返回存储在第一用户的影集中的图像。元数据处理逻辑118可以向该图像指派分数,该分数是第一和第二用户的影集中已被第一和第二用户共同指派为喜好或者被共同加标签的其他图像的数目的增函数,这是基于这种共享的行为充当了第二用户可能对第一用户的影集中第二用户尚未“加入喜好”或者加标签的图像尤其感兴趣的预报符的理论作出的。In another embodiment, the personalized score of a media object associated with a user can be based on the number of media objects that have been assigned the same type of metadata (e.g., tags or favorites) by both the user and the score requester. Media objects can be associated with users by, for example, assigning metadata or making it public. For example, assume a first user and a second user store photo images in their online photo albums 100 and 200, respectively. The second user can search for images associated with a particular tag. Search engine 111 can return images stored in the first user's album. Metadata processing logic 118 can assign a score to the image that is an increasing function of the number of other images in the first and second user's albums that the first and second users have jointly assigned as favorites or tagged, based on the theory that this shared behavior serves as a predictor that the second user may be particularly interested in images in the first user's album that the second user has not yet "favorited" or tagged.

在另一个实施例中,元数据处理逻辑118可以基于与媒体对象相关联的位置和与请求该媒体对象的分数的用户相关联的位置计算兴趣分数。例如,元数据处理逻辑118可以指示,如果与媒体对象相关联的位置被关联到特定用户的住所(例如,接近该用户的住所或者在与该用户的住所相同的地理区域中),或者被关联到与该用户具有预定关系的另一个用户的住所(例如,朋友或者家庭成员),或者被关联到自身与已被该用户指派了元数据(例如,加标签或者加入喜好)的阈值数目个媒体对象相关联的位置,则该用户对该媒体对象更感兴趣。In another embodiment, the metadata processing logic 118 may calculate an interest score based on a location associated with the media object and a location associated with the user requesting the score for the media object. For example, the metadata processing logic 118 may indicate that a user is more interested in a media object if the location associated with the media object is associated with a particular user's residence (e.g., near the user's residence or in the same geographic area as the user's residence), or is associated with the residence of another user with whom the user has a predetermined relationship (e.g., a friend or family member), or is associated with a location that is associated with a threshold number of media objects that have been assigned metadata by the user (e.g., tagged or favorited).

在后一情形中,例如,元数据处理逻辑118可以对于特定的用户,将该用户已将与华盛顿特区区域相关的大量图像指定为喜好这一事实正面考虑到华盛顿纪念碑的图像的兴趣分数中。这假设当向该图像指派元数据时,例如该图像的发布者或者通过标签字段或者分离的“位置”字段输入了该位置的另一个用户,已将指示华盛顿区域的位置的元数据与该纪念碑的图像相关联。In the latter case, for example, the metadata processing logic 118 may positively factor into the interest score of an image of the Washington Monument, for a particular user, the fact that the user has designated a large number of images related to the Washington, D.C. area as favorites. This assumes that metadata indicating a location in the Washington, D.C. area was associated with the image of the monument when the metadata was assigned to the image, e.g., by the publisher of the image or another user who entered the location via a tag field or a separate "location" field.

系统设计者也可以设置其他兴趣分数分量。例如,一些媒体对象可以作为不希望的媒体对象对待,因为它们包含令人讨厌的内容,例如,淫秽的图像或者竞争对手的产品的推广。系统设计者可以例如建立分数计算来将迄今积累的分数递减预定的分数抵消百分比,其中该预定的分数抵消百分比是指派给具有“黑名单”上的标签或者其他元数据的媒体对象的百分比。因为媒体对象可以被与多于一个列入黑名单的标签相关联,所以将分数抵消值挑选为与那些标签相关联的最大分数抵消值。System designers can also set other interest score components. For example, some media objects may be treated as undesirable because they contain objectionable content, such as obscene images or promotions of competitor products. System designers can, for example, set up a score calculation that decrements the previously accumulated score by a predetermined score offset percentage, where the predetermined score offset percentage is the percentage assigned to media objects with "blacklisted" tags or other metadata. Because a media object can be associated with more than one blacklisted tag, the score offset value is selected as the maximum score offset value associated with those tags.

还可以考虑另一个分数分量。例如,系统设计者可以建立分数计算来在从该媒体对象被发布起的时间上将迄今积累的分数递减预定的百分比。例如,该时间衰减可以使分数从发布日起每天递减2%。这种和其他手段可以被用来防止在利用兴趣自身对媒体对象进行的排序偏离结果时同时发生“正反馈环”,导致那些相同的媒体对象更频繁地被访问,从而不自然地增大它们的兴趣分数。Another score component can also be considered. For example, the system designer can set up the score calculation to decrement the accumulated score by a predetermined percentage over the time since the media object was published. For example, this time decay can cause the score to decrease by 2% per day starting from the publication date. This and other measures can be used to prevent a "positive feedback loop" from occurring when ranking media objects by interest itself deviates from the results, causing the same media objects to be visited more frequently, thereby artificially increasing their interest scores.

最终的兴趣分数可以被归一化,使得例如兴趣分数总是落在0和100或者0和1之间。实现归一化的一种方法是将真实分数值除以最大可能分数值。The final interest score may be normalized so that, for example, the interest score always falls between 0 and 100 or 0 and 1. One way to achieve normalization is to divide the true score value by the maximum possible score value.

搜索引擎111允许用户根据各种元数据对数据库中的媒体对象进行搜索。例如,用户可以在对该用户可访问的所有媒体对象中执行标签的布尔搜索。或者,用户可以执行对评论、注释、标题或者描述中的项目的全文布尔搜索。对一个用户可访问的媒体对象包括例如公共媒体对象、群组内的媒体对象、用户另一个私密群组的朋友/家庭成员或者成员的媒体对象、由该用户发布的所有媒体对象、或者用户定义的集合内的该用户的媒体对象。Search engine 111 allows users to search media objects in the database based on various metadata. For example, a user can perform a Boolean search for a tag in all media objects accessible to the user. Alternatively, a user can perform a full-text Boolean search for an item in a comment, annotation, title, or description. Media objects accessible to a user include, for example, public media objects, media objects within a group, media objects of friends/family members or members of another private group of the user, all media objects posted by the user, or media objects of the user within a user-defined collection.

从搜索返回的媒体对象可以根据兴趣被排名。例如,在一个实施例中,媒体服务器可以仅向搜索用户提供下述媒体对象来显示:兴趣分数大于预定阈值的媒体对象,或者预定数目个最高得分的媒体对象。The media objects returned from the search can be ranked according to interest. For example, in one embodiment, the media server can only provide the following media objects to the searching user for display: media objects with interest scores greater than a predetermined threshold, or a predetermined number of the highest-scoring media objects.

响应于利用标签的搜索,例如,统计引擎109可以(根据相关度度量)确定与搜索查询中的一个或多个标签(或者其他元数据)最相关的标签(或者其他元数据)。媒体服务器100可以将这些最相关的标签(或者其他元数据)与检索出的媒体对象一起返回给客户端处的用户。如果相关度计算导致两个相关标签(或者其他元数据)的集群,则与这两个集群相关联的媒体对象可以按照兴趣顺序被排名(并显示)。In response to a search using tags, for example, the statistics engine 109 can determine (based on a relevance metric) the tags (or other metadata) that are most relevant to one or more tags (or other metadata) in the search query. The media server 100 can return these most relevant tags (or other metadata) to the user at the client along with the retrieved media objects. If the relevance calculation results in two clusters of related tags (or other metadata), the media objects associated with these two clusters can be ranked (and displayed) in order of interest.

在广告上下文中,可以将广告与它们自己的元数据/关键字相关联,例如,对于意大利宾馆广告是“Rome Italy Hotels tourism travel”。基于那些相关的关键字,广告服务器122可以利用媒体服务器经由web服务器102提供的相关标签或者其他元数据的集合和相关度度量,来确定赞助该网站的哪些广告被与和相关标签的集合最接近匹配的预定义的元数据/关键字相关联。(广告服务器可以是网络112上的第三方服务器。)广告服务器可以向用户提供最接近匹配的广告显示。例如,广告服务器可以在用户的客户端计算机处提供意大利宾馆的广告来显示,并且显示来自媒体服务器的具有高度相关的标签“Rome ItalyHoneymoon”的图片。这样,广告服务器利用相关度度量和高度相关的标签的集合来提供与所显示的媒体对象最高度相关的广告。In the context of advertising, ads can be associated with their own metadata/keywords, such as "Rome Italy Hotels tourism travel" for an Italian hotel ad. Based on those associated keywords, ad server 122 can utilize a collection of related tags or other metadata and a relevance metric provided by the media server via web server 102 to determine which ads sponsoring the website are associated with predefined metadata/keywords that most closely match the collection of related tags. (The ad server can be a third-party server on network 112.) The ad server can then display the most closely matching ad to the user. For example, the ad server can display an ad for an Italian hotel on the user's client computer and a picture from the media server with the highly relevant tag "Rome Italy Honeymoon." In this way, the ad server utilizes the relevance metric and the collection of highly relevant tags to provide the ad that is most relevant to the displayed media object.

在另一个实施例中,响应于利用标签(或者其他元数据库)的搜索,统计引擎也可以防止指派了相关度分数落到相关度阈值之下的标签(或者其他元数据集合)的媒体对象作为搜索结果被返回。In another embodiment, in response to a search utilizing tags (or other metadata databases), the statistical engine may also prevent media objects assigned tags (or other metadata sets) whose relevance scores fall below a relevance threshold from being returned as search results.

将意识到,上面的清楚说明结合不同的功能单元描述了本发明的实施例。但是,应当清楚,在不脱离本发明的情况下,可以使用不同功能单元之间的任何功能分配。因此,对特定功能单元的指代仅应被看作对用于提供所述功能的适当装置的指代,而不是说明严格的逻辑或者物理结构或组织。It will be appreciated that the above description clearly describes embodiments of the present invention in conjunction with different functional units. However, it should be understood that any distribution of functionality between different functional units may be used without departing from the present invention. Therefore, references to specific functional units should only be considered as references to appropriate means for providing the functionality, rather than describing a strict logical or physical structure or organization.

可以以任何适当的形式实现本发明,包括硬件、软件、固件或者它们的任意组合。本发明的不同方面可以至少部分被实现为在一个或多个数据处理器和/或数字信号处理器上运行的计算机软件或固件。本发明实施例的元件和组件可以以任何合适的方式在物理上、功能上和逻辑上被实现。实际上,可以在单个单元中,在多个单元中实现功能,或者功能可以被实现为其他功能单元的一部分。这样,本发明可以被实现在单个单元中,或者可以在物理上和功能上分配在不同的单元和处理器之间。The present invention can be realized in any suitable form, including hardware, software, firmware or any combination thereof. Different aspects of the present invention can be implemented at least in part as computer software or firmware running on one or more data processors and/or digital signal processors. The elements and components of the embodiments of the present invention can be implemented physically, functionally and logically in any suitable manner. In fact, functions can be implemented in a single unit, in multiple units, or functions can be implemented as a part for other functional units. Like this, the present invention can be implemented in a single unit, or can be physically and functionally distributed between different units and processors.

尽管已结合一些实施例描述了本发明,但是不是要将本发明限制于这里所阐述的特定形式。相反,本发明的范围仅由权利要求书限制。另外,尽管可能看起来结合特定实施例描述了某个特征,但是本领域技术人员将认识到根据本发明可以对所描述的实施例的各种特征进行组合。此外,结合某个实施例描述的本发明的多个方面可以单独作为一个发明。Although the present invention has been described in conjunction with certain embodiments, it is not intended to limit the invention to the specific forms set forth herein. Rather, the scope of the invention is limited solely by the claims. Furthermore, although it may appear that a feature has been described in conjunction with a specific embodiment, those skilled in the art will recognize that various features of the described embodiments may be combined in accordance with the present invention. Furthermore, various aspects of the invention described in conjunction with an embodiment may stand alone as an invention.

此外,将意识到,在不脱离本发明的精神和范围的情况下,本领域技术人员可以进行修改和替换。本发明不由前述说明性细节限制,而是如权利要求书限定。Furthermore, it will be appreciated that modifications and substitutions may be made by those skilled in the art without departing from the spirit and scope of the present invention.The present invention is not to be limited by the foregoing illustrative details, but is to be defined by the claims.

Claims (43)

1.一种装置,包括:1. An apparatus comprising: 处理器;processor; 用于存储计算机指令的存储设备,所述指令包括:A storage device for storing computer instructions, the instructions including: 由所述处理器执行、用于提供至少一幅图像来显示给至少一个用户的逻辑;以及Logic executed by the processor for providing at least one image to be displayed to at least one user; and 由所述处理器执行、用于接受来自多个用户的关于所述至少一幅图像的输入的逻辑,其中,所述输入包括至少两种不同类型的元数据,其中用户输入的元数据包括标签;The processor executes logic for accepting input from multiple users regarding the at least one image, wherein the input includes at least two different types of metadata, and the metadata input by the users includes tags; 由所述处理器执行、用于利用至少一个元数据集合对图像进行搜索的逻辑;The logic executed by the processor for searching an image using at least one set of metadata; 由所述处理器执行、用于在所述搜索被执行之后,对从所述搜索返回的至少一幅图像进行排名的逻辑,所述用于排名的逻辑至少部分基于关于所述至少一幅图像的用户输入的元数据的量来对所述至少一幅图像进行排名,所述用于排名的逻辑还至少部分基于以下一项或两项来对所述至少一幅图像进行排名:第一元数据集合和第二元数据集合被共同指派给所述至少一幅图像的频度,与所述至少一幅图像相关的访问模式,所述访问模式包括所述用户访问所述至少一幅图像的路线;以及The processor executes logic for ranking at least one image returned from the search after the search is executed, the ranking logic ranking the at least one image based at least in part on the amount of metadata of user input about the at least one image, and the ranking logic ranking the at least one image also based at least in part on one or both of the following: the frequency with which a first metadata set and a second metadata set are jointly assigned to the at least one image, and access patterns associated with the at least one image, the access patterns including the route taken by the user to access the at least one image; and 由所述处理器执行、用于基于从至少一个用户输入的适当性来确定所述标签中的至少一个标签对于所述至少一幅图像的适当性的逻辑,其中所述排名是至少部分基于所述至少一个标签对于所述至少一幅图像的适当性的。The processor executes logic for determining the appropriateness of at least one of the tags for the at least one image based on appropriateness from at least one user input, wherein the ranking is at least partially based on the appropriateness of the at least one tag for the at least one image. 2.如权利要求1所述的装置,其中,所述至少两种不同类型的元数据被从包括下述项的元数据类型组中选出:标签、评论、描述、喜好和注释。2. The apparatus of claim 1, wherein the at least two different types of metadata are selected from a group of metadata types including: tags, comments, descriptions, preferences, and annotations. 3.如权利要求2所述的装置,其中,所述标签包括位置元数据。3. The apparatus of claim 2, wherein the tag includes location metadata. 4.如权利要求1所述的装置,其中,所述至少一幅图像包括视频图像。4. The apparatus of claim 1, wherein the at least one image comprises a video image. 5.如权利要求1所述的装置,其中,所述装置位于服务器中,并且每个用户与相应的客户端相关联。5. The apparatus of claim 1, wherein the apparatus is located in a server, and each user is associated with a corresponding client. 6.如权利要求1所述的装置,其中,所述用户输入的元数据被从包括下述项的元数据类型组中选出:标签、评论和注释。6. The apparatus of claim 1, wherein the user-input metadata is selected from a group of metadata types including tags, comments, and annotations. 7.如权利要求1所述的装置,其中,所述用于排名的逻辑还至少部分基于已对所述至少一幅图像指派了元数据的用户的数目来对图像进行排名。7. The apparatus of claim 1, wherein the ranking logic further ranks the images based at least in part on the number of users who have assigned metadata to the at least one image. 8.如权利要求7所述的装置,其中,所述用户输入的元数据是至少部分基于已将所述至少一幅图像指定为喜好的用户的数目的。8. The apparatus of claim 7, wherein the user-input metadata is at least partially based on the number of users who have designated the at least one image as a favorite. 9.如权利要求1所述的装置,其中,所述访问模式是至少部分基于对所述至少一幅图像的点入次数的。9. The apparatus of claim 1, wherein the access mode is at least partially based on the number of clicks on the at least one image. 10.如权利要求1所述的装置,其中,所述访问模式是至少部分基于对所述至少一幅图像的查看次数的。10. The apparatus of claim 1, wherein the access mode is at least partially based on the number of times the at least one image is viewed. 11.如权利要求1所述的装置,其中,所述用于排名的逻辑还根据与所述至少一幅图像相关的时间流逝来对所述至少一幅图像进行排名。11. The apparatus of claim 1, wherein the ranking logic further ranks the at least one image based on the passage of time associated with the at least one image. 12.如权利要求11所述的装置,其中,与所述至少一幅图像相关的所述时间流逝是自所述至少一幅图像被上传起的时间流逝。12. The apparatus of claim 11, wherein the elapsed time associated with the at least one image is the elapsed time since the at least one image was uploaded. 13.如权利要求1所述的装置,其中,所述用于排名的逻辑还至少部分基于所述至少一幅图像的发布者与发起所述搜索的用户之间的关系来对所述至少一幅图像进行排名。13. The apparatus of claim 1, wherein the ranking logic further ranks the at least one image based at least in part on the relationship between the publisher of the at least one image and the user who initiated the search. 14.如权利要求1所述的装置,其中所述计算机指令还包括统计逻辑,用于至少部分基于所述第一元数据集合和所述第二元数据集合被共同指派给相同图像的频度来确定度量。14. The apparatus of claim 1, wherein the computer instructions further include statistical logic for determining a metric based at least in part on the frequency with which the first metadata set and the second metadata set are jointly assigned to the same image. 15.如权利要求14所述的装置,其中,所述统计逻辑是用于以下操作的逻辑:15. The apparatus of claim 14, wherein the statistical logic is logic for the following operations: 基于与多幅图像相关联的多个元数据的度量确定相关元数据的集群;以及Clusters of relevant metadata are determined based on metrics associated with multiple images; and 对每个集群内的相关联的多幅图像进行排名。Rank the associated multiple images within each cluster. 16.如权利要求14所述的装置,其中,所述第一元数据和第二元数据是标签。16. The apparatus of claim 14, wherein the first metadata and the second metadata are tags. 17.如权利要求14所述的装置,其中,所述统计逻辑是用于如果所确定的度量超过阈值相关度值则将所述第一元数据和第二元数据作为相关元数据提供以用于显示的逻辑。17. The apparatus of claim 14, wherein the statistical logic is configured to provide the first metadata and the second metadata as relevant metadata for display if the determined metric exceeds a threshold relevance value. 18.如权利要求14所述的装置,其中所述计算机指令包括用于至少部分基于来自至少一个用户的适当性输入来确定所述第一元数据和第二元数据对所述图像的适当性的逻辑,其中所述统计逻辑是用于至少部分基于所确定的适当性来改变所述度量的逻辑。18. The apparatus of claim 14, wherein the computer instructions include logic for determining the suitability of the first metadata and the second metadata for the image based at least in part on suitability input from at least one user, wherein the statistical logic is logic for changing the metric based at least in part on the determined suitability. 19.如权利要求1所述的装置,其中所述计算机指令包括用于确定在预定时间段中至少一个特定元数据集合被指派给多幅图像的频度的逻辑。19. The apparatus of claim 1, wherein the computer instructions include logic for determining the frequency at which at least one particular set of metadata is assigned to multiple images within a predetermined time period. 20.如权利要求19所述的装置,其中,所述至少一个元数据集合包括至少一个标签。20. The apparatus of claim 19, wherein the at least one metadata set includes at least one tag. 21.如权利要求1所述的装置,其中所述计算机指令包括用于确定至少一个特定元数据集合被指派给的图像的数目的逻辑。21. The apparatus of claim 1, wherein the computer instructions include logic for determining the number of images to which at least one particular set of metadata is assigned. 22.一种方法,包括:22. A method comprising: 由计算设备提供至少一幅图像来显示给至少一个用户;At least one image is provided by a computing device to be displayed to at least one user; 由所述计算设备接受来自多个用户的关于所述至少一幅图像的输入,其中,所述输入包括至少两种不同类型的元数据,并且其中用户输入的元数据包括标签;The computing device accepts input from multiple users regarding the at least one image, wherein the input includes at least two different types of metadata, and wherein the metadata input by the users includes tags; 由所述计算设备利用至少一个元数据集合对图像进行搜索;The computing device searches the image using at least one set of metadata; 在所述搜索被执行之后,由所述计算设备对从所述搜索返回的至少一幅图像进行排名,所述排名包括:至少部分基于关于所述至少一幅图像的用户输入的元数据的量来对所述至少一幅图像进行排名,还至少部分基于以下一项或两项来对所述至少一幅图像进行排名:第一元数据集合和第二元数据集合被共同指派给所述至少一幅图像的频度,与所述至少一幅图像相关的访问模式,所述访问模式包括所述用户访问所述至少一幅图像的路线;以及After the search is executed, the computing device ranks at least one image returned from the search, the ranking comprising: ranking the at least one image at least partially based on the amount of metadata input by the user about the at least one image, and ranking the at least one image at least partially based on one or both of the following: the frequency with which a first metadata set and a second metadata set are jointly assigned to the at least one image, and access patterns associated with the at least one image, the access patterns including the routes taken by the user to access the at least one image; and 由所述计算设备基于从至少一个用户输入的适当性来确定所述标签中的至少一个标签对于所述至少一幅图像的适当性,其中所述排名是至少部分基于所述至少一个标签对于所述至少一幅图像的适当性的。The computing device determines the appropriateness of at least one of the tags for the at least one image based on appropriateness from at least one user input, wherein the ranking is at least in part based on the appropriateness of the at least one tag for the at least one image. 23.如权利要求22所述的方法,其中,所述至少两种不同类型的元数据被从包括下述项的元数据类型组中选出:标签、评论、描述、喜好和注释。23. The method of claim 22, wherein the at least two different types of metadata are selected from a group of metadata types including: tags, comments, descriptions, preferences, and annotations. 24.如权利要求22所述的方法,其中,提供至少一幅图像和接受输入在服务器中发生,并且每个用户与相应的客户端相关联。24. The method of claim 22, wherein providing at least one image and accepting input occur in the server, and each user is associated with a corresponding client. 25.如权利要求22所述的方法,其中,所述用户输入的元数据被从包括下述项的元数据类型组中选出:标签、评论和注释。25. The method of claim 22, wherein the user-input metadata is selected from a group of metadata types including tags, comments, and annotations. 26.如权利要求22所述的方法,其中,排名还包括至少部分基于已对所述至少一幅图像指派了元数据的用户的数目来对图像进行排名。26. The method of claim 22, wherein the ranking further includes ranking the images based at least in part on the number of users who have assigned metadata to the at least one image. 27.如权利要求26所述的方法,其中,所述用户输入的元数据是至少部分基于已将所述至少一幅图像指定为喜好的用户的数目的。27. The method of claim 26, wherein the user-input metadata is at least partially based on the number of users who have designated the at least one image as a favorite. 28.如权利要求22所述的方法,其中,排名还包括根据与所述至少一幅图像相关的时间流逝来对所述至少一幅图像进行排名。28. The method of claim 22, wherein the ranking further includes ranking the at least one image based on the passage of time associated with the at least one image. 29.如权利要求22所述的方法,其中,排名还包括至少部分基于所述至少一幅图像的发布者与发起所述搜索的用户之间的关系来对所述至少一幅图像进行排名。29. The method of claim 22, wherein the ranking further includes ranking the at least one image based at least in part on the relationship between the publisher of the at least one image and the user who initiated the search. 30.如权利要求22所述的方法,还包括至少部分基于所述第一元数据集合和所述第二元数据集合被共同指派给相同图像的频度来确定度量。30. The method of claim 22, further comprising determining a metric based at least in part on the frequency with which the first metadata set and the second metadata set are jointly assigned to the same image. 31.如权利要求30所述的方法,其中,确定度量还包括:31. The method of claim 30, wherein determining the metric further comprises: 基于与多幅图像相关联的多个元数据的度量来确定相关元数据的集群;以及Clusters of relevant metadata are determined based on metrics of multiple metadata associated with multiple images; and 对每个集群内的相关联的多幅图像进行排名。Rank the associated multiple images within each cluster. 32.如权利要求30所述的方法,还包括如果所确定的度量超过阈值相关度值,则将所述第一元数据和第二元数据作为相关元数据提供以用于显示。32. The method of claim 30, further comprising providing the first metadata and the second metadata as relevant metadata for display if the determined metric exceeds a threshold relevance value. 33.一种包括计算机可执行程序的非暂时性计算机可读存储介质,所述计算机可执行程序由处理器执行以用于:33. A non-transitory computer-readable storage medium including a computer-executable program, said computer-executable program being executed by a processor to: 由所述处理器提供至少一幅图像以显示给至少一个用户;以及The processor provides at least one image to be displayed to at least one user; and 由所述处理器接受来自多个用户的关于所述至少一幅图像的输入,其中,所述输入包括至少两种不同类型的元数据,并且其中用户输入的元数据包括标签;The processor accepts input from multiple users regarding the at least one image, wherein the input includes at least two different types of metadata, and wherein the metadata input by the users includes tags; 由所述处理器利用至少一个元数据集合对图像进行搜索;The processor searches the image using at least one set of metadata; 在所述搜索被执行之后,由所述处理器对从所述搜索返回的至少一幅图像进行排名,所述排名包括:至少部分基于关于所述至少一幅图像的用户输入的元数据的量来对所述至少一幅图像进行排名,还至少部分基于以下一项或两项来对所述至少一幅图像进行排名:第一元数据集合和第二元数据集合被共同指派给所述至少一幅图像的频度,与所述至少一幅图像相关的访问模式,所述访问模式包括所述用户访问所述至少一幅图像的路线;以及After the search is executed, the processor ranks at least one image returned from the search, the ranking comprising: ranking the at least one image at least partially based on the amount of metadata input by the user about the at least one image, and ranking the at least one image at least partially based on one or both of the following: the frequency with which a first metadata set and a second metadata set are jointly assigned to the at least one image, and access patterns associated with the at least one image, the access patterns including the routes taken by the user to access the at least one image; and 由所述处理器基于从至少一个用户输入的适当性来确定所述标签中的至少一个标签对于所述至少一幅图像的适当性,其中所述排名是至少部分基于所述至少一个标签对于所述至少一幅图像的适当性的。The processor determines the appropriateness of at least one of the tags for the at least one image based on appropriateness from at least one user input, wherein the ranking is at least in part based on the appropriateness of the at least one tag for the at least one image. 34.如权利要求33所述的非暂时性计算机可读存储介质,其中,所述至少两种不同类型的元数据被从包括下述项的元数据类型组选出:标签、评论、描述、喜好和注释。34. The non-transitory computer-readable storage medium of claim 33, wherein the at least two different types of metadata are selected from a group of metadata types including tags, comments, descriptions, preferences, and annotations. 35.如权利要求33所述的非暂时性计算机可读存储介质,其中,提供至少一幅图像和接受输入发生在服务器中,并且每个用户与相应的客户端相关联。35. The non-transitory computer-readable storage medium of claim 33, wherein providing at least one image and receiving input occur in a server, and each user is associated with a corresponding client. 36.如权利要求33所述的非暂时性计算机可读存储介质,其中,所述用户输入的元数据被从包括下述项的元数据类型组中选出:标签、评论和注释。36. The non-transitory computer-readable storage medium of claim 33, wherein the user-input metadata is selected from a group of metadata types including tags, comments, and annotations. 37.如权利要求33所述的非暂时性计算机可读存储介质,其中,排名还包括至少部分基于已对所述至少一幅图像指派了元数据的用户的数目来对图像进行排名。37. The non-transitory computer-readable storage medium of claim 33, wherein the ranking further includes ranking the images based at least in part on the number of users who have assigned metadata to the at least one image. 38.如权利要求37所述的非暂时性计算机可读存储介质,其中,所述用户输入的元数据是至少部分基于已将所述至少一幅图像指定为喜好的用户的数目的。38. The non-transitory computer-readable storage medium of claim 37, wherein the user-input metadata is based at least in part on the number of users who have designated the at least one image as a favorite. 39.如权利要求33所述的非暂时性计算机可读存储介质,其中,排名还包括根据与所述至少一幅图像相关的时间流逝来对所述至少一幅图像进行排名。39. The non-transitory computer-readable storage medium of claim 33, wherein the ranking further includes ranking the at least one image based on the passage of time associated with the at least one image. 40.如权利要求33所述的非暂时性计算机可读存储介质,其中,排名还包括至少部分基于所述至少一幅图像的发布者与发起所述搜索的用户之间的关系来对所述至少一幅图像进行排名。40. The non-transitory computer-readable storage medium of claim 33, wherein the ranking further includes ranking the at least one image based at least in part on the relationship between the publisher of the at least one image and the user who initiated the search. 41.如权利要求33所述的非暂时性计算机可读存储介质,其中,所述计算机可执行程序还用于:至少部分基于所述第一元数据集合和所述第二元数据集合被共同指派给相同图像的频度来确定度量。41. The non-transitory computer-readable storage medium of claim 33, wherein the computer-executable program is further configured to: determine a metric based at least in part on the frequency with which the first metadata set and the second metadata set are jointly assigned to the same image. 42.如权利要求41所述的非暂时性计算机可读存储介质,其中,确定度量还包括:42. The non-transitory computer-readable storage medium of claim 41, wherein determining the metric further comprises: 基于与多幅图像相关联的多个元数据的度量确定相关元数据的集群;以及Clusters of relevant metadata are determined based on metrics associated with multiple images; and 对每个集群内的相关联的多幅图像进行排名。Rank the associated multiple images within each cluster. 43.如权利要求41所述的非暂时性计算机可读存储介质,其中,所述计算机可执行程序还用于:如果所确定的度量超过阈值相关度值,则将所述第一元数据和第二元数据作为相关元数据提供以用于显示。43. The non-transitory computer-readable storage medium of claim 41, wherein the computer-executable program is further configured to: provide the first metadata and the second metadata as relevant metadata for display if the determined metric exceeds a threshold relevance value.
HK14111500.4A 2005-04-21 2014-11-13 Media object metadata association and ranking HK1197945B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US60/674,109 2005-04-21
US11/350,635 2006-02-08

Publications (2)

Publication Number Publication Date
HK1197945A HK1197945A (en) 2015-02-27
HK1197945B true HK1197945B (en) 2021-01-15

Family

ID=

Similar Documents

Publication Publication Date Title
CN103902656B (en) Media object metadata association and ranking
US10216763B2 (en) Interestingness ranking of media objects
US11681736B2 (en) System and method for tagging a region within a frame of a distributed video file
US9971842B2 (en) Computerized systems and methods for generating a dynamic web page based on retrieved content
US9588992B2 (en) Displaying images interesting to a user
US9805123B2 (en) System and method for data privacy in URL based context queries
US8060492B2 (en) System and method for generation of URL based context queries
US20100125604A1 (en) System and method for url based query for retrieving data related to a context
CN104217030B (en) A kind of method and apparatus that user's classification is carried out according to server search daily record data
TW201243632A (en) Search assistant system and method
AU2013263220A1 (en) Knowledge panel
Tjondronegoro et al. A study and comparison of multimedia Web searching: 1997–2006
HK1197945B (en) Media object metadata association and ranking
HK1197945A (en) Media object metadata association and ranking
HK1127660A (en) Media object metadata association and ranking
van Zwol et al. Multimedia information retrieval: " new challenges in audio visual search"
HK1212483B (en) Tagged search result maintenance