[go: up one dir, main page]

EP2052336A1 - Improved storage performance - Google Patents

Improved storage performance

Info

Publication number
EP2052336A1
EP2052336A1 EP07801606A EP07801606A EP2052336A1 EP 2052336 A1 EP2052336 A1 EP 2052336A1 EP 07801606 A EP07801606 A EP 07801606A EP 07801606 A EP07801606 A EP 07801606A EP 2052336 A1 EP2052336 A1 EP 2052336A1
Authority
EP
European Patent Office
Prior art keywords
pieces
digital object
nodes
node
network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP07801606A
Other languages
German (de)
French (fr)
Inventor
John Sullivan
David Ferguson
Jarrod Lowe
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alcatel Lucent SAS
Original Assignee
Velocix Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GB0615964A external-priority patent/GB2440760A/en
Application filed by Velocix Ltd filed Critical Velocix Ltd
Publication of EP2052336A1 publication Critical patent/EP2052336A1/en
Withdrawn legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/104Peer-to-peer [P2P] networks
    • H04L67/1074Peer-to-peer [P2P] networks for supporting data block transmission mechanisms
    • H04L67/1078Resource delivery mechanisms
    • H04L67/108Resource delivery mechanisms characterised by resources being split in blocks or fragments
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/172Caching, prefetching or hoarding of files
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • G06F16/1834Distributed file systems implemented based on peer-to-peer networks, e.g. gnutella
    • G06F16/1837Management specially adapted to peer-to-peer storage networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/104Peer-to-peer [P2P] networks
    • H04L67/1061Peer-to-peer [P2P] networks using node-based peer discovery mechanisms
    • H04L67/1068Discovery involving direct consultation or announcement among potential requesting and potential source peers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • H04L67/568Storing data temporarily at an intermediate stage, e.g. caching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/104Peer-to-peer [P2P] networks

Definitions

  • the invention relates to a computer network having a plurality of nodes in which the access of digital objects at a nodes is optimised and a method for optimising the access of the digital objects at a node.
  • Caches for the intermediate storage of data transferred about the Internet are known in the art.
  • the most common type of cache used in the Internet is a proxy cache.
  • the proxy cache operates at the application level, passing some messages unaltered between a client and a server, changing other ones of the messages and sometimes responding to the messages itself rather than relaying the messages.
  • a web proxy cache sits between web servers and one or more clients and watches requests for HTML pages, music or audio files, video files, image files .and data files (collectively known as digital objects) pass through.
  • the web proxy cache saves a copy of the HTML pages, images and files for itself. Subsequently if there is another request for the same object, the web proxy cache will use the copy that was saved instead of asking an origin server to resend the request.
  • proxy caches are used: i) In order to reduce latency - in this case, the request is satisfied from the proxy cache (which is closer to the client) instead of the origin server. It therefore takes less time for the client to get the object and display the object. This makes web sites seem more responsive to the client. ii) To reduce traffic - Each object is only retrieved once from the server once, the proxy cache reduces the amount of bandwidth used by an Internet Service Provider to the outside world and by the client. This saves money if the client is paying for the traffic and keeps the client's bandwidth requirements lower and more manageable. iii) To increase delivery speed.
  • the proxy caches may be provided by an Internet Service Provider at an access point and can continually store digital objects accessed by the ISP customers.
  • Caches generally have both a fast access solid state memory and disk memory. It is known that the access time to the disk memory is substantially slower than the access time to the solid state memory. This is because access to data on the disk memory requires the mechanical movement of a reading head.
  • a cache may have some local memory (solid state and/or disk) but may also have access to remote memory (solid state and/or disk). Accessing remote memory on a remote machine is also more expensive than accessing memory on the immediate machine
  • a peer-to-peer (also termed P2P) computer network is a network that relies primarily on the computing power and bandwidth of the participants in the computer network rather than concentrating computing power and bandwidth in a relatively low number of servers.
  • P2P computer networks are typically used for connecting nodes of the computer network via largely ad hoc connections.
  • the P2P computer network is useful for many purposes. Sharing content files containing, for example, audio, video and data is very common. Real time data, such as telephony traffic, is also passed using the P2P network.
  • a pure P2P network does not have the notion of clients or servers, but only equal peer nodes that simultaneously function as both "clients” and “servers” to the other nodes on the network.
  • This model of network arrangement differs from the client-server model in which communication is usually to and from a central server.
  • a typical example for a non P2P file transfer is an FTP server where the client and server programs are quite distinct. In the FTP server clients initiate the download/uploads and the servers react to and satisfy these requests from the clients.
  • Some networks and channels such as Napster, OpenNAP, or IRC @find, use a client-server structure for some tasks (e.g., searching) and a P2P structure for other tasks.
  • Networks such as Gnutella or Freenet use the P2P structure for all purposes, and are sometimes referred to as true P2P networks, although Gnutella is greatly facilitated by directory servers that inform peers of the network addresses of other peers.
  • BitTorrent One of the most popular file distribution programmes used in P2P networks is currently BitTorrent which was created by Bram Cohen. BitTorrent is designed to distribute large amounts of data widely without incurring the corresponding consumption in costly server and bandwidth resources.
  • BitTorrent files contain an "announce” section, which specifies the URL of a tracker, and an "info” section which contains (suggested) names for the files, their lengths, the piece length used, and a SHA-I hash code for each piece, which clients should use to verify the integrity of the data they receive.
  • the tracker is a server that keeps track of which seeds (i.e. a node with the complete file or group of files) and peers (i.e. nodes that do not yet have the complete file or group of files) are in a swarm (the expression for all of the seeds and peers involved in the distribution of a single file or group of files).
  • Nodes report information to the tracker periodically and from time-to-time request and receive information about other nodes to which they can connect.
  • the tracker is not directly involved in the data transfer and is not required to have a copy of the file.
  • Nodes that have finished downloading the file may also choose to act as seeds, i.e. the node provides a complete copy of the file.
  • a link to the torrent file is placed on a website or elsewhere, and it is normally registered with the tracker.
  • BitTorrent trackers maintain lists of the nodes currently participating in each torrent.
  • the computer with the initial copy of the file is referred to as the initial seeder.
  • users navigate to a site listing the torrent, download the torrent, and open the torrent in a BitTorrent client stored on their local machines. After opening the torrent, the BitTorrent client connects to the tracker, which provides the BitTorrent client with a list of clients currently downloading the file or files.
  • the BitTorrent protocol breaks down files into a number of much smaller pieces, typically a quarter of a megabyte (256 KB) in size. Larger file sizes typically have larger pieces. For example, a 4.37 GB file may have a piece size of 4 MB (4096 KB). The pieces are checked as they are received by the BitTorrent client using a hash algorithm to ensure that they are error free.
  • peers may download pieces in a random order and may prefer to download the pieces that are rarest amongst its peers, to increase the opportunity to exchange data. Exchange of data is only possible if two peers have different subsets of the file. It is known, for example, in the BitTorrent protocol that a peer initially joining the swarm will send to other members of the swarm a content availability message in the form of a BitField message which indicates an initial set of pieces of the digital object which the peer has available for download by other ones of the peers. On receipt of further ones of the pieces, the peer will send further content availability messages in the form of Have messages to the other peers to indicate that the further ones of the pieces are available for download.
  • US-patent application No. US 2004/0148344 (Serenade Systems, Mountain View, California) 10 teaches a hierarchical cached media distribution system for the delivery of digital objects.
  • This US patent application of Serenade teaches a so-called push mechanism for distributing a list of completed files available on one of the peers to another one of the peers in a P2P network. The peer receiving the list will respond to the sending peer if it lacks packets from the listed files.
  • a peer in the Serenade system which 15 receives the file list but already has a complete settled package for each of the listed files will respond with the file name and packet number for other ones of the packets that the peer needs. Should the content pushing peer have the requested packets, the retrieval process is initiated.
  • the network has at least a first node for storing pieces of a digital object and has a storage. There are a plurality of connections between the first node and the plurality of nodes.
  • the first node has a requester for requesting one or more pieces of the digital object, a transmitter for transferring one or more pieces of the digital object to at least one further node among the plurality of nodes, a receiver for receiving content availability messages from at least some of the plurality of nodes and a sender for sending at least one limited content availability message to at least one further node among the plurality of nodes.
  • the at least one limited content availability message indicates a subset but not all of the pieces of the digital object available through the first node.
  • the content availability messages in effect defers the delivery of full information about those pieces of the digital object which are currently present or available through the storage.
  • the limited content availability messages preferably indicate those pieces of the digital object preferentially transferable by the transmitter to another node (peer) in the network. Such preferential transfer would be because the piece is, for example, in the fast access memory (i.e. in the solid state memory) or on local disk.
  • the nodes can be either other peers or caches, such as proxy caches.
  • Logic can be incorporated into the network to decide which ones of the pieces of the digital object should be preferentially transferred.
  • the invention can be employed in a proxy cache and also in a peer in a peer-to-peer network.
  • the object of the invention could also be solved by providing a method of providing pieces of digital object from a first node to a plurality of nodes in a computer network.
  • the method comprises the following steps: - a first step of sending to at least one further node among the plurality of nodes at least one limited content availability message, whereby the at least one limited content availability message indicates a subset but not all of the pieces of the digital object available through the first node; - a second step of receiving a request for pieces of the digital object from at least one of the plurality of nodes;
  • Fig. 1 shows a first embodiment of the invention with a proxy cache.
  • Fig. 2 shows the access of pieces of the digital object through the proxy cache in a P2P network.
  • Fig. 3 shows a second embodiment of the invention in which multiple peers are connected to a cache.
  • Fig. 4 is a block diagram illustrating an exemplary embodiment of the modules of a proxy
  • Fig. 1 shows an example of a peer-to-peer network 10 with a plurality of peers 20a, 20b, 20c and 2Od - collectively referred to as peers 20 — connected to each other.
  • the peers 20 are connected to a memory 22 which includes both a fast storage system (e.g. solid state memory or RAM) 23 and a slow storage system, e.g. a disk 24.
  • a fast storage system e.g. solid state memory or RAM
  • a slow storage system e.g. a disk 24.
  • FIG. 1 only one memory 22 is shown. It will, however, be known to the skilled person_that each-of the peers 20 will probably have the memory 22.
  • the peer-to-peer network 10 includes in one embodiment of the invention one or more proxy cache, such as, proxy cache 30a and 30b - collectively referred to as proxy cache 30, a central server 60 and a tracker 70. At least one of the peers 20a is attached to at least one proxy cache 30. In the embodiment of the invention shown in Fig. 1 two proxy caches 30a, 30b are shown. However, this is not limiting of the invention and the invention could use none, one or more than one proxy cache 30.
  • the invention is described with respect to one protocol - the BitTorrent protocol - that has been described in the introduction. The invention is, however, not limited to the use of the BitTorrent protocol. Other protocols include similar messages to the BitField and Have messages, but name them differently.
  • the proxy caches 30 have both solid state memory 40 and a disk 50 as is shown in Fig. 2. It is known in the art that the access times for data stored on the disk 50 are substantially longer than the access times for data stored in the solid state memory 40. It is also possible that the proxy caches 30 have both local (immediate) memory and access to remote memory. Accessing data from the local memory is substantially faster than accessing data from the remote memory.
  • the peer 20a requests a digital object from the peer-to-peer network 10.
  • the peer 20a does this by downloading a file from a central server 60 containing meta- information relating to the digital object.
  • the meta-information includes the address of a tracker 70
  • the tracker 70 provides a list of peers 20 in the swarm.
  • the peer 20a issues a request to the other peers 20 in the swarm indicating an interest to download the available pieces of the digital object.
  • a BitField message will initially indicate that the peer 20a has no pieces of the digital object to contribute.
  • Other BitField messages are received from all of the peers 20 of the swarm.
  • Each of the peers 20 indicates the pieces of the digital object available with each of the peers 20.
  • the peer 20a indicates to the peers 20 of the pieces of the digital object that the peer 20a wishes to download. After downloading, the peer 20a sends a Have message to the members of the swarm which indicate the presence of the downloaded pieces of the digital object available with the peer 20a.
  • the memory 22 has one or more pieces of the digital object stored either in the solid state memory 23 or on the disk 24.
  • the peer 2Od would issue a content availability message (BitField message or Have message) to inform the peer 20a which pieces are available from the peer 2Od.
  • Some of the pieces are quickly accessible from the fast access memory in the form of the solid state memory 23.
  • Other ones of the pieces are accessible more slowly because they are stored on the disk 24.
  • a limited content availability message is sent which does not indicate all of the available pieces of the digital object, but a limited subset of the pieces. This limited subset of the pieces will be the pieces of the digital object which the peer 2Od is prepared to transfer to the peer 20a.
  • the limited subset of pieces would, in one embodiment of the invention, be those pieces of the digital object currently stored in the fast access portion of the memory 22.
  • the limited subset of pieces might also be the most requested pieces or those pieces which the peer 2Od will soon retrieve from the disk 22 (or has recently retrieved from the disk 22 because the piece was offered to another one of the peers 20).
  • the members of the limited subset of pieces could change with time as the pieces are stored to and retrieved from the disk 22.
  • the limited subset of pieces could also include all of the pieces of the digital object currently stored in the memory 22 of the peer 2Od and indeed ultimately might be all of the pieces that make up the digital object.
  • the peer 2Od continually offers to other peers 20 the pieces of the digital object which the requesting peer 20a does not currently have (as will be indicated by the content availability messages received from the requesting peer 20a. This ensures that the requesting peer 20a will offer the peer 2Od pieces of the digital object. It should also be noted that, if the peer 2Od has offered the piece to the requesting peer 20a, then it is advantageous to offer the piece to other ones of the peers 20 at this stage, since this would be the most efficient way of supplying the piece to the other members of the swarm.
  • the proxy cache 30 (when present) can be utilised to improve the transfer of the pieces of the digital object. All of the messages in this embodiment of the invention are communicated through the proxy cache 30. In the example illustrated in Fig. 1, two proxy caches 30a and 30b are shown and the messages are transferred through both of these. The received pieces of the digital object are transferred back to the requesting peer 20a through the proxy cache 30. In this embodiment, the pieces of the . digital object are also stored in the proxy cache 30. The stored pieces of the digital object are then transferred to the requesting peer 20a.
  • the proxy cache 30 stores the pieces of the digital object initially in a solid state memory 40 (fast-access memory) and subsequently stores the pieces of the digital object to the disk 50.
  • the proxy cache 30 may comprise more than one cache 30a, 30b as is indicated in Fig. 1.
  • different pieces of the digital object are stored on different ones of the proxy caches 30, such as the proxy caches 30a and 30b. So, for example, all "even-numbered" pieces of the digital object may be stored on the proxy cache 30a and all "odd-numbered" pieces of the digital object may be stored on the proxy cache 30b.
  • This method has the advantage that the storage on the proxy cache 30 is balanced between the two proxy caches 30a and 30b.
  • every third piece may be stored on the same one of the proxy caches 30.
  • two of the three caches may store copies of the same pieces.
  • a first one of the proxy caches 30 would have pieces 0 and 1
  • a second one of the proxy caches may store pieces 1 and 2
  • a third one of the proxy caches 30 may store pieces 2 and 0.
  • the proxy cache 30 has some stored pieces of the digital object in the solid- state memory 40 and some pieces of the digital object on the disk 50.
  • the peers 20b-d are outside of the Internet Service Provider, then it will be more efficient to supply the pieces from the disk 50 than from the peers 20b-d.
  • the BitField messages sent by the proxy cache 30 (acting as a peer) would include all of the possible pieces stored in the proxy cache 30 and/or the pieces available on peers 20b-d.
  • the proxy cache 30 may also in the prior art send Have messages indicating newly acquired pieces to the other peers 20 in the swarm.
  • This arrangement of the caches is particularly suitable for clusters of caches, for example at the edge of an ISP's network.
  • the logic of the proxy cache 30 is illustrated in Fig. 4. It should be understood that the blocks shown in Fig. 4 represent an exemplary embodiment of the logic implemented. It is not intended that the separate components are provided. It should be understood that these blocks may be implemented in an appropriate manner including virtual implementation.
  • the proxy cache 30 includes a sender 32 for sending content availability messages to the other peers 20.
  • the content availability messages include messages such as the BitField message and the Have message. In an embodiment of the invention, the content availability messages will be modified in a particular manner.
  • the proxy cache 30 also includes a receiver 34 for receiving content availability messages from the other peers 20.
  • the proxy cache 30 further includes a requester 36 for requesting one or more pieces of the digital object from at least one of the peers 20 and a transmitter 38 for transferring one or more pieces of the digital object to at least one of the peers 20.
  • the proxy cache 30 informs the other peers 20 of the swarm that the proxy cache 30 only has a limited number of pieces of the digital object (whereas, in fact, the proxy cache 30 has more pieces of the digital object).
  • This information is communicated in the form of a modified BitField message or Have message to the other peers 20 (i.e. a modified content availability message from sender 32). Therefore, the other peers 20 will only request from the proxy cache 30 the pieces of the digital object which the other peers 20 consider the proxy cache 30 to have.
  • the proxy cache 30 may ensure that the pieces of the digital object are in the solid state memory 40 and thus fast accessible. This reduces the number of disk accesses which not only improves the access speed but also reduces the risk of failure of the disk 50.
  • the proxy cache 30 can be proactive by examining the BitField and Have messages received from the other peers 20 to determine the pieces of the digital object most likely to be requested. If it is identified that a particular piece is likely to be requested and is present on the disk 50, the proxy cache 30 may retrieve the likely request piece and load the likely requested piece into the solid state memory 40 in order to allow fast access in the future. The proxy cache 30 can thereafter issue the Have message or the BitField message to the other peers 20 informing them of the availability of the piece.
  • the proxy cache 30 uses the received BitField and Have messages to determine the pieces most likely to be requested and may actively seek the most likely to be requested pieces from other peers 20. On retrieval, the requested pieces are held in the solid state memory 40 (as well as being stored to disk 50) to allow fast retrieval by other peers 20 - in particular the peers 20 connected to the proxy cache 30. So, for example, the proxy cache 30 may know that some of the pieces are obtainable from one of the peers 20 and these pieces are requested. The proxy cache 30 can issue a Have message to tell the other peers 20 that the requested piece is, in fact, present in the proxy cache 30 when the piece has yet to be loaded. The proxy cache 30 is likely to receive requests from the other peers 20 for the requested piece which the proxy cache 30 can fulfil once the proxy cache downloads the requested piece.
  • Fig 2 illustrates the proxy cache 30a with the disk 50 and the solid state memory 40 storing only odd-numbered pieces, such as
  • the peer 20a already has pieces 1-5 and still requires pieces 6-20.
  • the peer 20b has pieces 6-10 and 16-20.
  • the proxy cache 30a has pieces 7 and 9 in the solid state memory 40. It therefore sends a BitField (BF) and/or a Have (H) message to the peer 20a informing the peer 20a that the proxy cache 30a only has pieces 7 and 9.
  • the proxy cache 30a does not at this stage inform the peer 20a that the proxy cache 30a has pieces 11, 13 and 15 since these are stored on the disk 50 and as a result are not fast accessible.
  • the peer 20b sends a BitField message indicating that the peer 20b has pieces 6-10 and 16-20.
  • the proxy cache 30a does not need pieces 6-10 and, therefore, ignores this part of the BitField message.
  • the proxy cache 30a requires the pieces 17 and 19 (it will be recalled that the proxy cache 30a. in this example, does not store even-numbered pieces).
  • the proxy cache 30a sends a Require message to the peer 20b to retrieve the pieces 17 and 19.
  • the proxy cache 30a sends a Have message indicating that the proxy cache 30a has pieces 17 and 19 available.
  • the pieces 17 and 19 should be kept in solid state memory 40 to allow fast access as it is highly likely that the peers 20, not having 17 and 19, will react to the Have message.
  • FIG. 3 Another embodiment of the invention is illustrated in Fig. 3 in which one or more peers 120a- 120c (collectively referred to as cache 120) are connected to a cache 130.
  • the cache 130 does not function as a proxy cache but rather downloads the pieces of the digital object from.another server 190 and uploads the pieces of the digital object from the peers 120.
  • the function of the cache 130 is described in co-pending UK patent 0616049.3.
  • the cache 130 also a disk 150 and a solid-state memory 140 for faster access. Similar to the embodiment described above ⁇ pieces of the digital object requested by the peers 120 may be stored in the solid-state memory 140 and/or the disk 150.
  • the cache 130 also has a similar modules as illustrated in Fig. 4. The cache 130 issues BitField or Have messages indicating the pieces of the digital object which the cache 130 has stored. The cache 130 will not inform the peers 120 about all of the available pieces of the digital object and preferentially inform about the pieces of the digital object which are available in the fast access solid state memory 140 rather than the pieces of the digital object which are stored on the disk 150.
  • the peers 120 in their requests, will request from the cache 130 only the pieces of the digital object which the cache 130 has indicated to be available (even if the cache 130 has only issued a limited subset of the available pieces).
  • the requests from the peers 120 will be served from the solid-state memory 140.
  • the cache 130 can analyse the BitFields or the Have messages sent from the peers 120 to analyse the pieces of the digital object which are most requested. If the most requested pieces of the digital object are on the disk 150, but not in the solid-state memory 140, the cache 130 can read the pieces of the digital object into the solid-state memory 140 and can then issue a Have message to indicate that the cache 130 now has the most requested pieces of the digital object. Subsequently, the cache 130 will serve any requests for access to the most requested pieces of the digital object.
  • the cache 130 will retain the recently acquired piece of the digital object in the solid-state memory 140 and issue a Have message to allow the peers 120 to download the piece from the cache 130.
  • the cache can also retrieve pieces of the digital object from both local memory and more distant memory in a further embodiment of the invention.
  • local memory is that memory which is in the same location as the cache.
  • Remote memory is memory that is at a more distant location and thus requires longer to access.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Computer And Data Communications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A network (10), such as a computer network, is disclosed. The network (10) has a plurality of nodes (20; 120) and includes a first node (30, 130) for storing pieces of a digital object and having a storage (40, 50; 140, 150). A plurality of connections are present between the first node (30, 130) and the plurality of nodes (20; 120). The first node (30, 130) has a requester for requesting one or more pieces of the digital object, a transmitter for transferring one or more pieces of the digital object to at least one further node among the plurality of nodes (20; 120), a receiver for receiving content availability messages from at least some of the plurality of nodes, and a sender for sending a limited content availability message to at least one further node (20; 120) among the plurality of nodes (20, 120). The limited content availability message indicates a subset but not all of the pieces of the digital object available through the first node (30; 130). A method of providing pieces of digital object from a first node (30, 130) to a plurality of nodes (20, 120) in a network (10) is also disclosed which includes a first step of sending to at least one further node (20b, 120b) among the plurality of nodes the limited content availability message. In the second step a request for pieces of the digital object is received from at least one of the plurality of nodes (20, 120) and finally in the third step one or more pieces of the digital object are transferred to the at least one of the plurality of nodes (20,120).

Description

Description
FIELD OF THE INVENTION
The invention relates to a computer network having a plurality of nodes in which the access of digital objects at a nodes is optimised and a method for optimising the access of the digital objects at a node.
BACKGROUND TO THE INVENTION
Caches for the intermediate storage of data transferred about the Internet are known in the art. The most common type of cache used in the Internet is a proxy cache. The proxy cache operates at the application level, passing some messages unaltered between a client and a server, changing other ones of the messages and sometimes responding to the messages itself rather than relaying the messages. A web proxy cache sits between web servers and one or more clients and watches requests for HTML pages, music or audio files, video files, image files .and data files (collectively known as digital objects) pass through. The web proxy cache saves a copy of the HTML pages, images and files for itself. Subsequently if there is another request for the same object, the web proxy cache will use the copy that was saved instead of asking an origin server to resend the request.
There are three main reasons why proxy caches are used: i) In order to reduce latency - in this case, the request is satisfied from the proxy cache (which is closer to the client) instead of the origin server. It therefore takes less time for the client to get the object and display the object. This makes web sites seem more responsive to the client. ii) To reduce traffic - Each object is only retrieved once from the server once, the proxy cache reduces the amount of bandwidth used by an Internet Service Provider to the outside world and by the client. This saves money if the client is paying for the traffic and keeps the client's bandwidth requirements lower and more manageable. iii) To increase delivery speed. The proxy caches may be provided by an Internet Service Provider at an access point and can continually store digital objects accessed by the ISP customers. For example, CacheLogic, Cambridge, UK, provide solutions which can be used by ISPs and other to reduce their traffic. These solutions are documented briefly in the document "the Impact of P2P and the CacheLogic P2P Management Solution" (available 1 August 2006 at http://www.cachelogic.com/products/resource/Intro_CacheLogic_P2P_Mgmt_Solution_v3.0. pdf)
Caches generally have both a fast access solid state memory and disk memory. It is known that the access time to the disk memory is substantially slower than the access time to the solid state memory. This is because access to data on the disk memory requires the mechanical movement of a reading head. Alternatively, a cache may have some local memory (solid state and/or disk) but may also have access to remote memory (solid state and/or disk). Accessing remote memory on a remote machine is also more expensive than accessing memory on the immediate machine
One solution to speed up the access time would be to have solely solid state memory. However this is extremely expensive. Give the large sizes of the caches used in the ISPs the cost is likely to be prohibitive for many ISPs except for very special applications. It would therefore be advantageous to provide a management system to improve the access times.
A peer-to-peer (also termed P2P) computer network is a network that relies primarily on the computing power and bandwidth of the participants in the computer network rather than concentrating computing power and bandwidth in a relatively low number of servers. P2P computer networks are typically used for connecting nodes of the computer network via largely ad hoc connections. The P2P computer network is useful for many purposes. Sharing content files containing, for example, audio, video and data is very common. Real time data, such as telephony traffic, is also passed using the P2P network.
A pure P2P network does not have the notion of clients or servers, but only equal peer nodes that simultaneously function as both "clients" and "servers" to the other nodes on the network. This model of network arrangement differs from the client-server model in which communication is usually to and from a central server. A typical example for a non P2P file transfer is an FTP server where the client and server programs are quite distinct. In the FTP server clients initiate the download/uploads and the servers react to and satisfy these requests from the clients.
Some networks and channels, such as Napster, OpenNAP, or IRC @find, use a client-server structure for some tasks (e.g., searching) and a P2P structure for other tasks. Networks such as Gnutella or Freenet use the P2P structure for all purposes, and are sometimes referred to as true P2P networks, although Gnutella is greatly facilitated by directory servers that inform peers of the network addresses of other peers.
One of the most popular file distribution programmes used in P2P networks is currently BitTorrent which was created by Bram Cohen. BitTorrent is designed to distribute large amounts of data widely without incurring the corresponding consumption in costly server and bandwidth resources. To share a file or group of files through BitTorrent, clients first create a "torrent file". This is a small file which contains meta-information about the files to be shared and about the host computer (the "tracker") that coordinates the file distribution. Torrent files contain an "announce" section, which specifies the URL of a tracker, and an "info" section which contains (suggested) names for the files, their lengths, the piece length used, and a SHA-I hash code for each piece, which clients should use to verify the integrity of the data they receive.
The tracker is a server that keeps track of which seeds (i.e. a node with the complete file or group of files) and peers (i.e. nodes that do not yet have the complete file or group of files) are in a swarm (the expression for all of the seeds and peers involved in the distribution of a single file or group of files). Nodes report information to the tracker periodically and from time-to-time request and receive information about other nodes to which they can connect. The tracker is not directly involved in the data transfer and is not required to have a copy of the file. Nodes that have finished downloading the file may also choose to act as seeds, i.e. the node provides a complete copy of the file. After the torrent file is created, a link to the torrent file is placed on a website or elsewhere, and it is normally registered with the tracker.
BitTorrent trackers maintain lists of the nodes currently participating in each torrent. The computer with the initial copy of the file is referred to as the initial seeder. Using a web browser, users navigate to a site listing the torrent, download the torrent, and open the torrent in a BitTorrent client stored on their local machines. After opening the torrent, the BitTorrent client connects to the tracker, which provides the BitTorrent client with a list of clients currently downloading the file or files.
Initially, there may be no other peers in the swarm, in which case the client connects directly to the initial seeder and begins to request pieces. The BitTorrent protocol breaks down files into a number of much smaller pieces, typically a quarter of a megabyte (256 KB) in size. Larger file sizes typically have larger pieces. For example, a 4.37 GB file may have a piece size of 4 MB (4096 KB). The pieces are checked as they are received by the BitTorrent client using a hash algorithm to ensure that they are error free.
As further peers enter the swarm, all of the peers begin sharing pieces with one another, instead of downloading directly from the initial seeder. Clients incorporate mechanisms to optimize their download and upload rates. Peers may download pieces in a random order and may prefer to download the pieces that are rarest amongst its peers, to increase the opportunity to exchange data. Exchange of data is only possible if two peers have different subsets of the file. It is known, for example, in the BitTorrent protocol that a peer initially joining the swarm will send to other members of the swarm a content availability message in the form of a BitField message which indicates an initial set of pieces of the digital object which the peer has available for download by other ones of the peers. On receipt of further ones of the pieces, the peer will send further content availability messages in the form of Have messages to the other peers to indicate that the further ones of the pieces are available for download.
The substantial increase in traffic over P2P networks in the past few years has increased the demand for P2P caches and also for alternative P2P management techniques. In particular there is a need to ensure that those pieces of the digital object required are preferably available with short access times.
Several different patent applications disclose methods for ensuring that pieces or portions of the digital objects are preferably available within short access times. For example, US Patent Application No. US 2003/0158958 (Philips Electronics) teaches an electronic content delivery system which uses a network of end users, each one of which has a storage capability. The digital object - referred to as content in the Philips application - is stored in a distributed fashion on the network of the end user and is made available to individual ones of the end users in a P2P manner in order to cut download time and to reduce management errors. A look-up service is disclosed in the Philips application which holds information about the 5 network location of the selected digital object or its pieces. Information about the presence and access of the digital object in a local storage at the end user is written into the look-up service by a controlled system.
US-patent application No. US 2004/0148344 (Serenade Systems, Mountain View, California) 10 teaches a hierarchical cached media distribution system for the delivery of digital objects. This US patent application of Serenade, teaches a so-called push mechanism for distributing a list of completed files available on one of the peers to another one of the peers in a P2P network. The peer receiving the list will respond to the sending peer if it lacks packets from the listed files. According to the patent application, a peer in the Serenade system which 15 receives the file list but already has a complete settled package for each of the listed files will respond with the file name and packet number for other ones of the packets that the peer needs. Should the content pushing peer have the requested packets, the retrieval process is initiated.
20. Neither, of the Serenade system.patent application nor the Philips patent application disclose . the use of differential storage in which some of the pieces of the digital objects are held in fast storage whilest other ones of the digital objects are held in slow access storage.
SUMMARY OF THE INVENTION 25
It is therefore an object of the invention to provide a system for improving access times for data stored at nodes in a network.
It is furthermore an object of the invention to reduce the number of disk accesses to data at 30 nodes in a network.
These and other objects of the invention are solved by providing a network having a plurality of nodes. The network has at least a first node for storing pieces of a digital object and has a storage. There are a plurality of connections between the first node and the plurality of nodes. The first node has a requester for requesting one or more pieces of the digital object, a transmitter for transferring one or more pieces of the digital object to at least one further node among the plurality of nodes, a receiver for receiving content availability messages from at least some of the plurality of nodes and a sender for sending at least one limited content availability message to at least one further node among the plurality of nodes. In the invention the at least one limited content availability message indicates a subset but not all of the pieces of the digital object available through the first node.
The content availability messages in effect defers the delivery of full information about those pieces of the digital object which are currently present or available through the storage. The limited content availability messages preferably indicate those pieces of the digital object preferentially transferable by the transmitter to another node (peer) in the network. Such preferential transfer would be because the piece is, for example, in the fast access memory (i.e. in the solid state memory) or on local disk.
In the prior art systems, for example the Philips system and the Serenade system disclosed above, there is no requirement for producing limited available messages as neither of the systems teach the storage of the pieces in memories with different access speed.
The nodes can be either other peers or caches, such as proxy caches.
Logic can be incorporated into the network to decide which ones of the pieces of the digital object should be preferentially transferred.
The invention can be employed in a proxy cache and also in a peer in a peer-to-peer network.
The object of the invention could also be solved by providing a method of providing pieces of digital object from a first node to a plurality of nodes in a computer network. The method comprises the following steps: - a first step of sending to at least one further node among the plurality of nodes at least one limited content availability message, whereby the at least one limited content availability message indicates a subset but not all of the pieces of the digital object available through the first node; - a second step of receiving a request for pieces of the digital object from at least one of the plurality of nodes;
- a third step of transferring one or more pieces of the digital object to the at least one of the plurality of nodes.
DESCRIPTION OF THE DRAWINGS
Fig. 1 shows a first embodiment of the invention with a proxy cache. Fig. 2 shows the access of pieces of the digital object through the proxy cache in a P2P network. Fig. 3 shows a second embodiment of the invention in which multiple peers are connected to a cache. Fig. 4 is a block diagram illustrating an exemplary embodiment of the modules of a proxy
DETAILED DESCRIPTION OF THE INVENTION
Fig. 1 shows an example of a peer-to-peer network 10 with a plurality of peers 20a, 20b, 20c and 2Od - collectively referred to as peers 20 — connected to each other. The peers 20 are connected to a memory 22 which includes both a fast storage system (e.g. solid state memory or RAM) 23 and a slow storage system, e.g. a disk 24. In Fig. 1 , only one memory 22 is shown. It will, however, be known to the skilled person_that each-of the peers 20 will probably have the memory 22.
The peer-to-peer network 10 includes in one embodiment of the invention one or more proxy cache, such as, proxy cache 30a and 30b - collectively referred to as proxy cache 30, a central server 60 and a tracker 70. At least one of the peers 20a is attached to at least one proxy cache 30. In the embodiment of the invention shown in Fig. 1 two proxy caches 30a, 30b are shown. However, this is not limiting of the invention and the invention could use none, one or more than one proxy cache 30. The invention is described with respect to one protocol - the BitTorrent protocol - that has been described in the introduction. The invention is, however, not limited to the use of the BitTorrent protocol. Other protocols include similar messages to the BitField and Have messages, but name them differently. Other protocols may have messages requesting for offers of pieces available. The proxy caches 30 have both solid state memory 40 and a disk 50 as is shown in Fig. 2. It is known in the art that the access times for data stored on the disk 50 are substantially longer than the access times for data stored in the solid state memory 40. It is also possible that the proxy caches 30 have both local (immediate) memory and access to remote memory. Accessing data from the local memory is substantially faster than accessing data from the remote memory.
Suppose now that the peer 20a requests a digital object from the peer-to-peer network 10. The peer 20a does this by downloading a file from a central server 60 containing meta- information relating to the digital object. The meta-information includes the address of a tracker 70 The tracker 70 provides a list of peers 20 in the swarm. Thereafter the peer 20a then issues a request to the other peers 20 in the swarm indicating an interest to download the available pieces of the digital object. A BitField message will initially indicate that the peer 20a has no pieces of the digital object to contribute. Other BitField messages are received from all of the peers 20 of the swarm. Each of the peers 20 indicates the pieces of the digital object available with each of the peers 20. Subsequently the peer 20a indicates to the peers 20 of the pieces of the digital object that the peer 20a wishes to download. After downloading, the peer 20a sends a Have message to the members of the swarm which indicate the presence of the downloaded pieces of the digital object available with the peer 20a.
Consider now the peer 2Od connected to the memory 22. The memory 22 has one or more pieces of the digital object stored either in the solid state memory 23 or on the disk 24. hi the existing prior art systems, the peer 2Od would issue a content availability message (BitField message or Have message) to inform the peer 20a which pieces are available from the peer 2Od. Some of the pieces are quickly accessible from the fast access memory in the form of the solid state memory 23. Other ones of the pieces are accessible more slowly because they are stored on the disk 24. In the invention, therefore a limited content availability message is sent which does not indicate all of the available pieces of the digital object, but a limited subset of the pieces. This limited subset of the pieces will be the pieces of the digital object which the peer 2Od is prepared to transfer to the peer 20a.
The limited subset of pieces would, in one embodiment of the invention, be those pieces of the digital object currently stored in the fast access portion of the memory 22. The limited subset of pieces might also be the most requested pieces or those pieces which the peer 2Od will soon retrieve from the disk 22 (or has recently retrieved from the disk 22 because the piece was offered to another one of the peers 20). The members of the limited subset of pieces could change with time as the pieces are stored to and retrieved from the disk 22. Finally, it should be noted that it is possible that the limited subset of pieces could also include all of the pieces of the digital object currently stored in the memory 22 of the peer 2Od and indeed ultimately might be all of the pieces that make up the digital object. It is preferable that the peer 2Od continually offers to other peers 20 the pieces of the digital object which the requesting peer 20a does not currently have (as will be indicated by the content availability messages received from the requesting peer 20a. This ensures that the requesting peer 20a will offer the peer 2Od pieces of the digital object. It should also be noted that, if the peer 2Od has offered the piece to the requesting peer 20a, then it is advantageous to offer the piece to other ones of the peers 20 at this stage, since this would be the most efficient way of supplying the piece to the other members of the swarm.
In another embodiment of the invention, the proxy cache 30 (when present) can be utilised to improve the transfer of the pieces of the digital object. All of the messages in this embodiment of the invention are communicated through the proxy cache 30. In the example illustrated in Fig. 1, two proxy caches 30a and 30b are shown and the messages are transferred through both of these. The received pieces of the digital object are transferred back to the requesting peer 20a through the proxy cache 30. In this embodiment, the pieces of the . digital object are also stored in the proxy cache 30. The stored pieces of the digital object are then transferred to the requesting peer 20a.
The proxy cache 30 stores the pieces of the digital object initially in a solid state memory 40 (fast-access memory) and subsequently stores the pieces of the digital object to the disk 50. The proxy cache 30 may comprise more than one cache 30a, 30b as is indicated in Fig. 1. In an embodiment of the invention, different pieces of the digital object are stored on different ones of the proxy caches 30, such as the proxy caches 30a and 30b. So, for example, all "even-numbered" pieces of the digital object may be stored on the proxy cache 30a and all "odd-numbered" pieces of the digital object may be stored on the proxy cache 30b. This method has the advantage that the storage on the proxy cache 30 is balanced between the two proxy caches 30a and 30b. In an embodiment of the present invention, if three proxy caches 30 are included then every third piece may be stored on the same one of the proxy caches 30. In an alternate embodiment of the invention, two of the three caches may store copies of the same pieces. In this embodiment, a first one of the proxy caches 30 would have pieces 0 and 1, a second one of the proxy caches may store pieces 1 and 2 and a third one of the proxy caches 30 may store pieces 2 and 0.
Suppose now that the proxy cache 30 has some stored pieces of the digital object in the solid- state memory 40 and some pieces of the digital object on the disk 50. On account of the different access times, as discussed above, it would be most efficient to supply to the other peers 20 in the network 10 the pieces of the digital object that are in the solid-state memory 40 rather than the pieces of the digital object that are stored on the disk 50. Similarly, it is would be more efficient to supply the peer 20a with the pieces stored on the disk 50 rather than fetching other ones of the pieces from peers 20b-d. In particular, if the peers 20b-d are outside of the Internet Service Provider, then it will be more efficient to supply the pieces from the disk 50 than from the peers 20b-d. In the prior art the BitField messages sent by the proxy cache 30 (acting as a peer) would include all of the possible pieces stored in the proxy cache 30 and/or the pieces available on peers 20b-d. The proxy cache 30 may also in the prior art send Have messages indicating newly acquired pieces to the other peers 20 in the swarm.
This arrangement of the caches is particularly suitable for clusters of caches, for example at the edge of an ISP's network.
The logic of the proxy cache 30 is illustrated in Fig. 4. It should be understood that the blocks shown in Fig. 4 represent an exemplary embodiment of the logic implemented. It is not intended that the separate components are provided. It should be understood that these blocks may be implemented in an appropriate manner including virtual implementation. The proxy cache 30 includes a sender 32 for sending content availability messages to the other peers 20. The content availability messages include messages such as the BitField message and the Have message. In an embodiment of the invention, the content availability messages will be modified in a particular manner. The proxy cache 30 also includes a receiver 34 for receiving content availability messages from the other peers 20. The proxy cache 30 further includes a requester 36 for requesting one or more pieces of the digital object from at least one of the peers 20 and a transmitter 38 for transferring one or more pieces of the digital object to at least one of the peers 20. In the modification of the invention described herein the proxy cache 30 informs the other peers 20 of the swarm that the proxy cache 30 only has a limited number of pieces of the digital object (whereas, in fact, the proxy cache 30 has more pieces of the digital object). This information is communicated in the form of a modified BitField message or Have message to the other peers 20 (i.e. a modified content availability message from sender 32). Therefore, the other peers 20 will only request from the proxy cache 30 the pieces of the digital object which the other peers 20 consider the proxy cache 30 to have. In an embodiment of the invention, the proxy cache 30 may ensure that the pieces of the digital object are in the solid state memory 40 and thus fast accessible. This reduces the number of disk accesses which not only improves the access speed but also reduces the risk of failure of the disk 50.
The proxy cache 30 can be proactive by examining the BitField and Have messages received from the other peers 20 to determine the pieces of the digital object most likely to be requested. If it is identified that a particular piece is likely to be requested and is present on the disk 50, the proxy cache 30 may retrieve the likely request piece and load the likely requested piece into the solid state memory 40 in order to allow fast access in the future. The proxy cache 30 can thereafter issue the Have message or the BitField message to the other peers 20 informing them of the availability of the piece.
In another embodiment of the invention, the proxy cache 30 uses the received BitField and Have messages to determine the pieces most likely to be requested and may actively seek the most likely to be requested pieces from other peers 20. On retrieval, the requested pieces are held in the solid state memory 40 (as well as being stored to disk 50) to allow fast retrieval by other peers 20 - in particular the peers 20 connected to the proxy cache 30. So, for example, the proxy cache 30 may know that some of the pieces are obtainable from one of the peers 20 and these pieces are requested. The proxy cache 30 can issue a Have message to tell the other peers 20 that the requested piece is, in fact, present in the proxy cache 30 when the piece has yet to be loaded. The proxy cache 30 is likely to receive requests from the other peers 20 for the requested piece which the proxy cache 30 can fulfil once the proxy cache downloads the requested piece.
An example will serve to illustrate this in detail. Fig 2 illustrates the proxy cache 30a with the disk 50 and the solid state memory 40 storing only odd-numbered pieces, such as
1,3,5,7,9,11,13 and 15. The peer 20a already has pieces 1-5 and still requires pieces 6-20. The peer 20b has pieces 6-10 and 16-20. The proxy cache 30a has pieces 7 and 9 in the solid state memory 40. It therefore sends a BitField (BF) and/or a Have (H) message to the peer 20a informing the peer 20a that the proxy cache 30a only has pieces 7 and 9. The proxy cache 30a does not at this stage inform the peer 20a that the proxy cache 30a has pieces 11, 13 and 15 since these are stored on the disk 50 and as a result are not fast accessible.
The peer 20b sends a BitField message indicating that the peer 20b has pieces 6-10 and 16-20. The proxy cache 30a does not need pieces 6-10 and, therefore, ignores this part of the BitField message. On the other hand the proxy cache 30a requires the pieces 17 and 19 (it will be recalled that the proxy cache 30a. in this example, does not store even-numbered pieces). Thus, the proxy cache 30a sends a Require message to the peer 20b to retrieve the pieces 17 and 19. On retrieval the pieces 17 and 19 are stored in the proxy cache 30a and the proxy cache 30a sends a Have message indicating that the proxy cache 30a has pieces 17 and 19 available. The pieces 17 and 19 should be kept in solid state memory 40 to allow fast access as it is highly likely that the peers 20, not having 17 and 19, will react to the Have message.
Another embodiment of the invention is illustrated in Fig. 3 in which one or more peers 120a- 120c (collectively referred to as cache 120) are connected to a cache 130. In this case the cache 130 does not function as a proxy cache but rather downloads the pieces of the digital object from.another server 190 and uploads the pieces of the digital object from the peers 120.. The function of the cache 130 is described in co-pending UK patent 0616049.3.
Further, the cache 130 also a disk 150 and a solid-state memory 140 for faster access. Similar to the embodiment described above^ pieces of the digital object requested by the peers 120 may be stored in the solid-state memory 140 and/or the disk 150. The cache 130 also has a similar modules as illustrated in Fig. 4. The cache 130 issues BitField or Have messages indicating the pieces of the digital object which the cache 130 has stored. The cache 130 will not inform the peers 120 about all of the available pieces of the digital object and preferentially inform about the pieces of the digital object which are available in the fast access solid state memory 140 rather than the pieces of the digital object which are stored on the disk 150. The peers 120, in their requests, will request from the cache 130 only the pieces of the digital object which the cache 130 has indicated to be available (even if the cache 130 has only issued a limited subset of the available pieces). The requests from the peers 120 will be served from the solid-state memory 140. The cache 130 can analyse the BitFields or the Have messages sent from the peers 120 to analyse the pieces of the digital object which are most requested. If the most requested pieces of the digital object are on the disk 150, but not in the solid-state memory 140, the cache 130 can read the pieces of the digital object into the solid-state memory 140 and can then issue a Have message to indicate that the cache 130 now has the most requested pieces of the digital object. Subsequently, the cache 130 will serve any requests for access to the most requested pieces of the digital object.
Similarly if the cache 130 has recently acquired a piece of the digital object which the cache 130 previously did not have, the cache 130 will retain the recently acquired piece of the digital object in the solid-state memory 140 and issue a Have message to allow the peers 120 to download the piece from the cache 130.
It should be noted that the cache can also retrieve pieces of the digital object from both local memory and more distant memory in a further embodiment of the invention. In this sense, local memory is that memory which is in the same location as the cache. Remote memory is memory that is at a more distant location and thus requires longer to access.
The invention has been described with respect to various embodiments. It will be understood that the invention is not limited thereto. Numerous charges and modifications are known to the person skilled in the art and the invention is not limited to the details shown and described herein but is intended instead to cover all such charges as are obvious to one of ordinary skill of the art.

Claims

Claims
1. A network (10) having a plurality of nodes (20; 120) comprising
- at least a first node (30, 130) for storing pieces of a digital object and having a storage (40, 50; 140, 150)
- a plurality of connections between the first node (30, 130) and the plurality of nodes (20; 120), wherein the first node (30, 130) has:
- a requester (36) for requesting one or more pieces of the digital object;
- a transmitter (38) for transferring one or more pieces of the digital object to at least one further node among the plurality of nodes (20; 120); - a receiver (34) for receiving content availability messages from at least some of the plurality of nodes; and
- a sender (32) for sending at least one limited content availability message to at least one further node (20; 120) among the plurality of nodes (20, 120), whereby the at least one limited content availability message indicates a subset but not all of the pieces of the digital object available through the first node (30; 130).
2. The network ( 10) of claim 1 , wherein the subset comprises the pieces of the digital object preferentially transferable by the transmitter (38) to at least one further node.
3. The network (10) of any of the above claims, wherein the storage has a fast access storage (40) and a disk storage (50) and the subset comprises at least some of the pieces of the digital object stored in the fast access storage (40).
4. The network (10) of any of the above claims, wherein the subset comprises at least some of the pieces of the digital object recently accessed by the disk storage (50).
5. The network (10) of any of the above claims, wherein the subset comprises at least some of the pieces of the digital object accessible in a local storage.
6. The network (10) of any of the above claims, wherein the subset comprises at least some of the requested pieces of the digital object identified by the receiver (34) from the received content availability messages from the plurality of nodes (20).
7. The network (10) of any of the above claims, wherein the subset comprises at least some of the pieces available on further storages (30a, 30b, 130).
8. The network (10) of any of the above claims, wherein the subset comprises at least some of the pieces of the digital object sent in limited content availability messages to at least one further node among the plurality of nodes (20).
9. The network (10) of any of the above claims, wherein the subset is selected such that the number of pieces communicated to a member of the plurality of nodes (20, 120) through the limited content availability message is based on previously received content availability messages from the member of the plurality of nodes (20, 120).
10. The network (10) of any of the above claims, wherein the first node is a proxy cache
(30).
11. The network (10) of any of the above claims, wherein the first node comprises a plurality of proxy caches (30).
12. The network (10) of any one of claims 10 to 11, wherein at least two of the plurality of proxy caches (30a, 30b) stores a different subset of the pieces of the digital object.
13. The network (10) of any one of claims 1 to 9, wherein the first node is a peer or seed (20) in a peer-to-peer network.
14. A method of providing pieces of digital object from a first node (30, 130) to a plurality of nodes (20, 120) in a network (10), the method comprising the following steps:
- a first step of sending to at least one further node (20b, 120b) among the plurality of nodes at least one limited content availability message, whereby the at least one limited content availability message indicates a subset but not all of the pieces of the digital object available through the first node (30, 130);
- a second step of receiving a request for pieces of the digital object from at least one of the plurality of nodes (20, 120);
- a third step of transferring one or more pieces of the digital object to the at least one of the plurality of nodes (20, 120).
15. The method of claim 14, wherein the first step of sending to at least one further node among the plurality of nodes (20, 120) at least one limited content availability message comprises a step of including in the at least one limited content availability message at least some of the pieces of the digital object that are present in a fast access storage (40).
16. The method of claim 14, wherein the first step of sending to at least one further nodes among the plurality of nodes (20, 120) at least one limited content availability message comprises a step of including in the at least one limited content availability message at least some of the pieces of the digital object that are present in a local storage (40).
17. The method of one of claims 14 to 16, further comprising a step of accessing at least some of the pieces of the digital object stored in a slow access storage (50) and transferring the accessed pieces of the digital object to the fast access storage (40).
18. The method of any one of claims 14 to 17, further comprising a step of analysing a plurality of the requests for pieces of the digital object from the plurality of nodes (20, 120), thereby identifying most likely requested pieces of the digital object, and a step of ensuring that at least some of the most likely requested pieces of the digital object are in the fast access storage.
19. The method of any of claims 14 to 18 further comprising a step of requesting one or more pieces of the digital object and a step of transferring the requested one or more pieces of the digital object and storing the requested one or more pieces of the digital object into a storage device (40, 50, 140, 150).
20. The method of any of claims 14 to 19, wherein the first node (30) comprises at least two memories (30a, 30b) and different pieces of the digital object are stored in different ones of the at least two memories (30a, 30b).
EP07801606A 2006-08-11 2007-08-10 Improved storage performance Withdrawn EP2052336A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
GB0615964A GB2440760A (en) 2006-08-11 2006-08-11 Network and method of transferring data over the network by nodes sending messages containing a subset of list of data available at the node
US59611306A 2006-11-09 2006-11-09
PCT/EP2007/007107 WO2008017505A1 (en) 2006-08-11 2007-08-10 Improved storage performance

Publications (1)

Publication Number Publication Date
EP2052336A1 true EP2052336A1 (en) 2009-04-29

Family

ID=38566843

Family Applications (1)

Application Number Title Priority Date Filing Date
EP07801606A Withdrawn EP2052336A1 (en) 2006-08-11 2007-08-10 Improved storage performance

Country Status (3)

Country Link
EP (1) EP2052336A1 (en)
IL (1) IL197008A (en)
WO (1) WO2008017505A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2929792B1 (en) * 2008-04-02 2010-05-21 Groupe Ecoles Telecomm METHOD FOR SAVING / RESTORING FILES IN A PAIR NETWORK.
EP2748982B1 (en) * 2012-08-09 2018-01-10 Telefonaktiebolaget LM Ericsson (publ) Method and apparatus for use in p2p streaming system over mobile network
CN108964845B (en) 2018-07-03 2021-04-16 网宿科技股份有限公司 A method and device for acquiring BT resource information

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6785767B2 (en) * 2000-12-26 2004-08-31 Intel Corporation Hybrid mass storage system and method with two different types of storage medium
US20060168318A1 (en) * 2003-02-12 2006-07-27 Adam Twiss Methods and apparatus for traffic management in peer-to-peer networks

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6785767B2 (en) * 2000-12-26 2004-08-31 Intel Corporation Hybrid mass storage system and method with two different types of storage medium
US20060168318A1 (en) * 2003-02-12 2006-07-27 Adam Twiss Methods and apparatus for traffic management in peer-to-peer networks

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
ANONYMOUS: "Disk Cache, Hybrid Disk Cache and RamDisk", 12 July 2006 (2006-07-12), XP055488692, Retrieved from the Internet <URL:https://web.archive.org/web/20060712120704/http://www.superspeed.com:80/servers/supercache.php> [retrieved on 20180628] *
See also references of WO2008017505A1 *
WIERZBICKI A ET AL: "Cache replacement policies revisited: the case of P2P traffic", CLUSTER COMPUTING AND THE GRID, 2004. CCGRID 2004. IEEE INTERNATIONAL SYMPOSIUM ON CHICAGO, IL, USA APRIL 19-22, 2004, PISCATAWAY, NJ, USA,IEEE, 19 April 2004 (2004-04-19), pages 182 - 189, XP010725940, ISBN: 978-0-7803-8430-9 *
XINYAN ZHANG ET AL: "Coolstreaming/DONet: a data-driven overlay network for peer-to-peer live media streaming", INFOCOM 2005. 24TH ANNUAL JOINT CONFERENCE OF THE IEEE COMPUTER AND CO MMUNICATIONS SOCIETIES. PROCEEDINGS IEEE MIAMI, FL, USA 13-17 MARCH 2005, PISCATAWAY, NJ, USA,IEEE, PISCATAWAY, NJ, USA, vol. 3, 13 March 2005 (2005-03-13), pages 2102 - 2111, XP010829109, ISBN: 978-0-7803-8968-7, DOI: 10.1109/INFCOM.2005.1498486 *

Also Published As

Publication number Publication date
IL197008A (en) 2013-02-28
WO2008017505A1 (en) 2008-02-14
IL197008A0 (en) 2009-11-18

Similar Documents

Publication Publication Date Title
EP2057823B1 (en) Cache structure
US7995473B2 (en) Content delivery system for digital object
KR101359081B1 (en) Performance aware peer-to-peer content-on-demand
US7908362B2 (en) Method and apparatus for the delivery of digital data
US8321494B2 (en) Method and apparatus for reporting and invoicing of data downloads
CN113453038B (en) Effectiveness optimal collaborative cache management method under CDN-P2P hybrid architecture
US20060224687A1 (en) Method and apparatus for offline cooperative file distribution using cache nodes
US20080016201A1 (en) Methods and apparatus for transferring data
US20100115613A1 (en) Cacheable Mesh Browsers
WO2009152754A1 (en) Method, system and device for providing contents based on peer to peer network
WO2008017502A1 (en) Content distribution network
US20080208996A1 (en) Methods and apparatus for data transfer in networks using distributed file location indices
US20080040420A1 (en) Content distribution network
US20020078461A1 (en) Incasting for downloading files on distributed networks
EP2057822A1 (en) System and method for the location of caches
US9241032B2 (en) Storage performance
WO2008017505A1 (en) Improved storage performance
US20060236386A1 (en) Method and apparatus for cooperative file distribution in the presence of firewalls
Skevik et al. Analysis of bittorrent and its use for the design of a p2p based streaming protocol for a hybrid cdn
Johnsen et al. Peer-to-peer networking with BitTorrent
Jernberg et al. Doh: A content delivery peer-to-peer network
Cheng et al. Peer-assisted view-dependent progressive mesh streaming
Toole et al. Bittorrent architecture and protocol
Pacitti et al. Content Distribution in P2P Systems
Cameron et al. An ID-based approach to the caching and distribution of peer-to-peer, proxy-based video content

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20090303

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA HR MK RS

17Q First examination report despatched

Effective date: 20090602

DAX Request for extension of the european patent (deleted)
RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: ALCATEL LUCENT

111Z Information provided on other rights and legal means of execution

Free format text: AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC MT NL PL PT RO SE SI SK TR

Effective date: 20130703

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: ALCATEL LUCENT

D11X Information provided on other rights and legal means of execution (deleted)
RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: ALCATEL LUCENT

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20180829