[go: up one dir, main page]

WO2016062632A1 - A method for designing a minimal aggregation topology for scalable computing - Google Patents

A method for designing a minimal aggregation topology for scalable computing Download PDF

Info

Publication number
WO2016062632A1
WO2016062632A1 PCT/EP2015/074014 EP2015074014W WO2016062632A1 WO 2016062632 A1 WO2016062632 A1 WO 2016062632A1 EP 2015074014 W EP2015074014 W EP 2015074014W WO 2016062632 A1 WO2016062632 A1 WO 2016062632A1
Authority
WO
WIPO (PCT)
Prior art keywords
nodes
layer
aggregation
aggregation topology
topology
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/EP2015/074014
Other languages
French (fr)
Inventor
Erwan Le Merrer
Bao-Duy TRAN
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Thomson Licensing SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Publication of WO2016062632A1 publication Critical patent/WO2016062632A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/12Discovery or management of network topologies

Definitions

  • the present invention generally relates to a network of computers or processing units, such as, but not limited to cloud computing.
  • an aggregation topology comprises L layers, each layer having at least one node and wherein the last layer has only one node and wherein no data is lost and the number of layers, L, minimizes latency.
  • a computer executes a design tool to design an aggregation topology comprising L layers, each layer having more than one node and wherein the last layer has only one node and wherein no data is lost and the number of layers, L, minimizes latency.
  • FIG. 3 shows an illustrative computer for use in executing the flow chart of FIG. 2.
  • Aggregation topology 10 comprises a number of layers, L.
  • the input data to the aggregation topology 10 is provided by a number of source nodes (denoted by the label "100") n 0 (where the subscript "0" indicates a source node) and n 0 > 1. It is assumed that each source node emits data (packets) according to a homogeneous Poisson process with the same mean emission rate ⁇ (where, again, the subscript "0" indicates a source node). As illustrated in FIG.
  • each source node uniformly routes data to every node in layer 1 (denoted by the label "101"), i.e., with a probability ri i ' where n is the number of nodes in layer 1 and n 1 > I.
  • each node in layer 1 uniformly routes data to every node in layer 2 (denoted by the label "102'), i.e., with a probability / n , where n 2 is the number of nodes in layer 2 and n 2 ⁇ 1. This pattern continues for the remaining layers.
  • the number of nodes in each layer decreases until, for the last layer, L (denoted by the label "105"), there is only one node (the sink node) that provides the output from the aggregation topology. However, it should be noted that it may be the case that adjacent layers (other than layer L) may have the same numbers of nodes.
  • the number of all nodes in the aggregation topology 10 is N.
  • Each node in Layers 1 through L processes their incoming data according to the same given aggregator function and provides the resulting output data to Othe node(s) in the next layer.
  • An example of aggregator functions are average, maximum value, minimum value, sum, count, etc.
  • Every node in aggregation topology 10 imposes the same ingest rate constraint, ⁇ .
  • the ingest rate constraint, ⁇ is the number of items/second (e.g., packets/second, that can correspond to an information quantity per unit of time, as IMB/s) that can be processed by a node before data is rejected, and thus data loss starts to occur.
  • Each node deals with the aggregation of non-overlapping time windows.
  • Each time window can, e.g., be identified by a "KEY".
  • Each node accepts and aggregates data corresponding to one time window at a time. If the "KEY" corresponding to the time windows changes, the aggregation result is sent to the next layer and a new aggregation begins.
  • An illustrative process for use in a node is shown below:
  • T is a key-value tuple, where T is a time period (often called
  • T 0 is the current time period in a node
  • the node emits a tuple to the next layer with the values of T 0 and b and then sets T 0 to the new value of T and a new value is stored in b to begin again to accumulate the aggregator function results for the new time period.
  • the last layer is the sink node 105. This is the last operational node and provides the overall aggregation result 111.
  • an aggregation topology is designed such that no data is lost while employing the least number nodes, N, in the least number of layers, L, to reduce complexity and latency.
  • N the least number of layers
  • L the number of nodes can be determined for each layer, I, and the minimum number of layers, L, can be determined such that the resulting aggregation topology does not drop packets and reduces latency by iteratively executing equation (2), below:
  • I represents the current layer, initially set to 1 ;
  • n 0 represents the number of source nodes
  • represents the mean Poisson emission rate
  • n t represents the determined number of nodes for the
  • L represents the number of layers when it is determined that
  • a layer has only one node.
  • variable, I representing the current layer is initially set to a value of one.
  • equation (2) is executed to determine the number of nodes, n in that layer, I.
  • step 215 a check is made to determine if there is only one node in that layer. If there is more than one node, then the value of I is incremented in step 225 and equation (2) is executed again in step 210 to determine the number of nodes for the next layer, etc.
  • step 215 there is only one node in a layer, then further execution of equation (2) stops and a value for the number of layers, L, is provided as well as values for the number of nodes in each layer.
  • the inputs are at least a given number of source nodes, n 0 , each having the same mean Poisson emission rate ⁇ , and a known ingest rate constraint, ⁇ , for nodes in the aggregation topology.
  • This approach caps, or limits, the value for the number of layers, L, to 1 ⁇ L ⁇ L max .
  • L max is now also an input as shown in FIG. 4.
  • the variable m z is a binary mask such that for 1 ⁇ I ⁇ L, the value of m z is one and for L + 1 ⁇ I ⁇ L max the value of m z is zero.
  • the output is the number of layers, L, and the number of nodes in each layer.
  • the remainder of FIG. 4 is the problem statement for input to, e.g., either the "SCIP" or "MIDACO" MINLP solvers as known in the art.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

An aggregation topology comprises L layers, each layer having at least one node and wherein the last layer has only one node and wherein no data is lost by the aggregation topology and the number of layers, L, minimizes latency in the aggregation topology.

Description

A METHOD FOR DESIGNING A MINIMAL AGGREGATION TOPOLOGY FOR
SCALABLE COMPUTING
BACKGROUND OF THE INVENTION
[0001] The present invention generally relates to a network of computers or processing units, such as, but not limited to cloud computing.
[0002] The rise of the Internet of Things and the general migration of IT (information technology) services to the cloud push for the adoption of practical low-latency processing solutions. Current applications include the monitoring of potentially thousands or millions of devices that stream information to a collection point in the background. A device analyzer may collect hundreds of thousands of data points per day per monitored device (e.g., see D. T. Wagner, A. Rice, and A. R. Beresford; "Device analyzer: Large-scale mobile data collection"; SIGMETRICS Perform. Eval. Rev., 41(4):53-56, Apr. 2014).
[0003] Real-time computation is made possible by the progress of stream processing platforms such as "Storm", and of algorithms requiring small space and update time (e.g., see O. Papapetrou, M. Garofalakis, and A. Deligiannakis; "Sketch-based querying of distributed sliding-window data streams"; Proc. VLDB Endow., 5(10):992_1003, June 2012). The support for stream aggregation operations is required for advanced analytics, since it allows for important and more advanced applications like identification of heavy hitters ( e.g., see G. Cormode, F. Korn, S. Muthukrishnan, and D. Srivastava; " Finding hierarchical heavy hitters in data streams"; Proceedings of the 29th International Conference on Very Large Data Bases - Volume 29, VLDB Ό3, pages 464_475; VLDB Endowment, 2003), or anomaly detection (e.g., see Q. Huang and P. P. Lee; "LD-Sketch: A distributed sketching design for accurate and scalable anomaly detection in network data streams"; INFOCOM, 2014).
[0004] There are of course technical challenges raised by this increasing amount of monitored resources. It is particularly well understood that no single computing unit can sustain millions of connections for aggregating device information. As a consequence, computing units (also referred to as processing units or nodes) are arranged in a network topology such as "aggregation topologies". In an aggregation topology, there are "L" layers, where each layer comprises a number of nodes. Devices stream data (packets) to the aggregation topology for processing. The first layer of the aggregation topology receives the data. Generally, the number of nodes in each layer decreases until, for the last layer, there is only one node that provides the output from the aggregation topology. As such, the information received is reduced layer by layer in a scalable fashion, until the desired result is obtained (e.g., see Q. Zhang, J. Liu, and W. Wang; "Approximate clustering on distributed data streams"; Data Engineering, 2008; ICDE 2008; IEEE 24th International Conference on, pages 1131-1139, April 2008).
[0005] Currently, there is no systematic way to design an aggregation topology for a particular service (application), other than by using trial and error. One can for instance provision a tangibly high number of nodes, which will suffice for the service to run without losing device data. However, this may result in a high operational cost since there will likely be more nodes then required for the service. At the other extreme, if one designs an aggregation topology with too few nodes, then it is likely that the service will not be able to ingest the service load - resulting in packet loss.
SUMMARY OF THE INVENTION
[0006] Acknowledging the commercial or physical limits of processing nodes available for operation, there is thus a need for a systematic approach to design an aggregation topology on demand. This topology must be derived from the input characteristics (number of devices and their data sending rate) and on the operations to be achieved. Meanwhile, this topology should maintain the invariant that no node (also referred to as a shard) must be overwhelmed by received data from its neighbors in the topology. Trivially deriving the number of nodes from the raw traffic (divided by the rate limit) does not help, as it gives an indication on a minimal number of nodes to operate. It does not give information on the form of the topology, so nodes beyond the first layer can be overwhelmed too.
[0007] Therefore, and in accordance with the principles of the invention, an aggregation topology is designed such that no data is lost while employing the least number nodes in the least number of layers possible to reduce complexity and latency.
[0008] In an illustrative embodiment of the invention, an aggregation topology comprises L layers, each layer having at least one node and wherein the last layer has only one node and wherein no data is lost and the number of layers, L, minimizes latency.
[0009] In another illustrative embodiment of the invention, a computer executes a design tool to design an aggregation topology comprising L layers, each layer having more than one node and wherein the last layer has only one node and wherein no data is lost and the number of layers, L, minimizes latency. [0010] In view of the above, and as will be apparent from reading the detailed description, other embodiments and features are also possible and fall within the principles of the invention.
BRIEF DESCRIPTION OF THE DRAWINGS
[0011] FIG. 1 shows an illustrative aggregation topology in accordance with the principles of the invention;
[0012] FIG. 2 shows an illustrative flow chart for use in a computer for providing a design tool to design an aggregation topology in accordance with the principles of the invention; and
[0013] FIG. 3 shows an illustrative computer for use in executing the flow chart of FIG. 2.
DETAILED DESCRIPTION
[0014] Other than the inventive concept, the elements shown in the figures are well known and will not be described in detail. For example, other than the inventive concept, an aggregation topology, a processing unit (or node), and the components thereof, such as a transceiver (communications block), processor, etc., are well known and not described in detail herein. Further, other than the inventive concept, familiarity with the Internet and cloud computing is assumed and not described herein. It should also be noted that the inventive concept may be implemented using conventional programming techniques, e.g., APIs (application programming interfaces) which, as such, will not be described herein. Finally, like-numbers on the figures represent similar elements.
[0015] An illustrative aggregation topology 10 in accordance with the principles of the invention is shown in FIG. 1. As described further below, the number of layers, L, and the total number of nodes spread across the topology are selected such that no data is lost while employing the least number nodes in the least number of layers possible to reduce complexity and latency.
[0016] Aggregation topology 10 comprises a number of layers, L. The input data to the aggregation topology 10 is provided by a number of source nodes (denoted by the label "100") n0 (where the subscript "0" indicates a source node) and n0 > 1. It is assumed that each source node emits data (packets) according to a homogeneous Poisson process with the same mean emission rate ο (where, again, the subscript "0" indicates a source node). As illustrated in FIG. 1, each source node uniformly routes data to every node in layer 1 (denoted by the label "101"), i.e., with a probability rii' where n is the number of nodes in layer 1 and n1 > I. Likewise, each node in layer 1 uniformly routes data to every node in layer 2 (denoted by the label "102'), i.e., with a probability /n , where n2 is the number of nodes in layer 2 and n2≥ 1. This pattern continues for the remaining layers. Generally, the number of nodes in each layer decreases until, for the last layer, L (denoted by the label "105"), there is only one node (the sink node) that provides the output from the aggregation topology. However, it should be noted that it may be the case that adjacent layers (other than layer L) may have the same numbers of nodes. The number of all nodes in the aggregation topology 10 is N. Each node in Layers 1 through L, processes their incoming data according to the same given aggregator function and provides the resulting output data to Othe node(s) in the next layer. An example of aggregator functions are average, maximum value, minimum value, sum, count, etc. Further, every node in aggregation topology 10 imposes the same ingest rate constraint, Θ. The ingest rate constraint, Θ, is the number of items/second (e.g., packets/second, that can correspond to an information quantity per unit of time, as IMB/s) that can be processed by a node before data is rejected, and thus data loss starts to occur. Each node deals with the aggregation of non-overlapping time windows. Each time window can, e.g., be identified by a "KEY". Each node accepts and aggregates data corresponding to one time window at a time. If the "KEY" corresponding to the time windows changes, the aggregation result is sent to the next layer and a new aggregation begins. An illustrative process for use in a node is shown below:
- null; b <- agg(6);
each tuple (T, v)received do
if TQ = null then
T0 *- T; b <- agg ({ })
else if T ≠ T0 then
emit tuple (TQ, b);
T0 ^ T; b <- agg{{v});
else
b <- accumaqq (b, v); In terms of the variables shown in process (1), the following definitions are provided:
(T, v) is a key-value tuple, where T is a time period (often called
an epoch) used as the KEY and v is a payload value;
T0 is the current time period in a node;
agg({v}) is an aggregator function used by a node on a received
payload, v;
b stores the accumulation of the aggregator function results in a
node;
accumagg (b, v) accumulates the aggregator function results in b;
With the above definitions, process (1) operates in a node as follows. Initially, T0 is set to a null value and b is set to a value of 0. For each data (tuple) received, if T0 is equal to a null value, then T0 is set to the value of the time period, T, in the received tuple (T, v) and b is set equal to the value of the aggregator function result . As long as the value of T0 is equal to the value of the time period, T, in a received tuple (T, v), (in other words the time period hasn't changed) then b accumulates the aggregator function results via accur agg b, v). However, once the time period changes, and T ≠ T0 then the node emits a tuple to the next layer with the values of T0 and b and then sets T0 to the new value of T and a new value is stored in b to begin again to accumulate the aggregator function results for the new time period. Turning back to FIG.l, the last layer is the sink node 105. This is the last operational node and provides the overall aggregation result 111.
[0017] As noted earlier, there is no systematic way to design an aggregation topology for a particular service (application), other than by using trial and error. One can for instance provision a tangibly high number of nodes, which will suffice for the service to run without losing device data. However, this may result in a high operational cost since there will likely be more nodes then required for the service. At the other extreme, if one designs an aggregation topology with too few nodes, then it is likely that the service will not be able to ingest the service load - resulting in packet loss. It should also be noted that if packets are dropped later in the aggregation process rather than sooner in the aggregation process, there is a loss of more aggregation information.
[0018] Therefore, and in accordance with the principles of the invention, an aggregation topology is designed such that no data is lost while employing the least number nodes, N, in the least number of layers, L, to reduce complexity and latency. In particular, for a given number of source nodes, n0, each having the same mean Poisson emission rate λο, and a known ingest rate constraint, Θ, for each node in the aggregation topology, the number of nodes can be determined for each layer, I, and the minimum number of layers, L, can be determined such that the resulting aggregation topology does not drop packets and reduces latency by iteratively executing equation (2), below:
<- 1 to + oo do
Figure imgf000007_0001
return L, nx , n , ... nL
In terms of the variables in equation (2) the following definitions are provided:
I represents the current layer, initially set to 1 ;
n0 represents the number of source nodes;
ο represents the mean Poisson emission rate;
Θ represents the ingest rate constraint;
nt represents the determined number of nodes for the
current layer, I; and
L represents the number of layers when it is determined that
a layer has only one node.
[0019] Equation (2) is iteratively performed as illustrated in the flow chart of FIG. 2. In step 205, the following data is input:
the number of source nodes, n0 ;
the mean Poisson emission rate, ο; and
the ingest rate constraint, Θ .
In addition, the variable, I, representing the current layer is initially set to a value of one. In step 210, equation (2) is executed to determine the number of nodes, n in that layer, I. In step 215, a check is made to determine if there is only one node in that layer. If there is more than one node, then the value of I is incremented in step 225 and equation (2) is executed again in step 210 to determine the number of nodes for the next layer, etc. However, if, in step 215, there is only one node in a layer, then further execution of equation (2) stops and a value for the number of layers, L, is provided as well as values for the number of nodes in each layer. As a result, a systematic design approach is provided for designing an aggregation topology where there is no data loss and the number of layers is minimized to reduce latency.
[0020] Turning briefly to FIG. 3, an illustrative high level block diagram of a computer 500 for executing a design tool in accordance with the principles of the invention, as illustrated by the flow chart of FIG. 2, is shown. Only those portions relevant to the inventive concept are shown. As such, computer 500 can perform other functions. Computer 500 is a processor based system as represented by processor 505. The latter represents one, or more, stored-program controlled processors as known in the art. In other words, processor 505 executes programs stored in memory 510. The latter represents volatile and/or non-volatile memory, e.g., hard disk, CD-ROM, DVD, random access memory (RAM), etc.) for storing program instructions and data, e.g., for performing the illustrative flow chart shown in FIG. 2 for providing aggregation topology design. Computer 540 also has communications block 130, which supports communications of data over a data connection 541 as known in the art. Data communications can be wired, or wireless, utilizing 802.11, 3G LTE, 4G LTE, etc. Finally, mobile device 505 includes a display and keyboard 530 for providing information to a user, e.g., the output data from step 220 and receiving information from a user e.g., the input data in step 205.
[0021] Other approaches in accordance with the principles of the invention for determining an aggregation topology as a function of a given number of source nodes, n0, each having the same mean Poisson emission rate λο, and a known ingest rate constraint, Θ, for nodes in the aggregation topology can also be used. For example, a process can be used that utilizes mixed-integer non-linear programming (MINLP) and employs off-the-shelf MINLP solvers like "SCIP" (Solving Constraint Integer Programs) or "MIDACO (Mixed Integer Distributed Ant Colony Optimization) Solver". One such approach is shown in FIG. 4. As shown in FIG. 4, the inputs are at least a given number of source nodes, n0, each having the same mean Poisson emission rate λο, and a known ingest rate constraint, Θ, for nodes in the aggregation topology. This approach caps, or limits, the value for the number of layers, L, to 1 < L < Lmax. As such, Lmax, is now also an input as shown in FIG. 4. The variable mz is a binary mask such that for 1 < I < L, the value of mz is one and for L + 1 < I≤ Lmax the value of mz is zero. As shown in FIG. 4, the output is the number of layers, L, and the number of nodes in each layer. Other than the inventive concept, the remainder of FIG. 4 is the problem statement for input to, e.g., either the "SCIP" or "MIDACO" MINLP solvers as known in the art.
[0022] In view of the above, the foregoing merely illustrates the principles of the invention and it will thus be appreciated that those skilled in the art will be able to devise numerous alternative arrangements which, although not explicitly described herein, embody the principles of the invention and are within its spirit and scope. It is therefore to be understood that numerous modifications may be made to the illustrative embodiments and that other arrangements may be devised without departing from the spirit and scope of the present invention.

Claims

1. A method for use in determining an aggregation topology, the method comprising: receiving data representing a number of source nodes, a mean Poisson emission rate and an ingest rate constraint;
determining from the received data a number of nodes for a layer of the aggregation topology;
wherein if the number of nodes for the layer is greater than one, repeating the determining step for the next layer and if the number of nodes for the layer is equal to one, providing the number of layers for the aggregation topology and the number of nodes in each of the layers of the aggregation topology.
2. The method of claim 1, wherein the number of nodes in each layer of the aggregation topology results in no data loss and the number of layers for the aggregation topology minimizes latency for the aggregation topology.
3. The method of claim 1, wherein the determining step executes the following equation:
Figure imgf000010_0001
wherein I represents the current layer, initially set to 1 ;
n0 represents the number of source nodes;
ο represents the mean Poisson emission rate;
Θ represents the ingest rate constraint; and
Ui represents the determined number of nodes for the current
layer, I .
4. The method of claim 1, wherein the determining step uses a mixed- integer nonlinear program solver.
5. An aggregation topology comprising:
a first layer comprising a plurality of nodes for receiving data from a number of source nodes having the same mean Poisson emission rate;
a plurality of other layers, each layer having a plurality of nodes for receiving aggregated data from a previous layer; and
a last layer having only one node for receiving aggregated data from a previous layer and providing an aggregation result;
wherein the number of all layers in the aggregation topology is L; and
wherein the number of nodes in the aggregation topology across all L layers is N, and each node has the same ingest rate constraint, and each node performs the same aggregator function for providing the aggregation result;
wherein the number of layers, L, and the number of nodes in each layer are selected as a function of the number of source nodes, the mean Poisson emission rate and the ingest rate constraint.
6. The aggregation topology of claim 5, wherein the number of nodes in each layer of the aggregation topology results in no data loss and the number of layers for the aggregation topology minimizes latency for the aggregation topology.
7. The aggregation topology of claim 5, wherein the aggregator function is at least one of average, maximum value, minimum value, sum, and count functions.
8. The aggregation topology of claim 5, wherein the function is:
Figure imgf000011_0001
wherein I represents the current layer, initially set to 1 ;
n0 represents the number of source nodes;
ο represents the mean Poisson emission rate;
Θ represents the ingest rate constraint; and
Ui represents the determined number of nodes for the current
layer, I .
9. A computer for use in determining an aggregation topology, the computer comprising:
a memory for storing a program and data representing a number of source nodes, a mean Poisson emission rate and an ingest rate constraint; and
a processor for executing the stored program, wherein the processor determines from the stored data a number of nodes for a layer of the aggregation topology; and wherein if the number of nodes for the layer is greater than one, repeats the determining step for the next layer and if the number of nodes for the layer is equal to one, provides the number of layers for the aggregation topology and the number of nodes in each of the layers of the aggregation topology.
10. The computer of claim 9, wherein the number of nodes in each layer of the aggregation topology results in no data loss and the number of layers for the aggregation topology minimizes latency for the aggregation topology.
11. The computer of claim 9, wherein the stored program includes instructions to evaluate the following equation:
Figure imgf000012_0001
wherein I represents the current layer, initially set to 1 ;
n0 represents the number of source nodes;
ο represents the mean Poisson emission rate;
Θ represents the ingest rate constraint; and
Ui represents the determined number of nodes for the current
layer, I .
12. The computer of claim 9, wherein the stored program uses a mixed-integer nonlinear program solver.
PCT/EP2015/074014 2014-10-22 2015-10-16 A method for designing a minimal aggregation topology for scalable computing Ceased WO2016062632A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP14306679.3 2014-10-22
EP14306679 2014-10-22

Publications (1)

Publication Number Publication Date
WO2016062632A1 true WO2016062632A1 (en) 2016-04-28

Family

ID=51893955

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2015/074014 Ceased WO2016062632A1 (en) 2014-10-22 2015-10-16 A method for designing a minimal aggregation topology for scalable computing

Country Status (1)

Country Link
WO (1) WO2016062632A1 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2367324A1 (en) * 2010-03-15 2011-09-21 Samsung Electronics Co., Ltd. Techniques for self-organizing activity-diffusion-based wireless sensor network
US8392575B1 (en) * 2011-03-31 2013-03-05 Amazon Technologies, Inc. Clustered device dispersion in a multi-tenant environment

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2367324A1 (en) * 2010-03-15 2011-09-21 Samsung Electronics Co., Ltd. Techniques for self-organizing activity-diffusion-based wireless sensor network
US8392575B1 (en) * 2011-03-31 2013-03-05 Amazon Technologies, Inc. Clustered device dispersion in a multi-tenant environment

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
D. T. WAGNER; A. RICE; A. R. BERESFORD: "Device analyzer: Large-scale mobile data collection", SIGMETRICS PERFORM. EVAL. REV., vol. 41, no. 4, April 2014 (2014-04-01), pages 53 - 56
G. CORMODE; F. KORN; S. MUTHUKRISHNAN; D. SRIVASTAVA: "Finding hierarchical heavy hitters in data streams", PROCEEDINGS OF THE 29TH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, vol. 29, pages 464 - 475
O. PAPAPETROU; M. GAROFALAKIS; AND A. DELIGIANNAKIS: "Sketch-based querying of distributed sliding-window data streams", PROC. VLDB ENDOW., vol. 5, no. 10, June 2012 (2012-06-01), pages 992 - 1003
Q. HUANG; P. P. LEE: "LD-Sketch: A distributed sketching design for accurate and scalable anomaly detection in network data streams", INFOCOM, 2014
Q. ZHANG; J. LIU; W. WANG: "Data Engineering, 2008; ICDE 2008; IEEE 24th International Conference", April 2008, article "Approximate clustering on distributed data streams", pages: 1131 - 1139

Similar Documents

Publication Publication Date Title
Li et al. Coded terasort
US20200364607A1 (en) Systems and methods for unsupervised anomaly detection using non-parametric tolerance intervals over a sliding window of t-digests
US10355949B2 (en) Behavioral network intelligence system and method thereof
US20120226723A1 (en) Approximate order statistics of real numbers in generic data
US10601671B1 (en) Creating and displaying a graph representation of a computer network topology for an executing application
US12323443B2 (en) Attack behavior detection method and apparatus, and attack detection device
US12476875B2 (en) Discovering a computer network topology for an executing application
US10997517B2 (en) Methods and systems for aggregating distribution approximations
US11831492B2 (en) Group-based network event notification
US10678610B2 (en) Using and updating topological relationships amongst a set of nodes in event clustering
US10963346B2 (en) Scalable methods and systems for approximating statistical distributions
WO2020053792A1 (en) Malchain detection
CN116210211A (en) Anomaly Detection in Network Topologies
US20230403197A1 (en) Optimizing the transfer of massive amounts of data using AI strategies
Jiang et al. Towards max-min fair resource allocation for stream big data analytics in shared clouds
KR20220074819A (en) Graph Stream Mining Pipeline for Efficient Subgraph Detection
US10609206B1 (en) Auto-repairing mobile communication device data streaming architecture
CN113391907A (en) Task placement method, device, equipment and medium
US12007759B2 (en) Geometric aging data reduction for machine learning applications
JP2015156529A (en) Flow aggregation device, method and program
US11126667B2 (en) Streaming method for the creation of multifaceted statistical distributions
WO2016062632A1 (en) A method for designing a minimal aggregation topology for scalable computing
Rygielski et al. Context change detection for resource allocation in service-oriented systems
Elsayed et al. On the impact of network delays on Time-to-Live caching
Ghandour et al. Scalable overload prediction in cloud computing using a hybrid queuing-theoretic and machine learning framework

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15784319

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15784319

Country of ref document: EP

Kind code of ref document: A1