WO2001006396A2 - Data mining software to determine customer potential - Google Patents
Data mining software to determine customer potential Download PDFInfo
- Publication number
- WO2001006396A2 WO2001006396A2 PCT/US2000/018843 US0018843W WO0106396A2 WO 2001006396 A2 WO2001006396 A2 WO 2001006396A2 US 0018843 W US0018843 W US 0018843W WO 0106396 A2 WO0106396 A2 WO 0106396A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- prospect
- value
- prospects
- models
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
Definitions
- This invention relates to data mining software.
- Data mining software extracts knowledge that may be suggested by a set of data. For example, data mining software can be used to maximize a return on investment in collecting marketing data, as well as other applications such as credit risk assessment, fraud detection, process control, medical diagnoses and so forth.
- data mining software uses one or a plurality of different types of modeling algorithms in combination with a set of test data to determine what types of characteristics are most useful in achieving a desired response rate, behavioral response or other output from a targeted group of individuals represented by the data.
- data mining software executes complex data modeling algorithms such as linear regression, logistic regression, back propagation neural network, Classification and Regression Trees (CART) and Automatic Interaction Detection (CHAID) decision trees, as well as other types of algorithms on a set of data.
- CART Classification and Regression Trees
- CHID Automatic Interaction Detection
- One objective in using data mining software is to identify customers or prospects for target marketing.
- One concept used in data mining software is determining the value of a customer. By most practices, value is essentially equal to current value. For example, a common method of treating the customer is anybody who purchased from a company most recently and spent the most money with the company is a person considered to be of high value.
- marketing organizations will target people that are identified as having a past history of frequent or high value purchasing. Such persons are considered the best customers of a marketing organization. These best customers are often aggressively marketed.
- a method of determining a prospect's potential value as a customer includes scoring a group of prospects on a first valuation model that models prospects that are marketed to in a first manner and a second valuation model that models prospects that are marketed to m a second, different manner and determining a difference in scores of the group of prospects scored on the first valuation model and the second valuation model.
- a method of determining a prospect's true value as a customer includes determining a prospect's potential value and adding the prospect's potential value to an estimate of the prospect's current value to provide the prospect's true value.
- a computer program product for determining a prospect's true value as a customer includes instructions for causing a computer to determine a prospect's potential value and add the prospect's potential value to an estimate of the prospects current value to provide the prospect's true value.
- This invention takes the approach that value does not simply equate to current value and thus the customer with the highest current value may not actually be the best customer to aggressively pursue in a new marketing campaign. Rather, a true value is determined.
- the true value is related to a customer's current value and the customer's potential value.
- the data mining software includes a process to determine potential value. The concept of potential value adds a new dimension to how marketing is done.
- a fundamental approach used in the data mining software is that the software determines a customer's true value to a company based on an estimate of a current value which is observable and a determined potential value component.
- the data mining software uses an algorithm that scores prospects based on their calculated "true value" which is equal to current value plus potential value (e.g., untapped potential) .
- FIG. 1 is a block diagram of a computer system executing data mining software that determines true value of a customer.
- FIG. 2 is a block diagram of a data set.
- FIG. 2A is a diagram of a record.
- FIG. 3 is a block diagram of a training process for the data mining software used m FIG. 1.
- FIG. 4 is a flow chart of the data mining software that determines customer potential.
- FIG. 5 is a flow chart of an alternative technique to determine customer potential value.
- FIG. 6 is a flow chart of another alternative technique to determine customer potential value.
- a computer system 10 includes a CPU 12, main memory 14 and persistent storage device 16 all coupled via a computer bus 18.
- the system 10 also includes output devices such as a display 20 and a printer 22, as well as user input devices such as a keyboard 24 and a mouse 26.
- output devices such as a display 20 and a printer 22, as well as user input devices such as a keyboard 24 and a mouse 26.
- m FIG. 1 but necessarily included m a system of FIG. 1 are software drivers and hardware interfaces to couple all the aforementioned elements to the CPU 12.
- the computer system 10 also includes data mining software.
- the data mining software 30 may reside on the computer system 10 or may reside on a server 28, as shown, which is coupled to the computer system 10 m a conventional manner such as m a client-server arrangement. The details on how this data mining software is coupled to this computer system 10 are not important to understand the present invention.
- data mining software 30 executes complex data modeling algorithms such as linear regression, logistic regression, back propagation neural network, Classification and Regression Trees (CART) and Chi squared Automatic Interaction Detection (CHAID) decision trees, as well as other types of algorithms that operate on a data set.
- the data mining software 30 can use any one of these algorithms with different modeling parameters to produce different results.
- the data mining software 30 can render a visual representation of the results on the display 20 or printer 22 to provide a ⁇ ecision maker with the results.
- the results that are returned can be based on different algorithm types or different sets of parameters used with the same algorithm.
- One type of result that the software can return is in the form of a lift chart.
- the results can also be returned without a visual depiction of the results such as the score itself, calculating an RMS value, and so forth.
- One approach is to render a graph or other visual depiction of the results. A preferred arrangement for providing such lift curves on a lift chart is described m U.S. Patent Application, serial no.
- the data mining software 30 described below determines a true value of a customer.
- the data mining software includes a potential value determination process 32 that allows for execution of multiple evaluation models.
- the models are trained by evaluating different groups of randomly selected customers sampled from a larger data set.
- the value determining process 32 produces a true value based on a current value that can be observed and a calculated potential value.
- a data set 50 includes a plurality of records 51.
- the data set 50 often includes a very large number of such records 51.
- the records are divided into groups 52a-52b whose members are randomly selected from the larger data set 50.
- the records 51 (FIG. 2A) can include an identifier field 53a, as well as one or a plurality of fields 53b corresponding to input variable values that are used in the value determining process 32.
- the records 51 also include one or more result fields 53c that are used by the value determining process 32 to record scores for the record that measure the current value and potential value and determine the true value of a prospect represented by the record.
- the result fields 53c include a current value field 64a, a potential value field 65a and a true value field 66a.
- the data mining software 30 or user randomly partitions this data set 50 into a series of data segments, i.e., at least two 52a, 52b.
- a test marketing campaign is conducted.
- Individuals represented by the records of one of the groups e.g., 52a are treated differently from individuals represented by the records of another one of the groups, e.g., group 52b.
- treating differently is meant that one group e.g., the first group 52a can be exposed to intensive, i.e., aggressive marketing 57, whereas the second group 52b could be exposed to normal or less intensive marketing 59.
- Examples of aggressive marketing would include high incentives such as strong discounts or premiums, or higher cost collateral materials .
- value determining process 32 includes multiple valuation models 60a-60b (at least two) trained using the results of the test marketing directed at the respective groups of customers 52a-52b (FIG. 2) .
- the individual multiple models 60a-60b are trained with data for the respective one of the groups 52a-52b.
- the groups 52a-52b are used to train the models based on the marketing results. For example, a customer who spends $1,000 every year for the last 5 years, may be considered to have a current value of $1,000 per year. But, the potential value of this customer may not be $1000.
- tne value determining process 32 determines the potential value V P of a customer.
- the potential value is a measure of how much more this person can spend if the person is aggressively marketed.
- the value determining process 32 estimates the potential value by scoring each record 51 of the dataset 50 through a valuation model 60a for the special group and a valuation model 60b for the normal group. Each customer record 51 scored through models 60a and 60b yields scores SI and S2.
- the potential for each customer is determined 66 by taking the difference in the scores e.g., S1-S2.
- the potential value i.e., untapped potential V P is related to the difference between how well a prospect matches to the profiles of the specially treated group and the normally treated group.
- the determined score is a measure of the customer's potential value.
- the models return potential values as amounts of money e.g., dollar amounts, e.g. a measure of the amount of money that a prospect potentially may spend.
- Another potential determining process 70 uses the concept that customers with the highest potential are those that most resemble current best customers.
- the data mining software 30 models the probability density of each customer over their possible values that they might take on. Tne customer's potential can be approximated by integrating over a density function, e.g., multiplying the probability at each value by that value and integrating over the whole density of the customer's potential value.
- a is a scaling factor that can be tuned empirically
- p (v) is the probability density associated with value v
- C is the current value
- One possible way to build the probability density model would be to divide up the range of values into segments, and build j models, each learning a probability for one segment.
- the value determining process 70 builds models 70a-70j for each of a plurality of segments.
- the evaluation process 70 therefore has a model 70a for the top 10%, a second model 70b for between 10 and 20%, a third model 70c between 20 and 30% and so forth.
- the models each learn a segment of the probability density of possible customer values given customer attributes, p(V) .
- the outputs of the j models are used to build 76 a probability density.
- the outputs are combined and renormalized to produce a composite probability density model.
- non-parametric density function estimators include kernel estimators, orthogonal series estimators, histograms, splines, and mixture models.
- the training used m the evaluation process 70 examines the customer base and for each one of the customers provides a measure of how much money the customer had spent with the marketing organization in the past. That measure is used as a current value.
- the evaluation process 70 sorts the customer base into segments where the top segment would be customers that spend the most and the bottom segment would be customers that spend the least.
- the process 70 segments the customers into such groups and builds models 70a-70j of the probability of membership of a customer in each segment. Positive examples for each segment model are a sample of customers currently m that segment, and negative examples are a sample of customers not m that segment.
- the value determining process 70 uses models 70a-70j to give an estimate of the probability that a customer is in each segment. To score a new prospect, the process 70 computes the prospect's expected value (by integrating over the output of the density model) multiplied by the value associated with each segment (either an average or the output of a valuation model for that segment) . The process 70 subtracts the current value of the prospect (obtained from either a calculation or a model) from the prospect's expected value, and scales the difference by a factor alpha ⁇ (which can be tuned based on sample data) . The result is an estimate of the potential. The potential value process 70 uses the set of models 70a-70j where each model 70a-70] provides an estimate of the probability of a prospect moving into the segment represented by the model.
- the model process 70 will take a record corresponding to the prospect and score it on the other nine (9) models 70a-70 ⁇ .
- the average valuations for each segment would give an indication of how much money the prospect would spend if the prospect belonged to that segment.
- the model process 70 uses the score in order to estimate the likelihood of the prospect being a member of the each of the segments. Therefore, the estimated value of the prospect is equal to the sum of the probability of the prospect becoming a member of the top segment times the value of the prospect in that segment, plus the probability of the prospect becoming a member of the second segment times the value of the prospect in the second segment, and so forth down to the segment that is the one above the prospect's current segment.
- model 70a for the top 10%
- a third model 70c between 20 and 30%, and so forth model 70a would be the same, but model 70b would be for the entire top 20%
- model 70c would be for the entire top 30%, and so forth.
- One advantage of this embodiment is the larger sample sizes available for building the models. In this case, if for example, a prospect was m the fourth segment, rather than integrating over the outputs from models 70a through 70c, only the output from model 70c would need to be used.
- the potential value for each customer is determined from equation 2.
- the potential value i.e., untapped potential V P is related to the difference between how well a prospect matches to the profiles of each of the segments above the segment that the prospect is classified into .
- the process rather than multiplying the density by the average value for each segment, the process builds a plurality of valuation models (one for the customers from each segment), and multiplies the density by the score from the valuation model for each segment.
- a heuristic can be used as an alternative to producing a large number of models of probability of classification and determining the actual value for each segment.
- a heuristic based value determining process 80 can be further approximated by building a model of the probability of high value customers rather than of the entire probability density function.
- the heuristic based value determining process 80 has a model of the top n % of customers, using top customers as positive examples, and a random sample of other customers as negative examples. That is, a group of customers can be segmented into a plurality of segments based on how much money they currently spend with the organization.
- the heuristic based value determining process 80 takes the top segment (e.g., the top 10% if 10 segments) to build a model that maps their background historical information to the actual amount of money that is spent while in that top segment.
- the heuristic based value determining process 80 also builds a model that gives the probability of the prospect becoming a member of the top segment.
- the heuristic based value determining process 80 is a further approximation of potential value.
- the prospect's potential value therefore, is equal to the probability of that person moving to the top segment times the value of that person at the top segment.
- the value determining process 80 computes 82 the prospect's probability score from the top n % e.g., 10% model and multiplies 84 that by the mean value of the top n % e.g., 10% or by the output of a valuation model for that segment.
- the process 80 subtracts 86 the current value of the prospects (obtained from either a calculation or a model) , and scales 88 the difference by a factor ⁇ (which can be tuned based on sample data) .
- the result is an estimate of the potential.
- the estimate of potential can be added 90 to current value to determine the true value as in Equation 1 above.
- the process 32 (using any of the potential determining algorithms 32, 70, 80) can segment customers into a two-dimensional grid arranged as customers with high current value, high potential value, low current value, h gh potential value or who have low current value low potential value.
- the process 32 can tailor a marketing strategy to each one of them. For example, a maintenance or retention program could be geared to customers who have high current value but low potential value, whereas, prospects who have low current value but high potential value can be good candidates for aggressive marketing .
Landscapes
- Business, Economics & Management (AREA)
- Engineering & Computer Science (AREA)
- Accounting & Taxation (AREA)
- Development Economics (AREA)
- Strategic Management (AREA)
- Finance (AREA)
- Game Theory and Decision Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Economics (AREA)
- Marketing (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU59275/00A AU5927500A (en) | 1999-07-16 | 2000-07-11 | Data mining software to determine customer potential |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US35487299A | 1999-07-16 | 1999-07-16 | |
| US09/354,872 | 1999-07-16 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2001006396A2 true WO2001006396A2 (en) | 2001-01-25 |
| WO2001006396A8 WO2001006396A8 (en) | 2002-06-13 |
Family
ID=23395268
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2000/018843 Ceased WO2001006396A2 (en) | 1999-07-16 | 2000-07-11 | Data mining software to determine customer potential |
Country Status (2)
| Country | Link |
|---|---|
| AU (1) | AU5927500A (en) |
| WO (1) | WO2001006396A2 (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8965839B2 (en) | 2012-12-19 | 2015-02-24 | International Business Machines Corporation | On the fly data binning |
| CN118247026A (en) * | 2024-05-20 | 2024-06-25 | 财信证券股份有限公司 | Screening method, system, terminal and storage medium for potential customers of financial products |
| CN119599706A (en) * | 2024-12-18 | 2025-03-11 | 中国建设银行股份有限公司河北省分行 | A method and device for generating a marketing plan |
-
2000
- 2000-07-11 AU AU59275/00A patent/AU5927500A/en not_active Abandoned
- 2000-07-11 WO PCT/US2000/018843 patent/WO2001006396A2/en not_active Ceased
Non-Patent Citations (1)
| Title |
|---|
| No Search * |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8965839B2 (en) | 2012-12-19 | 2015-02-24 | International Business Machines Corporation | On the fly data binning |
| US8977589B2 (en) | 2012-12-19 | 2015-03-10 | International Business Machines Corporation | On the fly data binning |
| CN118247026A (en) * | 2024-05-20 | 2024-06-25 | 财信证券股份有限公司 | Screening method, system, terminal and storage medium for potential customers of financial products |
| CN119599706A (en) * | 2024-12-18 | 2025-03-11 | 中国建设银行股份有限公司河北省分行 | A method and device for generating a marketing plan |
Also Published As
| Publication number | Publication date |
|---|---|
| AU5927500A (en) | 2001-02-05 |
| WO2001006396A8 (en) | 2002-06-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP0681249B1 (en) | Fuzzy logic entity behavior profiler | |
| Sánchez et al. | Association rules applied to credit card fraud detection | |
| US8032448B2 (en) | Detecting and measuring risk with predictive models using content mining | |
| Geweke et al. | Bayesian inference for hospital quality in a selection model | |
| Finlay | Credit scoring for profitability objectives | |
| Bravo et al. | Granting and managing loans for micro-entrepreneurs: New developments and practical experiences | |
| US20200118136A1 (en) | Systems and methods for monitoring machine learning systems | |
| CN113706258A (en) | Product recommendation method, device, equipment and storage medium based on combined model | |
| CN111882420A (en) | Generation method of response rate, marketing method, model training method and device | |
| JP2003526147A (en) | Cross-correlation tool to automatically calculate portfolio description statistics | |
| CN117236996B (en) | User behavior prediction method and system based on big data analysis | |
| WO2001029692A2 (en) | Combining value and probability models in database mining | |
| Pehlivanli et al. | Detection of fraud risks in retailing sector using MLP and SVM techniques | |
| Linder et al. | Artificial neural networks, classification trees and regression: Which method for which customer base? | |
| Knight | Retail demand interdependence and chain store closures | |
| Gao et al. | Survival analysis: theory and application in finance | |
| Liu | The evaluation of classification models for credit scoring | |
| Cox et al. | The mathematics of banking and finance | |
| WO2001006396A2 (en) | Data mining software to determine customer potential | |
| Lawrence et al. | Judgement or models: The importance of task differences | |
| CN112837167A (en) | System and method for predicting high-net-value client wealth level increase probability | |
| CN111461865A (en) | Data analysis method and device | |
| Ahn | Predicting Customer Attrition Using Binge Trading Patterns: A Machine Learning Approach | |
| Foong et al. | The interaction of incentives and information disclosure: The case of confidence interval information | |
| Karimi et al. | Analyzing the impact of drug price liberalization on poverty and inequality and proposing a new framework for supporting vulnerable groups in Iran's economy on health reform |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
| AK | Designated states |
Kind code of ref document: C1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: C1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
| D17 | Declaration under article 17(2)a | ||
| 122 | Ep: pct application non-entry in european phase | ||
| NENP | Non-entry into the national phase in: |
Ref country code: JP |