-
2007-10-09
10/324,281
2002-12-19
US 7,280,974 B2
2007-10-09
-
-
Romain Jeanty | Justin M Pats
2025-09-21
A system and a method for selecting potential purchasers from a historical collection of confirmed purchasers. The method allows definition of a set of purchasing variables in relation to the confirmed purchasers, and computation of both a plurality of re-purchasing ratios and a plurality of purchasing amounts using the set of purchasing variables. Potential purchasing amounts are generated by combining the previous results.
Get notified when new applications in this technology area are published.
The present invention generally relates to targeted marketing and more particularly to a system and method for selecting purchasers from a large pool of confirmed purchasers.
An important aspect of commercial marketing is the identification of potential purchasers either from new prospective customers or from confirmed purchasers that have already bought items. To better target their marketing campaigns, more and more companies use statistical models based on customer behavior. They store purchasing data including for example the date of a last purchase, the product purchased and the monetary value of the purchase.
Targeted marketing is one of the applications of predictive modeling. Some solutions are mainly focused on targeting prospective customers, i.e. customers without a first-time purchase. Other solutions are more specifically linked to product category or brand to configure promotion packages such as in U.S. Pat. No. 6,026,370 from Jermyn.
However, there is a need for a generic solution that is independent of any product category and that improves purchaser knowledge both at the individual level as well as at a general level. Such a solution would be an efficient tool for driving short and mid-term growth.
Therefore, it is an object of the present invention to provide a system and method for determining potential purchasers with respect to a confirmed purchaser profile. Another object of the invention is to determine a profile of confirmed purchasers that are most likely to respond positively to marketing solicitation.
In a preferred embodiment, the method of the invention allows selection of potential purchasers from a historical collection of confirmed purchasers, and comprises the steps of:
The invention, as well as these and other related objects and advantages thereof, will be best understood by reference to the following detailed description to be read in conjunction with the accompanying drawings.
FIG. 1 is a flow chart to illustrate the general steps according to the present invention.
FIG. 2 is a timing diagram illustrating variables to be used by the method of the invention.
FIG. 3 is a detailed flow chart of the purchasing ratio step of FIG. 1.
FIG. 4 is a detailed flow chart of the purchasing monetary step of FIG. 1.
FIG. 5 is a block diagram showing a hardware platform on which the invention may be implemented.
Referring to the drawings, and more particularly to FIG. 1, steps of the present invention are now described. Initially, a purchasing history database to be observed over a particular duration βDhistβ is collected in step 102. The historical database contains at least identifiers for each customer and purchasing data associating with the respective customer. It is to be appreciated that the term customer is employed in the description with the meaning of a confirmed purchaser, i.e. someone that has already bought at least an item having a non-nul monetary value. The purchasing data may include at least the date and the monetary value of the purchase.
In step 104, several variables to be further used by the present method are defined and preferably comprise:
In the context of the invention and as illustrated in FIG. 2, the cycle βCtβ is defined as a time unit within the time observation of the purchasing history database. It is to be noted that the duration of the cycle is set from the second cycle, and that the very first cycle βC1β is equal to an initial duration cycle.
In a commercial application, the initial duration cycle is fixed to one day, so the first cycle is equivalent to the date of the first purchase.
The generation variable allows grouping a plurality of customers βCustjβ into homogeneous generation groups βGiβ in relation to the date of their first purchase. The maximum number βNβ of generation groups depends on βDhistβ and βDgenβ, and is equal to
N = INT β‘ [ Dhist Dgen ] , ( 1 )
wherein the βINTβ function returns the largest integer value that is less than or equal to the argument of the function.
As an example, for a history database having a starting date of January 1st 1995 and an ending date of Dec. 31, 2000 (which means that Dhist=72), the maximum number of full observed cycles is equal to βT=11β cycles for a duration of the cycles βDcycleβ fixed to six months. If the generation duration βDgenβ is fixed to one month, the maximum number of generation is then equal to βN=71β. Then, for a customer having a first purchase date of Aug. 15, 1997, the generation group to which he belongs is βG32β. Each other customer that has a first purchasing date occurring during August 1997 belongs to the same generation group. It is to be appreciated that this example is just for illustration and does not limit the invention, as any other duration values could be defined according to the vendor activity.
The recency frequency class variable βRFxyβ is defined as a binary description of a customer behavior concatenated over a predefined number of cycles. The index βxyβ reflects two consecutive cycles, and βxβ and βyβ are respectively coded as a binary value of β1β if a purchase has occurred during the respective cycle and coded as β0β if not. In a preferred implementation, a four-bit coding is chosen for describing the recency frequency class over four consecutive cycles. Thereby, the purchasing prediction is based on the behavior of customers over the four last cycles.
As shown in the timing diagram 200 of FIG. 2, a different number βnGiβ of full cycles occurs during the time observation βDhistβ for each generation group βGiβ. The maximum value of βnGiβ is observed for the first generation group βG1β; for sake of clarity this will be denoted βTβ in the rest of the description. Thus to recall, the cycles list may vary from βC1β to βCTβ and the generation group list may vary from ββG1β to βGNβ.
The setting of the variables in step 102 allows customization of the variables βDGenβ and βDCycleβ to better fit the vendor field activity.
Next, two processes are performed in steps 106 and 108 taking into account the variables set in step 104.
In step 106, a re-purchasing ratio is computed for each customer. The re-purchasing ratio indicates the probability of purchasing during the next cycle. It is to be noted that several customers may have the same probability even if they belong to different generation groups. The result of the purchasing ratio operation is a plurality of ratios.
In step 108, a purchasing amount is computed for each customer. It is to be noted that several customers may have the same purchasing amount even if they belong to different generation groups. The result of the purchasing amount operation is a plurality of amounts.
Finally, in step 110 for each customer, a potential purchasing amount is estimated by cross-computing the results of the two previous steps 106 and 108. A scorecard is generated that may be displayed on a display screen. The scorecard indicates score values with respect to the recency frequency class and the cycles that may be interpreted by a user to better target a marketing campaign.
Referring now to FIG. 3, a detailed sequence 300 of the steps to generate the re-purchasing ratio is described. The sequence begins in step 302 by generating a purchasing history table taking into account the different variables set during step 104 as described above. The result is a table having a first column for identifying the customer, and a second column for indicating the generation group of the customer. The table further comprises a third column for indicating the current cycle βCtβ with βtβ varying from 1 to βnGiβ for each customer, a fourth column for indicating the recency frequency class associated with the current cycle, and a fifth column for indicating the amount spent by the customer during the current cycle.
In next step 304, a probability of re-purchasing is computed from the purchasing history table using the recency frequency class βRFxyβ that has been previously defined, according to the following formula:
P β‘ [ G i , RF xy , C t ] = N cust β‘ [ G i , RF 1 β’ x , C t + 1 ] N cust β‘ [ G i , RF xy , C t ] ( 2 )
It is to be appreciated that the recency frequency class denoted as βRF1xβ for the numerator means that a purchase has occurred during the cycle βCt+1β (coded as the β1β); it does not matter whether a purchase has occurred during the cycle βCtβ (coded as βxβ which may be β1β or β0β). Similarly, the recency frequency class denoted as βRFxyβ for the denominator means that a purchase has occurred or not during the cycle βCtβ (coded as βxβ) and that a purchase has occurred or not during the cycle βCtβ1β (coded as βyβ).
In the cases where the number of customers by cross-category of RF class, by generation group and by cycle is not enough to compute a robust probability, the recency frequency classes are preferably concatenated together.
In the next step 306, the average value of the previous re-purchasing ratios is computed over the number of generation groups. The result is then noted as:
P[RFxy,Ct]ββ(3)
wherein βtβ varies from 1 to βTβ1β.
Those skilled in the art will appreciate that the average value could be estimated with any other appropriate analytical function such as the normal or the polynomial regression for example.
Going to FIG. 4, a detailed sequence 400 of the purchasing amount step 108 of FIG. 1 is now described. The sequence begins in step 402 by generating the purchasing history table as mentioned previously with reference to step 302.
In step 404, the average values of the purchasing amounts of customers belonging to an identical generation group and having an identical recency frequency class within the same cycle are computed. The result is then noted as:
M[Gi,RFxy,Ct]ββ(4)
wherein βiβ varies from 1 to βNβ and βtβ varies from 1 to βnGiβ1β.
Finally in step 406, the average value of the previous average values of the purchasing amounts is computed over the number of generation groups. The result is noted as:
M[RFxy,Ct]ββ(5),
wherein βtβ varies from 1 to βTβ.
Furthermore, a minimum value and a maximum value of the purchasing amounts are estimated. These values refer to the boundaries of a confidence interval at 95% of the purchasing amount. The results are then noted as:
Mmin[RFxy,Ct] and Mmax[RFxy,Ct]ββ(6),
wherein βtβ varies from 1 to βTβ.
As in the method steps of FIG. 3, for the cases where the number of customers by cross-category of RF class, by generation group and by cycle is not enough to compute a robust probability, the recency frequency classes are preferably concatenated together.
The final step of sequences 300 and 400 is the potential purchasing amount computation referred to as step 110. Three matrices are then generated to respectively indicate the minimum, the maximum and the value of a potential purchasing amount for a given recency frequency class and a given cycle.
In commercial application, a customer is characterized by his recency frequency class and his cycle. From these characteristics, a minimum, a maximum and a value are thus determined using the predictive matrices.
The invention is preferably implemented as a computer process executed on a computer system, including, for example, an IBM Personal Computer (PC), and an IBM RS6000 workstation, an IBM RS6000 SP2 multiprocessor system, or on an IBM S/390 system. The system, shown in generalized form in FIG. 5, includes at least one input device 507, at least a central processor unit 501, at least one storage device storing a database 502, memory 505 and at least one output device 503. The input device 507 may be a workstation or terminal having means for operating the appropriate software for allowing a user to select the data used by the process of the invention, to set the variables, and to issue commands to control the process steps in accordance of the present invention. Processor 501 may be of a conventional type sufficient to perform the calculation and to retrieve/store information from/to the database stored in the storage device 502 and memory 505 according to signals from the input device. Storage device 502 may be any conventional storage device, such as a direct access storage device (DASD), such as a hard disk drive, a tape, compact disc (CD) read only memory (ROM), or the like, capable of storing data records therein of the type to be used in accordance with the present invention. Preferably, the customer and purchasing history records are stored in a database facilitating high speed data retrieval. As shown in FIG. 5, the system processor 501 accesses the database on storage device 502 and performs the ratio calculations to generate the potential purchasing amounts that may be output on the output unit 503. The output unit 503 may be a character or graphical display terminal, or various types of printing devices. If desired, the connections between input devices 507 and processor 501 may be bi-directional in nature, so that the input devices 507 may also serve as the output devices 503.
While the invention has been described in terms of a preferred embodiment, those skilled in the art will recognize that the invention can be practiced with modifications within the spirit and scope of the appended claims.
1. A computer implemented method for determining ranges of purchasing amounts, said method comprising:
accessing purchasing data from a historical database of confirmed purchasers such that each confirmed purchaser has purchased at least one item having a non-null monetary value, wherein the historical database comprises the purchasing data for each confirmed purchaser within a historical duration that includes consecutive generations and consecutive cycles, wherein each generation is associated with a generation group consisting of all confirmed purchasers who have made a first purchase of an item of the at least one item within the associated generation, and wherein each cycle of the consecutive cycles is a time unit for analysis of the purchasing data for each confirmed purchaser;
determining from the accessed purchasing data a plurality of re-purchasing ratios and a plurality of purchasing amounts, wherein the re-purchase ratios and the purchasing amounts are each indexed according to generation group, cycle, and recency frequency class which is a binaiy descriptor of purchases over predefined consecutive cycles;
computing an average re-purchase ratio indexed according to cycle and recency frequency class, by averaging the plurality of re-purchase ratios over the generation groups according to which the plurality of re-purchase ratios are indexed;
computing an average purchasing amount indexed according to cycle and recency frequency class, by averaging the plurality of purchasing amounts over the generation groups according to which the plurality of purchasing amounts are indexed;
computing a minimum purchasing amount and a maximum purchasing amount each indexed according to cycle and recency frequency class, by utilizing the average purchasing amount indexed according to cycle and recency frequency class and the average re-purchase ratio indexed according to cycle and recency frequency class; and
outputting, to an output device of a computing system, the average purchasing amount, the minimum purchasing amount, and the maximum purchasing amount, each indexed according to cycle and recency frequency class,
wherein the re-purchase ratio for each generation group is a ratio of A to B:
wherein A is the total number of confirmed purchasers in said each generation group, for the respective cycle, and for the respective recency frequency whose predefined consecutive cycles comprises a latest cycle being a next cycle following the respective cycle; and
wherein B is the total number of confirmed purchasers in said each generation group, for the respective cycle, and for the respective recency frequency whose predefined consecutive cycles comprises a latest cycle being the respective cycle.
2. Computer implemented method for determining ranges of purchasing amounts, said method comprising:
accessing purchasing data from a historical database of confirmed purchasers such that each confirmed purchaser has purchased at least one item having a non-null monetary value, wherein the historical database comprises the purchasing data for each confirmed purchaser within a historical duration that includes consecutive generations and consecutive cycles, wherein each generation is associated with a generation group consisting of all confirmed purchasers who have made a first purchase of an item of the at least one item within the associated generation, and wherein each cycle of the consecutive cycles is a time unit for analysis of the purchasing data for each confirmed purchaser;
determining from the accessed purchasing data a plurality of re-purchasing ratios and a plurality of purchasing amounts, wherein the re-purchase ratios and the purchasing amounts are each indexed according to generation group, cycle, and recency frequency class which is a binary descriptor of purchases over predefined consecutive cycles;
computing an average re-purchase ratio indexed according to cycle and recency frequency class, by averaging the plurality of re-purchase ratios over the generation groups according to which the plurality of re-purchase ratios are indexed;
computing an average purchasing amount indexed according to cycle and recency frequency class, by averaging the plurality of purchasing amounts over the generation groups according to which the plurality of purchasing amounts are indexed;
computing a minimum purchasing amount and a maximum purchasing amount each indexed according to cycle and recency frequency class, by utilizing the average purchasing amount indexed according to cycle and recency frequency class and the average re-purchase ratio indexed according to cycle and recency frequency class; and
outputting, to an output device of a computing system, the average purchasing amount, the minimum purchasing amount, and the maximum purchasing amount, each indexed according to cycle and recency frequency clas, wherein the recency frequency is a concatenation of binary values respectively corresponding to different cycles of the predefined consecutive cycles, wherein the binary value for the respective cycle is 1 if a purchase of one or more items of the at least one item occurred in the respective cycle, and wherein the binary value for the respective cycle is 0 if a purchase of one or more items of the at least one item did not occur in the respective cycle.