-
2006-08-22
10/144,021
2002-05-13
US 7,095,875 B2
2006-08-22
-
-
Joseph Mancuso | Craig Kronenthal
2024-01-01
A method to arbitrate between results obtained from observationally derived data by several procedures, where the results are items characteristic of the observationally derived data is disclosed. Each procedure is given a ranking according to the confidence on the procedure. The results and characteristics derived from the results are used in a plurality of rules, where the rules are used to arbitrate between the results.
Get notified when new applications in this technology area are published.
G06K9/00 IPC
Methods or arrangements for recognising patterns
This application claims priority of U.S. Provisional Application No. 60/290,892, entitled âAddress Recognition Results Arbitrationâ, filed on May 15, 2001, which is hereby incorporated by reference herein.
This invention relates generally to the arbitration between results obtained by more than one method for matching observationally derived data to data from a number of databases, and, more particularly to arbitrating or deciding between address recognition methods for automatically recognizing addresses on items in an automated delivery system.
Due to the ever increasing volume of postal items and packages being delivered, postal services and delivery services are increasingly relying on optical character recognition to recognize the addresses of the items to be delivered. As address recognition performance levels improve, it becomes increasingly difficult to make significant advances in performance with a single address recognition system. Consequently, two or more independent address recognition systems are applied and a method is required to arbitrate competing results from these systems to select on a case by case basis the result most likely to be accurate.
Present methods and systems for arbitrating competing results from two or more independent address recognition systems utilize a small subset of result characteristics. Present methods and systems use the level of âtrustâ associated with the address recognition engine that produced the result as the primary arbitration factor. In addition present methods and systems also use a coarse indication of an alphanumerical âpostal codeâ (for example, ZIP Code in the U.S. Postal Service) matching against a limited set of results when from one or two other systems.
A method and system for arbitrating competing results from address recognition systems that are extendable to an arbitrary number of address recognition systems and which utilize a number of the characteristics of the results from the address recognition systems are needed.
A method and system to arbitrate between results obtained from observationally derived data by several procedures, where the results are items characteristic of the observationally derived data are disclosed. In the method, each procedure is given a ranking according to the confidence on the procedure. The results and characteristics derived from the results are used in a plurality of rules, where the rules are used to arbitrate between the results. The method generically comprises the steps of:
The method can also include, but is not limited to, the steps of: comparing the results to a number of known incorrect results, and, discarding any of the results that matches any one of said known incorrect results, in which case, only the non-discarded results are arbitrated.
In one embodiment, observationally derived data is data derived from mail pieces, such as addresses, and the result is an alphanumerical code, such as a ZIP Code in the U.S. Postal Service. However, the invention need not be limited to delivery systems for addressed pieces and could be applied to any system where arbitration of results between several procedures is required.
For a better understanding of the present invention, together with other and further objects thereof, reference is made to the accompanying drawings and detailed description and its scope will be pointed out in the appended claims.
FIG. 1 is a flowchart of an embodiment of the method of this invention;
FIG. 2 is a graphical representation of an envelope as input to an address recognition system;
FIG. 3 is a flowchart of an embodiment of the method of this invention applied to arbitration of results of address recognition; and
FIG. 4 is a block diagram of an embodiment of a system of this invention used for arbitration of results of address recognition.
A method to arbitrate between results obtained from observationally derived data by several procedures, where the results are items characteristic of the observationally derived data is disclosed. An example of procedures that generate results from observationally derived data are address recognition systems utilizing optical character recognition recognizing the addresses of the items to be delivered in a postal system or a delivery system. The method of this invention is extendable to an arbitrary number of procedures and utilizes a number of the characteristics of the results from the procedures.
A flowchart of an embodiment of the method of this invention is shown in FIG. 1. Results from several procedures 10 are compared to data from one of several databases 40 and classified according to a level of accuracy at which it matches a datum from one of the several databases 40 (step 70, FIG. 1).
The method can include a pre-processing series of steps in which the results 10 are compared to known incorrect results 30 (steps 50 and 60, FIG. 1). Some of procedures can generate known incorrect results as part of the normal course of operation of the procedure. For example, a procedure can have access to several databases of a different precision. Results from such a procedure that are inconsistent when compared to data bases of a different precision are considered incorrect results. In the first of the pre-processing steps, the results 10 are compared to known incorrect results 30 (step 50, FIG. 1). Any of the results 10 that matches one of the known incorrect results 30 is discarded (step 60, FIG. 1). The non discarded results are then compared to data from one of several databases 40 and classified according to a level of accuracy at which it matches a datum from one of the several databases 40 (step 70, FIG. 1).
Each result 10 corresponding to one observationally derived datum is compared to results 10 for the same observationally derived datum obtained from other ones of the procedures (step 80, FIG. 1). At least one indicator, indicating a degree of success of the comparison of results for each result corresponding to the same observationally derived datum, is assigned to each result 10 (step 90, FIG. 1).
Each procedure has a given ranking 120 according to the confidence on that procedure. An indicator of the ranking of confidence of the procedure that produced the result (hereafter called a confidence value) is assigned to each result (step 130, FIG. 1).
A data structure (not shown) can be constructed including the results, an identifier for each of the procedures that produced the results, the ranking according to confidence for each of the procedures, the confidence value for each result, and, at least one indicator for each result indicating the degree of success of the comparison of results for each result.
A set of rules 20, aimed at hierarchically obtaining a maximum number of successful results, is utilized in the arbitration process. The rules codify known practices and desired procedures for obtaining successful results or eliminating unsuccessful results. The rules utilize as inputs the results 10 and characteristics derived from the results, the characteristics including the at least one indicator and the results of the comparison of results (performed in step 80, FIG. 1). The results 10 (not including the discarded ones if pre-processing was performed), the at least one indicator, indicating the degree of success of the comparison of results for each result, the confidence value for each result 10 and selected ones of the rules 20 are inputs to an algorithm applying the selected rules to arbitrate between results corresponding to the same observationally derived datum but obtained by different procedures (step 100, FIG. 1). The outcome is the determination, from the arbitration, of a procedure, from which results most likely to be accurate are obtained (step 110, FIG. 1). Another indicator indicative of the outcome of applying the selected rules is appended to each result (step 120, FIG. 1). An embodiment of the method as applied to arbitrating competing results from two or more independent address recognition systems used in postal systems or package delivery systems is described below.
When two or more independent address recognition systems utilizing optical character recognition are used in postal services or package delivery services, the result used for arbitration is an alphanumerical code characterizing each address. (In the U.S. Postal Service, the alphanumerical code is a ZIP Code, a five, nine or eleven digit code, the first five of which are most commonly used.) In such systems, an addressed piece, such as envelope 135 shown in FIG. 2, is scanned by scanning means and the scanned output is provided as input to the address recognition systems utilizing optical character recognition. The optical character recognition (OCR) modules provide data on the delivery address 140, the alphanumerical code 150 (herein after referred to as the ZIP Code) and the return address 160 as well as other possible items on the addressed piece 135, such as a bar code (not shown).
A flowchart of an embodiment of the method of this invention applied to arbitration of results of address recognition systems is shown in FIG. 3. Result ZIPs from several procedures 210 are compared to data from one of several databases 240 and classified according to a level of accuracy at which it matches a datum from one of the several databases 40 (step 270, FIG. 3). Step 270 is hereinafter referred to as classifying as to sort depth.
The method can include a pre-processing series of steps in which the result ZIPs 210 are compared to known âBad ZIPsâ 230 (steps 250 and 260, FIG. 3). Some address recognition systems are able to assert as part of their input to the arbitration process âbadâ ZIP Codes developed internally as a possible delivery address, but rejected for specific attributes that indicate an error situation. For example, an address recognition system may have access to multiple address databases 240 of different precision. One address database (low precision) might represent a range between starting and ending address numbers on a street where a second database (high precision) lists the addresses only for existing dwellings on the street. An address recognition system with access to both databases can provide as input to the arbitration process ZIPS which are âbadâ in the sense that they are inconsistent with the high precision database, but are likely outcomes from engines that have access only to a low precision database. In the first of the pre-processing steps, the results 10 are compared to known incorrect results 30 (step 50, FIG. 1). Any of the result ZIPs 210 that matches one of the known âBadâ ZIPs 230 is discarded (step 260, FIG. 3). The non discarded ZIPs are then compared to data from one of several databases 240 and classified according to âsort depthâ (step 270, FIG. 3).
Each result ZIP 210 corresponding to one observationally derived datum is compared to results 210 for the same observationally derived datum obtained from other ones of the procedures (step 280, FIG. 3). The arbitration method considers the quantity and extent of matches and mismatches between ZIP results from different address recognition procedures (also referred to as systems). For USA ZIP Codes, a distinction is made in the arbitration process between mismatches in four sub-fields of the ZIP Code:âHigh order 3 digits (which identify a large region, sometimes incorporating multiple cities or towns)âHigh order 5 digits (typically unique within a single city) High order 9 digits (typically unique to a carrier route)â11 digits (identifying a specific delivery point). As an example of arbitration process sensitivity to the extent of mismatch, the process considers mismatches in the high order 3 digits as a possible indication that one of the engines has asserted a return address result instead of the intended delivery address. At least one indicator, indicating a degree of success of the comparison of results for each result corresponding to the same observationally derived datum, is assigned to each result ZIP 210 (step 290, FIG. 3).
Implementations of such indicators are a 9-Match Count and a 5-Match Count defined below. The 9-Match Count is obtained by comparing 9 digits of each Result ZIP 210 to result ZIPs 210 from other processes, including ZIPS of lower or finest depth. The 9-Match is incremented for each match against a ZIP created by a different database matching technique (hereinafter referred to as a DLU). Each different DLU is counted only once. The 5-Match Count is defined by comparing 5 digits of each Result ZIP 210 to Result ZIPs 210 from other processes. Increment the 5-Match for each match against a ZIP created by a different Address Block Location (ABL) algorithm.
Each procedure has a given ranking 120 according to the confidence on that procedure. An indicator of the ranking of confidence of the procedure that produced the result (hereafter called a confidence value) is assigned to each result ZIP (step 330, FIG. 3).
A data structure can be constructed including the result ZIPs 210, an identifier for each of the procedures that produced the results, the ranking 120 according to confidence for each of the procedures, the 5-Match Count, the 9-Match Count, and the sort depth. An implementation of the data structure is shown in Table 1 below.
| TABLE 1 |
| Result structure |
| Sort | |||||||||
| Depth | |||||||||
| (Finest, | |||||||||
| and | 5- | 9- | |||||||
| Result | Finalized | DLU | ABL | enumerated | Match | Match | |||
| ZIP | ZIP | Source | Class | Subclass | type | type | types) | Count | Count |
| DS | 0 | 0 | RCR | RCR | ||
| Charâ | 0 | 1 | RCR | MLOCR | ||
| M | ||||||
| Charâ | 0 | 2 | RCR | RCP | ||
| C | ||||||
| B | 0 | 3 | B | B | ||
| Gray | ||||||
| CSAR | 1 | 0 | USPS | RCP or | ||
| or B | MLOCR | |||||
| D | 2 | 0 | B | MLOCR | ||
| M | 2 | 1 | USPS | MLOCR | ||
| C | 2 | 2 | USPS | RCP | ||
The Class and Subclass items in the Result structure of Table 1 are indicators of the quality or precision of the results expected (or confidence for each of the procedures). Class 0 indicates the highest precision or highest confidence results; Class 2 indicates a lower precision or lower confidence result. That is, the lower the class number the higher the accuracy (confidence) of the result. Subclasses serve to provide a finer ranking of results according to precision (confidence). Subclass 0 indicates the highest precision (confidence) among the subclasses; Subclass 2 indicates a lower precision (confidence) result among the subclasses. Thus, class 0 and subclass 0 would produce the ranking of highest confidence. Class 2, subclass 2 indicates a lower precision (confidence) result (the lowest shown in Table 1). The indicator of the ranking of confidence of the procedure that produced the result, the confidence value, can be obtained from the class, subclass values.
A set of rules 20, aimed at hierarchically obtaining a maximum number of successful results, is utilized in the arbitration process. The rules codify known practices and desired procedures for obtaining successful results or eliminating unsuccessful results. The result ZIPs 210 (not including the discarded ones if pre-processing was performed), the at least one indicator, indicating the degree of success of the comparison of results for each result, the confidence value for each result 310 and selected ones of the rules 20 are inputs to an algorithm applying the selected rules to arbitrate between results corresponding to the same observationally derived datum but obtained by different procedures (step 300, FIG. 3). The outcome is the determination, from the arbitration, of a procedure, from which results most likely to be accurate are obtained (step 310, FIG. 3). A second indicator, a Delivery Print Code (DPC), indicative of the outcome of applying the selected rules is appended to each result (step 320, FIG. 3). As an outcome of applying the selected rules, a ZIP Code is selected corresponding to the result selected from arbitration among procedures. This ZIP Code provides a major datum for address recognition.
One embodiment of the rules is given below.
Return-to-Sender/Forwarding Detection:
Selected ones of the above rules are applied together according to a decision algorithm in order arbitrate between the results of several independent address recognition systems applied to the same data. The implementations given below operate on an ordered sequence of selected ones of the above rules.
Algorithm A1
Rule Sequence: 1, 21, 18, 7, 3, 6, 9, 13
Comments:
The rules and algorithms given above are not an exhaustive list. Other rules and algorithms can be applied and other characteristics of the results can be utilized as further inputs to the rules. For example, counts for mismatches of varying significance (5 digit, 9 digit) between competing results could be utilized as inputs to the rules or the algorithms. Similarly, a second confidence value associated with each result, asserted by the procedure that produces the ZIP result, could also be used as input to the rules or the algorithms. The second confidence value would be asserted by the procedure based on information regarding the observationally derived data (perhaps based on knowing that OCR results themselves were poor for that mail piece image, or that words in the address are apparently misspelled). The second confidence value would provide another input that could be utilized by an algorithm to determine a procedure, from which results most likely to be accurate are obtained.
The above algorithms implement a sequential application of selected ones of the above rules in order to codify known operational or observed performance. A known method for achieving the same is through the application of the techniques of artificial intelligence. The algorithms and rules are already in a form akin to an inference engine and can be cast into a rule based system for decisions or an âExpert Systemâ (see, for example, P. Harmon, D. King, Expert SystemsâArtificial Intelligence in Business, ISBN 0-471-80824-5, 1985). Following more recent trends in artificial intelligence, the algorithms can be cast in the form of a neural network (see, for example, S. K. Rogers, M. Kabrisky, An Introduction to Biological and Artificial Neural Networks for Pattern Recognition, ISBN 0-8194-0534-5, 1991) or a genetic algorithm (also known as an evolutionary algorithm). In either the neural network or the genetic algorithm, the rules can be generated by the neural network or the genetic algorithm. It should be apparent that, as other methods are developed in artificial intelligence or machine aided cognition, the algorithms can also be cast in the form of those methods.
The above method for arbitration between results obtained by more than one method for matching observationally derived data to data from a number of databases can be implemented as a computer readable code embodied in a computer usable or readable medium.
A block diagram of an embodiment of a system 400 of this invention used for arbitration of results of address recognition is shown in FIG. 4. The scanned output is provided as digitized input 420 to the address recognition system. A processor 410 operates on the input 420 and, utilizing the OCR systems implemented as computer readable code stored in a memory 460 and the database 425 generates address results. The processor 410, then, executes the arbitration method implemented as a computer readable code embodied in the memory 460. Other Memory 440 is used for other system functions (for example, control of scanning and transport, data and event logging, user interface, and other âhousekeepingâ functions) and could be implemented by means of any computer readable media. An interface to electromechanical systems 470 allows communication and control of scanning and transport functions. Such an interface could be specific to the actual design of the scanning and transport functions but such interfaces are well known in the art.
In general, the techniques described above may be implemented, for example, in hardware, software, firmware, or any combination thereof. The techniques described above may be implemented in one or more computer programs executing on a programmable computer including a processor, a storage medium readable by the processor (including, for example, volatile and non-volatile memory and/or storage elements), at least one input device, and at least one output device. Program code may be applied to data entered using the input device to perform the functions described and to generate output information. The output information may be applied to one or more output devices.
Elements and components described herein may be further divided into additional components or joined together to form fewer components for performing the same functions.
Each computer program within the scope of the claims below may be implemented in any programming language, such as assembly language, machine language, a high-level procedural programming language, or an object-oriented programming language. The programming language may be a compiled or interpreted programming language.
Each computer program may be implemented in a computer program product tangibly embodied in a computer-readable storage device for execution by a computer processor. Method steps of the invention may be performed by a computer processor executing a program tangibly embodied on a computer-readable medium to perform functions of the invention by operating on input and generating output.
Common forms of computer-readable or usable media include, for example, a floppy disk, a flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CDROM, any other optical medium, punched cards, paper tape, any other physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave, or any other medium from which a computer can read.
Although the invention has been described with respect to various embodiments, it should be realized this invention is also capable of a wide variety of further and other embodiments within the spirit and scope of the appended claims.
1. A method to arbitrate between results to be arbitrated in order to effectuate a desired result, said results to be arbitrated obtained by one of a plurality of procedures from observationally derived data, each of said results to be arbitrated having a representative characteristic, said representative characteristic herinafter referred to as a type, said method comprising the steps of:
comparing the results to be arbitrated to a plurality of known incorrect results; and
discarding any of said results to be arbitrated that matches any one of said known incorrect results;
applying, after discarding any of said results to be arbitrated, steps comprising;
a. ranking each of the procedures according to confidence on each of said procedures;
b. classifying each of the results to be arbitrated according to a level of accuracy at which each of said results to be arbitrated matches a datum from a database of a plurality of databases;
c. comparing each of the results to be arbitrated to each other;
d. generating rules, said rules utilizing as inputs said results to be arbitrated and information derived from said results to be arbitrated;
e. selecting at least one of said rules;
f. applying said at least one of said rules according to an algorithm;
g. selecting, from the application of said at least one of said rules, said desired result, said desired result being one of the results to be arbitrated; and said results to be arbitrated being of a same type of result;
âsteps (a) through (g) being applied only to said results to be arbitrated that are not discarded.
2. The method of claim 1 further comprising the step of:
appending an indicator to each result to be arbitrated, said indicator indicating an outcome of applying said at least one rule.
3. The method of claim 1 further comprising the steps of:
obtaining at least one indicator for each result to be arbitrated, said at least one indicator indicating a degree of success of said comparative step for each observationally derived datum.
4. The method of claim 3 further comprising the steps of:
creating a data structure, said data structure comprising:
said results to be arbitrated;
said desired result;
one other indicator indicating said ranking according to confidence for each of said results to be arbitrated;
said at least one indicator for each of the results.
5. The method of claim 1 further comprising the step of:
assigning to each of said results to be arbitrated yet another indicator of said ranking of confidence of the procedure that produced said result to be arbitrated.
6. The method of claim 1 wherein said observationally derived data comprises data derived from addressed pieces and wherein said result to be arbitrated comprises an alphanumerical code.
7. The method of claim 1 wherein said rules comprise detecting specific information in observationally derived data.
8. The method of claim 1 wherein said algorithm is implemented using techniques of artificial intelligence.
9. A method to arbitrated between results to be arbitrated in order to effectuate a desired result, each of said results to be arbitrated obtained by one of a plurality of procedures from observationally derived data, the method comprising the steps of:
a. ranking each of the procedures according to confidence on each of said procedures;
b. classifying each of the results to be arbitrated according to a level of accuracy at which each of said results to be arbitrated matches a datum from a database of plurality of databases;
c. comparing each of the results to be arbitrated to each other;
d. generating rules, said rules utilizing as inputs said results to be arbitrated and information derived from said results to be arbitrated;
e. selecting at least one of said rules;
f. applying said at least one of said rules according to an algorithm;
g. selecting, from the application of said at least one of said rules, said desired result, said desired result being one of the results to be arbitrated;
âsaid results to be arbitrated being of a same type of result;
âwherein said observationally derived data comprises data derived from addressed pieces and wherein said result to be arbitrated comprises an alphanumerical code, and
h. obtaining at least one indicator for one of said alphanumerical codes, said one of said alphanumerical codes resulting from applying one of said procedures to specific observationally derived data, said at least one indicator indicating the number of alphanumerical codes resulting from applying other ones of said plurality of procedures to said specific observationally derived data that are congruent to said one of said alphanumerical codes for a given number digits of said one of said alphanumerical codes.
10. The method of claim 9 wherein said rules comprise detecting return or forwarding information in said data derived from addressed pieces.
11. The method of claim 9 wherein said rules comprise detecting a lack of congruency of a group of digits in each one of said alphanumerical codes obtained from specific observationally derived data by several ones of said plurality of procedures.
12. The method of claim 9 wherein said rules comprise detecting congruency of a group of digits in each one of said alphanumerical codes obtained from specific observationally derived data by several ones of said plurality of procedures.
13. The method of claim 9 wherein said rules comprise ignoring, when more than one result to be arbitrated is obtained corresponding to one observationally derived datum by applying a plurality of procedures, the results to be arbitrated corresponding to said one observationally derived datum and obtained from other ones of said plurality of procedures having a lower ranking.
14. The method of claim 9 wherein said rules comprise a relationship between a time period, said time period comprising the difference between when an addressed piece was sent and when said observationally derived data was obtained, and a probability of inaccuracy of said results obtained from the observationally derived data corresponding to said addressed piece.
15. The method of claim 9 wherein said rules comprise utilizing knowledge of details of each of said plurality of procedures to determine a confidence level to each said comparison of each result to be arbitrated corresponding to one observationally derived datum to another result to be arbitrated for said one observationally derived datum obtained from another one of said plurality of procedures.
16. A computer program product comprising:
a computer usable medium having computer readable code embodied therein for arbitrating between results to be arbitrated in order to effectuate a desired result, each one of said results to be arbitrated obtained from observationally derived data by one procedure from a plurality of procedures, each of said results to be arbitrated having a representative characteristic, said representative characteristic herinafter referred to as a type and each of said results being of a same type of result, said code causing a computer system to:
compare the results to be arbitrated to a plurality of known incorrect results;
discard any of said results to be arbitrated that matches any one of said known incorrect results;
(a) rank each of the procedures according to confidence on each said procedure;
(b) classify each of the results to be arbitrated according to a level of accuracy at which each of the results to be arbitrated matches a datum from a database of a pluralities of databases;
(c) compare each of the results to be arbitrated to each other;
(d) select at least one rule, said at least one rule utilizing as inputs said results to be arbitrated and information derived from said results to be arbitrated;
(e) apply said at least one rule according to an algorithm;
(f) select, by applying said selected at least one rules, said desired result, said desired result being one of the results to be arbitrated;
said code being capable of causing said computer system to apply steps (a) through (f) to said results to be arbitrated that are not discarede.
17. The computer program product of claim 16 where the computer readable code further causes a computer system to:
append an indicator to each result to be arbitrated, said indicator indicating an outcome of applying said at least one rule.
18. The computer program product of claim 16 where the computer readable code further causes a computer system to:
obtain at least one indicator for each result to be arbitrated, said at least one indicator indicating a degree of success of said comparative step.
19. The computer program product of claim 18 where the computer readable code further causes a computer system to:
create a data structure, said data structure comprising:
said results to be arbitrated;
said desired result;
one other indicator indicating said ranking according to confidence for each of said results to be arbitrated;
said at least one other indicator for each result to be arbitrated.
20. The computer program product of claim 16 wherein said observationally derived data comprises data derived from addressed pieces and wherein said result to be arbitrated comprises an alphanumerical code.
21. The computer program product of claim 16 where the computer readable code further causes a computer system to:
generate said at least one rule.
22. The computer program product of claim 21 wherein said algorithm is implemented using techniques of artificial intelligence.
23. The computer program product of claim 16 wherein said algorithm is implemented using techniques of artificial intelligence.
24. A computer program product comprising:
a computer usable medium having computer readable code embodied therein for arbitrating between results to be arbitrated in order to effectuate a desired result, each one of said results to be arbitrated obtained from observationally derived data by one procedure from a plurality of procedures, said code being capable of causing a computer system to:
(a) rank each of the procedures according to confidence one each said procedure;
(b) classify each of the results to be arbitrated according to a level of accuracy at which each of the results to be arbitrated matches a datum from a database of a pluralities of databases;
(c) compare each of the results to be arbitrated to each other;
(d) select at least one rule, said at least one rule utilizing as inputs said results to be arbitrated and information derived from said results to be arbitrated;
(e) apply said at least one rule according to an algorithm;
(f) select, by applying said selected at least one rule, said desired result, said desired result being one of the results to be arbitrated, wherein said observationally derived data comprises data derived from addressed pieces and wherein said results to be arbitrated comprises an alphanumerical code; and
(g) obtain at least one indicator for one of said alphanumerical codes, said one of said alphanumerical codes resulting from applying one of said procedures to specific observationally derived data, said at least one indicator indicating the number of alphanumerical codes resulting from applying other ones of said plurality of procedures to said specific observationally derived data that are congruent to said one of said alphanumerical codes for a given number digits of said one of said alphanumerical codes.
25. A system including arbitration between results to be arbitrated in order to effectuate a desired result, each one of said results to be arbitrated obtained by one procedure from a plurality of procedures from observationally derived data, the result to be arbitrated being items characteristic of the observationally derived data, each of said results to be arbitrated having a representative characteristic, said representative characteristic hereinafter referred to as a type, and each of said results to be arbitrated being of a same type of result, said system comprising:
at least one processor;
at least one computer readable memory having instructions embodied therein, said instructions causing said at least one processor to:
compare the results to be arbitrated to a plurality of known incorrect results;
discard any of said results to be arbitrated that matches any one of said known incorrect results
said instructions causing said at least one processor for said results to be arbitrated that are not discarded to;
rank each of the procedures according to confidence on each said procedure;
classify each of the results to be arbitrated according to a level of accuracy at which it matches a datum from a database of a plurality of databases;
compare each of the results to be arbitrated to each other;
select at least one rule, said at least one rule utilizing as inputs said results to be arbitrated and characteristics derived from said results to be arbitrated;
apply said at least one rule according to an algorithm;
select, by applying said selected at least one rule, said desired result, said desired result being one of the results to be arbitrated.
26. The system of claim 25 wherein the instructions embodied in the at least one computer readable memory further cause said at least one processor to:
append an indicator to each result to be arbitrated, said indicator indicating an outcome of applying said at least one rule.
27. The system of claim 25 wherein the instructions embodied in the at least one computer readable memory further cause said at least one processor to:
obtain at least one indicator for each of the results to be arbitrated, said at least one indicator indicating a degree of success of said comparison of results to be arbitrated for each of the results to be arbitrated.
28. The system of claim 27 wherein the instructions embodied in the at least one computer readable memory further cause said at least one processor to:
create a data structure, said data structure comprising:
said results to be arbitrated;
said desired result obtained by applying said selected at least one rule;
one other indicator indicating said ranking according to confidence for each of said results to be arbitrated;
said at least one indicator for each of the results to be arbitrated.
29. The system of claim 25 wherein said observationally derived data comprises data derived from addressed pieces and wherein said results said results to be arbitrated comprise alphanumerical codes.
30. The system of claim 25 wherein the instructions embodied in the at least one computer readable memory further cause said at least one processor to:
generate said at least one rule.
31. The system of claim 30 wherein said algorithm is implemented using techniques of artificial intelligence.
32. The system of claim 25 wherein said algorithm is implemented using techniques of artificial intelligence.
33. A system including arbitration between results to be arbitrated in order to effectuate a desired result, each one of said results to be arbitrated obtained by one procedure from a plurality of procedures from observationally derived data, the result to be arbitrated being items characteristic of the observationally derived data, said system comprising:
at least one processor;
at least one computer readable memory having instructions embodied therein, said instructions causing said at least one processor to:
rank each of the procedures according to confidence on each said procedure;
classify each of the results to be arbitrated according to a level of accuracy at which it matches a datum from a database of a plurality of databases;
compare each of the results to be arbitrated to each other;
select at least one rule, said at least one rule utilizing as inputs said results to be arbitrated and characterisitcs derived from said results to be arbitrated;
apply said at least one rule according to an algorithm;
select by applying said selected at least one rules, said desired result;
wherein said observationally derived data comprises data derived from addressed pieces and wherein said results said results to be arbitrated comprise alphanumerical codes; and
obtain at least one indicator for one of said alphanumerical codes, said one of said alphanumerical codes resulting from applying one of said procedures to specific observationally derived data, said at least one indicator indicating the number of alphanumerical codes resulting from applying other ones of said plurality of procedures to said specific observationally derived data that are congruent to said one of said alphanumerical codes for a given number digits of said one of said alphanumerical codes.