You are not currently logged in.
Access JSTOR through your library or other institution:
Outcome Probabilities for a Record Matching Process with Complete Invariant Information
Journal of the American Statistical Association
Vol. 62, No. 318 (Jun., 1967), pp. 454-469
Stable URL: http://www.jstor.org/stable/2283974
Page Count: 16
You can always find the topics here!Topics: Probabilities, Unbiased estimators, Conditional probabilities, Population estimates, Classification systems, Statistical estimation, Statistics, Information retrieval, Information classification, Population size
Were these topics helpful?See somethings inaccurate? Let us know!
Select the topics that are inaccurate.
Preview not available
Record matching processes, which compare sets of identifying information, to decide whether or not a pair of records relate to the same individual or population item, are basic in a wide range of applications in social research, maintenance of files and information retrieval. Such processes may be conveniently described in terms of matching a single incoming record against a master file or list. In order to evaluate different record matching processes, in terms of matching costs and error losses, it is necessary to evaluate the outcome probabilities. It is shown that this can be done for a simple model which assumes that the information used for matching is complete and invariant but, possibly, insufficient to distinguish between all population items, by considering only the class-size probability distributions. The latter can be estimated directly from the list or from a sub-sample drawn from it, by the application of Goodman's  results concerning the estimation of the number of classes in a population. The outcome probabilities can then be evaluated by considering the incoming record as randomly drawn from the list, if it should match some item on the list, and as added to the list, to form a new list with approximately the same classification probabilities, if it ought to match no item on the list. A numerical example illustrates an application of the model.
Journal of the American Statistical Association © 1967 American Statistical Association