29 Oct 2023
Golf ball, P. (2000). Inside P. Baseball, H. F. Spirer, & L. Spirer (Eds.), Deciding to make the Instance: Exploring Large-scale Individual Rights Abuses Playing with Information Expertise and you will Analysis Data. AAAS.
Belin, T. R., & Rubin, D. B. (1995). A strategy getting calibrating untrue-fits costs when you look at the number linkage. Journal of your American Mathematical Association, 90(430), 694–707.
Bilenko, M., & Mooney, R. J. (2003). Adaptive Backup Identification Playing with Learnable String Resemblance Measures. When you look at the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automatic Checklist Linkage Having fun with Seeded Nearby Neighbour and you will Assistance Vector Machine Class. When you look at the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A study out of indexing approaches for scalable listing linkage and you may deduplication. IEEE Deals to your Training and Investigation Technology, 24(9), 1537–1555.
Cohen, W., Raviku). A comparison from string metrics to tavata KambodЕѕa-naisia own complimentary labels and facts. Within the KDD workshop with the research cleanup and you may object integration (Vol. step three, pp. 73–78).
Copas, J., & Hilton, F. (1990). Checklist linkage: Statistical models to own coordinating computers facts. Log of the Regal Statistical Area, Show A good, 153(3), 287–320.
Dai, An excellent. Yards., & Storkey, An effective. J. (2011). The newest classified author-topic model to own unsupervised organization resolution. Inside Artificial sensory networking sites and you will servers understanding–icann 2011 (pp. 241–249). Springer.
Fortini, Meters., Liseo, B., Nuccitelli, An effective., & Scanu, Yards. (2001). On Bayesian Checklist Linkage. Research in Formal Statistics, 4(1), 185–198.
Gutman, R., Afendulis, C., & Zaslavsky, A. (2013). A great bayesian means of file connecting to research stop- of-existence medical can cost you. Log of one’s Western Statistical Organization, 108(501), 34–47.
Hsu, W., Lee, Yards. L., Liu, B., & Ling, T. W. (2000). Exploration Mining from inside the Diabetic patients Databases: Conclusions and you can Results. Inside KDD ’00 (pp. 430–436). ACM.
A torn-blend Markov chain Monte Carlo procedure of the new Dirichlet techniques blend model
Jewell, N. P., Spagat, Yards., & Jewell, B. L. (2013). MSE and you will Casualty Counts: Assumptions, Translation, and you may Challenges. During the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Counting Civilian Casualties: An introduction to Recording and you may Quoting Nonmilitary Deaths in conflict. Oxford, UK: Oxford College Press.
Larsen, Yards. D. (2002)ments to your Hierarchical Bayesian Number Linkage. For the Proceedings of your combined statistical group meetings, part with the survey research strategies (pp. 1995–2000). Brand new Western Statistical Association.
Larsen, M. D. (2005). Advances inside the Record Linkage Theory: Hierarchical Bayesian Record Linkage Principle. Into the Procedures of the mutual statistical group meetings, point for the questionnaire search strategies (pp. 3277–3284). The fresh new Western Analytical Organization.
Larsen, Meters. D., & Rubin, D. B. (2001). Iterative automated list linkage using mix habits. Journal of Western Statistical Association, 96(453), 32–41.
Lum, K., Rates, M. Elizabeth., & Banking companies, D. (2013). Applications away from Several Possibilities Estimate in the Individual Liberties Look. New American Statistician, 67(4), 191–200.
Marchant, Letter. Grams., C., Kaplan, A., Rubinstein, B. We. P., & Elazar, D. Letter. (2019). D-blink: Delivered stop-to-prevent bayesian organization quality.
McCallum, A great., & Wellner, B. (2004). Conditional Different types of Title Uncertainty with App in order to Noun Coreference. When you look at the Enhances inside the sensory recommendations handling systems (nips ’04) (pp. 905–912). MIT Drive.
Miller, P. L., Frawley, S. J., & Sayward, F. Grams. (2000). IMM/Scrub: A website-Particular Tool toward Deduplication off Vaccination Records Information in Teens Immunization Registriesputers and Biomedical Lookup, 33(2), 126–143.
Murphy, J., Brackbill, R. M., Thalji, L., Dolan, M., Pulliam, P., & Walker, D. J. (2007). Computing and Enhancing Exposure worldwide Trade Cardio Health Registry. Analytics within the Drug, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic listing linkage and you will deduplication after indexing, blocking, and you will filtering. Journal away from Privacy and Privacy, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. Yards., Axford, S. J., & James, A great. P. (1959). Automatic linkage regarding public information hosts can be used to pull” follow-up” statistics off parents off records out-of techniques details. Research, 130(3381), 954–959.
Sadinle, Yards. (2014). Finding Copies within the a murder Registry Playing with a beneficial Bayesian Partitioning Method. Annals out of Used Statistics, 8(4), 2404–2434.
Sariyar, Yards., Borg, A great., & Pommerening, K. (2012). Active Understanding Approaches for the newest Deduplication out of Electronic Diligent Investigation Using Group Woods. Journal off Biomedical Informatics, 45(5), 893–900.
C., Hall, R., & Fienberg, S. Age. (2016). A great Bayesian Method to Graphical Checklist Linkage and you can Deduplication. Diary of one’s American Statistical Association, 111(516), 1660–1672.
Tancredi, A good., & Liseo, B. (2011). A great hierarchical Bayesian method of checklist linkage and population dimensions issues. Annals off Applied Statistics, 5(2B), 1553–1585.