Basketball, P. (2000). From inside the P. Baseball, H. F. Spirer, & L. Spirer (Eds.), Making the Situation: Investigating Large-scale People Legal rights Violations Playing with Guidance Systems and you will Studies Studies. AAAS.
Belin, T. R., & Rubin, D. B. (1995). A method to have calibrating not true-matches costs during the checklist linkage. Journal of American Mathematical Connection, 90(430), 694–707.
Bilenko, M., & Mooney, Roentgen. J. (2003). Adaptive Copy Identification Having fun with Learnable String Similarity Actions. When you look at the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automatic Checklist Linkage Playing with Seeded Nearby Neighbour and Service Vector Host Classification. Inside the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A survey off indexing tricks for scalable listing linkage and deduplication. IEEE Transactions into the Training and you may Investigation Engineering, 24(9), 1537–1555.
Cohen, W., Raviku). An evaluation away from string metrics having complimentary labels and you can info. Into the KDD working area for the research cleaning and you may target consolidation (Vol. step three, pp. 73–78).
Copas, J., & Hilton, F. (1990). Number linkage: Analytical patterns to possess matching pc facts. Record of the Regal Mathematical Community, Series An effective, 153(3), 287–320.
Dai, A good. Meters., & Storkey, A great. J. (2011). The new classified author-topic model to have unsupervised organization resolution. When you look at the Phony neural companies and machine studying–icann 2011 (pp. 241–249). Springer.
Fortini, Yards., Liseo, B., Nuccitelli, A beneficial., & Scanu, Yards. (2001). On Bayesian List Linkage. Lookup inside Authoritative Statistics, 4(1), 185–198.
Gutman, Roentgen., Afendulis, C., & Zaslavsky, An excellent. (2013). A good bayesian procedure of file hooking up to analyze stop- of-lives scientific costs. Diary of your Western Mathematical Connection, 108(501), 34–47.
Hsu, W., Lee, M. L., Liu, B., & Ling, T. W. (2000). Mining Mining for the Diabetics Database: Conclusions and you will Findings. Within the KDD ’00 (pp. 430–436). ACM.
A torn-mix Markov strings Monte Carlo process of brand new Dirichlet techniques mixture design
Jewell, N. P., Spagat, Yards., & Jewell, B. L. (2013). MSE and you can Casualty Matters: Presumptions, Translation, and Pressures. Into the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Relying Civil Casualties: An introduction to Recording and you will Estimating Nonmilitary Deaths incompatible. Oxford, UK: Oxford University Force.
Larsen, M. D. (2002)ments for the Hierarchical Bayesian Checklist Linkage. For the Procedures of your own joint mathematical group meetings, point on survey browse actions (pp. 1995–2000). The latest Western Analytical Connection.
Steorts, R
Larsen, M. D. (2005). Advances into the Number Linkage Concept: Hierarchical Bayesian Number Linkage Concept. Within the Legal proceeding of your own shared analytical conferences, section into the questionnaire look strategies (pp. 3277–3284). The brand new Western Statistical Relationship.
Larsen, Yards. D., & Rubin, D. B. (2001). Iterative automated list linkage using combination patterns. Record of your Western Analytical Organization, 96(453), 32–41.
Lum, K., Rate, Meters. Elizabeth., & Finance companies, D. (2013). Programs from Numerous Options Estimation for the People Legal rights Browse. The latest Western Statistician, 67(4), 191–200.
Marchant, Letter. G., C., Kaplan, A., Rubinstein, B. We. P., & Elazar, D. N. (2019). D-blink: Distributed prevent-to-stop bayesian organization solution.
McCallum, Good., & Wellner, B. (2004). Conditional Type Term Suspicion which have Application to help you Noun Coreference. During the Improves into the sensory information handling systems (nips ’04) (pp. 905–912). MIT Push.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A website-Specific Device on the Deduplication out of Vaccination History Facts when you look at the Youth Immunization Registriesputers and Biomedical Lookup, 33(2), 126–143.
Murphy, J., Brackbill, R. Meters., Thalji, L., Dolan, M., Pulliam, P., & Walker, D. J. (2007). Calculating and you can Increasing Coverage global Exchange Heart Health Registry. Analytics in the Medication, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic listing linkage and you may deduplication once indexing, clogging, and you will filtering. Diary away from Privacy and you may Confidentiality, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. M., Axford, S. J., & James, A great. P. (1959). Automatic linkage of public information machines are often used to extract» follow-up» analytics away from family away from documents of routine suggestions. Science, 130(3381), 954–959.
Sadinle, Yards. (2014). Discovering Duplicates in a homicide Registry Having fun with a beneficial Bayesian Partitioning Strategy. Annals out-of Used Statistics, 8(4), 2404–2434.
Sariyar, Yards., Borg, A beneficial., & Pommerening, K. (2012). Effective Discovering Approaches for the fresh new Deduplication of Digital Patient Analysis Playing with Group Woods. Record regarding Biomedical Informatics, 45(5), 893–900.
C., Hallway, R., & Fienberg, S. Elizabeth. (2016). A great Bayesian Method to Visual List Linkage and you will Deduplication. Diary of American Statistical Organization, 111(516), 1660–1672.
Tancredi, Good., & Liseo, B. (2011). Gulbarga in India marriage agency An excellent hierarchical Bayesian approach to checklist linkage and you may inhabitants proportions issues. Annals off Applied Statistics, 5(2B), 1553–1585.