automatedRecLin: Record Linkage Based on an Entropy-Maximizing Classifier

The goal of 'automatedRecLin' is to perform record linkage (also known as entity resolution) in unsupervised or supervised settings. It compares pairs of records from two datasets using selected comparison functions to estimate the probability or density ratio between matched and non-matched records. Based on these estimates, it predicts a set of matches that maximizes entropy. For details see: Lee et al. (2022) <https://www150.statcan.gc.ca/n1/pub/12-001-x/2022001/article/00007-eng.htm>, Vo et al. (2023) <https://ideas.repec.org/a/eee/csdana/v179y2023ics0167947322002365.html>, Sugiyama et al. (2008) <doi:10.1007/s10463-008-0197-x>.

Version: 1.0.0
Depends: R (≥ 4.1.0)
Imports: data.table, densityratio, FixedPoint, methods, nleqslv, purrr, reclin2, stats, utils
Suggests: tinytest, xgboost
Published: 2025-11-18
DOI: 10.32614/CRAN.package.automatedRecLin (may not be active yet)
Author: Adam Struzik [aut, cre], Maciej Beręsewicz ORCID iD [aut, ctb]
Maintainer: Adam Struzik <adastr5 at st.amu.edu.pl>
BugReports: https://github.com/ncn-foreigners/automatedRecLin/issues
License: GPL-3
URL: https://github.com/ncn-foreigners/automatedRecLin, http://ncn-foreigners.ue.poznan.pl/automatedRecLin/
NeedsCompilation: no
Materials: README, NEWS
CRAN checks: automatedRecLin results

Documentation:

Reference manual: automatedRecLin.html , automatedRecLin.pdf

Downloads:

Package source: automatedRecLin_1.0.0.tar.gz
Windows binaries: r-devel: not available, r-release: not available, r-oldrel: not available
macOS binaries: r-release (arm64): not available, r-oldrel (arm64): not available, r-release (x86_64): automatedRecLin_1.0.0.tgz, r-oldrel (x86_64): automatedRecLin_1.0.0.tgz

Linking:

Please use the canonical form https://CRAN.R-project.org/package=automatedRecLin to link to this page.