Returns the list of tunable thresholds used by frame(). Every field can
be overridden through the ... argument of frame(). The defaults are
chosen to be conservative on small data and to scale to millions of rows
through progressive subsampling.
Fields
min_obsMinimum complete cases required to screen a pair.
strong_threshold,moderate_threshold,weak_thresholdAbsolute correlation cut-offs used to classify pair strength.
near_constant_ratioMaximum allowed share of the dominant value before a column is flagged as near-constant.
id_unique_ratioIf a character column has more than this share of unique values, it is treated as an identifier.
skew_thresholdAbsolute skewness above which a column is flagged as right- or left-skewed.
subsample_thresholdRow count above which numeric pairs are screened by progressive subsampling.
subsample_probe,subsample_confirmProbe and confirmation sample sizes for the two-stage subsampling.
compositional_cvMaximum coefficient of variation of
x + yfor the pair to be flagged as a constrained complement.observer_min_levelsMinimum number of categorical levels before a strong group effect counts as an observer-style concern.
min_level_nLevels with fewer than this many rows are flagged as rare.
seedSeed used by the subsampling layer.
adjustmentOptional column names to partial out before numeric pair screening.