Skip to content

SampleSelectR Pre-Submission #726

@szimmer

Description

@szimmer

Submitting Author Name: Stephanie Zimmer
Submitting Author Github Handle: @szimmer
Repository: https://github.com/RTIInternational/SampleSelectR
Submission type: Pre-submission
Language: English


Package: SampleSelectR
Title: Randomly select samples for various probability-based methods
Version: 1.0.0
Authors@R: c(
    person("Stephanie", "Zimmer", , "[email protected]", role = c("aut", "cre")),
    person("John David", "Bunker, Jr.", role = "aut"),
    person("Thomas", "Burkett", role = "aut"),
    person("Philip", "Lee", role = "aut"),
    person("Haby", "Sow", role = "aut"),
    person(, "RTI International", role = "fnd")
  )
Description: Randomly select samples with SRS, systematic, and various PPS methods. Also includes functionality to select within strata and allocate sample sizes.
License: GPL (>= 3)
Encoding: UTF-8
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.3.2
Depends: 
    R (>= 4.1.0)
Imports: 
    data.table,
    stats,
    rlang,
    tidytable
LazyData: true
URL: https://github.com/RTIInternational/SampleSelectR, https://rtiinternational.github.io/SampleSelectR/
BugReports: https://github.com/RTIInternational/SampleSelectR/issues
Suggests: 
    testthat (>= 3.0.0)
Config/testthat/edition: 3

Scope

  • Please indicate which category or categories from our package fit policies or statistical package categories this package falls under. (Please check one or more appropriate boxes below):

    Data Lifecycle Packages

    • data retrieval
    • data extraction
    • data munging
    • data deposition
    • data validation and testing
    • workflow automation
    • version control
    • citation management and bibliometrics
    • scientific software wrappers
    • field and lab reproducibility tools
    • database software bindings
    • geospatial data
    • translation

    Statistical Packages

    • Bayesian and Monte Carlo Routines
    • Dimensionality Reduction, Clustering, and Unsupervised Learning
    • Machine Learning
    • Regression and Supervised Learning
    • Exploratory Data Analysis (EDA) and Summary Statistics
    • Spatial Analyses
    • Time Series Analyses
    • Probability Distributions
    • Survey statistics - random sampling
  • Explain how and why the package falls under these categories (briefly, 1-2 sentences). Please note any areas you are unsure of:

This did not fit exactly under any pre-defined category so I added my own. It implements statistical sampling algorithms

This has not been done

  • Who is the target audience and what are scientific applications of this package?

Statisticians who are drawing random samples from sampling frames. Primary audience is those in official statistics and public opinion research.

Some methods of sampling are included in other packages but not the Chromy sequential PPS method. We also wanted to make a unified API for sampling that we can easily add additional methods of sampling in the future.

Not applicable

  • Any other questions or issues we should be aware of?:

In early discussions of submitting to rOpenSci and discussing whether it is appropriate.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions