Package: groupdata2 2.0.5

Ludvig Renbo Olsen

groupdata2: Creating Groups from Data

Methods for dividing data into groups. Create balanced partitions and cross-validation folds. Perform time series windowing and general grouping and splitting of data. Balance existing groups with up- and downsampling or collapse them to fewer groups.

Authors:Ludvig Renbo Olsen [aut, cre]

groupdata2_2.0.5.tar.gz
groupdata2_2.0.5.zip(r-4.7)groupdata2_2.0.5.zip(r-4.6)groupdata2_2.0.5.zip(r-4.5)
groupdata2_2.0.5.tgz(r-4.6-any)groupdata2_2.0.5.tgz(r-4.5-any)
groupdata2_2.0.5.tar.gz(r-4.7-any)groupdata2_2.0.5.tar.gz(r-4.6-any)
groupdata2_2.0.5.tgz(r-4.6-emscripten)
manual.pdf |manual.html
DESCRIPTION |NEWS
card.svg |card.png
groupdata2/json (API)

# Install 'groupdata2' in R:
install.packages('groupdata2', repos = c('https://ludvigolsen.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/ludvigolsen/groupdata2/issues

On CRAN:

Conda:

balancecross-validationdatadata-framefoldgroup-factorgroupsparticipantspartitionsplitstaircase

9.01 score 26 stars 7 packages 472 scripts 1.6k downloads 22 exports 26 dependencies

Last updated from:56cdc19b77. Checks:9 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK261
source / vignettesOK195
linux-release-x86_64OK266
macos-release-arm64OK185
macos-oldrel-arm64OK172
windows-develOK192
windows-releaseOK199
windows-oldrelOK170
wasm-releaseOK119

Exports:%primes%%staircase%all_groups_identicalbalancecollapse_groupscollapse_groups_by_idscollapse_groups_by_levelscollapse_groups_by_numericcollapse_groups_by_sizediffers_from_previousdownsamplefind_missing_startsfind_startsfoldgroupgroup_factorpartitionranked_balancesspltsummarize_balancessummarize_group_colsupsample

Dependencies:backportscheckmateclicpp11dplyrgenericsgluelifecyclemagrittrnumberspillarpkgconfigplyrpurrrR6Rcpprearrrrlangstringistringrtibbletidyrtidyselectutf8vctrswithr

Cross-validation with groupdata2
Introduction | groupdata2 functions in focus | What is cross-validation? | Why training and test sets? | The data | Creating train/test sets | What is leakage? | Creating folds for cross-validation | Cross-validation | Cross-validation function | Linear regression models | Outro

Last update: 2022-11-19
Started: 2017-04-28

Description of groupdata2
Installing groupdata2 | CRAN version | GitHub development version | Attach packages | General information | group_factor() | group() | splt() | fold() | partition() | collapse_groups() | balance() | Groups, windows or folds? | Introduction vignettes | Use of kable() | Grouping Methods | Method: 'greedy' | Method: 'n_dist' (Default) | Method: 'n_fill' | Method: 'n_last' | Method: 'n_rand' | Method: 'l_sizes' | Method: 'l_starts' | Skipping | Automatically find group starts | find_starts() | find_missing_starts() | Method: 'every' | Method: 'staircase' | Find remainder - %staircase% | Method: 'primes' | Find remainder - %primes% | Balancing ID Methods | ID method: 'n_ids' | ID method: 'n_rows_c' | ID method: 'distributed' | ID method: 'nested' | Arguments | Grouping arguments | data | n | method | starts_col | force_equal | allow_zero | descending | randomize | col_name | remove_missing_starts | k | p | cat_col | num_col | id_col | id_aggregation_fn | extreme_pairing_levels | num_fold_cols | unique_fold_cols_only | max_iters | handle_existing_fold_cols | parallel | list_out | Balancing arguments | size | id_method | mark_new_rows | new_rows_col_name | Using Functions | Extra arguments showcase | Examples of method differences | n_ methods | Vector with 57 elements divided into 6 groups | Vector with 117 elements divided into 11 groups | Greedy | Vector with 100 elements with sizes of 8, 15, 20 | Staircasing | Vector with 1000 elements with step sizes of 2, 5, 11 | Primes | Vector with 1000 elements with n (start at) as 2, 5, 11 | The End

Last update: 2022-11-19
Started: 2017-01-22

Time series with groupdata2
Windowing time series | Attach packages | Load time series data | Reduce number of datapoints | Staircase groups | Outro

Last update: 2022-11-19
Started: 2017-04-28

Automatic groups with groupdata2
Introduction | Attach packages | Data | Outro

Last update: 2021-07-03
Started: 2017-04-28

Introduction to groupdata2
Introduction | Use cases | Outro

Last update: 2021-07-03
Started: 2017-01-25