Generate synthetic dataset for testing

gen_datatable_synthetic(N = 2e+08, K = 100)

Arguments

N

number of rows. Defaults to 200 million

K

controls the number of unique values for id. Some ids will have K distinct values while others have N/K distinct values