disk.frame 0.2.1 Unreleased

  • got rid of benchmarkme as a dependency
  • added hard_arrange
  • added more .progress options for joins
  • Using data.table::getDTthreads() as default number of workers
  • multisession instead of multiprocess as default backend for data.table

disk.frame 0.2.0 2019-10-05

  • deprecated group_by, arrange, summarise
  • add chunk_group_by, chunk_arrange, chunk_summarise
  • fit GLMs with dfglm
  • fixed so that dplyr function also work in mutate even with ~ in the name
  • fixed disk.frame so that in works in functions too

disk.frame 0.1.1 2019-09-15

  • Allowed map to accept multiple arguments. Thanks Knut Jägersberg for suggestion
  • Fixed bug where if the CSV is larger than RAM then it fails by adding {LaF} backend
  • Added {bigreadr} backend for reading large files by first splitting the file. This is the default behaviour
  • Added {LaF} and {readr} chunk readers to csv_to_disk.frame
  • fixed write_disk.frame(...., shardby) and other shardby functions including rechunk and shard
  • added df_ram_size() to accurate determine RAM size for RStudio in R3.6
  • added show_ceremony and show_boiler_plate to show setup code