disk.frame 0.7.2
disk.frame 0.7.1
- Bugfixes
- And ability to convert to Parquet
disk.frame 0.7
- Partitioned by folder
- Updated R version to 4
disk.frame 0.6.1
- Fixed bug with data.table syntax
- Auto detection of srckeep in group by
- Global detection for group by and summarise
disk.frame 0.6
- Much better NSE support in disk.frame!
- removed
hard_arrange
and hard_group_by
- various API updates
disk.frame 0.5
disk.frame 0.4.1
- removed use of
sysctl
which was violating CRAN policy
disk.frame 0.4.0
- Removed
count
and tally
- Fixed package compatibility
disk.frame 0.3.7
- Bugfix - add_chunk with date column not working
- Minor - added compression and other aprameters to
add_chunk
disk.frame 0.3.6
- General update to dplyr v1
- Remove all
_all/if/at
functions for dplyr v1
- Updated
pull
function to conform to dplyr
disk.frame 0.3.5
- Support for !!! in summarise; github #250
- Added
dplyr::pull
verb
disk.frame 0.3.4
- fixed vignette bug and all vignette are now in static pdf
- fixed bugs with group-by when run with Rscript
- also fixed bugs with multiple group-by columns
disk.frame 0.3.3
- removed bloom filters as they were causing issues with CRAN solaris system for some reason
disk.frame 0.3.2
- removed usage proc/meminfo inline with CRAN policy
- added (experimental) bloomfilter
disk.frame 0.3.1
- urgent bug fix for group-by failing when the number chunks is 1; see Github #241
disk.frame 0.3.0
- experimental one-stage group-by framework!
- bug fixes for data.table trigger by integration with tidyfast
- removed assertthat from imports
- add benchmarkme to Suggests
disk.frame 0.2.1
- got rid of benchmarkme as a dependency
- added
hard_arrange
thanks to Jacky Poon
- added more .progress options for joins
- Using data.table::getDTthreads() as default number of workers
- multisession instead of multiprocess as default backend for data.table
- added support for R3.4
- fixed df_get_ram for R < 3.6
disk.frame 0.2.0
- deprecated group_by, arrange, summarise
- add chunk_group_by, chunk_arrange, chunk_summarise
- fit GLMs with
dfglm
- fixed so that dplyr function also work in mutate even with ~ in the name
- fixed disk.frame so that in works in functions too
disk.frame 0.1.1
- Allowed
map
to accept multiple arguments. Thanks Knut Jägersberg for suggestion
- Fixed bug where if the CSV is larger than RAM then it fails by adding {LaF} backend
- Added {bigreadr} backend for reading large files by first splitting the file. This is the default behaviour
- Added {LaF} and {readr} chunk readers to
csv_to_disk.frame
- fixed
write_disk.frame(...., shardby)
and other shardby
functions including rechunk
and shard
- added
df_ram_size()
to accurate determine RAM size for RStudio in R3.6
- added
show_ceremony
and show_boiler_plate
to show setup code