Product Launch activeH5 on-disk and in-memory data storage
Active Analytics Ltd have created their first software product called activeH5. activeH5 is an R package for big data storage and access, it allows a user to store and access very large data frames and matrices both on file and in memory and data is stored in chunks. It uses the HDF5 file format on disk allowing very fast I/O speed on disk. In memory chunks of data are accessed by pointers allowing very large data sets to be accessed & processed in R very efficiently.
Representation of data as small subsets is an important feature of big data analysis and activeH5 offers new opportunities to improve the performance of big data analysis. The package is now fully tested and documented.
There will be a series of demonstrations of the capabilities of the activeH5 package. These will be listed here:
Data Science Consulting & Software Training
Dr. Chibisi Chima-Okereke, R Training, Statistics and Data Analysis.