Product Launch activeH5 on-disk and in-memory data storage

Active Analytics Ltd: posted 26 Jan 2014 10:43 by Chibisi Chima-Okereke [ updated 22 Mar 2014 08:33 ]

Introduction

Active Analytics Ltd have created their first software product called activeH5. activeH5 is an R package for big data storage and access, it allows a user to store and access very large data frames and matrices both on file and in memory and data is stored in chunks. It uses the HDF5 file format on disk allowing very fast I/O speed on disk. In memory chunks of data are accessed by pointers allowing very large data sets to be accessed & processed in R very efficiently.

Representation of data as small subsets is an important feature of big data analysis and activeH5 offers new opportunities to improve the performance of big data analysis. The package is now fully tested and documented.

Demonstrations

There will be a series of demonstrations of the capabilities of the activeH5 package. These will be listed here:

  1. Big data chain ladder analysis

Data Science Consulting & Software Training

Active Analytics Ltd. is a data science consultancy, and Open Source Statistical Software Training company. Please contact us for more details or to comment on the blog.

Dr. Chibisi Chima-Okereke, R Training, Statistics and Data Analysis.