You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is there a motivation to support RleListMatrix? For the same use case as above, I'm using VariantAnnotation to build a CompressedVcf object and it has matrices of lists. The list elements are in many cases NA so it may be efficient to be able to store these as an Rle-derived object. I can't go as far as to verify that such a structure would benefit from Rle - would the elements be sufficiently contiguous?
My workaround at the moment is to collapse the list elements into single delimited strings, in which case DelayedArray or RleMatrix work out of the box. In this case the string concatenation results in the matrix object decreasing in size by a factor of ~8 (potentially due to global string pooling). Converting to RleMatrix reduces it again by an additional factor of ~16. Total compression from matrix of lists to character RleMatrix is 128x. If RleListMatrix was able provide a comparable benefit without converting to string then that could be very useful.
I'll link another issue to this one specific to VariantAnnotation, but I thought I'd check if this was a) possible; b) useful; and c) of interest.
Ping @lawremi who first proposed investigating support for this structure.
The text was updated successfully, but these errors were encountered:
Related to #27, though I note that the following now works
Created on 2020-02-28 by the reprex package (v0.3.0)
Is there a motivation to support
RleListMatrix
? For the same use case as above, I'm usingVariantAnnotation
to build aCompressedVcf
object and it has matrices of lists. The list elements are in many cases NA so it may be efficient to be able to store these as anRle
-derived object. I can't go as far as to verify that such a structure would benefit fromRle
- would the elements be sufficiently contiguous?My workaround at the moment is to collapse the list elements into single delimited strings, in which case
DelayedArray
orRleMatrix
work out of the box. In this case the string concatenation results in the matrix object decreasing in size by a factor of ~8 (potentially due to global string pooling). Converting toRleMatrix
reduces it again by an additional factor of ~16. Total compression from matrix of lists to characterRleMatrix
is 128x. IfRleListMatrix
was able provide a comparable benefit without converting to string then that could be very useful.I'll link another issue to this one specific to
VariantAnnotation
, but I thought I'd check if this was a) possible; b) useful; and c) of interest.Ping @lawremi who first proposed investigating support for this structure.
The text was updated successfully, but these errors were encountered: