Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Space efficiency #33

Open
drevell opened this issue Aug 7, 2012 · 2 comments
Open

Space efficiency #33

drevell opened this issue Aug 7, 2012 · 2 comments

Comments

@drevell
Copy link
Contributor

drevell commented Aug 7, 2012

We spend a lot of space on storing row keys. We should figure out a way to reduce this. Possibilities:

  • Combine multiple values under a single key
  • Use variable-width key fields
  • Compact key fields across byte boundaries. Sets one bit at a time in keys instead of a byte at a time.

We'll want to preserve backward compatibility.

@eonnen
Copy link

eonnen commented Aug 7, 2012

Do we really though? Won't compression at the block level kick in and effectively remove a lot of the redundancy in keys?

@drevell
Copy link
Contributor Author

drevell commented Aug 7, 2012

Yeah, we can assume HBase compression to reduce a lot of the cruft. Theoretically datacube is not HBase-specific though :)

This is something I'll code on my own time rather than UA's time since space consumption isn't a problem for us, and we don't have any plans for non-HBase backends.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants