Implement HBase connector #6010

damiencarol · 2016-09-01T19:07:28Z

Throw an issue here as I'm working on it and will push a PR soon.

Plan to have a production ready plugin for the end of the year.

Main design choices:

Tables defined in JSON configuration files (like kafka/redis/mongodb connector)
Split aligned to region with HBase key pruning
parallelism aligned to region to scale
Manage configuration file for HBase (auto-updater)

Any advices, questions or tips are welcomes (specialy if you had started a plugin like me)

adamjshook · 2016-09-01T19:22:38Z

@damiencarol You may find some of the work for Apache Accumulo helpful, currently sitting in #5030.

damiencarol · 2016-09-01T19:29:39Z

@adamjshook yeah, I'm reading the PR code right now

adamjshook · 2016-09-01T19:44:59Z

@damiencarol I'm happy to answer any questions or give you any pointers on some BigTable-esque optimizations I've built to improve query times.

damiencarol · 2016-09-01T20:05:26Z

@adamjshook did your connector run in production ? also did you implemented insert/update/delete ?

adamjshook · 2016-09-01T20:14:03Z

@damiencarol Yes, it's been in production since March/April or so. INSERT is supported, but we use the Java APIs and some tools I've built for higher throughput. Presto doesn't support UPDATE (as far as I know), but you can issue another INSERT statement that shares the same Accumulo row ID and it effectively acts as an update. I haven't implemented DELETE yet -- haven't had a use case come up to drive the effort of implementing it.

damiencarol · 2016-09-05T16:10:10Z

First naive version here #6037 .
Please be kind with me, it is a work in progress.

yxydde · 2017-08-10T02:44:43Z

what the progress ?

nemo326 · 2017-12-27T06:06:48Z

what the progress now,please ? ~~~

ganeshjothikumar · 2018-08-28T22:24:30Z

We are exploring ways of trying to use Presto to query a HBase table. Can I get an update on where we are w.r.t HBase connector for Presto and any references for the same ?

GrigorievNick · 2018-08-29T06:55:45Z

Any news about this part?

JamesRTaylor · 2018-08-29T17:40:44Z

There's a PR for an Apache Phoenix connector here [1] which would allow you to read an HBase table. [1] #10536

…

On Tue, Aug 28, 2018 at 11:56 PM Nick ***@***.***> wrote: Any news about this part? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#6010 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AF4-K5-4rZznp3otBVsWDXhEDyxucxGaks5uVjsHgaJpZM4JzBqN> .

ganeshjothikumar · 2018-08-31T00:37:52Z

@JamesRTaylor Is it fair to say the Presto -> HBase (through Phoenix connector) is fairly nascent and probably not widely used in large production systems. Looking at doing something like this for a fairly large web scale production system. So hence wanted to know how hardened this is and current usage.

JamesRTaylor · 2018-09-04T20:23:05Z

Sounds like the author of the Phoenix connector is using it in production, @ganeshjothikumar, but you should ask him to confirm. Since neither the HBase connector nor the Phoenix connector are part of Presto yet, I'd imagine that they're both similar. FWIW, the SQL abstraction and query push down that Phoenix provides will make for a better fit as a Presto connector unless you're either 1) ok with many serial, full table scans by HBase, or 2) you try to do what Phoenix is doing within the HBase connector. Neither of these is a good option IMHO.

willshen · 2018-12-17T22:52:42Z

What's the latest on this PR (i.e., is it moving forward)?

ShawshankLin · 2019-03-18T02:01:39Z

what the progress now??

stale · 2021-06-22T17:34:19Z

This issue has been automatically marked as stale because it has not had any activity in the last 2 years. If you feel that this issue is important, just comment and the stale tag will be removed; otherwise it will be closed in 7 days. This is an attempt to ensure that our open issues remain valuable and relevant so that we can keep track of what needs to be done and prioritize the right things.

Crossoverrr · 2021-10-14T06:50:13Z

https://github.com/analysys/presto-hbase-connector

damiencarol mentioned this issue Sep 1, 2016

HBase Support #3992

Closed

stale bot added the stale label Jun 22, 2021

stale bot closed this as completed Jul 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement HBase connector #6010

Implement HBase connector #6010

damiencarol commented Sep 1, 2016 •

edited

Loading

adamjshook commented Sep 1, 2016

damiencarol commented Sep 1, 2016

adamjshook commented Sep 1, 2016

damiencarol commented Sep 1, 2016

adamjshook commented Sep 1, 2016

damiencarol commented Sep 5, 2016

yxydde commented Aug 10, 2017

nemo326 commented Dec 27, 2017

ganeshjothikumar commented Aug 28, 2018

GrigorievNick commented Aug 29, 2018

JamesRTaylor commented Aug 29, 2018 via email

ganeshjothikumar commented Aug 31, 2018 •

edited

Loading

JamesRTaylor commented Sep 4, 2018

willshen commented Dec 17, 2018

ShawshankLin commented Mar 18, 2019

stale bot commented Jun 22, 2021

Crossoverrr commented Oct 14, 2021

Implement HBase connector #6010

Implement HBase connector #6010

Comments

damiencarol commented Sep 1, 2016 • edited Loading

adamjshook commented Sep 1, 2016

damiencarol commented Sep 1, 2016

adamjshook commented Sep 1, 2016

damiencarol commented Sep 1, 2016

adamjshook commented Sep 1, 2016

damiencarol commented Sep 5, 2016

yxydde commented Aug 10, 2017

nemo326 commented Dec 27, 2017

ganeshjothikumar commented Aug 28, 2018

GrigorievNick commented Aug 29, 2018

JamesRTaylor commented Aug 29, 2018 via email

ganeshjothikumar commented Aug 31, 2018 • edited Loading

JamesRTaylor commented Sep 4, 2018

willshen commented Dec 17, 2018

ShawshankLin commented Mar 18, 2019

stale bot commented Jun 22, 2021

Crossoverrr commented Oct 14, 2021

damiencarol commented Sep 1, 2016 •

edited

Loading

ganeshjothikumar commented Aug 31, 2018 •

edited

Loading