Skip to content

Conversation

@debugger87
Copy link
Contributor

@debugger87 debugger87 commented Jul 16, 2017

What changes were proposed in this pull request?

  • Add a private method registerCurrentOperationLog in SparkExecuteStatementOperation
  • Call registerCurrentOperationLog before execute() and unregister it after execute() like implementation in SQLOperation.java#L204

How was this patch tested?

Add unittest in HiveThriftBinaryServerSuite

build/mvn -Phive-thriftserver -Dtest=none -DwildcardSuites=org.apache.spark.sql.hive.thriftserver.HiveThriftBinaryServerSuite test

HiveThriftBinaryServerSuite tests passed

@debugger87
Copy link
Contributor Author

@cloud-fan Could you please help me to review this PR? Thanks a lot!

@cloud-fan
Copy link
Contributor

ok to test

@SparkQA
Copy link

SparkQA commented Jul 17, 2017

Test build #79657 has finished for PR 18649 at commit b5c963f.

  • This patch fails Scala style tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@debugger87
Copy link
Contributor Author

@cloud-fan I just fix scala style issue, pls re-test again.

@SparkQA
Copy link

SparkQA commented Jul 17, 2017

Test build #79659 has finished for PR 18649 at commit fa4ba3d.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@debugger87
Copy link
Contributor Author

@cloud-fan Any suggestions?

@cloud-fan
Copy link
Contributor

cc @jerryshao

@jerryshao
Copy link
Contributor

So it looks like a basic copy-paste from hive code to support this feature in STS (like what I did before to support spnego). It looks fine to me based on my limited knowledge :).

@debugger87
Copy link
Contributor Author

@jerryshao Yes, it's just copied from SQLOperation in Hive. However, those code lines are the key point that HiveServer2 can return operation log to client via TFetchResultsReq which fetchType is FetchType.LOG. If hive-thriftserver in Spark SQL support complete HiveServer2 protocol, registerCurrentOperationLog should be introduced and consistent with Hive.

@jerryshao
Copy link
Contributor

Sorry I'm not familiar with this part, I cannot give you a valid comment, you could ask others to help reviewing your patch 😄 .

@debugger87
Copy link
Contributor Author

debugger87 commented Jul 19, 2017

@gatorsmile Could you please help me to review this PR?

@debugger87
Copy link
Contributor Author

@cloud-fan This patch is very simple and clear, is there any problem to review or merge it?

@cloud-fan
Copy link
Contributor

I'm not familiar with this code and don't know who I should ping, cc @srowen do we have a maintainer for thrift-server?

@debugger87
Copy link
Contributor Author

@cloud-fan @srowen If we can't find maintainer of hive-thriftserver in Spark, I may have to close this PR in few days later.

@cloud-fan
Copy link
Contributor

CC @vanzin

@vanzin
Copy link
Contributor

vanzin commented Jul 24, 2017

Really not familiar with this area of the code...

@debugger87
Copy link
Contributor Author

cc @davies

@dilipbiswal
Copy link
Contributor

@debugger87 Hello,
There have been a few posts about hive, where this setting causes too many open files issue.
https://community.hortonworks.com/questions/48351/hiveserver2-hive-users-nofile-ulimit-above-64000.html
there are a few concerns on this setting causing too many open file handles..
Does this also impact spark the same way or we have taken care of it already.

@debugger87
Copy link
Contributor Author

debugger87 commented Aug 15, 2017

@dilipbiswal

Thanks for your reply!
In my eyes, there have been some mechanism or configuration to control the number of opening files generated by SQL Operation. e.g:

hive.server2.idle.session.check.operation
hive.server2.session.check.interval
hive.server2.idle.session.timeout
hive.server2.idle.operation.timeout
hive.server2.logging.operation.enabled

If we mind the increase of opening files, we can adjust those configurations. However, in spark sql, there is no any methods or solutions to generate operation logs to provide the standard API: FetchResults with FetchType.LOG.

@SparkQA
Copy link

SparkQA commented Aug 16, 2017

Test build #80744 has finished for PR 18649 at commit 542e427.

  • This patch fails Scala style tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@gatorsmile
Copy link
Member

@liufengdb

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

@debugger87
Copy link
Contributor Author

#19721 Fixed the same issue, i will close it.

@debugger87 debugger87 closed this Jan 29, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants