Integration with Ultrawarm - Follow up #97

kaituo · 2020-05-01T05:39:23Z

Issue #, if available:

Description of changes:
This is a follow up PR to address comments.

See context in #95

Testing done:

gradle build passes
Verified AD runs only in hot nodes.
stats API and HourlyCron still works.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

This is a follow up PR to address comments. Testing done: 1. gradle build passes 2. Verified AD runs only in hot nodes. 3. stats API and HourlyCron still works.

wnbts · 2020-05-01T16:38:51Z

src/main/java/com/amazon/opendistroforelasticsearch/ad/util/DiscoveryNodeFilterer.java

+
+    static class HotDataNodePredicate implements Predicate<DiscoveryNode> {
+        @Override
+        public boolean test(DiscoveryNode discoveryNode) {


question. Just to confirm this if-and-only-if behavior since it is different from the last change, any data nodes that are marked but not marked hot are not eligible? if a data node is marked with some unknown new value, it would be ineligible based on this rule.

Currently, there are only hot and warm data nodes.
Previously, if a data node is marked with warm node, we ignore it
Now, if a data node is marked with hot node or not marked at all, we use it.
If a data node is marked with some unknown new value in the future, it would be ineligible.

Suggestion. Since this business logic/decision has important effects on system behavior, it is better to document it. Also, the documentation at line 49 is outdated based on this new change.

wnbts · 2020-05-01T16:42:44Z

src/main/java/com/amazon/opendistroforelasticsearch/ad/util/DiscoveryNodeFilterer.java

+     * @return whether we should use this node for AD
+     */
+    public boolean isEligibleNode(DiscoveryNode node) {
+        return new HotDataNodePredicate().test(node);


issue. this predicate should be instantiated, maybe injected, and reused for calls and should not be instantiated once for every call.

good point. Changed.

wnbts · 2020-05-01T16:44:17Z

src/main/java/com/amazon/opendistroforelasticsearch/ad/util/DiscoveryNodeFilterer.java

+        ClusterState state = this.clusterService.state();
+        final List<DiscoveryNode> eligibleNodes = new ArrayList<>();
+        final HotDataNodePredicate eligibleNodeFilter = new HotDataNodePredicate();
+        for (DiscoveryNode node : state.nodes()) {


suggestion. stream helps improve code efficiency (eliminating need for intermediary data structures, variables creation and operation) and readability.

It is personal coding style. Thanks for the suggestions.

wnbts · 2020-05-01T16:52:50Z

src/main/java/com/amazon/opendistroforelasticsearch/ad/util/DiscoveryNodeFilterer.java

+
+import com.amazon.opendistroforelasticsearch.ad.constant.CommonName;
+
+public class DiscoveryNodeFilterer {


Minor. class documentation is missing. For public classes and methods, the responsibilities should be summarized for clients and readers.

sohami · 2020-05-01T16:50:00Z

src/main/java/com/amazon/opendistroforelasticsearch/ad/util/DiscoveryNodeFilterer.java

+     *   model partitions to all data nodes in the cluster randomly, which
+     *    could cause a model performance downgrade issue once warm nodes
+     *     are throttled due to resource limitations. The PR excludes warm nodes to place model partitions.
+     * @return an array of eligible data nodes


minor: indentation is not needed in java docs

sohami · 2020-05-01T16:51:56Z

src/main/java/com/amazon/opendistroforelasticsearch/ad/ml/ModelManager.java

@@ -119,7 +119,7 @@ public String getName() {
    /**
     * Constructor.
     *
-     * @param clusterStateUtils cluster info
+     * @param nodeFilter utility class to select nodesr


minor: typo nodes r

sohami · 2020-05-01T17:00:14Z

src/main/java/com/amazon/opendistroforelasticsearch/ad/util/DiscoveryNodeFilterer.java

+
+import com.amazon.opendistroforelasticsearch.ad.constant.CommonName;
+
+public class DiscoveryNodeFilterer {


I don't see any UT for this class. Would be great to add some.

We have coverage tool to check coverage (75% line and 60% branch coverage). Without enough coverage, build would fail. This class is covered by its callers’ tests.

This is a follow up PR to address comments. Testing done: 1. gradle build passes 2. Verified AD runs only in hot nodes. 3. stats API and HourlyCron still works.

* Integration with Ultrawarm (#95) Ultrawarm introduces warm nodes into the ES cluster. Currently, we distribute model partitions to all data nodes in the cluster randomly, which could cause a model performance downgrade issue once warm nodes are throttled due to resource limitations. The PR excludes warm nodes to place model partitions. Since index shards are hosted on hot nodes, AD's coordinating nodes are in hot nodes as well. We don't need to send HourlyCron job and stats requests to warm nodes anymore. This PR implements those changes. Testing done: 1. Verified AD runs only in hot nodes. 2. stats API and HourlyCron still works. * Integration with Ultrawarm - Follow up (#97) This is a follow up PR to address comments. Testing done: 1. gradle build passes 2. Verified AD runs only in hot nodes. 3. stats API and HourlyCron still works.

Integration with Ultrawarm - Follow up

840fc07

This is a follow up PR to address comments. Testing done: 1. gradle build passes 2. Verified AD runs only in hot nodes. 3. stats API and HourlyCron still works.

kaituo requested a review from wnbts May 1, 2020 05:39

wnbts reviewed May 1, 2020

View reviewed changes

sohami reviewed May 1, 2020

View reviewed changes

Address comments

837b44f

wnbts approved these changes May 4, 2020

View reviewed changes

sohami approved these changes May 4, 2020

View reviewed changes

kaituo merged commit fbc8a4e into opendistro-for-elasticsearch:opendistro-1.4 May 4, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration with Ultrawarm - Follow up #97

Integration with Ultrawarm - Follow up #97

kaituo commented May 1, 2020

wnbts May 1, 2020

kaituo May 4, 2020

wnbts May 4, 2020

wnbts May 1, 2020

kaituo May 4, 2020

wnbts May 1, 2020

kaituo May 4, 2020

wnbts May 1, 2020

kaituo May 4, 2020

sohami May 1, 2020

kaituo May 4, 2020

sohami May 1, 2020

kaituo May 4, 2020

sohami May 1, 2020

kaituo May 4, 2020


		import com.amazon.opendistroforelasticsearch.ad.constant.CommonName;

		public class DiscoveryNodeFilterer {

Integration with Ultrawarm - Follow up #97

Integration with Ultrawarm - Follow up #97

Conversation

kaituo commented May 1, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment