Skip to content

Commit 9e66a18

Browse files
committed
update spark-k8s-anomaly-detection-taxi-data demo documentation
1 parent 5911d01 commit 9e66a18

File tree

1 file changed

+24
-24
lines changed

1 file changed

+24
-24
lines changed

docs/modules/demos/pages/spark-k8s-anomaly-detection-taxi-data.adoc

Lines changed: 24 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -59,28 +59,26 @@ image::spark-k8s-anomaly-detection-taxi-data/overview.png[]
5959

6060
To list the installed Stackable services run the following command:
6161

62-
// TODO(Techassi): Update console output
63-
6462
[source,console]
6563
----
6664
$ stackablectl stacklet list
67-
PRODUCT NAME NAMESPACE ENDPOINTS EXTRA INFOS
68-
69-
hive hive spark-k8s-ad-taxi-data hive 172.18.0.2:31912
70-
metrics 172.18.0.2:30812
71-
72-
hive hive-iceberg spark-k8s-ad-taxi-data hive 172.18.0.4:32133
73-
metrics 172.18.0.4:32125
74-
75-
opa opa spark-k8s-ad-taxi-data http http://172.18.0.3:31450
76-
77-
superset superset spark-k8s-ad-taxi-data external-superset http://172.18.0.2:31339 Admin user: admin, password: adminadmin
78-
79-
trino trino spark-k8s-ad-taxi-data coordinator-metrics 172.18.0.3:32168
80-
coordinator-https https://172.18.0.3:31408
8165
82-
minio minio-trino spark-k8s-ad-taxi-data http http://172.18.0.3:30589 Third party service
83-
console-http http://172.18.0.3:31452 Admin user: admin, password: adminadmin
66+
┌──────────┬───────────────┬───────────┬───────────────────────────────────────────────┬─────────────────────────────────┐
67+
│ PRODUCT ┆ NAME ┆ NAMESPACE ┆ ENDPOINTS ┆ CONDITIONS │
68+
╞══════════╪═══════════════╪═══════════╪═══════════════════════════════════════════════╪═════════════════════════════════╡
69+
│ hive ┆ hive ┆ default ┆ ┆ Available, Reconciling, Running │
70+
├╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┤
71+
│ hive ┆ hive-iceberg ┆ default ┆ ┆ Available, Reconciling, Running │
72+
├╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┤
73+
│ opa ┆ opa ┆ default ┆ ┆ Available, Reconciling, Running │
74+
├╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┤
75+
│ superset ┆ superset ┆ default ┆ external-http http://172.18.0.2:30562 ┆ Available, Reconciling, Running │
76+
├╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┤
77+
│ trino ┆ trino ┆ default ┆ coordinator-metrics 172.18.0.2:31980 ┆ Available, Reconciling, Running │
78+
│ ┆ ┆ ┆ coordinator-https https://172.18.0.2:32186 ┆ │
79+
├╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┤
80+
│ minio ┆ minio-console ┆ default ┆ http http://172.18.0.2:32276 ┆ │
81+
└──────────┴───────────────┴───────────┴───────────────────────────────────────────────┴─────────────────────────────────┘
8482
----
8583

8684
include::partial$instance-hint.adoc[]
@@ -89,8 +87,8 @@ include::partial$instance-hint.adoc[]
8987

9088
=== List Buckets
9189

92-
The S3 provided by MinIO is used as persistent storage to store all the data used. Open the endpoint `console-http`
93-
retrieved by `stackablectl stacklet list` in your browser (http://172.18.0.3:31452 in this case).
90+
The S3 provided by MinIO is used as persistent storage to store all the data used. Open the endpoint `http`
91+
retrieved by `stackablectl stacklet list` in your browser (http://172.18.0.2:32276 in this case).
9492

9593
image::spark-k8s-anomaly-detection-taxi-data/minio_0.png[]
9694

@@ -107,16 +105,16 @@ Here, you can see the two buckets the S3 is split into:
107105

108106
=== Inspect raw data
109107

110-
Click on the blue button `Browse` on the bucket `demo`.
108+
Click on the bucket `demo` and then on `ny-taxi-data` and `raw` respectively.
111109

112110
image::spark-k8s-anomaly-detection-taxi-data/minio_3.png[]
113111

114-
A folder (called prefixes in S3) contains a dataset of similarly structured data files. The data is partitioned by month
112+
This folder (called prefixes in S3) contains a dataset of similarly structured data files. The data is partitioned by month
115113
and contains several hundred MBs, which may seem small for a dataset. Still, the model is a time-series model where the
116114
data has decreasing relevance the "older" it is, especially when the data is subject to multiple external factors, many
117115
of which are unknown and fluctuating in scope and effect.
118116

119-
The second bucket prediction contains the output from the model scoring process:
117+
The second bucket prediction contains the output from the model scoring process under `prediction/anomaly-detection/iforest/data`:
120118

121119
image::spark-k8s-anomaly-detection-taxi-data/minio_4.png[]
122120

@@ -147,7 +145,9 @@ image::spark-k8s-anomaly-detection-taxi-data/spark_job.png[]
147145

148146
== Dashboard
149147

150-
The anomaly detection dashboard is pre-defined and accessible under `Dashboards` when logged in to Superset:
148+
Open the `external-http` Superset endpoint found in the output of the `stackablectl stacklet list` command. The anomaly detection
149+
dashboard is pre-defined and accessible under the `Dashboards` tab when logged in to Superset using the username `admin`
150+
password `adminadmin`:
151151

152152
image::spark-k8s-anomaly-detection-taxi-data/superset_anomaly_scores.png[]
153153

0 commit comments

Comments
 (0)