Some text updates, new reference (#236)

SeldonIO · May 25, 2022 · 25246b7 · 25246b7
1 parent 3e48d1b
commit 25246b7
Showing 1 changed file with 10 additions and 9 deletions.
diff --git a/docs/source/contents/architecture/index.md b/docs/source/contents/architecture/index.md
@@ -10,36 +10,37 @@ The core components are:
  * Pipeline gateway : handles REST/gRPC calls to pipelines.
  * Dataflow engine : handles the flow of data between components in a pipeline.
  * Model gateway : handles the flow of data from models to inference requests on servers and passes on the responses.
- * Agent : manages the loading and unloading of models on a Server and access to the server over REST/gRPC.
+ * Agent : manages the loading and unloading of models on a server and access to the server over REST/gRPC.
  * Envoy : manages the proxying of requests to the correct servers including load balancing.
 
 All the above are Kubernetes agnostic and can run locally, e.g. on Docker Compose.
 
 We also provide a Kubernetes Operator to allow Kubernetes usage.
 
-Kafka is used as the backbone for Pipelines allowing a decentralized, syncronous and asynchronous usage.
+Kafka is used as the backbone for Pipelines allowing a decentralized, synchronous and asynchronous usage.
 
 ## Kafka
 
-Kafka is used as the backbone for allowing Pipelines of Models to be connected together into arbtrary directed acyclic graphs. Models can be reused in different Pipelines. The flow of data between models is handled by the datafloe engine using [KStreams](https://docs.confluent.io/platform/current/streams/concepts.html).
+Kafka is used as the backbone for allowing Pipelines of Models to be connected together into arbitrary directed acyclic graphs. Models can be reused in different Pipelines. The flow of data between models is handled by the dataflow engine using [KStreams](https://docs.confluent.io/platform/current/streams/concepts.html).
 
 ![kafka](kafka.png)
 
 ## Dataflow Architecture
 
-Seldon V2 follows a dataflow architecture and its part of the current movement for data centric machine learning. By taking a decentralized route that focuses on the flow of data users can have more flexibility and insight in building complex applications containing machine learning and traditional components. This contrasts with a more centralized orchestration more traditional in service orientated architectures.
+Seldon V2 follows a dataflow design paradigm and it's part of the current movement for data centric machine learning. By taking a decentralized route that focuses on the flow of data users can have more flexibility and insight in building complex applications containing machine learning and traditional components. This contrasts with a more centralized orchestration more traditional in service orientated architectures.
 
 ![dataflow](dataflow.png)
 
 By focusing on the data we allow users to join various flows together using stream joining concepts as shown below.
 
 ![joins](joins.png)
 
-We support inner joins where all inputs need to be present for a transaction to join the tensors passed through the Pipeline; outer joins where only a subset need to be available during the join window as well as triggers in which data flows need to wait until one or more trigger data flows appear. The data in these triggers is not passed onwards from the join.
+We support several types of joins:
+ * _inner joins_, where all inputs need to be present for a transaction to join the tensors passed through the Pipeline;
+ * _outer joins_, where only a subset needs to be available during the join window
+ * _triggers_, in which data flows need to wait until records on one or more trigger data flows appear. The data in these triggers is not passed onwards from the join.
 
-We allowing these techniques complex pipeline flows of data between machine learning components can be created.
+These techniques allow users to create complex pipeline flows of data between machine learning components.
 
-More discussion on the data flow view of machine learning can be found in a paper by [Paleyes et al](https://arxiv.org/abs/2108.04105).
-
-.
+More discussion on the data flow view of machine learning can be found in a paper by [Paleyes et al](https://arxiv.org/abs/2204.12781).