[docs][serve][llm] added touch ups (ray-project#58406)

kouroshHakha · YoussefEssDS · commit f41c84d5f81e · 2025-11-07T19:03:05.000-05:00
Signed-off-by: Kourosh Hakhamaneshi &lt;kourosh@anyscale.com&gt;
diff --git a/doc/source/serve/llm/architecture/serving-patterns/prefill-decode.md b/doc/source/serve/llm/architecture/serving-patterns/prefill-decode.md
@@ -1,7 +1,7 @@
 (serve-llm-architecture-prefill-decode)=
 # Prefill-decode disaggregation
 
-Prefill-decode (PD) disaggregation is a serving pattern that separates the prefill phase (processing input prompts) from the decode phase (generating tokens). This pattern optimizes resource utilization by scaling each phase independently based on its specific requirements.
+Prefill-decode (PD) disaggregation is a serving pattern that separates the prefill phase (processing input prompts) from the decode phase (generating tokens). This pattern was first pioneered in [DistServe](https://hao-ai-lab.github.io/blogs/distserve/) and optimizes resource utilization by scaling each phase independently based on its specific requirements.
 
 ## Architecture overview