docs: change docs to default port 8000 #2876

PeaBrane · 2025-09-04T18:56:34Z

Overview:

as titled, a follow up for #2853

Summary by CodeRabbit

Documentation
- Updated default HTTP port from 8080 to 8000 across READMEs, guides, deployment docs, and examples.
- Adjusted curl commands, server invocation examples, and router quick start to use :8000.
- Refreshed diagrams and metrics docs to reflect :8000 endpoints and /metrics scrape path.
- Aligned multimodal, planner benchmark, system metrics, and tests docs with the new port.
- No functional or API changes.

Signed-off-by: PeaBrane <yanrpei@gmail.com>

coderabbitai · 2025-09-04T19:01:56Z

Walkthrough

Documentation-only updates change the example/default HTTP port for the OpenAI-compatible frontend and related references from 8080 to 8000 across READMEs, guides, diagrams, and curl examples. No code, APIs, or behavior modified.

Changes

Cohort / File(s)	Summary of changes
Root and Frontend READMEs `README.md`, `components/frontend/README.md`	Updated example frontend HTTP port from 8080 to 8000 in commands and curl examples.
Backends — Mocker and vLLM docs `components/backends/mocker/README.md`, `components/backends/vllm/deepseek-r1.md`, `components/backends/vllm/deploy/README.md`	Switched example/test request ports from 8080 to 8000 in documentation code blocks.
Router docs `docs/components/router/README.md`	Changed quick-start `--http-port` from 8080 to 8000 and updated descriptive text.
Deployment/Metrics docs `deploy/metrics/README.md`, `docs/guides/metrics.md`	Updated topology/mermaid references and scrape paths from `:8080` to `:8000` and `/metrics` examples accordingly.
Architecture doc `docs/architecture/dynamo_flow.md`	Adjusted text and diagram annotations from port 8080 to 8000.
Guides — Deploy/Run `docs/guides/dynamo_deploy/create_deployment.md`, `docs/guides/dynamo_run.md`	Updated frontend launch and curl examples to use port 8000.
Guides — Planner Benchmark `docs/guides/planner_benchmark/README.md`	Changed service endpoint URLs from `http://localhost:8080` to `http://localhost:8000` in two profiles.
Examples — Multimodal `examples/multimodal/README.md`	Replaced all client curl example ports from 8080 to 8000 across modalities.
Runtime Example — System Metrics `lib/runtime/examples/system_metrics/README.md`	Updated frontend port to 8000 in run and metrics queries; kept system metrics server at 8081.
Tests — LMCache docs `tests/lmcache/README.md`	Changed diagrams and curl example from 8080 to 8000.
Support Matrix `docs/support_matrix.md`	Updated referenced frontend port from 8080 to 8000 in a GPU support note.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Possibly related PRs

chore: Change vllm K8s from dynamo-run to python -m dynamo.frontend #2055 — Changes Kubernetes deployment args/YAMLs to use port 8000; complements this docs-only port update.
docs: update router docs #2148 — Edits docs/components/router/README.md, overlapping with this PR’s router README port change.
feat: FT enable DCGM and optional Prometheus and Grafana, plus fixes #1488 — Updates metrics topology and scrape targets; related to this PR’s metrics docs port adjustments.

Poem

I nudge my whiskers at port eight-thousand’s door,
Hop-hop! No more eight-oh-eight-oh lore.
Curl sings a simpler, steady tune,
Frontends align beneath the moon.
Tiny switch, tidy burrow—neat!
Requests now land where carrots meet. 🥕✨

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore or @coderabbit ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 3

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

tests/lmcache/README.md (1)
82-93: Fix invalid JSON in curl payload (unquoted model).

The value of "model" must be a JSON string; the current example will fail.
-    "model": Qwen/Qwen3-0.6B,
+    "model": "Qwen/Qwen3-0.6B",
Port update to 8000 is correct here.
docs/architecture/dynamo_flow.md (1)

18-26: Update all remaining HTTP port references from 8080 to 8000.
Occurrences found in:

launch/dynamo-run/src/main.rs (USAGE: default --http-port 8080)

tests/lmcache/* scripts (assume localhost:8080)

docs/_includes/quick_start_local.rst (curl localhost:8080/v1/chat/completions)

docs/guides/dynamo_deploy/installation_guide.md (modelExpressURL=http://...:8080)

deploy/cloud/operator (cmd flags, manager_auth_proxy_patch.yaml, internal tests)

deploy/cloud/helm/platform (values.yaml, operator component, README)

deploy/metrics/prometheus.yml (frontend scrape targets, demo instructions)

container/launch_message.txt (python -m dynamo.frontend [--http-port 8080])

🧹 Nitpick comments (13)

docs/guides/dynamo_deploy/create_deployment.md (1)
91-91: Use consistent capitalization and command style for the Frontend launch.

Earlier in the doc you use “Frontend” (one word) and python -m .... Suggest aligning here and formatting as code.
-The front end is launched with "python3 -m dynamo.frontend [--http-port 8000] [--router-mode kv]"
+The Frontend is launched with:
+
+`python -m dynamo.frontend [--http-port 8000] [--router-mode kv]`
components/backends/vllm/deepseek-r1.md (2)
29-41: Prefer explicit scheme in curl URL.

Using http:// is clearer and copy-paste friendly across environments.
-curl localhost:8000/v1/chat/completions \
+curl http://localhost:8000/v1/chat/completions \
34-37: Clean up typos in the sample prompt.

Minor misspellings (“ests”, “familt”) in the example text; fixing improves polish.
docs/support_matrix.md (1)
88-88: Show explicit Docker port mappings to avoid confusion.

Adding a concrete mapping example helps users replace --network host safely.
-... by mapping only the necessary ports (e.g., 4222 for nats, 2379/2380 for etcd, 8000 for frontend).
+... by mapping only the necessary ports (e.g., `-p 4222:4222 -p 2379:2379 -p 2380:2380 -p 8000:8000`).
docs/guides/dynamo_run.md (2)
75-76: Use explicit scheme in curl.
-curl localhost:8000/v1/models
+curl http://localhost:8000/v1/models
80-81: Use explicit scheme in curl.
-... http://localhost:8000/v1/chat/completions
+... http://localhost:8000/v1/chat/completions
lib/runtime/examples/system_metrics/README.md (1)
188-206: Optional: add a one-liner to verify both endpoints are up.

Small usability boost for readers.
 Then make curl requests to the frontend (see the [main README](../../../../README.md))
+
+Quick checks:
+`curl -sSf http://localhost:8000/metrics >/dev/null && echo "Frontend metrics OK"`
+`curl -sSf http://localhost:8081/metrics >/dev/null && echo "System metrics OK"`
tests/lmcache/README.md (2)
63-67: Add a language hint to fenced block.

Specify a language to satisfy MD040 and improve rendering of the ASCII diagram.
-```
+```text
 HTTP Request → Dynamo Ingress(8000) → Dynamo Worker → Direct Inference
 Environment: ENABLE_LMCACHE=0
---

`70-76`: **Add a language hint to fenced block.**

Ditto for the LMCache diagram.


```diff
-```
+```text
 HTTP Request → Dynamo Ingress(8000) → Dynamo Worker → LMCache-enabled Inference
 Environment: ENABLE_LMCACHE=1
             LMCACHE_CHUNK_SIZE=256
             LMCACHE_LOCAL_CPU=True
             LMCACHE_MAX_LOCAL_CPU_SIZE=1.0
</blockquote></details>
<details>
<summary>components/backends/vllm/deploy/README.md (1)</summary><blockquote>

`205-208`: **Minor typos in example prompt text.**

Non-blocking, but consider fixing to avoid distracting readers.


```diff
- ... hinting at ests that Aeloria holds a secret ...
+ ... hinting at suggests that Aeloria holds a secret ...
- ... a search for lost familt clue is hidden.
+ ... a search for lost family; the clue is hidden.
docs/components/router/README.md (1)
22-22: Minor phrasing nit.

Consider clarifying “configurable via --http-port” to be explicit.
-- Exposes the service on port 8000 (configurable)
+- Exposes the service on port 8000 (configurable via --http-port)
docs/guides/planner_benchmark/README.md (1)
79-79: Second genai-perf example also on :8000 — good.

Optional consistency: either include --service-kind openai in both examples or neither.
-genai-perf profile \
+genai-perf profile \
     --tokenizer deepseek-ai/DeepSeek-R1-Distill-Llama-8B \
     -m deepseek-ai/DeepSeek-R1-Distill-Llama-8B \
-    --endpoint-type chat \
+    --service-kind openai \
+    --endpoint-type chat \
     --url http://localhost:8000 \
     --streaming \
README.md (1)
133-133: Nit: add scheme for clarity and curl robustness.

Explicit scheme avoids proxy/env surprises.
-curl localhost:8000/v1/chat/completions   -H "Content-Type: application/json"   -d '{
+curl http://localhost:8000/v1/chat/completions -H "Content-Type: application/json" -d '{
Optional: add an HTTPS example above when TLS flags are used:
curl https://localhost:8000/v1/chat/completions -k -H "Content-Type: application/json" -d '{ ... }'

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 4581fcc and 30e26fc.

📒 Files selected for processing (16)

README.md (2 hunks)
components/backends/mocker/README.md (1 hunks)
components/backends/vllm/deepseek-r1.md (1 hunks)
components/backends/vllm/deploy/README.md (1 hunks)
components/frontend/README.md (1 hunks)
deploy/metrics/README.md (1 hunks)
docs/architecture/dynamo_flow.md (2 hunks)
docs/components/router/README.md (1 hunks)
docs/guides/dynamo_deploy/create_deployment.md (1 hunks)
docs/guides/dynamo_run.md (1 hunks)
docs/guides/metrics.md (1 hunks)
docs/guides/planner_benchmark/README.md (2 hunks)
docs/support_matrix.md (1 hunks)
examples/multimodal/README.md (6 hunks)
lib/runtime/examples/system_metrics/README.md (2 hunks)
tests/lmcache/README.md (2 hunks)

🧰 Additional context used

🧠 Learnings (1)

📓 Common learnings

Learnt from: PeaBrane
PR: ai-dynamo/dynamo#2756
File: lib/llm/src/kv_router/subscriber.rs:36-44
Timestamp: 2025-08-29T10:03:48.330Z
Learning: PeaBrane prefers to keep PRs contained in scope and is willing to defer technical improvements to future PRs when the current implementation works for the immediate use case. They acknowledge technical debt but prioritize deliverability over completeness in individual PRs.

🪛 markdownlint-cli2 (0.17.2)

components/backends/mocker/README.md

40-40: Unordered list style
Expected: dash; Actual: asterisk

(MD004, ul-style)

40-40: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)

tests/lmcache/README.md

70-70: Fenced code blocks should have a language specified

(MD040, fenced-code-language)

🪛 LanguageTool

docs/components/router/README.md

[grammar] ~20-~20: There might be a mistake here.
Context: ...e kv --http-port 8000 ``` This command: - Launches the Dynamo frontend service wit...

(QB_NEW_EN)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Build and Test - dynamo

🔇 Additional comments (15)

docs/guides/dynamo_deploy/create_deployment.md (1)

91-93: Verify default HTTP port in dynamo.frontend CLI (docs/guides/dynamo_deploy/create_deployment.md:91–93)
Cannot locate the --http-port default in the dynamo.frontend implementation; please confirm it remains set to 8000 to keep the docs accurate.

lib/runtime/examples/system_metrics/README.md (2)

188-193: LGTM: frontend default port note matches the PR change.

205-206: LGTM: metrics example updated to port 8000.

components/backends/mocker/README.md (1)

39-41: Port change to 8000 looks correct.

The frontend start command now matches the new default port.

examples/multimodal/README.md (1)

72-101: Port updates to 8000 across all client examples look good.

All curl samples consistently target localhost:8000.

Also applies to: 148-174, 224-251, 296-323, 368-393, 456-482

components/frontend/README.md (1)

3-3: Usage line updated to port 8000 — looks good.

components/backends/vllm/deploy/README.md (2)

165-168: Port-forward example updated to 8000 — looks good.

199-213: Test request now targets port 8000 — looks good.

docs/architecture/dynamo_flow.md (2)

26-26: Port updated to 8000 — consistent with PR intent.

Reads clearly and matches the new default.

87-87: Diagram label updated to 8000 — looks good.

Mermaid node text aligns with the prose change.

deploy/metrics/README.md (2)

26-26: Metrics scrape target moved to :8000 — correct.

Topology now reflects FE on 8000 with /metrics.

119-127: Ensure prometheus.yml matches the new FE port.

If prometheus.yml still targets :8080/metrics, scrapes will fail.

Use the repo-wide scan provided in docs/architecture/dynamo_flow.md comment to confirm prometheus.yml is aligned.

docs/components/router/README.md (1)

17-18: Quick start command switched to 8000 — OK.

Command is copy-pastable and consistent.

docs/guides/planner_benchmark/README.md (1)

49-49: URL updated to localhost:8000 — good.

docs/guides/metrics.md (1)

82-82: Diagram scrape edge switched to :8000/metrics — aligned with deploy docs.

components/frontend/README.md

docs/guides/dynamo_deploy/create_deployment.md

README.md

Signed-off-by: PeaBrane <yanrpei@gmail.com>

indrajit96

LGTM!

Signed-off-by: PeaBrane <yanrpei@gmail.com>

Signed-off-by: PeaBrane <yanrpei@gmail.com> Signed-off-by: nnshah1 <neelays@nvidia.com>

first commit

30e26fc

Signed-off-by: PeaBrane <yanrpei@gmail.com>

PeaBrane requested review from atchernych, biswapanda, hhzhang16, hutm, indrajit96, ishandhanani, julienmancuso, krishung5, mohammedabdulwahhab, nnshah1 and whoisj as code owners September 4, 2025 18:56

pull-request-size bot added the size/M label Sep 4, 2025

github-actions bot added the docs label Sep 4, 2025

coderabbitai bot reviewed Sep 4, 2025

View reviewed changes

components/frontend/README.md Show resolved Hide resolved

docs/guides/dynamo_deploy/create_deployment.md Show resolved Hide resolved

README.md Show resolved Hide resolved

more fixes

fa2764b

Signed-off-by: PeaBrane <yanrpei@gmail.com>

PeaBrane requested review from a team, GuanLuo, alec-flowers, grahamking, kkranen, paulhendricks, piotrm-nvidia, ptarasiewiczNV, rmccorm4, ryanolson, tanmayv25, tedzhouhk and tmonty12 as code owners September 4, 2025 19:09

copy-pr-bot bot temporarily deployed to GITLAB September 4, 2025 19:09 Inactive

copy-pr-bot bot temporarily deployed to GITLAB September 4, 2025 19:10 Inactive

revert dynamo-run back to 8080

938f87e

Signed-off-by: PeaBrane <yanrpei@gmail.com>

grahamking approved these changes Sep 4, 2025

View reviewed changes

julienmancuso approved these changes Sep 5, 2025

View reviewed changes

nnshah1 approved these changes Sep 5, 2025

View reviewed changes

indrajit96 approved these changes Sep 5, 2025

View reviewed changes

biswapanda approved these changes Sep 5, 2025

View reviewed changes

more 8080 -> 8000

2ef3a51

Signed-off-by: PeaBrane <yanrpei@gmail.com>

PeaBrane requested a review from richardhuo-nv as a code owner September 5, 2025 00:43

copy-pr-bot bot temporarily deployed to GITLAB September 5, 2025 00:43 Inactive

PeaBrane enabled auto-merge (squash) September 5, 2025 00:43

copy-pr-bot bot temporarily deployed to GITLAB September 5, 2025 00:43 Inactive

PeaBrane merged commit 1995ef9 into main Sep 5, 2025
13 of 15 checks passed

PeaBrane deleted the rupei/doc-fixes-8000 branch September 5, 2025 01:16

dillon-cullinan pushed a commit that referenced this pull request Sep 5, 2025

docs: change docs to default port 8000 (#2876)

b60a7eb

Signed-off-by: PeaBrane <yanrpei@gmail.com>

nnshah1 pushed a commit that referenced this pull request Sep 8, 2025

docs: change docs to default port 8000 (#2876)

4a6261a

Signed-off-by: PeaBrane <yanrpei@gmail.com> Signed-off-by: nnshah1 <neelays@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: change docs to default port 8000 #2876

docs: change docs to default port 8000 #2876

Uh oh!

PeaBrane commented Sep 4, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Sep 4, 2025

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

indrajit96 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

docs: change docs to default port 8000 #2876

docs: change docs to default port 8000 #2876

Uh oh!

Conversation

PeaBrane commented Sep 4, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Sep 4, 2025

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

indrajit96 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

PeaBrane commented Sep 4, 2025 •

edited by coderabbitai bot

Loading