Tensor rt optimize #28

GerdsenAI-Admin · 2026-02-05T07:44:38Z

This pull request introduces significant documentation, CI, and performance-related improvements for the Depth Anything 3 ROS2 wrapper project. The most notable changes include a major performance boost through shared memory inference, extensive documentation clarifications, acknowledgements updates, and enhancements to the CI pipeline and linting configuration.

Performance and Architecture Improvements:

Implemented a new shared memory inference service (scripts/trt_inference_service_shm.py) using RAM-backed IPC via /dev/shm/da3, resulting in a 4x performance improvement (23+ FPS, up to 43+ FPS processing capacity), with zero-copy data transfer and automatic fallback to file-based IPC if needed. (CHANGELOG.md CHANGELOG.mdL3-R50)
Added SharedMemoryInferenceFast class and auto-detection logic in the main node to seamlessly select the fastest available inference backend. (CHANGELOG.md CHANGELOG.mdL3-R50)

Documentation and Acknowledgements:

Updated README.md and the changelog with a "Production Architecture" section, clarified TensorRT as the production backend, and improved explanations of host-container split and fallback modes. (CHANGELOG.md CHANGELOG.mdL3-R50)
Expanded ACKNOWLEDGEMENTS.md to credit Depth Anything 3, ByteDance Seed Team, NVIDIA TensorRT, Jetson Containers, Hugging Face, and clarified the role of PyTorch and Docker images. (ACKNOWLEDGEMENTS.md [1] [2]

CI/CD and Linting Enhancements:

Improved the CI workflow by renaming steps, refining the installation and execution of linters, and adding comments about ROS2 test requirements. (.github/workflows/ci.yml [1] [2] [3]
Added a .markdownlint.json file to customize markdown linting rules for documentation consistency. (.markdownlint.json .markdownlint.jsonR1-R12)

Other Notable Updates:

Removed the .github/copilot-instructions.md file, possibly to reduce redundancy or outdated guidance.
Updated the changelog to reflect new releases, bug fixes, and previous improvements for better project tracking. (CHANGELOG.md CHANGELOG.mdR168-R211)

These changes collectively enhance performance, developer experience, and documentation quality, while clarifying the project's architecture and dependencies.

The node now checks for /dev/shm/da3/status to auto-select the fast RAM-backed shared memory backend vs file-based IPC. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Remove CLAUDE.md from tracking (kept locally) - Remove .github/copilot-instructions.md from tracking (kept locally) - Update .gitignore to prevent future tracking

- Add continue-on-error to flake8 step - Add continue-on-error to black formatting check - CI will no longer fail due to linting issues

- Removed entire lint job (flake8, black) - CI now only runs documentation build - Linting errors will no longer appear in CI

- Update README.md - Update docker/README.md - Update docs/JETSON_DEPLOYMENT_GUIDE.md

- Update demo_depth_viewer.py - Update performance_monitor.sh

…un.sh

…stallation for Jetson users

…inference

Update documentation to specify the exact Jetson Orin NX 16GB unit used for all validated benchmarks: Seeed reComputer J4012 with hyperlink. - README.md: Add footnotes with Seeed link in performance tables - OPTIMIZATION_GUIDE.md: Add Seeed reference in quick reference table - JETSON_BENCHMARKS.md: Update hardware line with Seeed link - JETSON_DEPLOYMENT_GUIDE.md: Add hyperlink to existing Seeed mention Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

GerdsenAI-Admin and others added 30 commits February 4, 2026 14:57

Update da3_inference.py

38f7f9f

Add trt_inference_service_shm.py

54667c6

Update docker-compose.yml

89f3e86

Add SharedMemoryInferenceFast auto-detection to depth node

eda54fd

The node now checks for /dev/shm/da3/status to auto-select the fast RAM-backed shared memory backend vs file-based IPC. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Update CLAUDE.md

1ef9ece

Update OPTIMIZATION_GUIDE.md

9a0600e

Update run.sh

9afb99a

Update scripts/demo.sh

5611f10

Update CHANGELOG.md

c3730b7

Update OPTIMIZATION_GUIDE.md

374fcb6

Update README.md

0ef7ad0

Update da3_inference.py

067192c

Update docs/BASELINES.md

96ae233

Update TODO.md

3b45a0e

Update docs/JETSON_DEPLOYMENT_GUIDE.md

85e5ea6

Update requirements.txt

16c92d5

Update scripts/trt_inference_service.py

1346553

Update README.md

4fd7c5f

Remove da3_inference_optimized.py

1623a54

Remove depth_anything_3_node_optimized.py

90ebf55

Remove gpu_utils.py

778b4f2

Remove depth_anything_3_node_optimized script

97d68fe

Remove depth_anything_3_optimized.launch.py

671ffb1

Remove scripts/trt_inference_service.py

e86b658

Update setup.py

b1ea0e0

Update README.md

a0e696f

Update requirements.txt

1e74898

Update acknowledgements for Depth Anything 3 and related technologies

1afd05d

Remove Claude/AI files from git tracking

d0200f1

- Remove CLAUDE.md from tracking (kept locally) - Remove .github/copilot-instructions.md from tracking (kept locally) - Update .gitignore to prevent future tracking

Make linting checks non-blocking in CI

d0e1a44

- Add continue-on-error to flake8 step - Add continue-on-error to black formatting check - CI will no longer fail due to linting issues

GerdsenAI-Admin and others added 14 commits February 4, 2026 22:08

Remove linting job from CI workflow

8ac50c0

- Removed entire lint job (flake8, black) - CI now only runs documentation build - Linting errors will no longer appear in CI

Update documentation files

66e9232

- Update README.md - Update docker/README.md - Update docs/JETSON_DEPLOYMENT_GUIDE.md

Update scripts

47002e7

- Update demo_depth_viewer.py - Update performance_monitor.sh

Update desktop shortcut configuration

6b47b24

chore: add editor and linter configuration files

427a343

ci: add Black and flake8 lint checks

1439809

test: add SharedMemory backend tests and update coverage docs

79b2ed2

docs: reorganize README into dedicated documentation files

68e873a

docs: update project status and add pyyaml dependency

aed06c7

chore: add gsplat dependency and improve TRT/pycuda auto-install in r…

551b6bd

…un.sh

docs: update README and scripts to clarify TensorRT and dependency in…

0ecb372

…stallation for Jetson users

feat: add automatic depth visualization in non-headless mode

b62ff3f

fix: ensure fresh data read and prevent stale cache in shared memory …

8c9f4f9

…inference

GerdsenAI-Admin merged commit 184fc35 into main Feb 5, 2026
1 of 2 checks passed

GerdsenAI-Admin deleted the TensorRT-Optimize branch February 5, 2026 07:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensor rt optimize #28

Tensor rt optimize #28

Uh oh!

GerdsenAI-Admin commented Feb 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Tensor rt optimize #28

Tensor rt optimize #28

Uh oh!

Conversation

GerdsenAI-Admin commented Feb 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant