feat(cache): add LLM metadata caching for model and provider information #1456

Pouyanpi · 2025-10-17T10:22:01Z

Extends the cache system to store and restore LLM metadata (model name and provider name) alongside cache entries. This allows cached results to maintain provenance information about which model and provider generated the original response.

Changes

Added LLMMetadataDict and LLMCacheData TypedDict definitions for type safety
Extended CacheEntry to include optional llm_metadata field
Implemented extract_llm_metadata_for_cache() to capture model and provider info from context
Implemented restore_llm_metadata_from_cache() to restore metadata when retrieving cached results
Updated get_from_cache_and_restore_stats() to handle metadata extraction and restoration
Added comprehensive test coverage for metadata caching functionality

Dependencies

Depends on: PR style(cache): replace pass with ellipsis in abstract methods #1455

Part of Stack

This is PR 2/5 in the NeMoGuards caching feature stack.

✅ PR 1: style(cache): replace pass with ellipsis in abstract methods #1455
✅ PR 2: LLM metadata caching (this PR)
⬜ PR 3: feat(cache): add caching support for topic safety and content safety output checks #1457
⬜ PR 4: feat(cache): add caching support for jailbreak detection #1458
⬜ PR 5: docs(examples): add nemoguards cache configuration example #1459

codecov-commenter · 2025-10-17T10:25:04Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

tgasser-nv

Looks good, just a few cleanup nits in the tests to address before merging

nemoguardrails/llm/cache/utils.py

tests/test_cache_utils.py

Extends the cache system to store and restore LLM metadata (model name and provider name) alongside cache entries. This allows cached results to maintain provenance information about which model and provider generated the original response. - Added LLMMetadataDict and LLMCacheData TypedDict definitions for type safety - Extended CacheEntry to include optional llm_metadata field - Implemented extract_llm_metadata_for_cache() to capture model and provider info from context - Implemented restore_llm_metadata_from_cache() to restore metadata when retrieving cached results - Updated get_from_cache_and_restore_stats() to handle metadata extraction and restoration - Added comprehensive test coverage for metadata caching functionalit

…, refactor LLMCallInfo instantiation

…ion (#1456) * feat(cache): add LLM metadata caching for model and provider information Extends the cache system to store and restore LLM metadata (model name and provider name) alongside cache entries. This allows cached results to maintain provenance information about which model and provider generated the original response. - Added LLMMetadataDict and LLMCacheData TypedDict definitions for type safety - Extended CacheEntry to include optional llm_metadata field - Implemented extract_llm_metadata_for_cache() to capture model and provider info from context - Implemented restore_llm_metadata_from_cache() to restore metadata when retrieving cached results - Updated get_from_cache_and_restore_stats() to handle metadata extraction and restoration - Added comprehensive test coverage for metadata caching functionalit * address review comments: add _s suffix to durations, add test fixture, refactor LLMCallInfo instantiation

This was referenced Oct 17, 2025

style(cache): replace pass with ellipsis in abstract methods #1455

Merged

feat(cache): add caching support for topic safety and content safety output checks #1457

Merged

This was referenced Oct 17, 2025

feat(cache): add caching support for jailbreak detection #1458

Merged

docs(examples): add nemoguards cache configuration example #1459

Merged

Pouyanpi force-pushed the feat/cache-interface-cleanup branch from 053cd1c to cb827ae Compare October 17, 2025 10:38

Pouyanpi force-pushed the feat/cache-llm-metadata branch from b05cac4 to e725d77 Compare October 17, 2025 10:39

Pouyanpi added this to the v0.18.0 milestone Oct 17, 2025

Pouyanpi self-assigned this Oct 17, 2025

Base automatically changed from feat/cache-interface-cleanup to develop October 17, 2025 14:47

tgasser-nv approved these changes Oct 17, 2025

View reviewed changes

nemoguardrails/llm/cache/utils.py Outdated Show resolved Hide resolved

nemoguardrails/llm/cache/utils.py Show resolved Hide resolved

tests/test_cache_utils.py Outdated Show resolved Hide resolved

tests/test_cache_utils.py Outdated Show resolved Hide resolved

Pouyanpi added 2 commits October 19, 2025 11:52

address review comments: add _s suffix to durations, add test fixture…

fd873b7

…, refactor LLMCallInfo instantiation

Pouyanpi force-pushed the feat/cache-llm-metadata branch from e725d77 to fd873b7 Compare October 19, 2025 10:00

Pouyanpi merged commit 32d57f5 into develop Oct 19, 2025
7 checks passed

Pouyanpi deleted the feat/cache-llm-metadata branch October 19, 2025 10:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(cache): add LLM metadata caching for model and provider information #1456

feat(cache): add LLM metadata caching for model and provider information #1456

Pouyanpi commented Oct 17, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Oct 17, 2025

Uh oh!

tgasser-nv left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat(cache): add LLM metadata caching for model and provider information #1456

feat(cache): add LLM metadata caching for model and provider information #1456

Conversation

Pouyanpi commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Dependencies

Part of Stack

Uh oh!

codecov-commenter commented Oct 17, 2025

Codecov Report

Uh oh!

tgasser-nv left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Pouyanpi commented Oct 17, 2025 •

edited

Loading