feat: add GuardrailsEngine for llama index #1005

kaushikb11 · 2024-08-11T23:54:37Z

Updated.

Implement GuardrailsEngine for LlamaIndex integration

This PR introduces the GuardrailsEngine class, which integrates Guardrails validation with LlamaIndex's query and chat engines. The GuardrailsEngine provides a unified interface for applying guardrails to both query and chat functionalities while maintaining compatibility with LlamaIndex's expected interfaces.

Inherits from BaseQueryEngine for compatibility with LlamaIndex components.
Supports both query and chat functionalities by wrapping either a BaseQueryEngine or a BaseChatEngine.

QueryEngine and ChatEngine Integration:

The GuardrailsEngine is designed to work seamlessly with both LlamaIndex's QueryEngine and ChatEngine.

This dual functionality allows users to apply guardrails to both one-off queries and multi-turn conversations, enhancing the safety and reliability of LLM responses in various use cases.

Example

from guardrails import Guard
from guardrails.hub import ToxicLanguage, CompetitorCheck
from guardrails.integrations.llama_index import GuardrailsEngine

from llama_index import VectorStoreIndex, SimpleDirectoryReader
from llama_index.core import Settings
from llama_index.embeddings.huggingface import HuggingFaceEmbedding

embed_model = HuggingFaceEmbedding(model_name="BAAI/bge-small-en-v1.5")
Settings.embed_model = embed_model

documents = SimpleDirectoryReader("data").load_data()
index = VectorStoreIndex.from_documents(documents)

guard = Guard().use(
    ToxicLanguage(on_fail="exception"),
    CompetitorCheck(competitors=["Apple", "Google", "Microsoft"], on_fail="exception")
)

# For query engine
query_engine = index.as_query_engine()
guardrails_engine = GuardrailsEngine(engine=query_engine, guard=guard)

try:
    response = query_engine.query("What are the main products of our company?")
    print(response)
except ValueError as e:
    print(f"Validation error: {str(e)}")
    
# For chat engine
chat_engine = index.as_chat_engine()
guardrails_engine = GuardrailsEngine(engine=chat_engine, guard=guard)

try:
	response = guardrails_engine.chat("Tell me about our product compared to Apple's.")
	print(response)
except ValueError as e:
	print(f"Validation error: {str(e)}")

zsimjee · 2024-08-13T14:52:49Z

Interesting design! A few followups

Why implement this as a query engine instead of something else?
Guardrails already natively has support for input and output validation at the guard level. You can do Guard().use(Validator, on='messages') to run validation on input
The flow here is interesting, and I wonder if you've thought of inverting it. i.e. in teh current code, I assume something happens like input validation, call the LLM through llamaindex's call (not guardrails), and then output parse. What if we passed in a llamaindex module as the LLM call itself to an existing guard. What do we lose with that design?

CalebCourier

I have similar questions as @zsimjee. WRT to principal of least surprise I would expect to see something like query_engine.query passed as the llm_api and the existing forms of input and output validation used on a single Guard.

In addition to this I have doubts about the behaviour of the current implementation since the types don't line up. Could you add some unit and integration tests to assert the code is acting as expected?

guardrails/integrations/llama_index/guardrails_query_engine.py

kaushikb11 · 2024-08-23T15:00:36Z

Why implement this as a query engine instead of something else?

Query engines in LlamaIndex are end-to-end pipelines that allow asking questions over data. They handle the retrieval of relevant context and passing it to the LLM along with the query.
By implementing GuardrailsEngine as a query engine, it integrates seamlessly with LlamaIndex's existing infrastructure, allowing users to easily wrap it on top of other query engines.
Query engines in LlamaIndex are designed to work with both one-off queries and multi-turn conversations (via chat engines), making the GuardrailsEngine versatile for various use cases.

Guardrails already natively has support for input and output validation at the guard level. You can do Guard().use(Validator, on='messages') to run validation on input.

You're correct. Removed the need of defining the input and output validators separately. Current design should account for it. It will be highlighted in the documentation.

The flow here is interesting, and I wonder if you've thought of inverting it. i.e. in the current code, ...... What if we passed in a llamaindex module as the LLM call itself to an existing guard. What do we lose with that design?

Let's consider the implications of inverting the flow:
Current flow: Input validation -> LlamaIndex LLM call -> Output parsing
Proposed flow: Pass LlamaIndex module as LLM to Guardrails Guard

Potential benefits of the inverted design:

It would leverage more of Guardrails' native functionality, potentially simplifying the implementation.
It might offer more flexibility in applying different guardrails to different LlamaIndex components.

Potential drawbacks or limitations:

We might lose some fine-grained control over the LlamaIndex query process, as it would be abstracted within the Guard's LLM call.
It could be more challenging to access and utilize LlamaIndex-specific features and metadata.
Gets messier to deal with the LLM API responses, could pass a wrapper but doesn't seem clean from developer's perspective.
The integration might be less intuitive for users already familiar with LlamaIndex's query and chat engine paradigm. (v important for developer experience).

kaushikb11 · 2024-08-23T15:02:19Z

Could you add some unit and integration tests to assert the code is acting as expected?

yup, of course. it would be better for us to align on the RFC first. Will be fixing the typing issues too.

github-actions · 2024-09-29T03:38:54Z

This PR is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 14 days.

github-actions · 2024-10-14T03:39:01Z

This PR was closed because it has been stalled for 14 days with no activity.

CalebCourier · 2024-11-12T15:58:29Z

Closing in favor of branch on guardrails repo: https://github.com/guardrails-ai/guardrails/tree/feat/llama-index

CalebCourier · 2024-11-12T15:59:13Z

See: #1160

feat: add GuardrailsQueryEngine for llama index

2ea54ea

CalebCourier reviewed Aug 13, 2024

View reviewed changes

guardrails/integrations/llama_index/guardrails_query_engine.py Outdated Show resolved Hide resolved

guardrails/integrations/llama_index/guardrails_query_engine.py Outdated Show resolved Hide resolved

Update to GuardrailsEngine

af2ffff

kaushikb11 changed the title ~~feat: add GuardrailsQueryEngine for llama index~~ feat: add GuardrailsEngine for llama index Aug 23, 2024

kaushikb11 added 3 commits August 25, 2024 18:46

fix(1/2) pyright typing issues

1a48c13

fix(2/n) pyright typing issues

80d53a5

add tests for GuardrailsEngine

e1b79f2

kaushikb11 marked this pull request as ready for review August 27, 2024 12:50

kaushikb11 added 5 commits August 27, 2024 20:13

fix tests

f44a3cc

fix pyright imports issues

8161c73

add experimental tag to GuardrailsEngine

174b8a8

add check for guardrails engine call with llm_api

c58f324

fix typing in get_llm_ask

1a3eb23

github-actions bot added the Stale label Sep 29, 2024

github-actions bot closed this Oct 14, 2024

CalebCourier reopened this Nov 12, 2024

CalebCourier removed the Stale label Nov 12, 2024

CalebCourier closed this Nov 12, 2024

CalebCourier mentioned this pull request Nov 12, 2024

Feat/llama index #1160

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add GuardrailsEngine for llama index #1005

feat: add GuardrailsEngine for llama index #1005

kaushikb11 commented Aug 11, 2024 •

edited

Loading

zsimjee commented Aug 13, 2024

CalebCourier left a comment

kaushikb11 commented Aug 23, 2024 •

edited

Loading

kaushikb11 commented Aug 23, 2024 •

edited

Loading

github-actions bot commented Sep 29, 2024

github-actions bot commented Oct 14, 2024

CalebCourier commented Nov 12, 2024

CalebCourier commented Nov 12, 2024

feat: add GuardrailsEngine for llama index #1005

feat: add GuardrailsEngine for llama index #1005

Conversation

kaushikb11 commented Aug 11, 2024 • edited Loading

Implement GuardrailsEngine for LlamaIndex integration

QueryEngine and ChatEngine Integration:

Example

zsimjee commented Aug 13, 2024

CalebCourier left a comment

Choose a reason for hiding this comment

kaushikb11 commented Aug 23, 2024 • edited Loading

kaushikb11 commented Aug 23, 2024 • edited Loading

github-actions bot commented Sep 29, 2024

github-actions bot commented Oct 14, 2024

CalebCourier commented Nov 12, 2024

CalebCourier commented Nov 12, 2024

kaushikb11 commented Aug 11, 2024 •

edited

Loading

kaushikb11 commented Aug 23, 2024 •

edited

Loading

kaushikb11 commented Aug 23, 2024 •

edited

Loading