LangChain's XMLOutputParser vulnerable to XML Entity Expansion
Moderate severity
GitHub Reviewed
Published
Mar 26, 2024
to the GitHub Advisory Database
•
Updated Mar 27, 2024
Description
Published by the National Vulnerability Database
Mar 26, 2024
Published to the GitHub Advisory Database
Mar 26, 2024
Reviewed
Mar 26, 2024
Last updated
Mar 27, 2024
The XMLOutputParser in LangChain uses the etree module from the XML parser in the standard python library which has some XML vulnerabilities; see: https://docs.python.org/3/library/xml.html
This primarily affects users that combine an LLM (or agent) with the
XMLOutputParser
and expose the component via an endpoint on a web-service.This would allow a malicious party to attempt to manipulate the LLM to produce a malicious payload for the parser that would compromise the availability of the service.
A successful attack is predicated on:
References