Unstrutured library is unable to extract CDATA from the xml data

Sample XML:

`<GENERAL_INFO><TITLE><![CDATA[Mobile Apple Devices (iPhones, iPads, and Smartwatches)]]></TITLE><SUMMARY><![CDATA[<p>This article highlights the key benefits and specifications of Apple iPhones, iPads, and Smartwatches.</p></SUMMARY></GENERAL_INFO>`

Code to fetch data from the XML 
```
from unstructured.partition.html import partition_html

_text = ' '.join([element.text for element in partition_html(text=_html_text)])
```


Is there any flag or function to enable extracting content from the CDATA ?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unstrutured library is unable to extract CDATA from the xml data #3075

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unstrutured library is unable to extract CDATA from the xml data #3075

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions