Skip to content

Specify columns [indices?] to return raw HTML on read_html()? #11921

Closed
@jeremyabramson

Description

@jeremyabramson

As per this: http://stackoverflow.com/questions/34236870/pandas-read-html-to-return-raw-html-contents-for-certain-rows-cells-etc?noredirect=1#comment56359521_34236870 Pandas returns the HTML stripped text of tables it scrapes. It might be nice to be able to specify columns for which you would return the raw HTML for post-processing. This would let you maintain references to images, grab links, etc.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Dtype ConversionsUnexpected or buggy dtype conversionsDuplicate ReportDuplicate issue or pull requestIO HTMLread_html, to_html, Styler.apply, Styler.applymap

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions