- Installation
- Basic usage
- Extensions
- Filtering links, images and iframes hosts
- Forcing HTTPS on links, images and iframes hosts
- Configuring allowed attributes
html-sanitizer requires PHP 7.1+.
You can install the library using the following command:
composer require tgalopin/html-sanitizer
The main entry point to the sanitizer is the HtmlSanitizer\Sanitizer
class. It requires
an array of configuration:
$sanitizer = HtmlSanitizer\Sanitizer::create(['extensions' => ['basic']]);
$safeHtml = $sanitizer->sanitize($untrustedHtml);
The sanitizer works using extensions. Extensions are a set of features that you can easily enable to allow specific tags in the content (read the next part to learn more about them).
Note that the sanitizer is working using a strict whitelist of allowed tags: in the previous example, the sanitizer would allow only the basic HTML5 tags (
strong
,a
,div
, etc., ).
Extensions are a way to quickly add sets of tags to the whitelist of allowed tags. There are 8 core extensions that you can enable by adding them in your configuration:
$sanitizer = HtmlSanitizer\Sanitizer::create([
'extensions' => ['basic', 'code', 'image', 'list', 'table', 'iframe', 'details', 'extra'],
]);
$safeHtml = $sanitizer->sanitize($untrustedHtml);
Here is the list of tags each extension allow:
- basic allows the insertion of basic HTML elements:
a
,b
,br
,blockquote
,div
,del
,em
,figcaption
,figure
,h1
,h2
,h3
,h4
,h5
,h6
,i
,p
,q
,small
,span
,strong
,sub
,sup
,u
- list allows the insertion of lists:
dd
,dl
,dt
,li
,ol
,ul
- table allows the insertion of tables:
table
,thead
,tbody
,tfoot
,tr
,td
,th
- image allows the insertion of images:
img
- code allows the insertion of code blocks:
pre
,code
- iframe allows the insertion of iframes:
iframe
- details allows the insertion of view/hide blocks:
details
,summary
- extra allows the insertion of the following tags:
abbr
,caption
,hr
,rp
,rt
,ruby
Note: sensible attributes are allowed by default for each tag (for instance, the
src
attribute is allowed by default on images). You can also override these allowed attributes manually if you need to.
Note: the Sanitizer does not allow relative URLs: they are always filtered out for security reasons.
The sanitizer basic, image and iframe extensions provide a feature to filter hosts, which can be useful to avoid connecting to external websites that may, for instance, track your website views.
To enable this feature, you need to configure the tags:
$sanitizer = HtmlSanitizer\Sanitizer::create([
'extensions' => ['basic', 'image', 'iframe'],
'tags' => [
'a' => [
/*
* If an array is provided, links targeting other hosts than one in this array
* will be disabled (the `href` attribute will be blank). This can be useful if you want
* to prevent links targeting external websites. Keep null to allow all hosts.
* Any allowed domain also includes its subdomains.
*
* Example:
* 'allowed_hosts' => ['trusted1.com', 'google.com'],
*/
'allowed_hosts' => null,
/*
* If true, mailto links will be accepted.
*/
'allow_mailto' => false,
/*
* If true, relative links will be accepted.
*/
'allow_relative_links' => false,
],
'img' => [
/*
* If an array is provided, images relying on other hosts than one in this array
* will be disabled (the `src` attribute will be blank). This can be useful if you want
* to prevent images contacting external websites. Keep null to allow all hosts.
* Any allowed domain also includes its subdomains.
*
* Example:
* 'allowed_hosts' => ['trusted1.com', 'google.com'],
*/
'allowed_hosts' => null,
/*
* If true, images data-uri URLs will be accepted.
*/
'allow_data_uri' => false,
/*
* If true, relative links will be accepted.
*/
'allow_relative_links' => false,
],
'iframe' => [
/*
* If an array is provided, iframes relying on other hosts than one in this array
* will be disabled (the `src` attribute will be blank). This can be useful if you want
* to prevent iframes contacting external websites.
* Any allowed domain also includes its subdomains.
*
* Example:
* 'allowed_hosts' => ['trusted1.com', 'google.com'],
*/
'allowed_hosts' => null,
/*
* If true, relative links will be accepted.
*/
'allow_relative_links' => false,
],
],
]);
The sanitizer basic, image and iframe extensions provide a feature to force HTTPs on targeted hosts.
To enable this feature, you need to configure the tags:
$sanitizer = HtmlSanitizer\Sanitizer::create([
'extensions' => ['basic', 'image', 'iframe'],
'tags' => [
'a' => [
/*
* If true, all links targets using the HTTP protocol will be rewritten to use HTTPS instead.
*/
'force_https' => false,
],
'img' => [
/*
* If true, all images URLs using the HTTP protocol will be rewritten to use HTTPS instead.
*/
'force_https' => false,
],
'iframe' => [
/*
* If true, all iframes URLs using the HTTP protocol will be rewritten to use HTTPS instead.
*/
'force_https' => false,
],
],
]);
The core extensions define sensible default allowed attributes for each tag, which mean you usually won't need to change them. However, if you want to customize which attributes are allowed on specific tags, you can use a tag-specific configuration for them.
For instance, to allow the class
attribute on the div
and img
tags, you can use the following configuration:
$sanitizer = HtmlSanitizer\Sanitizer::create([
'extensions' => ['basic'],
'tags' => [
'div' => [
'allowed_attributes' => ['class'],
],
'img' => [
'allowed_attributes' => ['src', 'alt', 'title', 'class'],
],
],
]);