Structured data allows developers and publishers to mark up content in a way that it's easy for machines to understand. Proper use of structured data enables third-parties like Google to parse reviews, event data, and contact information in meaningful ways, ensuring you're getting the most "bang" out of your publishing buck.
Fortunately, the major players in the Search game, including Google, Microsoft, Yahoo!, and Yandex) came together in the early 2010s to form Schema.org, a collaborative, community-driven standard for structured data.
With the major search engines and communities behind it, we're all marking up everything with appropriate structured data now, right?
Unfortunately, the process of implementing Schema.org in a site – especially one driven by dynamic content – is less than straightforward. WordPress, for instance, powers roughly a quarter of the web, but implementing structured data across hundreds of thousands of themes would be a nightmare.
Or, at least it would be, if it weren't for Schemify.
There are two approaches to adding structured data to a website: via the markup or JSON for Linking Data (JSON-LD).
Consider the following author information, which might appear at the bottom of a blog post:
<div class="author">
<img src="http://en.gravatar.com/avatar/88ea4e10ed968136228545d7112d82cb?s=200" alt="Steve Grunwell" />
<h2><a href="https://stevegrunwell.com" rel="author">Steve Grunwell</a></h2>
<p>Steve is the Director of Technology at Growella. When he's not working, you can find him speaking at conferences, roasting coffee, or spending time with his wife and daughter</p>
</div>
If I wanted this information to use Schema.org markup, it would look something like this:
<div class="author" itemscope itemtype="http://schema.org/Person">
<img src="http://en.gravatar.com/avatar/88ea4e10ed968136228545d7112d82cb?s=200" alt="Steve Grunwell" itemprop="image" />
<h2 itemprop="name"><a href="https://stevegrunwell.com" rel="author" itemprop="url">Steve Grunwell</a></h2>
<p itemprop="description">Steve is the Director of Technology at Growella. When he's not working, you can find him speaking at conferences, roasting coffee, or spending time with his wife and daughter</p>
</div>
While that may not seem like a lot of work, that's still a lot of extra markup being added. Even worse, a lot of that markup might normally be generated by WordPress helper functions like get_avatar()
, which means extra work to get the necessary itemprop
attribute in place.
Fortunately, the other approach for adding structured data is much more straight-forward in a theme, as it's simply a JSON-LD object:
<script type="application/ld+json">
{
"@context": "http://schema.org",
"@type": "Person",
"name": "Steve Grunwell",
"url": "https://stevegrunwell.com",
"image": "http://en.gravatar.com/avatar/88ea4e10ed968136228545d7112d82cb?s=200",
"description": "Steve is the Director of Technology at Growella. When he's not working, you can find him speaking at conferences, roasting coffee, or spending time with his wife and daughter"
}
</script>
We simply generate this JSON-LD object and append it to our document's <body>
. When Google or anyone else who supports JSON-LD structured data parses our page, they can immediately understand that Steve Grunwell is a person and determine his website, avatar, and biography.
The best part? There's no need to change the markup within our theme!
Schemify aims to automate the generation of JSON-LD objects for content within WordPress. Its flexible structure and reasonable defaults enables drop-in support for structured data, regardless of the WordPress theme.
After uploading Schemify to your WordPress instance, activate it via the Plugins screen in wp-admin.
Out of the box, Schemify has support for the "post", "page", and "attachment" post types, but you can easily enable it on your own custom post types via add_post_type_support()
:
/**
* Add Schemify support to the "book" custom post type.
*/
function mytheme_add_schemify_support() {
add_post_type_support( 'book', 'schemify' );
}
add_action( 'init', 'mytheme_add_schemify_support' );
Note: You may need to adjust the action ordering to ensure that your
add_post_type_support()
callback runs after your custom post type is registered. You may alternately choose to add "schemify" to thesupports
parameter withinregister_post_type()
as the custom post type is registered.
Out of the box, Schemify will append a JSON-LD object to the footer of your site's homepage and any single posts that have registered post type support for Schemify.
While that might be plenty for most sites, more complex sites will likely want to tweak the way Schemify works to optimize the way content is represented.
When constructing Schemify objects, it's important to remember that a single, top-level Schema can have any number of nested objects. For example, a single post may be represented with a BlogPosting
object, which has properties like "author", "publisher", and "image", which are represented by Person
, Organization
, and ImageObject
objects, respectively.
As each object is built, its values are passed through the schemify_get_properties_$schema
filter, where $schema
is the current object's declared Schema. The result is then passed through the schemify_get_properties
filter, which receives the same arguments but is applied to every schema.
The filters are each passed up to four arguments:
- (array) $data
- The collection of properties assembled for this object.
- (string) $schema
- The current schema being filtered.
- (int) $object_id
- The object ID being constructed.
- (bool) $is_main
- Is this the top-level JSON-LD schema being constructed?
Let's say you've added a custom twitter_url
user meta property to user profiles on your site, and wish to inject this information into the "sameAs" property on a Person
object:
/**
* Add a user's "twitter_url" user meta.
*
* @param array $data The collection of properties assembled for this object.
* @param string $schema The current schema being filtered.
* @param int $object_id The object ID being constructed.
*/
function add_twitter_url( $data, $schema, $object_id ) {
if ( $twitter = get_user_meta( $object_id, 'twitter_url', true ) ) {
$data['sameAs'] = $twitter;
}
return $data;
}
add_filter( 'schemify_get_properties_Person', __NAMESPACE__ . '\add_twitter_url', 10, 3 );
Using the nested nature of Schema objects, you can easily use Schemify to manipulate your data objects.
Unless told otherwise, Schemify will represent all custom post type objects using the Thing
Schema, which is often undesirable. Fortunately, there are two ways to override this: via a filter or at post-type registration (via register_post_type()
).
Before any object is built, Schemify needs to determine which Schema should be used. This value is then passed through the schemify_schema
filter, where it can be modified.
- (string) $schema
- The schema to use for this object.
- (string) $object_type
- The type of object being constructed.
- (string) $post_type
- The object's post type. If $object_type is not equal to 'post' will be an empty string.
- (int) $object_id
- The post or object ID.
For example, if you wanted to overwrite the default WebPage
schema used for the "page" post type to BlogPosting
, you could do so with the following:
/**
* Use "BlogPosting" instead of "WebPage" for single pages.
*
* @param string $schema The schema to use for this post.
* @param string $object_type The type of object being constructed.
* @param string $post_type The object's post type. If $object_type is not equal
* to 'post' this will be an empty string.
*/
function mytheme_treat_pages_as_blog_posts( $schema, $object_type, $post_type ) {
if ( 'page' === $post_type ) {
$schema = 'BlogPosting';
}
return $schema;
}
add_filter( 'schemify_schema', 'mytheme_treat_pages_as_blog_posts', 10, 3 );
You may also set the default Schema for a custom post type at the time of registration, using the schemify_schema
property:
register_post_type( 'my-custom-post-type', array(
// ...
'schemify_schema' => 'WebPage',
) );
This approach lets you skip having to write filters to override the schema type for custom post types.