Data Model

The following is a discussion about the data model used by

The data model used is very generic and derived from RDF Schema (which in turn was derived from CycL, which in turn ...).

  1. We have a set of types, arranged in a multiple inheritance hierarchy where each type may be a sub class of multiple types.
  2. We have a set of properties:
    1. each property may have one or more types as its domains. The property may be used for instances of any of these types.
    2. each property may have one or more types as its ranges. The value(s) of the property should be instances of at least one of these types.

The decision to allow multiple domains and ranges was purely pragmatic. While the computational properties of systems with a single domain and range are easier to understand, in practice, this forces the creation of a lot of artifical types, which are there purely to act as the domain/range of some properties.

Like many other systems, the schema presented here can be extended (with a few types like Type and Property and a few properties like domainIncludes and rangeIncludes) to allow for reflection, i.e., for the schema to be represented in terms of itself.

The canonical machine representation of is in RDFa:

Now-obsolete snapshots of Microdata and OWL experimental data dumps were previously published. See also TopBraid's version.

The type hierarchy presented on this site is not intended to be a 'global ontology' of the world. It only covers the types of entities for which we (Microsoft, Yahoo!, Google and Yandex), think we can provide some special treatment for, through our search engines, in the near future.


While we would like all the markup we get to follow the schema, in practice, we expect a lot of data that does not. We expect properties to be used with new types. We also expect that often, where we expect a property value of type Person, Place, Organization or some other subClassOf Thing, we will get a text string. In the spirit of "some data is better than none", we will accept this markup and do the best we can.

Mapping to RDFa Lite

Our use of Microdata maps easily into RDFa Lite. In fact, all of can be used with the RDFa Lite syntax. The RDFa Lite version of the markup looks almost isomorphic to the Microdata version. The following sample demonstrates the use of RDFa Lite to mark up a Product type example:
<div vocab="" typeof="Product">
  <img property="image" src="dell-30in-lcd.jpg" />
  <span property="name">Dell UltraSharp 30" LCD Monitor</span>

  <div property="aggregateRating"
    <span property="ratingValue">87</span>
    out of <span property="bestRating">100</span>
    based on <span property="ratingCount">24</span> user ratings

  <div property="offers" typeof="AggregateOffer">
    <span property="lowPrice">$1250</span>
    to <span property="highPrice">$1495</span>
    from <span property="offerCount">8</span> sellers

  <div property="offers" typeof="Offer">
    <a property="url" href="">
     Save A Lot Monitors - $1250</a>
  <div property="offers" typeof="Offer">
    <a property="url" href="">
     Jon Doe's Gadgets - $1350</a>

More specifically:

  1. itemprop is replaced with property.
  2. itemscope is dropped.
  3. itemtype is replaced with typeof.

In addition, the attribute value pair vocab="" is added to the body or some other enclosing tag.


Background information on mainEntityOfPage / mainEntity properties.

mainEntityOfPage "Indicates a page (or other CreativeWork) for which this thing is the main entity being described."

Many (but not all) pages have a fairly clear primary topic, some entity or thing that the page describes. For example a restaurant's home page might be primarily about that Restaurant, or an event listing page might represent a single event. The mainEntity and mainEntityOfPage properties allow you to explicitly express the relationship between the page and the primary entity.

Related properties include sameAs, about, and url.

The sameAs and url properties are both similar to mainEntityOfPage. The url property should be reserved to refer to more official or authoritative web pages, such as the item’s official website. The sameAs property also relates a thing to a page that indirectly identifies it. Whereas sameAs emphasises well known pages, the mainEntityOfPage property serves more to clarify which of several entities is the main one for that page.

mainEntityOfPage can be used for any page, including those not recognized as authoritative for that entity. For example, for a product, sameAs might refer to a page on the manufacturer’s official site with specs for the product, while mainEntityOfPage might be used on pages within various retailers’ sites giving details for the same product.

about is similar to mainEntity, with two key differences. First, about can refer to multiple entities/topics, while mainEntity should be used for only the primary one. Second, some pages have a primary entity that itself describes some other entity. For example, one web page may display a news article about a particular person. Another page may display a product review for a particular product. In these cases, mainEntity for the pages should refer to the news article or review, respectively, while about would more properly refer to the person or product.