Have questions about buying, selling or renting during COVID-19? Learn more

Zillow Research

Zillow Rent Index Methodology (2012)

Note: The Zillow Rent Index was first introduced in 2012, and this post details the original method for its calculation. In August 2019, the methodology was updated to better reflect the actual rental stock of homes and more accurately capture small market movements. Please see our new methodology, first published with July 2019 data, here.

Introduction

Similar to the Zillow Home Value Index (ZHVI), we created the Zillow Rent Index (ZRI) to track the monthly median rent in particular geographical regions. Like the ZHVI, we sought to create an index for rents that is unaffected by the mix of homes for rent at any particular time. This makes temporal comparisons of rents more valid since the index is tracking the rents for a consistent stock of inventory. It also makes it easier to compare the ZHVI and ZRI since they are based on a similar set of homes whereas traditional metrics tracking rent and sale prices are often based on markedly different sets of homes (often located in different neighborhoods), thus making comparisons less valid.

Underlying Data

Similar to the Zestimate, we estimate rents (Rent Zestimates) based on proprietary statistical and machine learning models. Within each county or state, the models observe recent rental listings and learn the relative contribution of various home attributes in predicting prevailing rents. These home attributes include physical facts about the home, prior sale transactions, tax assessment information and geographic location as well as the estimated market value of the home (Zestimate). Based on the patterns learned, these models estimate rental prices on all homes, including those not presently for rent. The purpose of the Rent Zestimate is to give consumers an indication of the fair market rent for a home, while the purpose of the ZRI is to give consumers insight into rental price trends in a way that is not biased by the mix of homes currently for rent.

Because of the availability of Zillow rental listing data used to train the models, Rent Zestimates are only available back to November 2010 and, consequently, each ZRI time series begins on this date as well. We generate the ZRI at seven geographic levels including neighborhood, ZIP code, city, congressional district, county, metropolitan area, state and the nation.

Market Segments

Within each region, we calculate the ZRI for various subsets of homes (or market segments) so as to afford greater insight into what is happening in a particular market. All market segments are shown in the table below. For more details about market segments, please see the Zillow Home Value Index methodology.

Table 1 (As of August 2016): Market Segments for Zillow Rent Index

Market Segment Number of Rent Zestimates Description
All Homes plus Multifamily 99.8 M Single family + condominium + cooperative + multifamily
All Homes 89.9 M Single family + condominium + cooperative
Single Family 79.9 M Single family only
Condo 9.9 M Condominium + cooperative only
1 Bedroom 1.7 M 1 Bedroom
2 Bedroom 11.6 M 2 Bedroom
3 Bedroom 30.2 M 3 Bedroom
4 Bedroom 12.5 M 4 Bedroom
5+Bedroom 2.9 M 5 Bedroom or more
Top Tier 28.2 M Top price tier among homes within the same metropolitan
Middle Tier 28.2 M Middle price tier among homes within the same metropolitan
Bottom Tier 28.2 M Bottom price tier among homes within the same metropolitan

 

Methodology

Using the estimated rent of every home as represented in the Rent Zestimate, the main steps in the construction of the ZRI are as follows:

  1. Calculate Raw Median Rent Zestimates
  2. Apply Simple 3-Month Moving Average
  3. Final Quality Control

Calculate Raw Median Rent Zestimates

Let t be a discrete independent time variable with a value at the end of each month. Let H(t) be an M by N matrix with each element hij(t) representing the number of homes at time t for the i-th market segment in the j-th geographical region, where M is the total number of market segments and N is the total number of unique regions having a minimum required number of Rent Zestimates. Currently, we have M=12 and N=57,022. Geographical regions include national, state, metro, county, city, ZIP code, neighborhood and congressional district. The Number of Rent Zestimates column in Table 1 above represents the number of homes in the i-th element of hij when j=’National’ and t=’Jan-2012’.

Let zij(t) be the vector of Rent Zestimates of all homes at time t having length hij(t) for i-th market segment and j-th region. The raw median Rent Zestimate, rij(t), for i-th market segment and j-th region is defined as:

rij(t)=Median(zij(t))

rij(t) is the median Rent Zestimate and is an element of the M by N matrix R(t). In order to ensure reliability and stability, we only compute rij when hij(t) is above some minimum threshold. For Jan 2012, there are a total of 391,375 unique set of regions and market segments for which the median could be computed:

Count{rij(t) ≠NA, for i=1,..M and j=1,..N} is 391,375.

 

Apply Simple Three-Month Moving Average

We apply a simple three-month moving average to R(t) to filter out noise in the data:

ZRI(t)={ R(t)+R(t-1)+ R(t-2)}/3

The resultant M by N matrix ZRI (t) is a smooth estimate of the median home value free of residual systematic error. This may not be as necessary for large regions such as the nation and states because of the large available data set, but it is applied to all levels for consistency.

Final Quality Control

The time series matrix ZRI(t) has the same dimension as H(t) which is M by N (as noted, 12 x 57,022). While this theoretically could produce more than 680,000 different time series, in practice many time series are eliminated because of data sparseness or temporal volatility. The general logic determining whether a ZRI time series for a particular combination of region and market segment will be suppressed from the publicly available data set includes the following elements:

  1. Number of Rent Zestimates < [threshold]
  2. Number of rental listings in most recent three months < [threshold]
  3. Temporal volatility measured by annualized, monthly or quarterly change > [threshold]
  4. Region has been deemed suspect based on a manual review

Applying the suppression criteria above, there are 195,258 unique deliverable ZRI time series for the report period ending Jan 2012. Table 2 below shows the count of regional time series by region level and market segment.

Table 2: Number of deliverable ZRI time series by region level and market segment (As of August 2016)

2 Bedroom1496061,3668,0153,90710,895

Market Segment National State MSA County City Neighborhood Zip
All Homes 1 50 683 1776 13,215 6,502 15,896
All Homes Plus Multifamily 1 49 682 1,764 13,217 6,501 15,892
Single Family 1 48 678 1764 13,192 6,327 15,832
1 Bedroom 1 51 454 877 1,994 1,171 3,393
3 Bedroom 1 50 635 1,519 10,358 5,255 13,282
4 Bedroom 1 49 555 1,327 7,690 3,398 10,708
5+Bedroom 1 49 455 972 3,338 1,162 5,649

 

Restatement

Unlike the ZHVI, there is no restatement of the ZRI in the routine monthly calculations because Rent Zestimates do not depend on data that arrive with some latency such as public record transactional data (such as is the case with Zestimates and the corresponding ZHVI). However, there are two situations in which restatements are unavoidable. First, when the boundaries of a geographic region change, the ZRI for the region will change as well since the set of homes underlying the ZRI is different. Second when we regenerate historical Rent Zestimates (for example, when a more accurate algorithm is developed), we also have to re-generate all historical ZRIs.

We are continuously working on improving the underlying algorithm to make Rent Zestimates more accurate. When major improvements to the algorithm are made, we will re-compute the historical Rent Zestimates for affected homes. Our purpose in doing so is to provide consumers with the best estimate of historical rents.

Data Coverage

We calculate the ZRI at the national level as the median Rent Zestimate of 84.9 million homes. The interactive map below displays the number of Rent Zestimates by county for the period ending Jan 31, 2012.


 

Some county-level ZRIs are suppressed based on the filter rules discussed in the Final Quality Control section above (although Rent Zestimates in those counties are used in computing higher-level ZRIs). The interactive map below shows counties that have a valid ZRI as of January 2012 (green) and those counties where the ZRI has been suppressed based on filter rules but individual Rent Zestimates are still available (red).


 

Zillow Rent Index Methodology (2012)