Note: The Zillow Rent Index was first introduced in 2012, and this post details the original method for its calculation. In August 2019, the methodology was updated to better reflect the actual rental stock of homes and more accurately capture small market movements. Please see our new methodology, first published with July 2019 data, here.
Introduction
Similar to the Zillow Home Value Index (ZHVI), we created the Zillow Rent Index (ZRI) to track the monthly median rent in particular geographical regions. Like the ZHVI, we sought to create an index for rents that is unaffected by the mix of homes for rent at any particular time. This makes temporal comparisons of rents more valid since the index is tracking the rents for a consistent stock of inventory. It also makes it easier to compare the ZHVI and ZRI since they are based on a similar set of homes whereas traditional metrics tracking rent and sale prices are often based on markedly different sets of homes (often located in different neighborhoods), thus making comparisons less valid.
Underlying Data
Similar to the Zestimate, we estimate rents (Rent Zestimates) based on proprietary statistical and machine learning models. Within each county or state, the models observe recent rental listings and learn the relative contribution of various home attributes in predicting prevailing rents. These home attributes include physical facts about the home, prior sale transactions, tax assessment information and geographic location as well as the estimated market value of the home (Zestimate). Based on the patterns learned, these models estimate rental prices on all homes, including those not presently for rent. The purpose of the Rent Zestimate is to give consumers an indication of the fair market rent for a home, while the purpose of the ZRI is to give consumers insight into rental price trends in a way that is not biased by the mix of homes currently for rent.
Because of the availability of Zillow rental listing data used to train the models, Rent Zestimates are only available back to November 2010 and, consequently, each ZRI time series begins on this date as well. We generate the ZRI at seven geographic levels including neighborhood, ZIP code, city, congressional district, county, metropolitan area, state and the nation.
Market Segments
Within each region, we calculate the ZRI for various subsets of homes (or market segments) so as to afford greater insight into what is happening in a particular market. All market segments are shown in the table below. For more details about market segments, please see the Zillow Home Value Index methodology.
Table 1 (As of August 2016): Market Segments for Zillow Rent Index
Market Segment | Number of Rent Zestimates | Description |
All Homes plus Multifamily | 99.8 M | Single family + condominium + cooperative + multifamily |
All Homes | 89.9 M | Single family + condominium + cooperative |
Single Family | 79.9 M | Single family only |
Condo | 9.9 M | Condominium + cooperative only |
1 Bedroom | 1.7 M | 1 Bedroom |
2 Bedroom | 11.6 M | 2 Bedroom |
3 Bedroom | 30.2 M | 3 Bedroom |
4 Bedroom | 12.5 M | 4 Bedroom |
5+Bedroom | 2.9 M | 5 Bedroom or more |
Top Tier | 28.2 M | Top price tier among homes within the same metropolitan |
Middle Tier | 28.2 M | Middle price tier among homes within the same metropolitan |
Bottom Tier | 28.2 M | Bottom price tier among homes within the same metropolitan |
Methodology
Using the estimated rent of every home as represented in the Rent Zestimate, the main steps in the construction of the ZRI are as follows:
- Calculate Raw Median Rent Zestimates
- Apply Simple 3-Month Moving Average
- Final Quality Control
Calculate Raw Median Rent Zestimates
Let t be a discrete independent time variable with a value at the end of each month. Let H(t) be an M by N matrix with each element hij(t) representing the number of homes at time t for the i-th market segment in the j-th geographical region, where M is the total number of market segments and N is the total number of unique regions having a minimum required number of Rent Zestimates. Currently, we have M=12 and N=57,022. Geographical regions include national, state, metro, county, city, ZIP code, neighborhood and congressional district. The Number of Rent Zestimates column in Table 1 above represents the number of homes in the i-th element of hij when j=’National’ and t=’Jan-2012’.
Let zij(t) be the vector of Rent Zestimates of all homes at time t having length hij(t) for i-th market segment and j-th region. The raw median Rent Zestimate, rij(t), for i-th market segment and j-th region is defined as:
rij(t)=Median(zij(t))
rij(t) is the median Rent Zestimate and is an element of the M by N matrix R(t). In order to ensure reliability and stability, we only compute rij when hij(t) is above some minimum threshold. For Jan 2012, there are a total of 391,375 unique set of regions and market segments for which the median could be computed:
Count{rij(t) ≠NA, for i=1,..M and j=1,..N} is 391,375.
Apply Simple Three-Month Moving Average
We apply a simple three-month moving average to R(t) to filter out noise in the data:
ZRI(t)={ R(t)+R(t-1)+ R(t-2)}/3
The resultant M by N matrix ZRI (t) is a smooth estimate of the median home value free of residual systematic error. This may not be as necessary for large regions such as the nation and states because of the large available data set, but it is applied to all levels for consistency.
Final Quality Control
The time series matrix ZRI(t) has the same dimension as H(t) which is M by N (as noted, 12 x 57,022). While this theoretically could produce more than 680,000 different time series, in practice many time series are eliminated because of data sparseness or temporal volatility. The general logic determining whether a ZRI time series for a particular combination of region and market segment will be suppressed from the publicly available data set includes the following elements:
- Number of Rent Zestimates < [threshold]
- Number of rental listings in most recent three months < [threshold]
- Temporal volatility measured by annualized, monthly or quarterly change > [threshold]
- Region has been deemed suspect based on a manual review
Applying the suppression criteria above, there are 195,258 unique deliverable ZRI time series for the report period ending Jan 2012. Table 2 below shows the count of regional time series by region level and market segment.
Table 2: Number of deliverable ZRI time series by region level and market segment (As of August 2016)
2 Bedroom1496061,3668,0153,90710,895
Market Segment | National | State | MSA | County | City | Neighborhood | Zip |
All Homes | 1 | 50 | 683 | 1776 | 13,215 | 6,502 | 15,896 |
All Homes Plus Multifamily | 1 | 49 | 682 | 1,764 | 13,217 | 6,501 | 15,892 |
Single Family | 1 | 48 | 678 | 1764 | 13,192 | 6,327 | 15,832 |
1 Bedroom | 1 | 51 | 454 | 877 | 1,994 | 1,171 | 3,393 |
3 Bedroom | 1 | 50 | 635 | 1,519 | 10,358 | 5,255 | 13,282 |
4 Bedroom | 1 | 49 | 555 | 1,327 | 7,690 | 3,398 | 10,708 |
5+Bedroom | 1 | 49 | 455 | 972 | 3,338 | 1,162 | 5,649 |
Restatement
Unlike the ZHVI, there is no restatement of the ZRI in the routine monthly calculations because Rent Zestimates do not depend on data that arrive with some latency such as public record transactional data (such as is the case with Zestimates and the corresponding ZHVI). However, there are two situations in which restatements are unavoidable. First, when the boundaries of a geographic region change, the ZRI for the region will change as well since the set of homes underlying the ZRI is different. Second when we regenerate historical Rent Zestimates (for example, when a more accurate algorithm is developed), we also have to re-generate all historical ZRIs.
We are continuously working on improving the underlying algorithm to make Rent Zestimates more accurate. When major improvements to the algorithm are made, we will re-compute the historical Rent Zestimates for affected homes. Our purpose in doing so is to provide consumers with the best estimate of historical rents.
Data Coverage
We calculate the ZRI at the national level as the median Rent Zestimate of 84.9 million homes. The interactive map below displays the number of Rent Zestimates by county for the period ending Jan 31, 2012.
Some county-level ZRIs are suppressed based on the filter rules discussed in the Final Quality Control section above (although Rent Zestimates in those counties are used in computing higher-level ZRIs). The interactive map below shows counties that have a valid ZRI as of January 2012 (green) and those counties where the ZRI has been suppressed based on filter rules but individual Rent Zestimates are still available (red).