Capturing SkyTour: How structured drone capture powers immersive 3D

Zillow Media Experts, real-time feedback and AI modeling work together in the field

zillow media expert capturing drone footage with skytour
Zillow

Written by on March 16, 2026

When we set out to bring SkyTour to life, we weren’t just building a new feature. We were pioneering an industry-first capture process that relies on quality at every step. What sets SkyTour apart isn’t just the technology — it’s the people behind it.

The footage captured in the field becomes the foundation for SkyTour’s AI engine, where Structure-from-Motion and 3D Gaussian Splatting (3DGS) transform imagery into a volumetric model. That trained model is then rendered through a navigation model that keeps the home at the center of the experience, enabling buyers to explore each property interactively. Designing capture meant designing for the entire system — ensuring decisions made in the field translate seamlessly into reconstruction and the final experience.

Because 3DGS synthesizes new views from real imagery, it performs best when desired viewpoints are close to actual captured frames. While 3DGS can generate novel views, it depends on consistent, comprehensive coverage of the scene. Early testing showed that delivering a strong user experience depends on capturing the home from viewpoints that closely match how people will explore it.

Capturing the home from every critical angle

After designing the interactive viewing experience, we crafted a structured flight plan to match. The core of our methodology is a slow, steady set of orbits at multiple elevations, with camera settings set to minimize blur.

This approach effectively covers the viewing volume with high-quality source imagery.

By holding consistent flight characteristics and centering the home, we generate comprehensive coverage and sufficient diversity of viewpoints to provide the spatial constraints required for a geometrically sound splat model.

A single continuous video provides a clear, unbroken signal that allows us to precisely reconstruct the drone’s flight path. Those views are then combined into a seamless 3D space.

image showing how skytour works with zillow media experts
Flight plans are structured around a slow, steady set of orbits at multiple elevations.

Scaling this methodology from early tests to more than 300 pilots required a new mindset. Drone photography often encourages creative freedom, but SkyTour depends on structured procedural capture. We introduced training that emphasized consistency in orbit structure, velocity and framing. Once photographers saw how this framework minimized 3DGS artifacts, reduced missing coverage and improved fidelity, they embraced it.

Capture isn’t an isolated operational step. It’s the first stage of an integrated AI system, where decisions in the field directly influence model quality and user experience.

Navigating obstacles and ensuring compliance

Every property presents unique challenges. Trees, power lines, neighboring structures and environmental barriers require judgment and adaptability. Balancing a prescribed capture process with the flexibility to adapt to real-world conditions is one of the most complex aspects of operationalizing SkyTour.

Lessons from beta testing informed situational training, including obstacle avoidance, environmental adjustments and neighbor de-escalation. Our pilots must be confident in their decision-making and supported in making the right call in the field. Continuous communication, shared feedback and iterative improvement have helped refine flying techniques across the network.

Because the Zillow Media Experts team had already provided a variety of drone-based services, we began with a strong foundation in FAA compliance and flight safety. In areas where SkyTour isn’t possible due to federal or local regulations — such as airspace ceilings below 100 feet, temporary flight restrictions (TFRs) or no-fly zones — our Customer Service and Media Fulfillment teams coordinate directly with clients.

Operational discipline ensures that structured capture can scale responsibly across diverse environments.

Delivering real-time feedback in the field

Coverage and viewpoint diversity are essential for high-quality 3DGS reconstruction. Early on, photographers relied primarily on their phone’s live video viewfinder during capture. However, because final 3DGS generation is computationally intensive, they would not know whether coverage was sufficient until hours after leaving the site.

To provide more immediate insight, we developed two complementary feedback systems.

First, we use GPS and altitude data recorded during video capture. By analyzing several hundred SkyTours, we parameterized flight paths against an abstract orbital model and evaluated resulting quality to design a capture path checklist. During new captures, flight paths are parameterized and compared against this model, helping photographers stay within an optimal range during orbit.

Second, we built a preview system based on photogrammetry. By decimating captured video and generating a low-fidelity textured mesh on device, we quickly compute a representation of the home immediately after capture. Photogrammetry provides both geometry and pose data — the location and viewing angle of each image used to construct the model.

For each face in the mesh, we compute visual coverage based on the number of poses in which that face is visible. We then layer this scalar field onto the texture map, highlighting areas with insufficient coverage using a visual pattern that photographers can quickly interpret. If needed, they can perform additional passes before leaving the property.

This shift moved capture from a film-camera model to a digital-camera model, giving photographers visibility into how flight decisions translate into the final 3D scene.

Integrating human expertise at every stage

Even with structured capture and AI-driven reconstruction, human oversight remains essential. Zillow annotators review every SkyTour before delivery, identify errors, and collaborate with regional managers to improve performance. For some homes, foliage or structures prevent the drone pilot from capturing parts of the lower orbit at a high quality, and in those cases the annotation team can modify the extents of the view cone to compensate. Where environments make capture difficult or impossible, we provide reshoot and refund processes to ensure quality and client satisfaction.

From structured flight paths to real-time feedback and final review, the decisions made in the field ripple forward to shape reconstruction fidelity and enable the smooth, interactive exploration buyers experience when they launch a SkyTour.

SkyTour represents a coordinated system in which skilled professionals and AI-powered reconstruction work together across capture, production and display — expanding what it means to understand a home before ever stepping inside.

This system reflects a deliberate approach to AI by pairing machine intelligence with human expertise and production infrastructure to reshape how real estate is experienced.

Contact Our PR Team

Speak with a Zillow expert about company news or research.

Email the team

Related Articles

couple adding a for rent sign in their front yard

5 min read

Simplifying the shift from seller to landlord

person looking at 3d tours on their tablet

5 min read

How Zillow made virtual tours feel even more like being there in person

Person looking at SkyTour experience on their tablet

5 min read

Inside SkyTour’s AI engine: From drone footage to immersive 3D

illustration of mailbox

Sign up for Zillow news updates

Subscribe to receive daily emails for the latest Zillow news and announcements, product updates and more.

 

By submitting this form, you agree to receive email communication from Zillow. We respect your privacy. See our privacy policy.