LogoLogo
HomeDocumentationLoginTry for free
  • CARTO Academy
  • Working with geospatial data
    • Geospatial data: the basics
      • What is location data?
      • Types of location data
      • Changing between types of geographical support
    • Optimizing your data for spatial analysis
    • Introduction to Spatial Indexes
      • Spatial Index support in CARTO
      • Create or enrich an index
      • Work with unique Spatial Index properties
      • Scaling common geoprocessing tasks with Spatial Indexes
      • Using Spatial Indexes for analysis
        • Calculating traffic accident rates
        • Which cell phone towers serve the most people?
    • The modern geospatial analysis stack
      • Spatial data management and analytics with CARTO QGIS Plugin
      • Using data from a REST API for real-time updates
  • Building interactive maps
    • Introduction to CARTO Builder
    • Data sources & map layers
    • Widgets & SQL Parameters
    • AI Agents
    • Data visualization
      • Build a dashboard with styled point locations
      • Style qualitative data using hex color codes
      • Create an animated visualization with time series
      • Visualize administrative regions by defined zoom levels
      • Build a dashboard to understand historic weather events
      • Customize your visualization with tailored-made basemaps
      • Visualize static geometries with attributes varying over time
      • Mapping the precipitation impact of Hurricane Milton with raster data
    • Data analysis
      • Filtering multiple data sources simultaneously with SQL Parameters
      • Generate a dynamic index based on user-defined weighted variables
      • Create a dashboard with user-defined analysis using SQL Parameters
      • Analyzing multiple drive-time catchment areas dynamically
      • Extract insights from your maps with AI Agents
    • Sharing and collaborating
      • Dynamically control your maps using URL parameters
      • Embedding maps in BI platforms
    • Solving geospatial use-cases
      • Build a store performance monitoring dashboard for retail stores in the USA
      • Analyzing Airbnb ratings in Los Angeles
      • Assessing the damages of La Palma Volcano
    • CARTO Map Gallery
  • Creating workflows
    • Introduction to CARTO Workflows
    • Step-by-step tutorials
      • Creating a composite score for fire risk
      • Spatial Scoring: Measuring merchant attractiveness and performance
      • Using crime data & spatial analysis to assess home insurance risk
      • Identify the best billboards and stores for a multi-channel product launch campaign
      • Estimate the population covered by LTE cells
      • A no-code approach to optimizing OOH advertising locations
      • Optimizing site selection for EV charging stations
      • How to optimize location planning for wind turbines
      • Calculate population living around top retail locations
      • Identifying customers potentially affected by an active fire in California
      • Finding stores in areas with weather risks
      • How to run scalable routing analysis the easy way
      • Geomarketing techniques for targeting sportswear consumers
      • How to use GenAI to optimize your spatial analysis
      • Analyzing origin and destination patterns
      • Understanding accident hotspots
      • Real-Time Flood Claims Analysis
      • Train a classification model to estimate customer churn
      • Space-time anomaly detection for real-time portfolio management
      • Identify buildings in areas with a deficit of cell network antennas
    • Workflow templates
      • Data Preparation
      • Data Enrichment
      • Spatial Indexes
      • Spatial Analysis
      • Generating new spatial data
      • Statistics
      • Retail and CPG
      • Telco
      • Insurance
      • Out Of Home Advertising
      • BigQuery ML
      • Snowflake ML
  • Advanced spatial analytics
    • Introduction to the Analytics Toolbox
    • Spatial Analytics for BigQuery
      • Step-by-step tutorials
        • How to create a composite score with your spatial data
        • Space-time hotspot analysis: Identifying traffic accident hotspots
        • Spacetime hotspot classification: Understanding collision patterns
        • Time series clustering: Identifying areas with similar traffic accident patterns
        • Detecting space-time anomalous regions to improve real estate portfolio management (quick start)
        • Detecting space-time anomalous regions to improve real estate portfolio management
        • Computing the spatial autocorrelation of POIs locations in Berlin
        • Identifying amenity hotspots in Stockholm
        • Applying GWR to understand Airbnb listings prices
        • Analyzing signal coverage with line-of-sight calculation and path loss estimation
        • Generating trade areas based on drive/walk-time isolines
        • Geocoding your address data
        • Find similar locations based on their trade areas
        • Calculating market penetration in CPG with merchant universe matching
        • Measuring merchant attractiveness and performance in CPG with spatial scores
        • Segmenting CPG merchants using trade areas characteristics
        • Store cannibalization: quantifying the effect of opening new stores on your existing network
        • Find Twin Areas of top-performing stores
        • Opening a new Pizza Hut location in Honolulu
        • An H3 grid of Starbucks locations and simple cannibalization analysis
        • Data enrichment using the Data Observatory
        • New police stations based on Chicago crime location clusters
        • Interpolating elevation along a road using kriging
        • Analyzing weather stations coverage using a Voronoi diagram
        • A NYC subway connection graph using Delaunay triangulation
        • Computing US airport connections and route interpolations
        • Identifying earthquake-prone areas in the state of California
        • Bikeshare stations within a San Francisco buffer
        • Census areas in the UK within tiles of multiple resolutions
        • Creating simple tilesets
        • Creating spatial index tilesets
        • Creating aggregation tilesets
        • Using raster and vector data to calculate total rooftop PV potential in the US
        • Using the routing module
      • About Analytics Toolbox regions
    • Spatial Analytics for Snowflake
      • Step-by-step tutorials
        • How to create a composite score with your spatial data
        • Space-time hotspot analysis: Identifying traffic accident hotspots
        • Computing the spatial autocorrelation of POIs locations in Berlin
        • Identifying amenity hotspots in Stockholm
        • Applying GWR to understand Airbnb listings prices
        • Opening a new Pizza Hut location in Honolulu
        • Generating trade areas based on drive/walk-time isolines
        • Geocoding your address data
        • Creating spatial index tilesets
        • A Quadkey grid of stores locations and simple cannibalization analysis
        • Minkowski distance to perform cannibalization analysis
        • Computing US airport connections and route interpolations
        • New supplier offices based on store locations clusters
        • Analyzing store location coverage using a Voronoi diagram
        • Enrichment of catchment areas for store characterization
        • Data enrichment using the Data Observatory
    • Spatial Analytics for Redshift
      • Step-by-step tutorials
        • Generating trade areas based on drive/walk-time isolines
        • Geocoding your address data
        • Creating spatial index tilesets
Powered by GitBook
On this page
  • Step 1. Find your raster data and download it
  • Step 2. Re-project the raster to a quadbin grid
  • Step 3. Install CARTO's Raster Loader
  • Step 4. Upload raster to BigQuery
  • Step 5. Compute the PV power potential of every building in the US
  • Step 6. Calculate the total rooftop PV potential in the US

Was this helpful?

Export as PDF
  1. Advanced spatial analytics
  2. Spatial Analytics for BigQuery
  3. Step-by-step tutorials

Using raster and vector data to calculate total rooftop PV potential in the US

Last updated 12 months ago

Was this helpful?

In spatial analytics, two main data types are used: raster and vector. Combining these two data formats can provide a comprehensive and powerful solution for various analyses. A common use case for combining raster and vector data in geospatial analysis is land use and land cover mapping. The raster data, such as satellite imagery, provides a detailed view of the Earth's surface, while vector data, such as GIS polygon boundaries, provides information on administrative and political units. By overlaying these two data types, a complete picture of land use and land cover can be generated, allowing for in-depth analysis and decision making in areas such as urban planning, natural resource management, real estate, and insurance.

In this example, you will learn how to easily combine raster and vector data using the of the Analytics Toolbox. In particular, the following use case explains all the steps required to compute the total rooftop photovoltaic power (PV) potential in the United States in less than 3 minutes by combining:

  • raster data for PV power potential from the , and

  • vector data for the building boundaries from OSM publicly available in BigQuery: `bigquery-public-data.geo_openstreetmap.planet_features_multipolygons`

Step 1. Find your raster data and download it

Note that storing your data in cloud storage might be more convenient for you than managing the raster data on your local computer.

Step 2. Re-project the raster to a quadbin grid

gdalwarp ./PVOUT.tif  -of COG -co TILING_SCHEME=GoogleMapsCompatible -co COMPRESS=DEFLATE ./PVOUT.quadbin.tif

Note that this re-projection is not required but is highly recommended because the current beta version of the raster module in the Analytics Toolbox is optimized for quadbin grids and our support for generic raster is still in a very experimental phase.

Step 3. Install CARTO's Raster Loader

pip install raster-loader

Step 4. Upload raster to BigQuery

carto bigquery upload --file_path PVOUT.quadbin.tif --project <my-bigquery-project> --dataset <my-bigquery-dataset> --table PVOUT_USA --output_quadbin

A new table `<my-bigquery-project>.<my-bigquery-dataset>.PVOUT_USA` is created in BigQuery containing the quadbin raster data in a compacted format.

Step 5. Compute the PV power potential of every building in the US

  • The qualified name of the table with the raster data: <my-bigquery-project>.<my-bigquery-dataset>.PVOUT_USA

  • The qualified name of the table with the building geometries (vector data): carto-demo-data.demo_tables.osm_buildings_usa

  • The name of the output table: <my-bigquery-project>.<my-bigquery-dataset>.USA_buildings_PVOUT_enriched

CALL
    `carto-un`.carto.RASTER_ST_GETVALUE_FROM_TABLE(
        ‘<my-bigquery-project>.<my-bigquery-dataset>.PVOUT_USA’,
        ‘carto-demo-data.demo_tables.osm_buildings_usa’,
        NULL,
        ‘<my-bigquery-project>.<my-bigquery-dataset>.USA_buildings_PVOUT_enriched’
        );
CALL
    `carto-un-eu`.carto.RASTER_ST_GETVALUE_FROM_TABLE(
        ‘<my-bigquery-project>.<my-bigquery-dataset>.PVOUT_USA’,
        ‘carto-demo-data.demo_tables.osm_buildings_usa’,
        NULL,
        ‘<my-bigquery-project>.<my-bigquery-dataset>.USA_buildings_PVOUT_enriched’
    );
CALL
    carto.RASTER_ST_GETVALUE_FROM_TABLE(
        ‘<my-bigquery-project>.<my-bigquery-dataset>.PVOUT_USA’,
        ‘carto-demo-data.demo_tables.osm_buildings_usa’,
        NULL,
        ‘<my-bigquery-project>.<my-bigquery-dataset>.USA_buildings_PVOUT_enriched’
    );

Note that we made US building data publicly available through table `carto-demo-data.demo_tables.osm_buildings_usa` so users don't need to process the entire world's data.

The new table will contain, for every building boundary (geog) its corresponding PV power potential (band_1_float32).

Step 6. Calculate the total rooftop PV potential in the US

Finally, we can calculate the total rooftop PV potential in the US with a simple query using the previously enriched table:

SELECT  SUM(ST_AREA(geog)* band_1_float32) avg_daily_pv_pp_usa_buildings
FROM  `<my-bigquery-project>.<my-bigquery-dataset>.USA_buildings_PVOUT_enriched`

First, we need to find the raster containing the data of interest. We download the GIS data file that contains solar resource (GHI, DNI, DIF, GTI, OPTA), PV power potential (PVOUT) and other parameters in raster format.

Next, we will re-project the raster data to a quadbin grid. For this, we use on the previously unzipped raster file PVOUT.tif. gdalwapr is an image re-projection and warping utility. We just need to run the following:

We are now ready to upload the raster data to BigQuery using , a Python package for loading GIS raster data to standard cloud-based data warehouses that don’t natively support raster data. This package can be easily installed via pip

Once installed, we can proceed to upload it to BigQuery through :

Note that this package can be also used as a Python library that you can .

With the raster data already in BigQuery, we can now assign to every building in the US its corresponding PV power potential using the procedure. We only need to pass:

Custom options (see ).

LTAy_AvgDailyTotals
gdalwarp
CARTO’s Raster Loader
the carto command-line interface (CLI)
import and use in your Python projects
RASTER_ST_GETVALUE_FROM_TABLE
reference
raster module
Global Solar Atlas
Advanced difficulty banner