How to Incorporate Mesh Blocks into Datasets

How to Incorporate Mesh Blocks into Datasets

Mesh Blocks in real estate and proptech applications

Mesh Blocks are useful for geospatial and proptech applications, providing granularity and accuracy for understanding local real estate markets, demographics and land use.

The integration of Mesh Blocks into datasets can enhance the precision and relevance of analyses within the proptech and real estate sectors.

Useful in geospatial data and census analyses, embedding Mesh Blocks into digital boundaries enhances their usability in various applications.

We will cover the steps to incorporate mesh blocks into data sets below.

What are Mesh Blocks and how are they used in real estate?

Mesh Blocks are foundational building blocks for geospatial and proptech applications, providing granularity and accuracy for understanding local real estate markets, demographics and land use.

How to incorporate Mesh Blocks into datasets

Incorporating Mesh Block into datasets involves several steps to ensure seamless integration and effective utilisation of geographical information. Here’s a guide on how to incorporate Mesh Blocks into datasets:

Step 1: Data Collection

Gather relevant data that aligns with Mesh Blocks.

This may include demographic information, property values, land use details, or any other dataset that can be associated with specific geographical areas.

 

Step 2: Download Mesh Block Boundaries

Mesh Block boundary files can be downloaded from authoritative sources, such as the Australian Bureau of Statistics (ABS) or relevant statistical agencies.

For ease, The Proptech Cloud has a free comprehensive dataset Geography – Boundaries & Insights – Australia ready for access and immediate use.

Geography – Boundaries & Insights – Australia

This free dataset from The Proptech Cloud is available for seamless access from Snowflake Marketplace.

Step 3: Geospatial Data Processing

Use Geographic Information System (GIS) software or programming libraries (e.g., Python with geospatial libraries like GeoPandas) to process and manipulate the mesh block boundaries.

Tip:

Geographical boundaries can be imported using Python libraries including Geopandas and shapely.

Many data warehouses including Snowflake, BigQuery and PostgreSQL have in-built geospatial functionality to allow for the processing of geospatial data.

QGIS – Loading in Geospatial files in QGIS

1. From the toolbar at the top of the page click Layer > Add Layer > Add Vector Layer

2. Make sure the Source Type is clicked to File

3. Load in the Source Data by using the three dots button at the side of the Vector Dataset(s) toolbar

QGIS - Loading in Geospatial files in QGIS

Geospatial Formats

The two most common ways geospatial data are represented in files are Well-Known-Text (WKT) which is a textual representation of a polygon and the geojson format which shows the coordinates and type of geojson format.

Both Python and Snowflake have capabilities to work with these 3 formats and parse them so they can be used in geography functions

WKT Format

#Example 2 using WKT format

from shapely import wkt

brisbane_bbox = “POLYGON ((153.012021 -27.471741, 153.012021 -27.462598, 153.032931 -27.462598, 153.032931 -27.471741, 153.012021 -27.471741))”

brisbane_poly = wkt.loads(brisbane_bbox)

Python – Loading in GeoJSON

The libraries geojson, shapely and json need to be installed.

#EXAMPLE 1 working with a geojson format

import json

import geojson

from shapely.geometry import shape

geojson_example = {

"coordinates": [[[153.01202116, -27.47174129], [153.01202116, -27.46259798], [153.03293092, -27.46259798], [153.03293092, -27.47174129], [153.01202116, -27.47174129]]],

"type": "Polygon"

}

geojson_json = json.dumps(geojson_example)

# Convert to geojson.geometry.Polygon

geojson_poly = geojson.loads(geojson_json)

poly = shape(geojson_poly ))

Snowflake

GeoJSON and WKT format can also be loaded into snowflake and converted to a geometry using the following commands:

#converting Well-Known-Text into geography format

SELECT ST_GEOGRAPHYFROMWKT('POLYGON ((153.012021 -27.471741, 153.012021 -27.462598, 153.032931 -27.462598, 153.032931 -27.471741, 153.012021 -27.471741))');

#Converting Geojson to geography format

SELECT TO_GEOGRAPHY('{

"coordinates": [[[153.01202116, -27.47174129], [153.01202116, -27.46259798], [153.03293092, -27.46259798], [153.03293092, -27.47174129], [153.01202116, -27.47174129]]],

"type": "Polygon"

}

')

Step 4: Data Matching

Match the dataset records with the appropriate mesh blocks based on their geographical coordinates. This involves linking each data point to the corresponding mesh block within which it is located.

Tip:

Geospatial functions which are supported in big data warehouses and Python can be used to match geospatial data.

A common way to match two geographical objects is to see if the coordinates of the two objects intersect. An example of how to do this in Python and Snowflake is shown below.

In Python

Data matching can be done using the shapely library intersects function.

from shapely import wkt, intersects

shape1 = wkt.loads("POLYGON ((153.012021 -27.471741, 153.012021 -27.462598, 153.032931 -27.462598, 153.032931 -27.471741, 153.012021 -27.471741))")

shape2 = wkt.loads("POLYGON ((153.012021 -27.471741, 153.032931 -27.462598, 153.012021 -27.471741))")

shape_int = intersects(shape1, shape2)

print(shape_int)

 

In Snowflake

Data matching can be done using the ST_Intersects function. One of the advantages of using big data warehouses including Snowflake to geospatially match data is that it leverages its highly scalable infrastructure to quickly complete geospatial processing.

WITH geog_1 as (

SELECT ST_GEOGRAPHYFROMWKT('POLYGON ((153.012021 -27.471741, 153.012021 -27.462598, 153.032931 -27.462598, 153.032931 -27.471741, 153.012021 -27.471741))') as poly

),

geog_2 as (

SELECT ST_GEOGRAPHYFROMWKT('POLYGON ((153.012021 -27.471741, 153.022021 -27.465, 153.032931 -27.462598, 153.012021 -27.471741))') as poly

)

SELECT

g1.poly, g2.poly

FROM geog_1 as g1

INNER JOIN geog_2 as g2

on ST_INTERSECTS(g1.poly, g2.poly)

Step 5: Attribute Joining

If your dataset and mesh blocks data have common attributes (e.g., unique identifiers), perform attribute joins to combine information from both datasets. This allows you to enrich your dataset with additional details associated with mesh blocks.

Step 6: Quality Assurance

Verify the accuracy of the spatial integration by checking for any discrepancies or errors. Ensure that each data point is correctly associated with the corresponding mesh block.

Tip:

geojson.io is a handy website that can help with visualising geojson data and ensure it is correct.

If you’re using Snowflake, the ST_AsGeojson command can be used to convert geography into a geojson which allows you to quickly visualise the shapes created.

Step 7: Data Analysis and Visualisation

Leverage the integrated dataset for analysis and visualisation. Explore trends, patterns, and relationships within the data at the mesh block level. Utilise geospatial tools to create maps and visual representations of the information.

Tip:

It’s worth mentioning that Snowflake has the option to create a Streamlit app within the Snowflake UI which allows for the cleaning and processing of data using Python and SQL and the interactive visualisation of data through the Streamlit App.

Read our blog which demonstrates how to predict migration patterns and create forecasts using Snowpark and Streamlit>

Snowflake also integrates really well with local Python development environments so all the initial data processing and cleaning can be done through a Snowflake API, then geography can be converted to a GeoJson or Text formal. Thereafter, libraries like plotly, folium, pydeck can be used to do complex geospatial visualisations.

Step 8: Data Storage and Management

Establish a system for storing and managing the integrated dataset, ensuring that it remains up-to-date as new data becomes available.

Consider using databases or platforms that support geospatial data.

Tip:

Geospatial datasets are usually very large and complex datasets due to the number of attributes included in a geospatial dataset, the resolution of the data and the number of records.

Cloud-based big data platforms can be an excellent option for storing geospatial data due to the low-cost of storage. Many of these platforms including also have spatial clustering options so that geospatial data in a similar location are grouped together, meaning queries for data in certain areas run more efficiently.

Snowflake (Enterprise Edition or Higher) also has an option to add a search optimisation to geospatial data tables to optimise the performance of queries that use geospatial functions.

Step 9: Documentation

Document the integration process, including the source of mesh block boundaries, any transformations applied, and the methods used for data matching. This documentation is essential for transparency and replicability.

By following these above steps, you can effectively incorporate mesh blocks into your datasets, enabling a more detailed and location-specific analysis of the information at the mesh block level.

 

Geography – Boundaries & Insights – Australia

This free dataset from The Proptech Cloud is available for seamless access from Snowflake Marketplace.

All rights are reserved, and no content may be republished or reproduced without express written permission from Data Army and The Proptech Cloud. All content provided is for informational purposes only. While we strive to ensure that the information provided here is both factual and accurate, we make no representations or warranties of any kind about the completeness, accuracy, reliability, suitability, or availability with respect to the blog or the information, products, services, or related graphics contained on the blog for any purpose.

Subscribe to our newsletter

Subscribe to receive the latest blogs and data listings direct to your inbox.

Read more from The Proptech Cloud

What’s The Difference Between Proptech and Contech?

Proptech and Contech is driving innovation across industries. We uncover the nuances and distinctions between Proptech and Contech here.

What is a Coordinate Reference System (CRS)?

We break down the concept of Coordinate reference systems (CRS) and discuss the different types, their purpose and uses in the world of real estate.

The Property Lifecycle

Discover the extensive property lifecycle and how it connects with other sectors. This infographic highlights the key phases and processes involved.

Why Is It So Difficult To Parse Addresses?

This blog explores what address parsing is and why it presents such unique challenges. Discover the intricacies behind address parsing and why getting it right is more complicated than it first appears.

How To Set Coordinate Reference Systems (CRS) In Snowflake Using Spatial Reference Identifiers

When working with geospatial data and mapping, you often need to specify the CRS for accurate and consistent spatial referencing and calculations. We guide you how.

Australia’s Migration Trends: Where Are People Moving To?

Australia’s Migration Trends: Where Are People Moving To?

Data consultancy, Data Army, delved into the Australia Post Movers Statistics dataset to understand where people are migrating to within Australia and predicting where they’re likely to move to next.

We explain the data visualisations created and documented in our previous blog How to Predict Migration Patterns using Auspost Movers Statistics Data and Snowflake’s Cortex ML functions>

By analysing movements and observing trends, we’re able to gather valuable insights to inspire decisions with data-driven intelligence.

TL;DR

Australia’s migration trends reveal significant shifts in population movement, driven by economic opportunities, lifestyle choices, and government policies.

The results by state show some interesting results based on analysis and data visualisations conducted by Data Army.

Key insights include the rising popularity of regional areas, the impact of international students, and the influence of skilled migration programs. These trends are shaping the future of Australia’s demographic landscape.

How to Interpret the Data Visualisations

The visualisations below show the net migration metric for all Australian states.

Net migration is calculated by forecasting the number of people moving into an area subtracted by the number of people people moving out of an area.

White, grey and lighter blue colours indicate regions with lower net migration, representing regions where a high number of people are leaving the regions and a lower number of people are relocating into these regions.

Mid to darker blue colours represent regions with higher net migration, regions where the number of people moving to those regions outweigh the number of people leaving those regions.

The Results by State

 

New South Wales (NSW)

In NSW, prior to the pandemic, the light blue areas in the inner city areas indicate there was some movement in inner city areas including Chatswood, the Sydney Central Business District (CBD) and areas just west of the city.

However, there was a much higher level of migration into the areas much further west of the city including Penrith and Blaxland, as well as Newcastle.

The trend of moving away from the city has further increased since the COVID pandemic in NSW, where areas very close to the city show the lowest forecasted net migration in the state.

This indicates that people are moving away from the city. Some possible explanations for these movements could be due to rising rents or potentially due to the fact that they no longer need to live within metropolitan areas for work.

In NSW, rural areas south of the city close to Canberra such as Goulburn, and rural areas north of Newcastle such as Taree are the regions with the highest amount of forecasted net migration as shown by the visualisations below.

NSW Pre-COVID
NSW Post-COVID

Figure 1: Pre- and post-COVID migration per SA4 for New South Wales

Victoria

A similar trend can be observed in Victoria. Both pre- and post-pandemic, the areas which had the lowest net migration were the inner city Melbourne suburbs of Brunswick, North Melbourne and Fitzroy.

However, prior to the pandemic, the areas with the highest forecasted net migration was Geelong and the south coast of Melbourne.

Post-COVID, the areas with the highest forecasted migration are even further away, possibly indicating these coastal areas are now also less desirable or unaffordable.

These include rural areas including Warragul and Taree. Greenfield suburbs just north of the city including Sunbury also have high levels of forecasted migration.

VIC Pre-COVID
VIC Post-COVID

Figure 2: Pre- and post-COVID migration per SA4 for Victoria

Queensland

Like Melbourne, the forecasts for net migration in the inner city part of Brisbane is relatively similar both pre- and post-pandemic.

The inner city areas have low levels of forecasted net migration.

Interestingly, the highest amount of forecasted migration in Queensland prior to the pandemic was in the Gold Coast, which is only approximately an hour from Brisbane CBD.

Post-pandemic, areas further west of the Brisbane city including Ipswich, and Harrisville have higher levels of forecasted migration.

This could be indicative of people from Queensland relocating, but could also suggest people from interstate or overseas moving from other locations to places west of the city.

There is also a high level of migration predicted for the Sunshine Coast post-pandemic, further highlighting the trend also observed in Sydney and Melbourne of people moving into more rural areas.

QLD Pre-COVID
QLD Post-COVID

Figure 3: Pre- and post-COVID migration per SA4 for Queensland

South Australia

South Australia, unlike NSW, Victoria and Queensland is one of the few states where the highest forecasted pre-pandemic net migration was in an inner-city area.

However, the trend to relocate to rural areas was very high post-pandemic. Rural areas including Kangaroo Island, Murray Bridge and Clare had much higher forecasted net migration after the pandemic. This supports the trend observed in the other states.

SA Pre-COVID
SA Post-COVID

Figure 4: Pre- and post-COVID migration per SA4 for South Australia

Western Australia

Western Australia is one of the few states where the forecasted net migration into rural areas is not high.

The pre-COVID migration forecasts indicate the highest level of net migration were in the Perth City area and post-COVID the highest amount of net migration was just south of the city.

One possible reason for this could be that while Perth house prices and rents have been rising, they are still much lower than Sydney or Melbourne, and therefore is still affordable for people to be able to live close to the city.

Secondly, as mining is the predominant industry in Western Australia, it is possible that it is not feasible for many of these workers to move and work remotely.

WA Pre-COVID
WA Post-COVID

Figure 5: Pre- and post-COVID migration per SA4 for Western Australia

Tasmania

Tasmania is the only Australian state where the amount of net migration into the inner city forecasts are higher post-covid as compared to pre-COVID.

Prior to COVID , Hobart had the lowest net migration compared to all other regions in Tasmania. However, post-COVID the amount of met migration in the CBD is higher, indicating people are moving into Hobart.

Similarly, the amount of forecasted migration into Launceston, Tasmania’s second biggest city, is higher post-COVID as compared to pre-COVID.

The reason that the same rural migration has not been seen in Tasmania, unlike other states, could be because of Tasmania’s population.

Hobart’s population is only approximately 250,000 which is smaller than rural areas that people were migrating to including the Sunshine Coast.

Thus, the high rental and accommodation costs that are evident in highly populated cities, including Sydney or Melbourne may not be evident in Tasmania.

TAS Pre-COVID
TAS Post-COVID

Figure 6: Pre- and post-COVID migration per SA4 for Tasmania

​Summary of Findings

In the period post-COVID there is high evidence of people migrating to rural areas, especially in states with larger CBDs such as New South Wales, Victoria and Brisbane.

Interestingly, in these states, people seemed to be migrating to outer-city areas even prior to the pandemic.

This may suggest that there were factors encouraging people to move out of the city. This trend seems to have increased further since COVID.

Overall, there is a clear trend in the two most populated states, New South Wales and Victoria for net migration into rural areas.

These were the two states that were most affected by COVID lockdowns in Australia and have the highest house prices in the country which may be one of the key the drivers behind the high level of relocation to rural areas.

Less populated states including South Australia and Queensland have experienced a similar trend with high levels of net migration to rural areas including Kangaroo Island, Clare and the Sunshine Coast.

The only states that haven’t experienced net migration to rural areas are Western Australia and Tasmania.

A Reflection of Australia’s Housing Situation

Australia is in the midst of a housing crisis where steep house prices prevent many first-home buyers from entering the market, especially in inner-city areas.

Driven by the low supply of rentals and high post-pandemic migration, rents continue to skyrocket in many metropolitan cites.

In Australia’s most populous metropolitan areas Sydney and Melbourne, rents rose by 10.2% and 11.1%1 from December 2022 to December 2023 respectively.

Since 2020, the COVID pandemic has transformed the workplace environment, by dictating some office workers to do their job remotely due to lockdowns and government restrictions.

While there are initiatives to reverse this, some office workers continue to work remotely at least part of the week, meaning that when choosing a place to live, they may not need to prioritise being within a reasonable commuting distance from their physical office.

The combination of unaffordable rents and mortgages in inner city areas and increase in work from home trends may have contributed to many Australians migrating to outer-city and rural locations.

Strategic Insights

The findings hold significant strategic value for both the private and public sectors.

Incorporating these insights alongside additional data points, such as overseas migration into Australia, enriches the analysis, providing a more comprehensive understanding of migration patterns.

This broader perspective can enhance strategic planning and decision-making processes across various industries and governmental levels.

Examples include real estate development, investment, business expansion, transportation and infrastructure decisions, as well as urban, land use, policy or even healthcare and public services planning.

These findings can offer a foundation for both private and public sectors to adapt to changing demographic patterns in a way that maximises economic opportunities while ensuring community well-being and sustainability.

About The Analysis

Data Army used the Australian Post Movers Statistics dataset to base the forecasts in migration patterns during and after the COVID pandemic in each Australian state.

The primary dataset used in this study is the Australia Post Movers Statistics. It contains de-identified and aggregated data on moves across Australia based on mail redirection requests from the previous 5 years.

For this exercise, data from February 2019 to January 2024 was used.

Each entry in the data includes

  • the postcode the household relocated from,
  • the postcode the household relocated to,
  • the month of relocation, and
  • the number of the people that relocated.

This analysis shows forecasted migrated trends for the next year when pre-pandemic data is used (Feb 2019 – Jan 2020) compared to forecasts based on mail redirection requests in the post-pandemic era (2022-2024).

The analysis was conducted on a Statistical Area Level 4 level which are Australian Bureau of Statistics (ABS) defined regions that clearly distinguish inner-city areas, outer-city areas and rural areas.

For a step-by-step guide, see our blog on How to Predict Migration Patterns using Auspost Movers Statistics Data and Snowflake’s Cortex ML functions>

Australia Post Movers Statistics Data

This dataset contains five years of de-identified, aggregated information on past moves captured by Australia Post’s Mail Redirection service.

Access Australia Post mail redirect statistics now to help you develop competitive data-driven strategies.

All rights are reserved, and no content may be republished or reproduced without express written permission from The Proptech Cloud. All content provided is for informational purposes only. While we strive to ensure that the information provided here is both factual and accurate, we make no representations or warranties of any kind about the completeness, accuracy, reliability, suitability, or availability with respect to the blog or the information, products, services, or related graphics contained on the blog for any purpose.

Subscribe to our newsletter

Subscribe to receive the latest blogs and data listings direct to your inbox.

Read more from The Proptech Cloud

What’s The Difference Between Proptech and Contech?

Proptech and Contech is driving innovation across industries. We uncover the nuances and distinctions between Proptech and Contech here.

What is a Coordinate Reference System (CRS)?

We break down the concept of Coordinate reference systems (CRS) and discuss the different types, their purpose and uses in the world of real estate.

The Property Lifecycle

Discover the extensive property lifecycle and how it connects with other sectors. This infographic highlights the key phases and processes involved.

Why Is It So Difficult To Parse Addresses?

This blog explores what address parsing is and why it presents such unique challenges. Discover the intricacies behind address parsing and why getting it right is more complicated than it first appears.

How To Set Coordinate Reference Systems (CRS) In Snowflake Using Spatial Reference Identifiers

When working with geospatial data and mapping, you often need to specify the CRS for accurate and consistent spatial referencing and calculations. We guide you how.