Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Reports
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Signal Scanfree
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Research
  3. Cities
  4. Synthetic Data

Synthetic Data

Artificially generated datasets that mimic real urban data patterns while protecting individual privacy
Back to CitiesView interactive version

As cities become increasingly digitised, the demand for vast amounts of data to drive innovation, improve public services, and enhance urban planning has never been greater. However, the collection and use of real-world data raise significant privacy concerns, as well as ethical and logistical challenges. This is where synthetic data emerges as a critical solution. Synthetic data refers to artificially generated data that mimics the characteristics of real data without compromising personal privacy or sensitive information. It provides a pathway to maintaining data utility while sidestepping the risks associated with handling real-world datasets.

Synthetic data is generated using advanced machine learning models that analyse patterns, structures, and relationships within existing datasets. These models then produce new data points that statistically resemble the original data but do not correspond to real individuals or events. The technology ensures that the synthetic data retains the essential qualities needed for analysis and decision-making, while simultaneously eliminating the potential for privacy breaches. This is particularly valuable in urban contexts, where data derived from sensors, cameras, and other sources can be rich in detail and highly sensitive.

As urban areas continue to evolve into smart cities, relying on extensive data for functions ranging from traffic management to public safety, synthetic data offers a secure and scalable way to train AI systems, simulate urban scenarios, and conduct research without exposing real individuals to privacy risks. Furthermore, synthetic data allows for the testing of new technologies and policies in a risk-free environment, fostering innovation without unintended consequences.

In essence, synthetic data not only solves the pressing issue of data privacy but also propels the development of urban technologies forward. By enabling safer, more ethical data practices, synthetic data supports the creation of smarter, more resilient cities that can adapt to the needs of their inhabitants while safeguarding their privacy.

Technology Readiness Level
8/9Ready for Implementation
Diffusion of Innovation
3/5Early Majority
Technology Life Cycle
2/4Growth
Category
Software

Related Organizations

Replica logo
Replica

United States · Company

98%

A data platform that models the built environment and human movement patterns to help public agencies make informed decisions.

Developer
Parallel Domain logo
Parallel Domain

United States · Startup

92%

Synthetic data generation platform for autonomous systems.

Developer
Mostly AI logo
Mostly AI

Austria · Company

90%

Pioneers in AI-generated synthetic data for enterprise and insurance.

Developer
NVIDIA logo
NVIDIA

United States · Company

90%

Developing foundation models for robotics (Project GR00T) and vision-language models like VILA.

Developer
Gretel.ai logo
Gretel.ai

United States · Startup

88%

Privacy engineering platform offering synthetic data generation APIs.

Developer
Immense Simulations

United Kingdom · Startup

85%

Mobility simulation platform.

Developer
PTV Group logo
PTV Group

Germany · Company

85%

Software company for traffic planning and transport logistics.

Developer
Hazy logo
Hazy

United Kingdom · Company

80%

Synthetic data platform for enterprise.

Developer

Supporting Evidence

Paper

A Comprehensive Evaluation Framework for Synthetic Trip Data Generation in Public Transport

arXiv · Sep 16, 2025

Proposes a Representativeness-Privacy-Utility (RPU) framework to evaluate synthetic trip data for public transport. The study benchmarks twelve generation methods to address privacy and accessibility challenges in smart card data usage.

Support 92%Confidence 95%

Article

Synthetic Mobility Data: The Future of Smart Urban Planning

SocietyByte · May 2, 2025

Discusses how synthetic mobility data offers a solution for smart cities to harness movement patterns for urban planning and traffic optimization while strictly protecting individual privacy.

Support 92%Confidence 95%

Paper

Practical Applications of Synthetic Data Generation

Statistics Canada · Jul 29, 2025

Discusses the use of synthetic data generation for privacy-preserving data sharing and de-biasing in official statistics. It presents case studies and evidence of performance across various use cases.

Support 82%Confidence 90%

Article

Synthetic Data Is About To Transform Artificial Intelligence

forbes.com

Imagine if it were possible to produce infinite amounts of the world’s most valuable resource, cheaply and quickly. What dramatic economic transformations and opportunities would result?

Support 50%Confidence 80%

Article

Why Synthetic Data Is Key To Paving the Way for Smart Cities

spiceworks.com

With the increasing need for operational efficiency and process automation across the board, from small-scale projects to the magnitude of smart cities, the demands for data continue to become more exacting. Steve Harris, CEO of Mindtech, explains how achieving them with real-world data is challenging, thus launching synthetic data into the spotlight to address these shortfalls. The potential for uniting synthetic and real-world data in the pursuit of smart cities is highly promising for the future of smarter AI.

Support 50%Confidence 80%

Article

Governing Smart Cities: Use Cases for Urban Transformation | World Economy Forum

www3.weforum.org

Many smart cities still lack the basic governance and policies needed to ensure the adoption of technology in a responsible and ethical way. This matters because cities are accelerating the use of digital tools to access real-time intelligence and target interventions to save lives. But such technology brings privacy, equity, accessibility and cybersecurity risks. The G20 Global Smart Cities Alliance has helped cities leapfrog these challenges by publishing seven model policies in high-priority areas. Since developing model policies is just the first step, the alliance has also developed regional networks in Japan, South-East Asia, India and Latin America to support city leaders in implementing and tailoring these policies to suit the local context. This report profiles three pilot programmes supported by the alliance that tackle three different policy challenges in Mexico City, Tsukuba and Istanbul.

Support 50%Confidence 80%

Article

Top 10 Emerging Technologies of 2024 | World Economic Forum

weforum.org

The Top 10 Emerging Technologies report is a vital source of strategic intelligence. First published in 2011, it draws on insights from scientists, researchers and futurists to identify 10 technologies poised to significantly influence societies and economies. These emerging technologiesare disruptive, attractive to investors and researchers, and expected to achieve considerable scale within five years. This edition expands its analysis by involving over 300 experts from the Forum’s Global Future Councils and a global network of comprising over 2,000 chief editors worldwide from top institutions through Frontiers, a leading publisher of academic research.

Support 50%Confidence 80%

Article

Synthetic data use: exploring use cases to optimise data utility

link.springer.com

Synthetic data is a rapidly evolving field with growing interest from multiple industry stakeholders and European bodies. In particular, the pharmaceutical industry is starting to realise the value of synthetic data which is being utilised more prevalently as a method to optimise data utility and sharing, ultimately as an innovative response to the growing demand for improved privacy. Synthetic data is data generated by simulation, based upon and mirroring properties of an original dataset. Here, with supporting viewpoints from across the pharmaceutical industry, we set out to explore use cases for synthetic data across seven key but relatable areas for optimising data utility for improved data privacy and protection. We also discuss the various methods which can be used to produce a synthetic dataset and availability of metrics to ensure robust quality of generated synthetic datasets. Lastly, we discuss the potential merits, challenges and future direction of synthetic data within the pharmaceutical industry and the considerations for this privacy enhancing technology.

Support 50%Confidence 80%

Article

Why Synthetic Data helps with Big Data Privacy (Part 4/5)

youtube.com

Synthetic Data allows organizations to innovate with their valuable big data assets without putting their customers' privacy at risk. In the fourth part of our mini video series on Synthetic Data we will cover how exactly Synthetic Data enables privacy-friendly innovation.

Support 50%Confidence 80%

Article

What Is Synthetic Data?

blogs.nvidia.com

Synthetic data generated from computer simulations or algorithms provides an inexpensive alternative to real-world data that’s increasingly used to create accurate AI models.

Support 50%Confidence 80%

Article

Is Synthetic Data the Future of AI?

gartner.com

Synthetic data is often treated as a lower-quality substitute and used when real data is inconvenient to get, expensive or constrained by regulation. However, this reaction misses the true potential of synthetic data. Gartner estimates that by 2030, synthetic data will completely overshadow real data in AI models.

Support 50%Confidence 80%

Article

Report investigates how synthetic data can be used in government

adruk.org

In a report published today, the Behavioural Insights Team (BIT) summarised their investigation into the uses of synthetic data in government. They also discussed the different types, benefits, and challenges of creating and using synthetic data, making recommendations for ADR UK, government and stakeholders outside of government.

Support 50%Confidence 80%

Article

On synthetic data: a brief introduction for data protection law dummies

europeanlawblog.eu

Synthetic data is attracting increasing attention from technicians and legal scholars in recent years. This is especially noticeable among entities and people working on data-driven technologies, particularly in the artificial intelligence application development and testing sector, where sheer volumes of data are needed. In these circles, synthetic data has become a growing trend under the “fake it till you make it” concept by promising to alleviate existing data access and analytics challenges while respecting data protection rules. Given the rising prospects and acceptance of data synthesis, there is a need to assess the legal implications of its generation and use, the starting point being the legal qualification of synthetic data.

Support 50%Confidence 80%

Same technology in other hubs

DataTrends
DataTrends
Synthetic Data for Privacy-Preserving Analytics

Artificial datasets that mimic real data patterns without exposing individual identities

Connections

Software
Software
Generative AI

AI systems that generate optimized urban design scenarios for sustainability, efficiency, and resilience

Technology Readiness Level
7/9
Diffusion of Innovation
2/5
Technology Life Cycle
2/4
Ethics & Security
Ethics & Security
Ethical AI

AI systems designed with transparency, accountability, and fairness to align with ethical standards and regulations

Technology Readiness Level
5/9
Diffusion of Innovation
2/5
Technology Life Cycle
1/4

Book a research session

Bring this signal into a focused decision sprint with analyst-led framing and synthesis.
Research Sessions