Interpretable Future Traffic Scene Generation for Urban Digital Twins
Project Description
This project aims to build a prototype system that transforms future traffic states into interpretable scene-level representations for urban digital twins. It is ideal for final-year students who are interested in combining spatio-temporal data management, digital twin systems, and AI-based scene generation to build a practical system that helps users better understand future traffic conditions.
Ideal Candidate:
This project is ideal for students who enjoy building practical systems and want to work on an interdisciplinary problem involving traffic, simulation, and AI-based scene generation. You should be comfortable with Python programming and willing to work with structured data, simple geometric representations, and prototype system development. Experience with visualization, simulation tools, or deep learning frameworks is helpful but not required.
Skills Required:
Programming: Python, C++
Interest in traffic simulation, digital twins, or generative AI
Willingness to learn scene representation and interactive demo development
Ideal Candidate:
This project is ideal for students who enjoy building practical systems and want to work on an interdisciplinary problem involving traffic, simulation, and AI-based scene generation. You should be comfortable with Python programming and willing to work with structured data, simple geometric representations, and prototype system development. Experience with visualization, simulation tools, or deep learning frameworks is helpful but not required.
Skills Required:
Programming: Python, C++
Interest in traffic simulation, digital twins, or generative AI
Willingness to learn scene representation and interactive demo development
Supervisor
ZHOU, Xiaofang
Quota
2
Course type
UROP1000
UROP1100
UROP2100
UROP3100
UROP3200
UROP4100
Applicant's Roles
This project aims to build a demo system that shows what future traffic may look like in an urban digital twin world. For traffic systems, future conditions can already be estimated by deep learning models. However, these models usually produce summarized results, such as average traffic conditions over the next few minutes or colored congestion levels on a map (e.g., red refers to heavy congestion). Although these outputs are useful for analysis, they do not match the kind of digital twin experience we expect, where people can directly see how vehicles may move and interact in a visible future world.
To address this limitation, researchers have started to explore macroscopic traffic simulation algorithm based on future vehicle route information [1,2]. Compared with summarized prediction results, these methods can provide more detailed and interpretable future traffic states. For example, they can estimate how many vehicles will be on a road at a specific future time, how many may be waiting near an intersection, and how traffic may be distributed across a road network. However, these results are still mainly numerical or structural. They are closer to future traffic itself, but still do not directly show what the future traffic scene may actually look like in a digital twin system.
Inspired by recent AI video generation tools [3,4], they have shown strong ability in creating realistic road scenes and videos from structured inputs, such as road map, vehicles' positions, and other scene descriptions. However, these tools are mainly designed for visual generation. They are not directly connected to real traffic estimation systems or urban digital twin applications.
Based on this background, the goal of this project is to connect future traffic estimation with visual generation tools and build a complete demo pipeline for urban digital twins. Students will build on an existing traffic simulation framework that estimates future traffic conditions from vehicle future route information, and then design a presentation system that converts these estimated traffic states into inputs that video generation tools can understand. These inputs may include road layouts, approximate vehicle locations, and simple vehicle boxes, which can then be used by rendering tools or existing AI video generation tools to produce images or short videos.
Through this project, students will have the chance to learn about traffic simulation, future traffic estimation, scene design, and AI-based visual generation in an accessible and practical way. They will also gain experience in building a full system prototype and presenting it as an interactive demo. If the project goes well, there may also be an opportunity to further develop it into a demo paper for a top conference.
To address this limitation, researchers have started to explore macroscopic traffic simulation algorithm based on future vehicle route information [1,2]. Compared with summarized prediction results, these methods can provide more detailed and interpretable future traffic states. For example, they can estimate how many vehicles will be on a road at a specific future time, how many may be waiting near an intersection, and how traffic may be distributed across a road network. However, these results are still mainly numerical or structural. They are closer to future traffic itself, but still do not directly show what the future traffic scene may actually look like in a digital twin system.
Inspired by recent AI video generation tools [3,4], they have shown strong ability in creating realistic road scenes and videos from structured inputs, such as road map, vehicles' positions, and other scene descriptions. However, these tools are mainly designed for visual generation. They are not directly connected to real traffic estimation systems or urban digital twin applications.
Based on this background, the goal of this project is to connect future traffic estimation with visual generation tools and build a complete demo pipeline for urban digital twins. Students will build on an existing traffic simulation framework that estimates future traffic conditions from vehicle future route information, and then design a presentation system that converts these estimated traffic states into inputs that video generation tools can understand. These inputs may include road layouts, approximate vehicle locations, and simple vehicle boxes, which can then be used by rendering tools or existing AI video generation tools to produce images or short videos.
Through this project, students will have the chance to learn about traffic simulation, future traffic estimation, scene design, and AI-based visual generation in an accessible and practical way. They will also gain experience in building a full system prototype and presenting it as an interactive demo. If the project goes well, there may also be an opportunity to further develop it into a demo paper for a top conference.
Applicant's Learning Objectives
To learn the existing traffic prediction framework and macroscopic traffic simulation algorithm, and AI-based image/video generation tools.
To design a module that converts simulated future traffic states into structured inputs for visual generation.
To build a digital twin demo system that connects future traffic estimation with image or video generation.
To design a module that converts simulated future traffic states into structured inputs for visual generation.
To build a digital twin demo system that connects future traffic estimation with image or video generation.
Complexity of the project
Moderate