Holistic Semantic Representation for Navigational Trajectory Generation

AAAI 2025


Ji Cao1, Tongya Zheng2,3,*, Qinghong Guo1, Yu Wang1, Junshu Dai1,
Shunyu Liu4, Jie Yang1, Jie Song1, Mingli Song3,5

1Zhejiang University
2Big Graph Center, Hangzhou City University
3State Key Laboratory of Blockchain and Data Security, Zhejiang University
4Nanyang Technological Univerisity
5Hangzhou High-Tech Zone (Binjiang) Institute of Blockchain and Data Security

* denotes the corresponding author

Abstract


Trajectory generation has garnered significant attention from researchers in the field of spatio-temporal analysis, as it can generate substantial synthesized human mobility trajectories that enhance user privacy and alleviate data scarcity. However, existing trajectory generation methods often focus on improving trajectory generation quality from a singular perspective, lacking a comprehensive semantic understanding across various scales. Consequently, we are inspired to develop a HOlistic SEmantic Representation (HOSER) framework for navigational trajectory generation. Given an origin-and-destination (OD) pair and the starting time point of a latent trajectory, we first propose a Road Network Encoder to expand the receptive field of road- and zone-level semantics. Second, we design a Multi-Granularity Trajectory Encoder to integrate the spatio-temporal semantics of the generated trajectory at both the point and trajectory levels. Finally, we employ a Destination-Oriented Navigator to seamlessly integrate destination-oriented guidance. Extensive experiments on three real-world datasets demonstrate that HOSER outperforms state-of-the-art baselines by a significant margin. Moreover, the model's performance in few-shot learning and zero-shot learning scenarios further verifies the effectiveness of our holistic semantic representation.

Framework Overview


Framework Image

HOSER predicts the next spatio-temporal point based on the current state and generates the trajectory between the given OD pair through a search-based method. As illustrated above, HOSER first employs a Road Network Encoder to model the road network at different levels. Based on the road network representation, a Multi-Granularity Trajectory Encoder is proposed to extract the semantic information from the current partial trajectory. To better incorporate prior knowledge of human mobility, a Destination-Oriented Navigator is used to seamlessly integrate the current partial trajectory semantics with the destination guidance.

Heatmap Visualization of Generated and Real Trajectories


Ours trajectories heatmap in Beijing

(a) Ours

Real trajectories heatmap in Beijing

(b) Real

Beijing

Ours trajectories heatmap in Porto

(a) Ours

Real trajectories heatmap in Porto

(b) Real

Porto

Ours trajectories heatmap in San Francisco

(a) Ours

Real trajectories heatmap in San Francisco

(b) Real

San Francisco

Visualization of Generated and Real Trajectories with Identical OD Pairs


(a) Ours

(b) Real

Beijing

(a) Ours

(b) Real

Porto

(a) Ours

(b) Real

San Francisco

Citation


@inproceedings{cao2025hoser,
  title={Holistic Semantic Representation for Navigational Trajectory Generation},
  author={Cao, Ji and Zheng, Tongya and Guo, Qinghong and Wang, Yu and Dai, Junshu and Liu, Shunyu and Yang, Jie and Song, Jie and Song, Mingli},
  booktitle={AAAI Conference on Artificial Intelligence (AAAI)},
  year={2025},
}

Acknowledgments


This work is supported by the Zhejiang Province "JianBingLingYan+X" Research and Development Plan (2024C01114), Zhejiang Province High-Level Talents Special Support Program "Leading Talent of Technological Innovation of Ten-Thousands Talents Program" (No.2022R52046), the Fundamental Research Funds for the Central Universities (No.226-2024-00058), and the Scientific Research Fund of Zhejiang Provincial Education Department (Grant No.Y202457035). Also, we thank Bayou Tech (Hong Kong) Limited for providing the data used in this paper free of charge.