PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records
Harbin Institute of Technology, Shenzhen
*Corresponding author
Towards building more intelligent personalized agents for aligning with user's implicit intents.
⭐ Give us a star if you like it! ⭐
✨If you find this work useful for your research, please kindly cite our paper.✨
This is the official repository for PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records. In this work, we aim to build a more intelligent personalized GUI Agent capable of learning and matching implicit user intents from long-term interaction histories.
To this end, we constructed a new GUI benchmark, AndroidIntent. From 20k trajectories of different users over two months of mobile phone usage, we annotated 775 preference intents with 7,915 GUI actions and 215 routine intents for evaluation.
# Create and activate conda environment
conda create -n personal python=3.10 -y
conda activate personal
git clone https://github.com/JiuTian-VL/PersonalAlign.git
cd PersonalAlign
pip install -r requirements.txtFirst, please download the Fingertip 20k from Kaggle and extract it into the ./data directory.
Evaluate Agent Personalized Execution
sh scripts/execution/eval_qwen3vl.shEvaluate Agent Proactive Suggestion
sh scripts/proactive/eval_proactive.shEvaluate Agent Memory (HIM-Agent)
sh scripts/HIM_agent/eval_execution.sh
sh scripts/HIM_agent/eval_proactive.shNote: To prevent the agent from returning non-standard JSON that might interfere with API calls, we inference-first and then evaluation. Specifically, results are saved and then calculated using scripts/proactive/calculate.sh
To support PersonalAlign, agent memory should generalize stable representations to exclude one-off moments while separating preferences and routines, and continuously evolve to stay aligned with user intents. We introduce HIM-Agent, a foundational and inspirational personal agent memory that enables GUI agents to rapidly leverage long-term records as context for personalization without interfering with original execution. We construct a streaming update memory and hierarchically organize memory prototypes into Preference Intent Memory and Routine Intent Memory through the execution-based and state-based filter to enable hierarchical intent alignment.
We attempt to generalize our method into reusable skills, aiming to make it applicable beyond GUI scenarios to broader agent settings. We have implemented a preliminary skills-beta version, and the detailed design can be found in README-skills.
In migrating from the GUI setting to a more general agent scenario, we introduced several adaptations, while retaining the core components: daily-update, preference_memory, and routine_memory.
Many thanks to the contribution from FingerTip. When training models or using the dataset for personal agent, please strictly respect and protect user privacy and data security.
We sincerely apologize for the delay in open-sourcing. A server disk failure caused significant data loss, and we have tried our best to restore this repo. If PersonalAlign helps your research, please kindly consider giving us a Star🌟 to support us!
If you find this work useful for your research, please also kindly cite our paper.
@article{lyu2026personalalign,
title={PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records},
author={Lyu, Yibo and Chen, Gongwei and Shao, Rui and Guan, Weili and Nie, Liqiang},
journal={arXiv preprint arXiv:2601.09636},
year={2026}
}
