Xavier: Toward Better Coding Assistance in Authoring Tabular Data Wrangling Scripts

  • Yunfan Zhou ,
  • Xiwen Cai ,
  • Qiming Shi ,
  • Yanwei Huang ,
  • ,
  • Huamin Qu ,
  • Di Weng ,
  • Yingcai Wu

CHI 2025 |

Data analysts frequently employ code completion tools in writing custom scripts to tackle complex tabular data wrangling tasks. However, existing tools do not sufficiently link the data contexts such as schemas and values with the code being edited. This not only leads to poor code suggestions, but also frequent interruptions in coding processes as users need additional code to locate and understand relevant data. We introduce Xavier, a tool designed to enhance data wrangling script authoring in computational notebooks. Xavier maintains users’ awareness of data contexts while providing data-aware code suggestions. It automatically highlights the most relevant data based on the user’s code, integrates both code and data contexts for more accurate suggestions, and instantly previews data transformation results for easy verification. To evaluate the effectiveness and usability of Xavier, we conducted a user study with 16 data analysts, showing its potential to streamline data wrangling scripts authoring.