Publications
Selected Publications
KathDB: Explainable Multimodal Database Management System with Human-AI Collaboration
Guorui Xiao, Enhao Zhang, Nicole Sullivan, Will Hansen, and Magdalena Balazinska
To Appear, CIDR 2026
CENTS: A Flexible and Cost-Effective Framework for LLM-Based Table Understanding
Guorui Xiao, Done He, Jin Wang, Magdalena Balazinska
VLDB 2025
RACOON+: System for LLM-based Table Understanding with a Knowledge Graph
Linxi Wei, Guorui Xiao, Moe Kayali, Dan Suciu, and Magdalena Balazinsk
Under Review, VLDB 2026
RACOON: An LLM-based Framework for Retrieval-Augmented Column Type Annotation with a Knowledge Graph
Linxi Wei, Guorui Xiao, Magdalena Balazinska
NeurIPS 2024, Table Representation Learning
Revealing Protocol Architecture’s Design Patterns in the Volumetric DDoS Defense Design Space
Zhiyi Zhang, Guorui Xiao, Sichen Song, R. Can Aygun, Angelos Stavrou, Lixia Zhang
IEEE Communications Surveys and Tutorials 2024
Highly Efficient String Similarity Search and Join over Compressed Indexes
Guorui Xiao, Jin Wang, Chunbin Lin, Carlo Zaniolo
ICDE 2022
RaSQL: A Powerful Language and its System for Big Data Applications
Jin Wang, Guorui Xiao, Jiaqi Gu, Jiacheng Wu, Carlo Zaniolo
SIGMOD 2020
Manuscript
ReLiShare: Reliable Leaker Identification in Sensitive Dataset Sharing
RS-SQL: A Query Language for Supporting Recursive Query Processing over Data Streams