Selected Publications

KathDB: Explainable Multimodal Database Management System with Human-AI Collaboration
Guorui Xiao, Enhao Zhang, Nicole Sullivan, Will Hansen, and Magdalena Balazinska
To Appear, CIDR 2026

CENTS: A Flexible and Cost-Effective Framework for LLM-Based Table Understanding
Guorui Xiao, Done He, Jin Wang, Magdalena Balazinska
VLDB 2025

RACOON+: System for LLM-based Table Understanding with a Knowledge Graph
Linxi Wei, Guorui Xiao, Moe Kayali, Dan Suciu, and Magdalena Balazinsk
Under Review, VLDB 2026

RACOON: An LLM-based Framework for Retrieval-Augmented Column Type Annotation with a Knowledge Graph
Linxi Wei, Guorui Xiao, Magdalena Balazinska
NeurIPS 2024, Table Representation Learning

Revealing Protocol Architecture’s Design Patterns in the Volumetric DDoS Defense Design Space
Zhiyi Zhang, Guorui Xiao, Sichen Song, R. Can Aygun, Angelos Stavrou, Lixia Zhang
IEEE Communications Surveys and Tutorials 2024

Highly Efficient String Similarity Search and Join over Compressed Indexes
Guorui Xiao, Jin Wang, Chunbin Lin, Carlo Zaniolo
ICDE 2022

RaSQL: A Powerful Language and its System for Big Data Applications
Jin Wang, Guorui Xiao, Jiaqi Gu, Jiacheng Wu, Carlo Zaniolo
SIGMOD 2020

Manuscript

ReLiShare: Reliable Leaker Identification in Sensitive Dataset Sharing

RS-SQL: A Query Language for Supporting Recursive Query Processing over Data Streams