Research
I am interested in agentic data systems.
My current work focuses on studying and building agents for accurate and scalable data processing, as well as designing algorithms that enable efficient and robust unstructured data processing.
➤ Agentic Data Processing: Benchmarking and designing data agents for realistic, complex, data-centric tasks.
➤ Unstructured Document Analytics: Classifying and extracting hierarchical structures from unstructured yet hierarchically organized documents for intelligent analytics.
➤ Automatic System Algorithms Design: Exploring general-purpose LLM-based frameworks for automated and creative algorithm design in systems.
|
Publications
MetaMuse: Algorithm Generation via Creative Ideation
Ruiying Ma, Chieh-Jan Mike Liang, Yanjie Gao, Francis Y. Yan
ICLR, 2026
paper
/
code
Querying Templatized Document Collections with Large Language Models (ZenDB)
Yiming Lin, Madelon Hulsebos, Ruiying Ma, Shreya Shanker, Sepanta Ziegham, Aditya G. Parameswaran, Eugene Wu
ICDE, 2025
paper
|
|