Ruiying Ma

I'm a first-year CS PhD student at UC Berkeley EPIC Data Lab, advised by Prof. Aditya Parameswaran. I received a B.Eng in CS from Tsinghua University Yao Class in June, 2025. Prior to that, I visited UC Berkeley as a undergraduate student researcher. I was also a research intern at Systems and Networking Research Group, Microsoft Resarch Asia, where I was fortunate to work with Dr. Chieh-Jan Mike Liang, Prof. Francis Y. Yan, and Yanjie Gao.

Email  /  Google Scholar  /  Github  /  LinkedIn

profile photo

Research

I am interested in agentic data systems. My current work focuses on studying and building agents for accurate and scalable data processing, as well as designing algorithms that enable efficient and robust unstructured data processing.

Agentic Data Processing: Benchmarking and designing data agents for realistic, complex, data-centric tasks.
Unstructured Document Analytics: Classifying and extracting hierarchical structures from unstructured yet hierarchically organized documents for intelligent analytics.
Automatic System Algorithms Design: Exploring general-purpose LLM-based frameworks for automated and creative algorithm design in systems.

Publications

MetaMuse: Algorithm Generation via Creative Ideation
Ruiying Ma, Chieh-Jan Mike Liang, Yanjie Gao, Francis Y. Yan
ICLR, 2026
paper / code

Querying Templatized Document Collections with Large Language Models (ZenDB)
Yiming Lin, Madelon Hulsebos, Ruiying Ma, Shreya Shanker, Sepanta Ziegham, Aditya G. Parameswaran, Eugene Wu
ICDE, 2025
paper

Source code from Jon Barron's website.