Publications
Preprints
- Cognify: The Automated Optimizer for Generative AI Workflows
Zijian He*, Reyna Abhyankar*, Vikranth Srivatsa, Yiying Zhang
Preprint, February 2024
arXiv:2502.08056
Conference Papers
Preble: Efficient Distributed Prompt Scheduling for LLM Serving
Vikranth Srivatsa*, Zijian He*, Reyna Abhyankar, Dongming Li, Yiying Zhang
International Conference on Learning Representations (ICLR), 2025
ICLR 2025InferCept: Efficient Intercept Support for Augmented Large-Language Model Inferencing
Reyna Abhyankar*, Zijian He*, Vikranth Srivatsa, Hao Zhang, Yiying Zhang
International Conference on Machine Learning (ICML), 2024
ICML 2024