Publications

Preprints

  • Cognify: The Automated Optimizer for Generative AI Workflows
    Zijian He*, Reyna Abhyankar*, Vikranth Srivatsa, Yiying Zhang
    Preprint, February 2024
    arXiv:2502.08056

Conference Papers

  • Preble: Efficient Distributed Prompt Scheduling for LLM Serving
    Vikranth Srivatsa*, Zijian He*, Reyna Abhyankar, Dongming Li, Yiying Zhang
    International Conference on Learning Representations (ICLR), 2025
    ICLR 2025

  • InferCept: Efficient Intercept Support for Augmented Large-Language Model Inferencing
    Reyna Abhyankar*, Zijian He*, Vikranth Srivatsa, Hao Zhang, Yiying Zhang
    International Conference on Machine Learning (ICML), 2024
    ICML 2024