SIGIR Workshop on Large Language Model for Evaluation in IR, LLM4Eval
Distributed and Interactive Systems

Zhang, W., Aliannejadi, M., Pei, J., Yuan, Y., Huang, J.-H., & Kanoulas, E. (2024). A comparative analysis of faithfulness metrics and humans in citation evaluation. In Proceedings of the LLM4EVAL Workshop at ACM SIGIR.