Publications

2025

  1. How Do LLM-Generated Texts Impact Term-Based Retrieval Models?
    Wei Huang , Keping Bi , Yinqiong Cai , Wei Chen , Jiafeng Guo , and Xueqi Cheng
    arXiv preprint arXiv:2508.17715, 2025
    WSDM
  2. Injecting External Knowledge into the Reasoning Process Enhances Retrieval-Augmented Generation
    Minghao Tang , Shiyu Ni , Jiafeng Guo , and Keping Bi
    arXiv preprint arXiv:2507.19333, 2025
    SIGIR-AP
  3. Distilling a Small Utility-Based Passage Selector to Enhance Retrieval-Augmented Generation
    Hengran Zhang , Keping Bi , Jiafeng Guo , Jiaming Zhang , Shuaiqiang Wang , Dawei Yin , and Xueqi Cheng
    arXiv preprint arXiv:2507.19102, 2025
    SIGIR-AP
  4. A Comparative Study of Specialized LLMs as Dense Retrievers
    Hengran Zhang , Keping Bi , and Jiafeng Guo
    arXiv preprint arXiv:2507.03958, 2025
    CCIR
  5. LifeIR at the NTCIR-18 Lifelog-6 Task
    Jiahan Chen , Da Li , and Keping Bi
    arXiv preprint arXiv:2505.20987, 2025
    NTCIR
  6. Bridging Queries and Tables through Entities in Table Retrieval
    Da Li , Keping Bi , Jiafeng Guo , and Xueqi Cheng
    arXiv preprint arXiv:2504.06551, 2025
    CIKM
  7. Tailoring Table Retrieval from a Field-aware Hybrid Matching Perspective
    Da Li , Keping Bi , Jiafeng Guo , and Xueqi Cheng
    arXiv preprint arXiv:2503.02251, 2025
    EMNLP
  8. Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG
    Hengran Zhang , Minghao Tang , Keping Bi , Jiafeng Guo , Shihao Liu , Daiting Shi , Dawei Yin , and Xueqi Cheng
    arXiv preprint arXiv:2504.05220, 2025
    EMNLP
  9. Do LVLMs Know What They Know? A Systematic Study of Knowledge Boundary Perception in LVLMs
    Zhikai Ding , Shiyu Ni , and Keping Bi
    arXiv preprint arXiv:2508.19111, 2025
    EMNLP (Findings)
  10. Unbiased Learning to Rank with Query-Level Click Propensity Estimation: Beyond Pointwise Observation and Relevance
    Lulu Yu , Keping Bi , Jiafeng Guo , Shihao Liu , Dawei Yin , and Xueqi Cheng
    In Companion Proceedings of the ACM on Web Conference 2025, 2025
    WebConf (Short)
  11. Clipure: Purification in latent space via clip for adversarially robust zero-shot classification
    Mingkun Zhang , Keping Bi , Wei Chen , Jiafeng Guo , and Xueqi Cheng
    arXiv preprint arXiv:2502.18176, 2025
    ICLR
  12. Towards fully exploiting llm internal states to enhance knowledge boundary perception
    Shiyu Ni , Keping Bi , Jiafeng Guo , Lulu Yu , Baolong Bi , and Xueqi Cheng
    arXiv preprint arXiv:2502.11677, 2025
    ACL
  13. Evaluating implicit bias in large language models by attacking from a psychometric perspective
    Yuchen Wen , Keping Bi , Wei Chen , Jiafeng Guo , and Xueqi Cheng
    arXiv preprint arXiv:2406.14023, 2025
    ACL (Findings)
  14. Came: Competitively learning a mixture-of-experts model for first-stage retrieval
    Jiafeng Guo , Yinqiong Cai , Keping Bi , Yixing Fan , Wei Chen , Ruqing Zhang , and Xueqi Cheng
    ACM Transactions on Information Systems, 2025
    TOIS

2024

  1. Causaldiff: Causality-inspired disentanglement via diffusion model for adversarial defense
    Mingkun Zhang , Keping Bi , Wei Chen , Quanrun Chen , Jiafeng Guo , and Xueqi Cheng
    Advances in Neural Information Processing Systems, 2024
    NeurIPS
  2. Are Large Language Models More Honest in Their Probabilistic or Verbalized Confidence?
    Shiyu Ni , Keping Bi , Lulu Yu , and Jiafeng Guo
    In China Conference on Information Retrieval, 2024
    CCIR
  3. Linkage: Listwise ranking among varied-quality references for non-factoid qa evaluation via llms
    Sihui Yang , Keping Bi , Wanqing Cui , Jiafeng Guo , and Xueqi Cheng
    arXiv preprint arXiv:2409.14744, 2024
    EMNLP (Findings)
  4. Reproducibility Analysis and Enhancements for Multi-aspect Dense Retriever with Aspect Learning
    Keping Bi , Xiaojie Sun , Jiafeng Guo , and Xueqi Cheng
    In European Conference on Information Retrieval, 2024
    ECIR
  5. A Multi-Granularity-Aware Aspect Learning Model for Multi-Aspect Dense Retrieval
    Xiaojie Sun , Keping Bi , Jiafeng Guo , Sihui Yang , Qishen Zhang , Zhongyi Liu , Guannan Zhang , and Xueqi Cheng
    In Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024
    WSDM
  6. MORE: Multi-mOdal REtrieval augmented generative commonsense reasoning
    Wanqing Cui , Keping Bi , Jiafeng Guo , and Xueqi Cheng
    arXiv preprint arXiv:2402.13625, 2024
    ACL (Findings)
  7. When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation
    Shiyu Ni , Keping Bi , Jiafeng Guo , and Xueqi Cheng
    arXiv preprint arXiv:2402.11457, 2024
    ACL (Findings)

2023

  1. A comparative study of training objectives for clarification facet generation
    Shiyu Ni , Keping Bi , Jiafeng Guo , and Xueqi Cheng
    In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, 2023
    SIGIR-AP
  2. Pre-training with aspect-content text mutual prediction for multi-aspect dense retrieval
    Xiaojie Sun , Keping Bi , Jiafeng Guo , Xinyu Ma , Yixing Fan , Hongyu Shan , Qishen Zhang , and Zhongyi Liu
    In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023
    CIKM
  3. L2r: Lifelong learning for first-stage retrieval with backward-compatible representations
    Yinqiong Cai , Keping Bi , Yixing Fan , Jiafeng Guo , Wei Chen , and Xueqi Cheng
    In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023
    CIKM
  4. Cir at the ntcir-17 ultre-2 task
    Lulu Yu , Keping Bi , Jiafeng Guo , and Xueqi Cheng
    arXiv preprint arXiv:2310.11852, 2023
    NTCIR
  5. Ensemble Ranking Model with Multiple Pretraining Strategies for Web Search
    Xiaojie Sun , Lulu Yu , Yiting Wang , Keping Bi , and Jiafeng Guo
    arXiv preprint arXiv:2302.09340, 2023
    WSDM
  6. Feature-enhanced network with hybrid debiasing strategies for unbiased learning to rank
    Lulu Yu , Yiting Wang , Xiaojie Sun , Keping Bi , and Jiafeng Guo
    arXiv preprint arXiv:2302.07530, 2023
    WSDM