A ML-based Query Expansion in Vertical Aggregated Web Search Using Pseudo-Relevance Feedback
Keywords:
Graph Theory, Query Expansion, Pseudo-Relevance Feedback, Machine Learning, Multimedia Web Search, Vertical Web SearchAbstract
The massive increase in multimedia content over the internet has posed a significant challenge to vertical search engines in instances where users submit short, ambiguous, or vocabulary-mismatched queries. Current query expansion techniques, including Pseudo-Relevance Feedback (PRF), show limited effectiveness in addressing these challenges, particularly in multimedia and cross-vertical search settings. The developed hybrid query expansion method described in this paper combines PRF, Machine Learning (ML), and Graph Theory to enhance the relevance of search results and enhance the semantic relevance of search results. The methodology consists of applying PRF to extend the first query with the help of relevant terms and then using a machine learning model to narrow the terms according to past search behavior and semantic trends. Also, graph-based techniques are utilized to determine semantic relationships among query terms, thereby improving contextual relevance of the query. The framework identifies semantically related terms and refines the query to improve retrieval relevance and adjusts the search, so the results are more accurate and relevant. This experiment used the TREC Web Track collection, which consists of 50 queries, 25,000 documents, and some artificially generated multimedia sources. The empirical findings show that there are great improvements in essential performance measures. Specifically, the given approach yields a 36.2% increase in MAP, 28.5% in nDCG@10, and 32.6% in P@10, as compared to BM25. In addition, a 30-user study has shown a 40% reduction in the query formulation time and a 25-30 percent increase in user effectiveness. The results indicate better accuracy of retrieval, lower query effort, and increased user satisfaction, especially for novice users involved in exploratory search activities. Additionally, issues of scalability, data privacy, and the computation cost are presented in the real-world application.
References
“Modern Information Retrieval: A Brief Overview.” Accessed: Mar. 11, 2026. [Online]. Available: https://research.google/pubs/modern-information-retrieval-a-brief-overview/
Hiteshwar Kumar Azad, Akshay Deepak, “Query expansion techniques for information retrieval: A survey,” Inf. Process. Manag., vol. 56, no. 5, pp. 1698–1735, 2019, doi: https://doi.org/10.1016/j.ipm.2019.05.009.
Arlind Kopliku, Karen Pinel-Sauvagnat, “Aggregated search: A new information retrieval paradigm,” ACM Comput. Surv., vol. 46, no. 3, 2014, [Online]. Available: https://dl.acm.org/doi/10.1145/2523817
Sanae Achsas, El Habib Nfaoui, “An Analysis Study of Vertical Selection Task in Aggregated Search,” Procedia Comput. Sci., vol. 148, pp. 171–180, 2019, doi: https://doi.org/10.1016/j.procs.2019.01.021.
FrommholzIngo, CabanacGuillaume, “Report on the 11th bibliometric-enhanced information retrieval workshop (BIR 2021),” ACM SIGIR Forum, vol. 55, no. 1, 2021, [Online]. Available: https://dl.acm.org/doi/abs/10.1145/3476415.3476426
S. Ibrihich, A. Oussous, “A Review on recent research in information retrieval,” Procedia Comput. Sci., vol. 201, pp. 777–782, 2022, doi: https://doi.org/10.1016/j.procs.2022.03.106.
Abdul Majeed, Ibtisam Rauf, “Graph Theory: A Comprehensive Survey about Graph Theory Applications in Computer Science and Social Networks,” Inventions, vol. 5, no. 1, p. 10, 2020, doi: https://doi.org/10.3390/inventions5010010.
Milad Momeni, “Visualization-Enhanced Aggregated Search Interfaces,” CHIIR 2024 - Proc. 2024 Conf. Hum. Inf. Interact. Retr., 2024, [Online]. Available: https://dl.acm.org/doi/10.1145/3627508.3638336
A. R. Khan and U. Rashid, “A Relational Aggregated Disjoint Multimedia Search Results Approach using Semantics,” 2021 Int. Conf. Artif. Intell. ICAI 2021, pp. 62–67, Apr. 2021, doi: 10.1109/ICAI52203.2021.9445229.
J. Ooi, X. Ma, H. Qin, and S. C. Liew, “A survey of query expansion, query suggestion and query refinement techniques,” 2015 4th Int. Conf. Softw. Eng. Comput. Syst. ICSECS 2015 Virtuous Softw. Solut. Big Data, pp. 112–117, Nov. 2015, doi: 10.1109/ICSECS.2015.7333094.
Wei Li, Rishi Choudhary, “RCES: Rapid Cues Exploratory Search Using Taxonomies For COVID-19,” Int. Conf. Inf. Knowl. Manag. Proc., vol. 10, 2021, [Online]. Available: https://dl.acm.org/doi/10.1145/3459637.3481990
Hiteshwar Kumar Azad, Akshay Deepak, “A new approach for query expansion using Wikipedia and WordNet,” Inf. Sci. (Ny)., 2019, [Online]. Available: https://arxiv.org/abs/1901.10197
Mohamed Reda Bouadjenek, Hakim Hacid, “Social networks and information retrieval, how are they converging? A survey, a taxonomy and an analysis of social information retrieval approaches and platforms,” Inf. Syst., vol. 56, pp. 1–18, 2016, doi: ttps://doi.org/10.1016/j.is.2015.07.008.
“(PDF) Modern Information Retrieval.” Accessed: Mar. 11, 2026. [Online]. Available: https://www.researchgate.net/publication/2352627_Modern_Information_Retrieval
D. K. Sharma, R. Pamula, and D. S. Chauhan, “Semantic approaches for query expansion,” Evol. Intell. 2021 142, vol. 14, no. 2, pp. 1101–1116, Mar. 2021, doi: 10.1007/s12065-020-00554-x.
Hiteshwar Kumar Azad, Akshay Deepak, “Improving query expansion using pseudo-relevant web knowledge for information retrieval,” Pattern Recognit. Lett., vol. 158, pp. 148–156, 2022, doi: https://doi.org/10.1016/j.patrec.2022.04.013.
A. Garba, S. Khalid, and I. Ullah, “Understanding the impact of query expansion on federated search,” Multimed. Tools Appl. 2023 834, vol. 83, no. 4, pp. 10393–10407, Jun. 2023, doi: 10.1007/S11042-023-15831-X.
Jamal Abdul Nasir, Iraklis Varlamis, “A knowledge-based semantic framework for query expansion,” Inf. Process. Manag., vol. 56, no. 5, pp. 1605–1617, 2019, doi: https://doi.org/10.1016/j.ipm.2019.04.007.
Dimitrios Markonis, Roger Schaer & Henning Müller, “Evaluating multimodal relevance feedback techniques for medical image retrieval,” Inf. Retr. J., vol. 19, pp. 100–112, 2016, [Online]. Available: https://link.springer.com/article/10.1007/s10791-015-9260-4
Dilip Kumar Sharma, Rajendra Pamula, “Query expansion – Hybrid framework using fuzzy logic and PRF,” Measurement, vol. 198, p. 111300, 2022, doi: https://doi.org/10.1016/j.measurement.2022.111300.
“(PDF) A New Query Expansion Approach for Improving Web Search Ranking.” Accessed: May 02, 2026. [Online]. Available: https://www.researchgate.net/publication/368325525_A_New_Query_Expansion_Approach_for_Improving_Web_Search_Ranking
Liang Wang, Nan Yang, Furu Wei, “Query2doc: Query Expansion with Large Language Models,” arXiv:2303.07678, 2023, [Online]. Available: https://arxiv.org/abs/2303.07678
T. Khan, U. Rashid, and A. R. Khan, “End-to-end pseudo relevance feedback based vertical web search queries recommendation,” Multimed. Tools Appl. 2024 8331, vol. 83, no. 31, pp. 75995–76033, Feb. 2024, doi: 10.1007/S11042-024-18559-4.
S. Tiwari, F. N. Al-Aswadi, and D. Gaurav, “Recent trends in knowledge graphs: theory and practice,” Soft Comput. 2021 2513, vol. 25, no. 13, pp. 8337–8355, Apr. 2021, doi: 10.1007/S00500-021-05756-8.
Shivani Choudhary, Tarun Luthra, Ashima Mittal, Rajat Singh, “A Survey of Knowledge Graph Embedding and Their Applications,” arXiv:2107.07842, 2021, [Online]. Available: https://arxiv.org/abs/2107.07842
Ciyuan Peng, Feng Xia, Mehdi Naseriparsa & Francesco Osborne, “Knowledge Graphs: Opportunities and Challenges,” Artif. Intell. Rev., vol. 56, 2023, [Online]. Available: https://link.springer.com/article/10.1007/s10462-023-10465-9
“ConvGQR: Generative Query Reformulation for Conversational Search - ACL Anthology.” Accessed: May 02, 2026. [Online]. Available: https://aclanthology.org/2023.acl-long.274/
Yunfan Gao, Yun Xiong, Meng Wang, Haofen Wang, “Modular RAG: Transforming RAG Systems into LEGO-like Reconfigurable Frameworks,” arXiv:2407.21059, 2024, [Online]. Available: https://arxiv.org/abs/2407.21059
Minghan Li, Xinxuan Lv, Junjie Zou, Tongna Chen, Chao Zhang, Suchao An, Ercong Nie, Guodong Zhou, “Query Expansion in the Age of Pre-trained and Large Language Models: A Comprehensive Survey,” arXiv:2509.07794, 2025, [Online]. Available: https://arxiv.org/abs/2509.07794
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 50sea

This work is licensed under a Creative Commons Attribution 4.0 International License.


















