Can LLMs replace traditional search engines for information retrieval?

Do LLMs actually give better answers than a Google search?

Not yet — and in some ways they are worse. A 2023 study directly compared ChatGPT against Google for patient questions about a common medical condition (benign paroxysmal positional vertigo). ChatGPT's answers were significantly harder to read: they required a 13.9 grade reading level versus Google's 10.7, and scored lower on a standard quality scale (DISCERN part 2: 17.5 vs 25.4 out of 40) [1]. That means the average person would find ChatGPT's explanations more difficult to understand than a typical Google result.

However, ChatGPT did score well on accuracy (4.19 out of 5) and currency (4.31 out of 5), meaning the information was correct and up-to-date — just harder to digest [1]. So the trade-off is: LLMs can produce accurate, current answers, but they often bury them in complex language, while search engines deliver more readable content from diverse sources.

What is the fundamental weakness of using an LLM as your only search tool?

The core problem is hallucination — LLMs confidently generate false information. A 2024 perspective paper from a leading information retrieval researcher states bluntly that 'concerns such as hallucination undermine their trustworthiness, limiting their actual utility when deployed in real-world applications, especially high-stake applications where trust is vital' [2]. Unlike a search engine that links to sources you can verify, an LLM gives you a smooth-sounding answer with no guarantee it is true.

This is why the same paper argues that 'LLMs will not be able to replace search engines' and predicts that future LLMs will need to 'learn how to use a search engine' — essentially becoming a smarter interface on top of traditional retrieval [2]. Another 2024 talk makes the same point: retrieval technology is 'more relevant than ever before, because we need information to be grounded in sources' [4]. The takeaway: LLMs are powerful at understanding and generating language, but they are unreliable fact-checkers on their own.

So what does the future actually look like?

The evidence points to a hybrid model where LLMs and search engines work together, not one replacing the other. Major search engines are already integrating AI chat into their results: Google launched Gemini, Microsoft launched Copilot (formerly Bing Chat), and Baidu launched Ernie [3]. These systems use LLMs to understand complex or conversational queries, then rely on the search engine's index to retrieve and cite real sources.

Research backs this up. A 2024 study on cross-lingual search showed that combining a multilingual retrieval system with an LLM achieved state-of-the-art results, outperforming either approach alone [5]. And a 2023 paper found that LLMs can generate accurate URLs when given a few examples — nearly 90% of those URLs led to documents containing correct answers — but the LLM still needed the search engine's database to point to [6]. The bottom line: LLMs are becoming a smarter front-end for search, not a replacement for the search engine itself.

Sources used in this answer

BPPV Information on Google Versus AI (ChatGPT)

ChatGPT's medical answers were harder to read (13.9 vs 10.7 grade level) and lower quality (DISCERN 17.5 vs 25.4) than Google's top 30 results, though they were accurate and current.

2023 · Jeffrey R Bellinger, Julian S De La Chapa, Minhie W Kwak, Gabriel A Ramos, Daniel Morrison, Bradley W Kesser · Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery

Original

Large Language Models and Future of Information Retrieval: Opportunities and Challenges

LLMs cannot replace search engines due to hallucination and trust issues; future LLMs will need to use search engines to ground their answers.

2024 · ChengXiang Zhai · SIGIR

Original

Is ChatGPT-like technology going to replace commercial search engines?

Google, Microsoft, and Baidu have all integrated AI chat into their search engines, creating hybrid systems rather than replacements.

2024 · Artur Strzelecki · Library Hi Tech News

Original

Is the Search Engine of the Future a Chatbot?

Retrieval technology is more relevant than ever because information must be grounded in sources, even as LLMs change how users interact with information.

2024 · Suzan Verberne · CIKM

Original

Steering Large Language Models for Cross-lingual Information Retrieval

Combining a multilingual retrieval system with an LLM (ASMR) achieved state-of-the-art results on cross-lingual search benchmarks, outperforming either alone.

2024 · Ping Guo, Yubing Ren, Yue Hu, Yanan Cao, Yunpeng Li, Heyan Huang · SIGIR

Original

Large Language Models are Built-in Autoregressive Search Engines

LLMs can generate accurate URLs for document retrieval with nearly 90% success when given a few examples, but still rely on the search engine's database.

2023 · Noah Ziems, Wenhao Yu, Zhihan Zhang, Meng Jiang · Findings of the Association for Computational Linguistics: ACL 2023

Original