Comparison of LLM-based benchmarks using auxiliary systems such as Knowledge Graphs (KGs), Retrieval Augmented Generation (RAG) systems and external dataset sources such documents and search engines
View DatasetDescription
This comparison includes the most representative state-of-the-art (SOTA) benchmarks up-to-date, focusing and including on each respective benchmark architecture, large language models (LLMs) aided by knowledge graph (KG), retrieval augmented generation (RAG) systems, and other external sources such as document datasets and search engines. We present the following research questions associated with this benchmark comparison:1) RQ1: Do the diverse LLM-based benchmarks increase accuracy metrics significantly by incorporating external sources such as auxiliary knowledge graphs (KGs), retrieval augmented generation (RAG) systems, external document datasets, and search engines? 2) RQ2: Do the diverse LLM-based benchmarks decrease LLM-based hallucination metrics significantly by incorporating external sources such as auxiliary knowledge graphs (KGs), retrieval augmented generation (RAG) systems, external document datasets, and search engines? 3) RQ3: Do the diverse LLM-based benchmarks increase accuracy metrics significantly by fine-tuning the respective LLMs, as compared to querying pre-trained LLMs via zero-shot prompting?
Citations (0)
No citations found
Mentions (0)
No mentions found
Metrics Over Time
Publication Details
Subfield
Artificial Intelligence
Field
Computer Science
Domain
Physical Sciences
Confidence Score
48%
Source
Scholar Data Model