Comparison of LLM-based benchmarks using auxiliary systems such as Knowledge Graphs (KGs), Retrieval Augmented Generation (RAG) systems and external dataset sources such documents and search engines

View Dataset
Carlos F. Enguix

Description

This comparison includes the most representative state-of-the-art (SOTA) benchmarks up-to-date, focusing and including on each respective benchmark architecture, large language models (LLMs) aided by knowledge graph (KG), retrieval augmented generation (RAG) systems, and other external sources such as document datasets and search engines. We present the following research questions associated with this benchmark comparison:1) RQ1: Do the diverse LLM-based benchmarks increase accuracy metrics significantly by incorporating external sources such as auxiliary knowledge graphs (KGs), retrieval augmented generation (RAG) systems, external document datasets, and search engines? 2) RQ2: Do the diverse LLM-based benchmarks decrease LLM-based hallucination metrics significantly by incorporating external sources such as auxiliary knowledge graphs (KGs), retrieval augmented generation (RAG) systems, external document datasets, and search engines? 3) RQ3: Do the diverse LLM-based benchmarks increase accuracy metrics significantly by fine-tuning the respective LLMs, as compared to querying pre-trained LLMs via zero-shot prompting?

Citations (0)

Mentions (0)

Metrics

Dataset Index

1.4

FAIR Score

58%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

Open Research Knowledge Graph

Assigned Domain

Subfield

Artificial Intelligence

Field

Computer Science

Domain

Physical Sciences

Confidence Score

48%

Source

Scholar Data Model

Keywords

Semantic Web

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00