Financial Question-answering Dataset for Slovak Language Model Evaluation

Autori

Daniel Hládek
Kristián Sopkovič
Ján Staš
Zuzana Sokolová
Matúš Pleva

DOI:

https://doi.org/10.2478/jazcas-2025-0022

Kľúčové slová:

question answering, financial domain, large language model, evaluation, Slovak language resource

Abstrakt

The limited availability of language resources for Slovak presents a significant challenge for the development and evaluation of language models. In this paper, we introduce a multiple-choice question-answering dataset specifically designed for the financial domain in Slovak. The dataset contains 1,334 questions, each with one correct answer and four incorrect ones. It is systematically organized by topic and difficulty level to facilitate structured evaluation. Using this dataset, we assess the performance of several Slovak generative language models and compare their results against a general questionanswering dataset to analyze domain-specific model capabilities. The best-performing model is a monolingual Slovak model. Furthermore, the observed performance differences between financial-domain and general question-answering tasks suggest that domainspecific language modeling requires further research.

Sťahovanie

PDF (English)

Publikované

31-03-2025

Číslo

Ročník 76 Číslo 1 (2025): Jazykovedný časopis

Rubrika

Štúdie

Licencia

Táto práca je licencovaná pod Medzinárodnou licenciou Creative Commons Attribution-NonCommercial-NoDerivatives 4.0.

Ako citovať

Financial Question-answering Dataset for Slovak Language Model Evaluation. (2025). Jazykovedný časopis, 76(1), 247-257. https://doi.org/10.2478/jazcas-2025-0022

Stiahnuť citáciu