A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle

This dataset focuses on the agricultural management practices and production in Norway, derived from the websites Nibio.no, Plantevernleksikonet.no, and nlr.no. All gathered data is in Norwegian. The data is in JSON files (RAW format) and covers topics pertinent to Norwegian agriculture, such as cro...

Full description

Saved in:
Bibliographic Details
Main Authors: Olena Bugaiova, Kristian Nikolai Jæger Hansen
Format: Article
Language:English
Published: Elsevier 2025-04-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340925000587
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832087563125391360
author Olena Bugaiova
Kristian Nikolai Jæger Hansen
author_facet Olena Bugaiova
Kristian Nikolai Jæger Hansen
author_sort Olena Bugaiova
collection DOAJ
description This dataset focuses on the agricultural management practices and production in Norway, derived from the websites Nibio.no, Plantevernleksikonet.no, and nlr.no. All gathered data is in Norwegian. The data is in JSON files (RAW format) and covers topics pertinent to Norwegian agriculture, such as crop rotation, soil health, plant protection and sustainable farming techniques. The data was collected by three Python scripts specially adapted to each website. The cleaned text data is valuable for training or evaluating Natural Language Processing (NLP) Models in an experimental context in Norway or adapting Large-Language Models (LLM) to the domain of Norwegian agriculture within the Norwegian language.
format Article
id doaj-art-9fe7cd2682b74105969b3a9ad6a0f451
institution Kabale University
issn 2352-3409
language English
publishDate 2025-04-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj-art-9fe7cd2682b74105969b3a9ad6a0f4512025-02-06T05:11:57ZengElsevierData in Brief2352-34092025-04-0159111326A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggleOlena Bugaiova0Kristian Nikolai Jæger Hansen1Aarhus University, Department of Agroecology, Section for Systems Analysis and Sustainability, 8830 Tjele, DenmarkCorresponding author.; Aarhus University, Department of Agroecology, Section for Systems Analysis and Sustainability, 8830 Tjele, DenmarkThis dataset focuses on the agricultural management practices and production in Norway, derived from the websites Nibio.no, Plantevernleksikonet.no, and nlr.no. All gathered data is in Norwegian. The data is in JSON files (RAW format) and covers topics pertinent to Norwegian agriculture, such as crop rotation, soil health, plant protection and sustainable farming techniques. The data was collected by three Python scripts specially adapted to each website. The cleaned text data is valuable for training or evaluating Natural Language Processing (NLP) Models in an experimental context in Norway or adapting Large-Language Models (LLM) to the domain of Norwegian agriculture within the Norwegian language.http://www.sciencedirect.com/science/article/pii/S2352340925000587FarmingMachine learningText dataDomain adaption
spellingShingle Olena Bugaiova
Kristian Nikolai Jæger Hansen
A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle
Data in Brief
Farming
Machine learning
Text data
Domain adaption
title A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle
title_full A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle
title_fullStr A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle
title_full_unstemmed A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle
title_short A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle
title_sort dataset dedicated to the training of large language models for agronomic management practices and production in norwegian agriculturegithubkaggle
topic Farming
Machine learning
Text data
Domain adaption
url http://www.sciencedirect.com/science/article/pii/S2352340925000587
work_keys_str_mv AT olenabugaiova adatasetdedicatedtothetrainingoflargelanguagemodelsforagronomicmanagementpracticesandproductioninnorwegianagriculturegithubkaggle
AT kristiannikolaijægerhansen adatasetdedicatedtothetrainingoflargelanguagemodelsforagronomicmanagementpracticesandproductioninnorwegianagriculturegithubkaggle
AT olenabugaiova datasetdedicatedtothetrainingoflargelanguagemodelsforagronomicmanagementpracticesandproductioninnorwegianagriculturegithubkaggle
AT kristiannikolaijægerhansen datasetdedicatedtothetrainingoflargelanguagemodelsforagronomicmanagementpracticesandproductioninnorwegianagriculturegithubkaggle