A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle
This dataset focuses on the agricultural management practices and production in Norway, derived from the websites Nibio.no, Plantevernleksikonet.no, and nlr.no. All gathered data is in Norwegian. The data is in JSON files (RAW format) and covers topics pertinent to Norwegian agriculture, such as cro...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-04-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340925000587 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832087563125391360 |
---|---|
author | Olena Bugaiova Kristian Nikolai Jæger Hansen |
author_facet | Olena Bugaiova Kristian Nikolai Jæger Hansen |
author_sort | Olena Bugaiova |
collection | DOAJ |
description | This dataset focuses on the agricultural management practices and production in Norway, derived from the websites Nibio.no, Plantevernleksikonet.no, and nlr.no. All gathered data is in Norwegian. The data is in JSON files (RAW format) and covers topics pertinent to Norwegian agriculture, such as crop rotation, soil health, plant protection and sustainable farming techniques. The data was collected by three Python scripts specially adapted to each website. The cleaned text data is valuable for training or evaluating Natural Language Processing (NLP) Models in an experimental context in Norway or adapting Large-Language Models (LLM) to the domain of Norwegian agriculture within the Norwegian language. |
format | Article |
id | doaj-art-9fe7cd2682b74105969b3a9ad6a0f451 |
institution | Kabale University |
issn | 2352-3409 |
language | English |
publishDate | 2025-04-01 |
publisher | Elsevier |
record_format | Article |
series | Data in Brief |
spelling | doaj-art-9fe7cd2682b74105969b3a9ad6a0f4512025-02-06T05:11:57ZengElsevierData in Brief2352-34092025-04-0159111326A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggleOlena Bugaiova0Kristian Nikolai Jæger Hansen1Aarhus University, Department of Agroecology, Section for Systems Analysis and Sustainability, 8830 Tjele, DenmarkCorresponding author.; Aarhus University, Department of Agroecology, Section for Systems Analysis and Sustainability, 8830 Tjele, DenmarkThis dataset focuses on the agricultural management practices and production in Norway, derived from the websites Nibio.no, Plantevernleksikonet.no, and nlr.no. All gathered data is in Norwegian. The data is in JSON files (RAW format) and covers topics pertinent to Norwegian agriculture, such as crop rotation, soil health, plant protection and sustainable farming techniques. The data was collected by three Python scripts specially adapted to each website. The cleaned text data is valuable for training or evaluating Natural Language Processing (NLP) Models in an experimental context in Norway or adapting Large-Language Models (LLM) to the domain of Norwegian agriculture within the Norwegian language.http://www.sciencedirect.com/science/article/pii/S2352340925000587FarmingMachine learningText dataDomain adaption |
spellingShingle | Olena Bugaiova Kristian Nikolai Jæger Hansen A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle Data in Brief Farming Machine learning Text data Domain adaption |
title | A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle |
title_full | A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle |
title_fullStr | A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle |
title_full_unstemmed | A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle |
title_short | A dataset dedicated to the training of large- language models for agronomic management practices and production in Norwegian agricultureGithubKaggle |
title_sort | dataset dedicated to the training of large language models for agronomic management practices and production in norwegian agriculturegithubkaggle |
topic | Farming Machine learning Text data Domain adaption |
url | http://www.sciencedirect.com/science/article/pii/S2352340925000587 |
work_keys_str_mv | AT olenabugaiova adatasetdedicatedtothetrainingoflargelanguagemodelsforagronomicmanagementpracticesandproductioninnorwegianagriculturegithubkaggle AT kristiannikolaijægerhansen adatasetdedicatedtothetrainingoflargelanguagemodelsforagronomicmanagementpracticesandproductioninnorwegianagriculturegithubkaggle AT olenabugaiova datasetdedicatedtothetrainingoflargelanguagemodelsforagronomicmanagementpracticesandproductioninnorwegianagriculturegithubkaggle AT kristiannikolaijægerhansen datasetdedicatedtothetrainingoflargelanguagemodelsforagronomicmanagementpracticesandproductioninnorwegianagriculturegithubkaggle |