Lielo valodas modeļu automātiska pielāgošana

Vītols, Andris

dc.contributor.advisor	Ivanovs, Maksims
dc.contributor.author	Vītols, Andris
dc.contributor.other	Latvijas Universitāte. Datorikas fakultāte
dc.date.accessioned	2024-06-20T01:04:30Z
dc.date.available	2024-06-20T01:04:30Z
dc.date.issued	2024
dc.identifier.other	102949
dc.identifier.uri	https://dspace.lu.lv/dspace/handle/7/66138
dc.description.abstract	Lielo valodas modeļu izmantošana specifisku uzdevumu veikšanai var būt nepieciešams veikt papildus pielāgošanu. Gan izmantošana, gan pielāgošana prasa ievērojami daudz datorresursu. Abi šie ierobežojumi kavē lielo valodas modeļu tālāku pētniecību, attīstību un ieviešanu. Šī bakalaura darba mērķis ir apskatīt metodes un metodoloģijas, kas varētu atvieglot lielo valodas modeļu pielāgošanas procesu ar ierobežotiem datorresursiem. Tika veikti divi eksperimenti lietojot darbā apskatīto metodoloģiju. 1. eksperimentā tika veikta Mistral-7B-Instruct-v0.2 kvantēta modeļa tālāka pielāgošana instrukciju sekošanai, 2. eksperimentā Mistral-7B-v0.1. Abos eksperimentos metrikās tika iegūtu salīdzinoši labāki rezultāti. Atslēgas vārdi: kvantēšana, lielie valodas modeļi, metodoloģija, metrikas, pielāgošana
dc.description.abstract	Using large language models for specific tasks may require the model to be fine-tuned. Both the use and tuning require a significant amount of computer resources. These limitations hinder further research, development and implementation of large language models. The purpose of this bachelor's paper is to review methods and methodologies that could ease the process of fine-tuning large language models with limited computer resources. Two experiments were conducted using the methodology discussed in this work. In experiment 1, further adaptation of the Mistral-7B-Instruct-v0.2 quantized model for instruction following was performed, in experiment 2 it was done with Mistral-7B-v0.1. In both experiments, the metrics reflected relatively better results. Key words: benchmarks, fine-tuning, large language models, methodology, quantization
dc.language.iso	lav
dc.publisher	Latvijas Universitāte
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	Datorzinātne
dc.subject	kvantēšana
dc.subject	lielie valodas modeļi
dc.subject	metodoloģija
dc.subject	metrikas
dc.subject	pielāgošana
dc.title	Lielo valodas modeļu automātiska pielāgošana
dc.title.alternative	Large language model self-alignment
dc.type	info:eu-repo/semantics/bachelorThesis

Files in this item

Name:: 302-102949-Vitols_Andris_av201 ...
Size:: 1.367Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Bakalaura un maģistra darbi (EZTF) / Bachelor's and Master's theses [5488]

Show simple item record