This repo contains experiment scripts and configs that were used in TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis paper.
The names of the directories align with the names used in the paper. Config files contain information about the hyperparameters that we used while finetuning.
The prompts are applied beforehand, and you can find the prompt structures in the paper. If you want to use your own dataset for finetuning, you need to preprocess the data and then use the scripts.
There are a certain number of libraries that need to be installed in your virtual environment before running the scripts. You can find the requirements.txt
file in the repo.
@article{najafi2024turkishbertweet,
title={Turkishbertweet: Fast and reliable large language model for social media analysis},
author={Najafi, Ali and Varol, Onur},
journal={Expert Systems with Applications},
pages={124737},
year={2024},
publisher={Elsevier}
}