https://pan.webis.de/semeval23/pan23-web/clickbait-challenge.html
We preprocessed the train and validation datasets in this script.
Requires an OpenAI API key. We prepared our gpt baseline here. We get predictions for validation data using 2-shot.
We prepeared our TF-IDF baseline here. Both the prediction and evaluation done in this notebook.
We finetuned a LLaMA-7B model using LoRA. We also save the predictions for validation dataset in a txt file
We finetuned a Falcon-7B model using QLoRA. We quantized into 4 bits. We also save the predictions for validation dataset.
We finetuned a RoBERTa and saved the validation outputs.
We finetuned a T5 model and save a checkpoint.
We make predictions for the trained T5 model, which loads from the saved checkpoint, and save the results.
We calculate the Bleu and Bert scores in this script for all save validation outputs from all models.