ElectAI

Taxonomy

Datasets

The dataset folder contains two subfolders.

Code

The datasets and code can be accessed through the following link: Datasets

All the scripts are implemented using python.

Installation

Clone the repository from github using the link above as follows:

git clone https://github.com/LanguageTechnologyLab/ElectAI.git

Navigate to the GenOffense directory and install the required packages

cd ElectAI
pip install -r requirements.txt

Accessing Data

The files are a '.csv' files. The columns are tab separated. These file can be loaded in python using the pandas library as follows.

import pandas
pandas.read_csv(filename, sep="\t")

Claim Understanding

To evaluate an LLM for claim understanding, use the following command. You need to specify the name of the model.

python incontext.py MODEL_NAME

MODEL_NAME can be llama, falcon, mistral, flan

Authorship Attribution

To train/fine-tune a model for classification, run the following command. You must specify the name of the model. In case of Random forest, specify the features to be used

python trainer.py MODEL_NAME FEATURE_NAME

MODEL_NAME can be bert, roberta, rf

FEATURE_NAME can be tfidf, word2vec