Lux

Model for text classification regarding veracity.

Baseline uses W2V embeddings trained on google corpus for fake news

Lux proposes the usage of Linguistic Aspects as Features.

Lux

INSTALLATION

This repository uses bert. in order to use BERT properly:

1)Clone bert repo inside Lux/res:

-- git clone https://github.com/google-research/bert

2)Download the pre-trained model from bert:

-- wget https://storage.googleapis.com/bert_models/2018_10_18/uncased_L-12_H-768_A-12.zip

3)Unzip the model inside bert folder:

-- unzip uncased_L-12_H-768_A-12.zip

You should have 3 files, the model, bert_config.json and vocab.txt.

4)Set env variable BERT_BASE_DIR:

-- export BERT_BASE_DIR=/path/to/Lux/res/bert/uncased_L-12_H-768_A-12

in our case: export BERT_BASE_DIR=~/Lux/res/bert/uncased_L-12_H-768_A-12

5)Start bert-as-a-service server for requests in another session/screen tab

-- bert-serving-start -model_dir $BERT_BASE_DIR -max_seq_len 512 -mask_cls_sep

Install Specificity model

1)Download DASSP.zip inside res/specificity

-- wget https://www.dropbox.com/s/41uw7wm2bbgoff4/DASSP.zip

2)Unzip its contents

-- unzip DASSP.zip

3)Go into folder, download and unzip glove:

-- cd Domain-Agnostic-Sentence-Specificity-Prediction/
-- wget https://www.dropbox.com/s/0g880op64chjw4b/glove.840B.300d.zip
-- unzip glove.840B.300d.zip

+)Check the README.md inside the folder, if modifications have to be done

Create an virtual environment with python3 and activate it

1)Back to Lux

-- virtualenv envLux-p python3

-- source envLux/bin/activate

Install requirements

-- pip install -r requirements.txt

Download and extract GoogleNews-vectors-negative300.bin into data/

-- cd data/
-- wget -c "https://s3.amazonaws.com/dl4j-distribution/GoogleNews-vectors-negative300.bin.gz"
-- gunzip GoogleNews-vectors-negative300.bin.gz

Running

-- bash run.sh

OR

-- sudo -E python3 lux.py

if 'True' is passed as first argument, force_reload will receive its value and new bert models as well as new features will be generated.

Papers

Please cite the published articles related to this work:

Azevedo, Lucas, et al. "LUX (Linguistic aspects Under eXamination): Discourse Analysis for Automatic Fake News Classification." Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. 2021.

@inproceedings{azevedo2021lux,
  title={LUX (Linguistic aspects Under eXamination): Discourse Analysis for Automatic Fake News Classification},
  author={Azevedo, Lucas and d’Aquin, Mathieu and Davis, Brian and Zarrouk, Manel},
  booktitle={Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021},
  pages={41--56},
  year={2021}
}

Azevedo, Lucas, and Mohamed Moustafa. "Veritas annotator: Discovering the origin of a rumour." Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER). 2019.

@inproceedings{azevedo2019veritas,
  title={Veritas annotator: Discovering the origin of a rumour},
  author={Azevedo, Lucas and Moustafa, Mohamed},
  booktitle={Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER)},
  pages={90--98},
  year={2019}
}

Azevedo, Lucas. "Truth or lie: Automatically fact checking news." Companion Proceedings of the The Web Conference 2018. 2018.

@inproceedings{azevedo2018truth,
  title={Truth or lie: Automatically fact checking news},
  author={Azevedo, Lucas},
  booktitle={Companion Proceedings of the The Web Conference 2018},
  pages={807--811},
  year={2018}
}

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.idea		.idea
Autotuner/Lux		Autotuner/Lux
BCK		BCK
__pycache__		__pycache__
data		data
glue_data		glue_data
lux_best_models		lux_best_models
plots		plots
res		res
results		results
results_ablation		results_ablation
src		src
.gitignore		.gitignore
Map of Features.pdf		Map of Features.pdf
README.md		README.md
ablation.sh		ablation.sh
bck_resultslala.txt		bck_resultslala.txt
best_results.py		best_results.py
check_ablation_res.py		check_ablation_res.py
check_lines.py		check_lines.py
data_loader.py		data_loader.py
data_loader.pyc		data_loader.pyc
download_glue_data.py		download_glue_data.py
emb_gen.sh		emb_gen.sh
finetune-bert.py		finetune-bert.py
generateExtraFeatures.py		generateExtraFeatures.py
generateFeatures.py		generateFeatures.py
git_large.sh		git_large.sh
gittest.lala		gittest.lala
last_results.txt		last_results.txt
last_runs.sh		last_runs.sh
log_test_folds.txt		log_test_folds.txt
log_test_folds1.txt		log_test_folds1.txt
lux.py		lux.py
requirements.txt		requirements.txt
results.txt		results.txt
results24-01-22.txt		results24-01-22.txt
results_b4_may_2020.txt		results_b4_may_2020.txt
results_bck_jan_2022.txt		results_bck_jan_2022.txt
results_final.txt		results_final.txt
run.sh		run.sh
temp.sh		temp.sh
test.py		test.py
test2.py		test2.py
test3.py		test3.py
test_bash.sh		test_bash.sh
test_lala.py		test_lala.py
test_runs.sh		test_runs.sh
teste.py		teste.py
testtensorflow.py		testtensorflow.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model for text classification regarding veracity.

Lux

INSTALLATION

This repository uses bert. in order to use BERT properly:

Install Specificity model

Create an virtual environment with python3 and activate it

Install requirements

Download and extract GoogleNews-vectors-negative300.bin into data/

Running

Papers

About

Releases

Packages

Contributors 2

Languages

lucas0/Lux

Folders and files

Latest commit

History

Repository files navigation

Model for text classification regarding veracity.

Lux

INSTALLATION

This repository uses bert. in order to use BERT properly:

Install Specificity model

Create an virtual environment with python3 and activate it

Install requirements

Download and extract GoogleNews-vectors-negative300.bin into data/

Running

Papers

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages