Verifying Cross-modal Entity Consistency in News using Vision-language Models

This repository is the official implementation of Verifying Cross-modal Entity Consistency in News using Vision-language Models published in ECIR 2025.

Please use the following citation:

@InProceedings{ECIR_Tahmasebi25,
  author    = {Sahar Tahmasebi, Eric Müller-Budack and Ralph Ewerth},
  booktitle = {European Conference on Information Retrieval, (ECIR) 2025,  Lucca, Italy, April 6-10, 2025},
  title     = {Verifying Cross-modal Entity Consistency in News using Vision-language Models},
  year      = {2025},
}

Requirements

To install requirements:

conda create -n LVLM4CEC python=3.9.18
conda install pytorch==1.11.0 torchvision==0.12.0 torchaudio==0.11.0 cudatoolkit=11.3.1 -c pytorch
pip install -r requirements.txt

Datasets

The fully annotated datasets used in this work, including images, can be found in _dataset directory.
To access the original TamperedNews and News400 datasets with their corresponding images, visit: TamperedNews & News400 Dataset
To access the original MMG dataset along with its images, visit: MMG Dataset

Evaluation

Each experiment includes a dedicated script for analyzing the generated answers. To run the pipeline and evaluate on different datasets, execute the corresponding bash script for the specific experiment and dataset using the following command:

<path_to_experiment>/run_<dataset_name>.sh <basepath_to_experiment> <generate_questions_flag> <generate_answers_flag>

For example, to run the pipeline on the news400_ent dataset without incorporating evidence images, use:

LVLM4CEC/01_without_evidence_images/run_news400.sh LVLM4CEC/01_without_evidence_images/ 1 1

All generated outputs, including model predictions and evaluation statistics, will be saved in the ./output directory. To display the results for a specific model and prompt templates, specify the model name using the --models argument and run:

python PrintResults.py --models <model_name>

Credit

This repository is built by Sahar Tahmasebi.

Contributing

Our work is licenced under the CC BY 4.0. It includes tamperedNews and news400 datasets which are licensed under [CC BY 3.0] by Eric Müller-Budack. The original datasets have been modified to align with the requirements of this project.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
LVLM4CEC		LVLM4CEC
_datasets		_datasets
models		models
.gitignore		.gitignore
README.md		README.md
printResultTable.py		printResultTable.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Verifying Cross-modal Entity Consistency in News using Vision-language Models

Requirements

Datasets

Evaluation

Credit

Contributing

About

Releases

Packages

Languages

TIBHannover/LVLM4CEC

Folders and files

Latest commit

History

Repository files navigation

Verifying Cross-modal Entity Consistency in News using Vision-language Models

Requirements

Datasets

Evaluation

Credit

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages