A Benchmark Dilemma

This repository contains the code accompanying the paper A Benchmark Dilemma.

Project Structure

art-multimodal-benchmark/
│
├── data/                                # (these files are not pushed, but created as when running the code)
│   ├── wikiart/                         # -> download WikiArt dataset from HuggingFace and save to disk as 'wikiart'
│   ├── wikiart_embeddings/              # created when running extract_embeddings.py with wikiart data
│   ├── wikidata/                        # -> download WikiData dataset from HuggingFace and save to disk as 'wikidata'
│   ├── wikidata_embeddings/             # created when running extract_embeddings.py with wikidata data
│   └── aug_embeddings/                  # created when running extract_augmented_embeddings.py with wikiart data
│
├── src/                                 # Source code
│   ├── classification_utils.py          # utils for building initial classification model
│   ├── classification_wikidata.py       # classify artists from wikidata dataset
│   ├── classify_augmentations.py        # run classification with augmented data
│   ├── custom_augmentations.py          # contains code to create custom data augmentations
│   ├── extract_embeddings.py            # extract embeddings from HuggingFace dataset with MIEB/timm models
│   ├── extract_augmented_embeddings.py  # extract embeddings from augmented images
│   ├── initial_classification.py        # code to run initial classification task on embeddings
│   ├── segmentation_task.py             # code to run tree segmentation + augmentation with SAM3
│   └── subclassification.py             # code to run subclassification with input subclassification task
│
├── out/                                 # All outputs
│   ├── classification_reports/          # classification reports of initial classification task on genre, styles and artists
│   ├── extraction_times/                # feature extraction times for wikidata embeddings
│   ├── misclassified_examples_subclassifications/ # plots of misclassification examples for the subclassification task
│   ├── subclassification_conf_matrices/ # confusion matric plots for subclassification task
│   ├── subclassification_reports/       # classification reports for subclassification task
│   ├── test_augmentation_results/       # classification reports for data augmentation classification task
│   ├── segmentations/                   # .csv scripts with segmentation results metadata
│   └── wikidata_clf_results/            # results from wikidata classification
│
├── .env                                 # contains HuggingFace token (currently empty, needs to be specified by the user)
├── all_models.txt                       # list of embedding models used for this project                    
├── extract_embeddings.sh                # run script to extract embeddings for dataset with specified list of models
├── initial_clf_task.sh                  # run script for initial classification task
├── README.md               
├── requirements.txt                     # Python dependencies
├── run_augmentations.sh                 # runs extract_augmented_embeddings.py and classify_augmentations.py to extract augmented embeddings and classify them
├── run_subclf.sh                        # run subclassification.py script with defined subclassification task
├── run_wikidata_clf.sh                  # run classification_wikidata.py scrip
├── SAM_requirements.txt                 # required packages + versions to run SAM3 + segmentation code
├── sam_setup.sh                         # download SAM3 model and install required packages with specified versions
└── setup.sh                             # set up virtual environment and install required packages

Prerequisites

First, clone the project's repository:

git clone https://github.com/centre-for-humanities-computing/art-multimodal-benchmark.git

In order to use SAM3, you need to agree to share your contact information and specify a personal HuggingFace token. See https://huggingface.co/facebook/sam3. Next, create a HuggingFace token (which allows usage of the models) and insert it in the .env file in the repo.

Data

A filtered & cleaned version of WikiArt version can be found on HuggingFace HERE.

The WikiData dataset used can be found HERE.

We recommend downloading the datasets via HuggingFace and placing it in the data folder:

import datasets
import os
import argparse 
from functools import partial

# load dataset from the hub
hf_data = datasets.load_dataset('chcaa/wikiart_benchmarking', split='train', streaming=True) # change 'wikiart' with 'wikidata' to get wikidata instead

# convert dataset to iterable generator
def gen_from_iterable_dataset(iterable_ds):
    yield from iterable_ds

# convert to dataset to be saved locally
ds = datasets.Dataset.from_generator(partial(gen_from_iterable_dataset, hf_data), features=hf_data.features)

# create datafolder
os.makedirs('data', exist_ok=True)

# save to disk
ds.save_to_disk(os.path.join('data', 'wikiart')) # change with 'wikidata' for wikidata dataset

Usage

First, clone the repo with

git clone https://github.com/centre-for-humanities-computing/art-multimodal-benchmark.git

In the main folder, set up virtual environment and install required packages with

bash setup.sh

Make sure all scripts are run from the main folder, i.e., art-multimodal-benchmark.

Extract embeddings

To run feature extraction with predefined arguments, run:

bash extract_embeddings.sh

Initial classification task:

initial_clf_task.sh

Benchmarking tasks

Classifications/benchmarking tasks with predefined arguments:

Subclassifications:

run_subclf.sh

Wikidata classification:

run_wikidata_clf.sh

Augmentations:

run_augmentations.sh

Segmentation:

# download SAM3 model
bash sam_setup.sh

# activate SAM3 env
source .venv/bin/activate

# run tree segmentation with SAM3

python3 src/segmentation_task.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Benchmark Dilemma

Project Structure

Prerequisites

Data

Usage

Extract embeddings

Initial classification task:

Benchmarking tasks

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
out		out
src		src
.DS_Store		.DS_Store
.env		.env
.gitignore		.gitignore
README.md		README.md
SAM_requirements.txt		SAM_requirements.txt
all_models.txt		all_models.txt
env_to_jupyter.sh		env_to_jupyter.sh
extract_embeddings.sh		extract_embeddings.sh
initial_clf_task.sh		initial_clf_task.sh
requirements.txt		requirements.txt
run_augmentations.sh		run_augmentations.sh
run_subclf.sh		run_subclf.sh
run_wikidata_clf.sh		run_wikidata_clf.sh
sam_setup.sh		sam_setup.sh
setup.sh		setup.sh

Folders and files

Latest commit

History

Repository files navigation

A Benchmark Dilemma

Project Structure

Prerequisites

Data

Usage

Extract embeddings

Initial classification task:

Benchmarking tasks

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages