GitHub - hamlet-khachatryan/PSEUDO: PSEUDO: Computational framework for debiasing and experimental uncertainty quantification in protein structural models

PSEUDO

PSEUDO is a computational framework for debiasing and experimental uncertainty quantification in protein structural models resolved by molecular replacement.

It provides a three-stage pipeline that runs on HPC clusters via SLURM:

Stage	Command	What it does
Debias	`pseudo-debias`	Generates stochastic omit perturbation (STOMP) maps via Phenix
Quantify	`pseudo-quantify`	Separates true signal from phase bias across the map ensemble
Analyse	`pseudo-analyse`	Scores every model atom against the debiased SNR map (MUSE)

Installation

git clone https://github.com/hamlet-khachatryan/PSEUDO.git
cd PSEUDO
pip install -e ".[dev]"

PSEUDO requires the Phenix Software Suite for STOMP map calculation. Load it before submitting any SLURM jobs:

module load phenix

Minimal configuration

Create a YAML file (e.g. run.yaml) with just the required fields:

debias:
  run_name: "my_experiment"
  structure_path: "/data/target.pdb"
  reflections_path: "/data/target.mtz"

paths:
  work_dir: "/scratch/my_project"

slurm:
  partition: "cs05r"

All other parameters fall back to built-in defaults (see Configuration Reference).

Input modes

PSEUDO accepts three input modes for the debias stage:

Mode	Required fields	Behaviour
Single structure	`structure_path` + `reflections_path`	Always processed as-is
CSV screening	`screening_path` (`.csv`)	All rows always processed; must contain `PDB`/`CIF` and `MTZ` columns
SQLite screening	`screening_path` (`.sqlite`)	Diamond SoakDB format; supports outcome filtering and structure count capping

SQLite-specific options

sqlite_outcomes — comma-separated substrings matched against the RefinementOutcome column. Accepted values:

Analysis Pending, PANDDA model - minor, In Refinement,
CompChem ready, Deposition ready, Deposited, Analysed & Rejected

max_structures — cap on the number of structures processed from the SQLite file. Set to null (default) to process all matching entries.

Example config for SQLite with filtering:

debias:
  screening_path: "/data/soakdb.sqlite"
  sqlite_outcomes: "CompChem ready, Deposition ready, Deposited"  # comma-separated string
  max_structures: 50

Or via CLI:

pseudo-debias generate-params --config run.yaml \
  --sqlite_outcomes "CompChem ready, Deposited" \
  --max_structures 50

Basic usage

Three commands take a structure from raw reflections to per-atom density support scores:

pseudo-debias generate-params --config run.yaml

jid=$(sbatch --parsable submit_preprocessing.slurm) && \
sbatch --dependency=afterok:$jid submit_omission.slurm

pseudo-quantify --input_path /scratch/my_project/my_experiment
pseudo-analyse  --input_path /scratch/my_project/my_experiment

Quantification results are stored in <work_dir>/<run_name>/<crystal>/quantify_results/<k_*_cap_*>/:

File	Description
`{stem}_mean.ccp4`	STOMP$_{\mu}$ map: average over the STOMP ensemble
`{stem}_std.ccp4`	STOMP$_{\sigma}$ map: voxel-wise standard deviation map of the STOMP ensemble
`{stem}_snr.ccp4`	STOMP$_{SNR}$ map: voxel-wise signal-to-noise ratio map
`{stem}_p_values.ccp4`	voxel-wise signal probability map derived from the STOMP$_{SNR}$ map

Analysis results land in <work_dir>/<run_name>/<crystal>/analyse_results/:

File	Description
`{stem}_atoms.csv`	Per-atom MUSE scores and diagnostic flags
`{stem}_residues.csv`	Per-residue MUSEm aggregated scores
`{stem}_summary.json`	Global statistics (OPIA, counts, thresholds)
`{stem}_scored.pdb`	Structure with MUSE scores in the B-factor column

Load {stem}_scored.pdb in PyMOL and colour by B-factor to visualise density support across the model.

Screen Report

When running on a screening directory (multiple crystals), pseudo-analyse automatically generates an interactive HTML summary report at <screening_dir>/index.html.

The report provides:

Summary cards — total crystals, analysis-complete count, mean OPIA across the run
Sortable table — one row per crystal with colour-coded OPIA and ligand MUSE badges, SNR significance threshold, links to all output files, and a collapsible binding-site residue table
Coot integration — one pre-built session script per crystal that loads the scored model, refined 2FoFc map, STOMP_μ map, and STOMP_SNR map, centred on the ligand

The report can be regenerated at any time without re-running analysis:

pseudo-screen-report --input_path /scratch/results/my_screen

See the Analyse guide for full documentation.

Citation

If you use this code — including STOMP maps, the PSEUDO platform, or MUSE scores — in your research, please use the following citation (preprint available soon):

@software{khachatryan2026pseudo,
    title     = {PSEUDO: Framework for Phase Uncertainty Estimation of Protein Models},
    author    = {Khachatryan, Hamlet and Wild, Conor and von Delft, Frank},
    year      = {2026},
    url       = {https://github.com/hamlet-khachatryan/PSEUDO}
}

MUSE scores

MUSE adapts the EDIA methodology. If you use MUSE scores, please also cite:

@article{meyder2017edia,
    title   = {Estimating Electron Density Support for Individual Atoms and Molecular Fragments in X-ray Structures},
    author  = {Meyder, Agnes and Nittinger, Eva and Lange, Gudrun and Klein, Robert and Rarey, Matthias},
    journal = {Journal of Chemical Information and Modeling},
    volume  = {57},
    pages   = {2437--2447},
    year    = {2017},
    doi     = {10.1021/acs.jcim.7b00391}
}

@article{nittinger2015water,
    title   = {Evidence of Water Molecules --- {A} Statistical Evaluation of Water Molecules Based on Electron Density},
    author  = {Nittinger, Eva and Meyder, Agnes and Lange, Gudrun and Klein, Robert and Rarey, Matthias},
    journal = {Journal of Chemical Information and Modeling},
    volume  = {55},
    pages   = {771--783},
    year    = {2015},
    doi     = {10.1021/ci500662d}
}

Copyright

For enquiries, contact [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
.github/workflows		.github/workflows
docs		docs
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Minimal configuration

Input modes

SQLite-specific options

Basic usage

Screen Report

Further reading

Citation

MUSE scores

Copyright

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Installation

Minimal configuration

Input modes

SQLite-specific options

Basic usage

Screen Report

Further reading

Citation

MUSE scores

Copyright

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages