GitHub - groupmm/subsequenceSDTW: Accompanying code for the paper "Subsequence SDTW: Differentiable Alignment with Flexible Boundary Conditions", ICASSP 2026

Accompanying code for: Subsequence SDTW: Differentiable Alignment with Flexible Boundary Conditions

Johannes Zeitler (johannes.zeitler@audiolabs-erlangen.de)
International Audio Laboratories Erlangen
February 2026

Overview

This repository contains code to reproduce all experiments in the paper. The main notebooks are:

train_strong.ipynb: training with strongly aligned targets
train_SDTW_noMismatch.ipynb: training with standard SDTW and no boundary mismatch
train_SDTW.ipynb: training with standard SDTW and boundary mismatch
train_subSDTW.ipynb: training with subSDTW and boundary mismatch
train_subSDTW-W.ipynb: training with weighted subSDTW and boundary mismatch
eval.ipynb: compute evaluation metrics

Additionally, the following files/folders are contained:

data/: Open-domain subset of the BPSD. It's not sufficient to reproduce the paper results, but it provides a functional codebase. Audio is corrected in tuning to A4=440Hz and resampled to 16kHz flac
dataset_weakLabels.py: provides dataset class for weakly aligned score-audio pairs for the BPSD dataset
midi.py: some helper functions for MIDI parsing
onsets_and_frames/: pytorch onsets-and-frames implementation from https://github.com/jongwook/onsets-and-frames
prepare_weak_targets.ipynb: pre-compute weak target representations in musical and physical time from the BPSD annotations.
pretrained_model.pt: A transcriber pretrained on the MAESTRO dataset
SDTW.py: standard SDTW
subSDTW.py: subsequence SDTW without weight penalty
subSDTW_W.py: subsequence SDTW with weight penalty

Notes

To reduce the memory footprint of this repository, we do not include all training datasets. The MAESTRO (https://magenta.withgoogle.com/datasets/maestro) and BPSD (https://doi.org/10.5281/zenodo.10847702) datasets need to be acquired separately. For the BPSD dataset, we use an audio version that was corrected to A4=440Hz tuning

If you use this code...

please cite our paper

Johannes Zeitler and Meinard Müller. Subsequence SDTW: Differentiable Alignment with Flexible Boundary Conditions. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, 2026.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Notes

If you use this code...

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
data/BPSD		data/BPSD
onsets_and_frames		onsets_and_frames
thirdPartyLegalnotices		thirdPartyLegalnotices
.gitignore		.gitignore
README.md		README.md
SDTW.py		SDTW.py
dataset_weakLabels.py		dataset_weakLabels.py
environment.yml		environment.yml
eval.ipynb		eval.ipynb
midi.py		midi.py
prepare_weak_targets.ipynb		prepare_weak_targets.ipynb
pretrained_model.pt		pretrained_model.pt
subSDTW.py		subSDTW.py
subSDTW_W.py		subSDTW_W.py
train_SDTW.ipynb		train_SDTW.ipynb
train_SDTW_noMismatch.ipynb		train_SDTW_noMismatch.ipynb
train_strong.ipynb		train_strong.ipynb
train_subSDTW-W.ipynb		train_subSDTW-W.ipynb
train_subSDTW.ipynb		train_subSDTW.ipynb

Folders and files

Latest commit

History

Repository files navigation

Overview

Notes

If you use this code...

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages