Dataset

Kaldi-format dataset loading and reference-audio management.

class pathbench.dataset.Dataset(dataset_path, use_reference=False, reference_path=None, reference_type='control', reference_mapping=None)[source]

Bases: object

Handles a speech dataset in a Kaldi-style format.

get_utterances()[source]

Returns a list of utterance IDs.