Dataset
Kaldi-format dataset loading and reference-audio management.
-
class pathbench.dataset.Dataset(dataset_path, use_reference=False, reference_path=None, reference_type='control', reference_mapping=None)[source]
Bases: object
Handles a speech dataset in a Kaldi-style format.
-
get_utterances()[source]
Returns a list of utterance IDs.