vak.datasets.frame_classification.frames_dataset.FramesDataset#
- class vak.datasets.frame_classification.frames_dataset.FramesDataset(dataset_path: str | Path, dataset_df: DataFrame, input_type: str, split: str, sample_ids: ndarray[Any, dtype[_ScalarType_co]], inds_in_sample: ndarray[Any, dtype[_ScalarType_co]], frame_dur: float, subset: str | None = None, item_transform: Callable | None = None)[source]#
Bases:
object
A dataset class used for neural network models with the frame classification task, where the source data consists of audio signals or spectrograms of varying lengths.
- dataset_path#
Path to directory that represents a frame classification dataset, as created by
vak.prep.prep_frame_classification_dataset()
.- Type:
- subset#
Name of subset to use. If specified, this takes precedence over split. Subsets are typically taken from the training data for use when generating a learning curve.
- Type:
str, optional
- dataset_df#
A frame classification dataset, represented as a
pandas.DataFrame
. This will be only the rows that correspond to eithersubset
orsplit
from thedataset_df
that was passed in when instantiating the class.- Type:
- frames_paths#
Paths to npy files containing frames, either spectrograms or audio signals that are input to the model.
- Type:
- frame_labels_paths#
Paths to npy files containing vectors with a label for each frame. The targets for the outputs of the model.
- Type:
- input_type#
The type of input to the neural network model. One of {‘audio’, ‘spect’}.
- Type:
- sample_ids#
Indexing vector representing which sample from the dataset every frame belongs to.
- Type:
- inds_in_sample#
Indexing vector representing which index within each sample from the dataset that every frame belongs to.
- Type:
- frame_dur#
Duration of a frame, i.e., a single sample in audio or a single timebin in a spectrogram.
- Type:
- item_transform#
Transform applied to each item \((x, y)\) returned by
FramesDataset.__getitem__()
.- Type:
callable, optional
- __init__(dataset_path: str | Path, dataset_df: DataFrame, input_type: str, split: str, sample_ids: ndarray[Any, dtype[_ScalarType_co]], inds_in_sample: ndarray[Any, dtype[_ScalarType_co]], frame_dur: float, subset: str | None = None, item_transform: Callable | None = None)[source]#
Initialize a new instance of a FramesDataset.
- Parameters:
dataset_path (pathlib.Path) – Path to directory that represents a frame classification dataset, as created by
vak.prep.prep_frame_classification_dataset()
.dataset_df (pandas.DataFrame) – A frame classification dataset, represented as a
pandas.DataFrame
.input_type (str) – The type of input to the neural network model. One of {‘audio’, ‘spect’}.
split (str) – The name of a split from the dataset, one of {‘train’, ‘val’, ‘test’}.
sample_ids (numpy.ndarray) – Indexing vector representing which sample from the dataset every frame belongs to.
inds_in_sample (numpy.ndarray) – Indexing vector representing which index within each sample from the dataset that every frame belongs to.
frame_dur (float) – Duration of a frame, i.e., a single sample in audio or a single timebin in a spectrogram.
subset (str, optional) – Name of subset to use. If specified, this takes precedence over split. Subsets are typically taken from the training data for use when generating a learning curve.
item_transform (callable, optional) – Transform applied to each item \((x, y)\) returned by
FramesDataset.__getitem__()
.
Methods
__init__
(dataset_path, dataset_df, ...[, ...])Initialize a new instance of a FramesDataset.
from_dataset_path
(dataset_path[, split, ...])Make a
FramesDataset
instance, given the path to a frame classification dataset.Attributes
duration
shape
- classmethod from_dataset_path(dataset_path: str | Path, split: str = 'val', subset: str | None = None, item_transform: Callable | None = None)[source]#
Make a
FramesDataset
instance, given the path to a frame classification dataset.- Parameters:
dataset_path (pathlib.Path) – Path to directory that represents a frame classification dataset, as created by
vak.prep.prep_frame_classification_dataset()
.split (str) – The name of a split from the dataset, one of {‘train’, ‘val’, ‘test’}.
subset (str, optional) – Name of subset to use. If specified, this takes precedence over split. Subsets are typically taken from the training data for use when generating a learning curve.
item_transform (callable, optional) – Transform applied to each item \((x, y)\) returned by
FramesDataset.__getitem__()
.
- Returns:
frames_dataset
- Return type: