preprocessing.to_model
format_xarray_for_rnn
def format_xarray_for_rnn(
ds: xr.Dataset,
read_from_variable: str = "position_processed",
keypoints: list[str] | None = None
) -> Tuple[np.ndarray, Dict[str, Any]]
Formats the xarray dataset for use VAME's RNN model:
- The x and y coordinates of the centered_reference_keypoint are excluded.
- The x coordinate of the orientation_reference_keypoint is excluded.
- The remaining data is flattened and transposed.
Parameters
- ds (
xr.Dataset
): The xarray dataset to format. - read_from_variable (
str, default="position_processed"
): The variable to read from the dataset. - keypoints (
list[str] | None, optional
): A list of keypoints to include in the output. If None, all keypoints are included. If provided, only the specified keypoints will be included in the output.
Returns
Tuple[np.ndarray, Dict[str, Any]]
: A tuple containing:
- The formatted array in the shape (n_features, n_samples)
- A dictionary with feature provenance and processing information Where n_features = 2 * n_keypoints * n_spaces - 3.