holisticai.datasets.Dataset#
- class holisticai.datasets.Dataset(_data: DataFrame | None = None, _metadata=None, **kargs)[source]#
Represents a dataset.
Parameters
- data: pd.DataFrame
The underlying data of the dataset.
- features: list[str]
The list of features in the dataset.
- num_rows: int
The number of rows in the dataset.
- random_state: np.random.RandomState
The random state used for sampling.
- map(fn, vectorized=True)[source]#
Applies a function to the dataset and returns a new dataset.
Parameters
- fn: function
The function to apply to the dataset.
- vectorized: bool
Whether to apply the function in a vectorized manner or not.