pjml.data.evaluation.split.Split

class pjml.data.evaluation.split.Split(split_type: str = 'holdout', partitions: int = 2, partition: int = 0, test_size: float = 0.3, seed: int = 0, fields: str = 'X, Y', **kwargs)[source]

Split a given Data field into training/apply set and testing/use set.

Developer: new metrics can be added just following the pattern ‘_fun_xxxxx’ where xxxxx is the name of the new metric.

Parameters:
train_indexes

Indexes of rows to get from data objects during apply().

test_indexes

Indexes of rows to get from data objects during use().

fields

Name of the matrices to be modified.

__init__(split_type: str = 'holdout', partitions: int = 2, partition: int = 0, test_size: float = 0.3, seed: int = 0, fields: str = 'X, Y', **kwargs)[source]

Initialize self. See help(type(self)) for accurate signature.

Methods

__init__(split_type, partitions, partition, …) Initialize self.
default_config() Create a copy of the component default configuration.
disable_pretty_printing() Disable the pretty-printing.
dual_transform(train, …)
enable_pretty_printing() Enable the pretty-printing.
updated(**kwargs) Clone this component, optionally replacing given params.

Attributes

cfserialized
cfuuid UUID excluding ‘model’ and ‘enhance’ flags.
component
cs1 Convert component into a config space with a single component inside it.
enhancer
id Short uuID First 8 chars of uuid, usually for printing purposes.
jsonable
longname
model
name
path
pretty_printing
serialized
sid Short uuID First 6 chars of uuid, usually for printing purposes.
unwrap Subpipeline inside the first Wrap().
uuid Lazily calculated unique identifier for this dataset.
wrapped Same as unwrap(), but with the external container Wrap.
cfuuid[source]

UUID excluding ‘model’ and ‘enhance’ flags. Identifies the transformer.

cs1[source]

Convert component into a config space with a single component inside it.

classmethod default_config() → Dict[str, Any][source]

Create a copy of the component default configuration.

Returns:
dict

Copy of the component default configuration.

disable_pretty_printing()[source]

Disable the pretty-printing.

enable_pretty_printing()[source]

Enable the pretty-printing.

id[source]

Short uuID First 8 chars of uuid, usually for printing purposes. First collision expect after 12671943 combinations. :return:

sid[source]

Short uuID First 6 chars of uuid, usually for printing purposes. :return:

unwrap[source]

Subpipeline inside the first Wrap().

Hopefully there is only one Wrap in the pipeline. This method performs a depth-first search.

Examples

>>> pipe = Pipeline(
>>>     File(name='iris.arff'),
>>>     Wrap(Std(), SVMC()),
>>>     Metric(function='accuracy')
>>> )
>>> pipe.unwrap  # -> Chain(Std(), SVMC())
updated(**kwargs)[source]

Clone this component, optionally replacing given params.

Returns:
A ready to use component.
uuid

Lazily calculated unique identifier for this dataset.

Should be accessed direct as a class member: ‘uuid’.

Returns:
A unique identifier UUID object.
wrapped[source]

Same as unwrap(), but with the external container Wrap.