Nested Trees Classifier¶

Abstract of NestedTreesClassifier

Extracted from Ovchinnik, Otero & Freitas (2022), "Nested trees for longitudinal classification".

Longitudinal datasets contain repeated measurements of the same variables at different points in time. Longitudinal data mining algorithms aim to utilize such datasets to extract interesting knowledge and produce useful models. Many existing longitudinal classification methods either dismiss the longitudinal aspect of the data during model construction or produce complex models that are scarcely interpretable. We propose a new longitudinal classification algorithm based on decision trees, named Nested Trees. It utilizes a unique longitudinal model construction method that is fully aware of the longitudinal aspect of the predictive attributes (variables) and constructs tree nodes that make decisions based on a longitudinal attribute as a whole, considering measurements of that attribute across multiple time points. The algorithm was evaluated using 10 classification tasks based on the English Longitudinal Study of Ageing (ELSA) data.

See More In References

What are features_group and non_longitudinal_features?

Two key attributes, features_group and non_longitudinal_features, enable algorithms to interpret the temporal structure of longitudinal data.

features_group: A list of lists where each sublist contains indices of a longitudinal attribute's waves, ordered from oldest to most recent. This captures temporal dependencies.
non_longitudinal_features: A list of indices for static, non-temporal features excluded from the temporal matrix.

Proper setup of these attributes is critical for leveraging temporal patterns effectively.

See More In Temporal Dependency Guide

NestedTreesClassifier ¶

Bases: CustomClassifierMixinEstimator

Nested Trees Classifier for longitudinal data classification.

The Nested Trees Classifier enhances traditional decision tree methods with a two-level, longitudinal-aware construction: the outer tree picks splits on a whole longitudinal attribute (the group of time-specific features that represent repeated measurements of the same variable across waves) instead of on a single feature, and each outer node hosts an inner DecisionTreeClassifier from scikit-learn that partitions the data using only the measurements of that selected attribute across time. This preserves the longitudinal structure during model construction, keeps decisions interpretable (each outer node is labelled by one attribute), and naturally captures temporal patterns and dependencies.

Parameters:

Name	Type	Description	Default
`features_group`	`List[List[int]]`	Temporal matrix of feature indices for longitudinal attributes. Required for longitudinal functionality.	`None`
`non_longitudinal_features`	`List[Union[int, str]]`	Indices of static, non-temporal features. Defaults to None.	`None`
`max_outer_depth`	`int`	Maximum depth of the outer decision tree. Defaults to 3.	`3`
`max_inner_depth`	`int`	Maximum depth of inner decision trees. Defaults to 2.	`2`
`min_outer_samples`	`int`	Minimum samples required to split an outer node. Defaults to 5.	`5`
`inner_estimator_hyperparameters`	`Optional[Dict[str, Any]]`	Hyperparameters for inner decision trees. Defaults to None.	`None`
`class_weight`	`Optional[Union[dict, List[dict], str]]`	Class weights applied to every inner decision tree unless explicitly provided through `inner_estimator_hyperparameters`.	`None`
`save_nested_trees`	`bool`	If True, saves visualizations of the nested structure. Defaults to False.	`False`
`parallel`	`bool`	Enables parallel processing for fitting inner trees. Defaults to False.	`False`
`num_cpus`	`int`	Number of CPUs for parallel processing (-1 uses all available). Defaults to -1.	`-1`

Attributes:

Name	Type	Description
`root`	`Node`	Root node of the outer tree, initialized as None and set during fitting.
`classes_`	`ndarray`	Unique class labels, set during fitting.

Examples:

Basic Usage

from sklearn.metrics import accuracy_score
from scikit_longitudinal.estimators.ensemble import NestedTreesClassifier
import numpy as np
from scikit_longitudinal.data_preparation import LongitudinalDataset

# Load dataset
dataset = LongitudinalDataset('./stroke_longitudinal.csv')
dataset.load_data()
dataset.load_target(target_column="stroke_w2")
dataset.setup_features_group("elsa")
dataset.load_train_test_split(test_size=0.2, random_state=42)

clf = NestedTreesClassifier(features_group=dataset.feature_groups())
clf.fit(dataset.X_train, dataset.y_train)
y_pred = clf.predict(dataset.X_test)
print(f"Accuracy: {accuracy_score(dataset.y_test, y_pred)}")

Advanced: customising inner tree hyperparameters

# ... Similar setup as above ...

inner_params = {'criterion': 'gini', 'max_depth': 3}
clf = NestedTreesClassifier(
    features_group=features_group,
    non_longitudinal_features=non_longitudinal_features,
    inner_estimator_hyperparameters=inner_params
)
clf.fit(X, y)

# ... Similar prediction and evaluation as above ...

Source code in scikit_longitudinal/estimators/ensemble/nested_trees/nested_trees.py

class NestedTreesClassifier(CustomClassifierMixinEstimator):
    """
    Nested Trees Classifier for longitudinal data classification.

    The Nested Trees Classifier enhances traditional decision tree methods with a two-level, longitudinal-aware
    construction: the outer tree picks splits on a **whole longitudinal attribute** (the group of time-specific
    features that represent repeated measurements of the same variable across waves) instead of on a single
    feature, and each outer node hosts an inner `DecisionTreeClassifier` from scikit-learn that partitions the
    data using only the measurements of that selected attribute across time. This preserves the longitudinal
    structure during model construction, keeps decisions interpretable (each outer node is labelled by one
    attribute), and naturally captures temporal patterns and dependencies.

    Args:
        features_group (List[List[int]], optional):
            Temporal matrix of feature indices for longitudinal attributes. Required for longitudinal functionality.
        non_longitudinal_features (List[Union[int, str]], optional):
            Indices of static, non-temporal features. Defaults to None.
        max_outer_depth (int, optional):
            Maximum depth of the outer decision tree. Defaults to 3.
        max_inner_depth (int, optional):
            Maximum depth of inner decision trees. Defaults to 2.
        min_outer_samples (int, optional):
            Minimum samples required to split an outer node. Defaults to 5.
        inner_estimator_hyperparameters (Optional[Dict[str, Any]], optional):
            Hyperparameters for inner decision trees. Defaults to None.
        class_weight (Optional[Union[dict, List[dict], str]], optional):
            Class weights applied to every inner decision tree unless explicitly provided through
            `inner_estimator_hyperparameters`.
        save_nested_trees (bool, optional):
            If True, saves visualizations of the nested structure. Defaults to False.
        parallel (bool, optional):
            Enables parallel processing for fitting inner trees. Defaults to False.
        num_cpus (int, optional):
            Number of CPUs for parallel processing (-1 uses all available). Defaults to -1.

    Attributes:
        root (Node, optional):
            Root node of the outer tree, initialized as None and set during fitting.
        classes_ (np.ndarray):
            Unique class labels, set during fitting.

    Examples:
        !!! example "Basic Usage"

            ```python
            from sklearn.metrics import accuracy_score
            from scikit_longitudinal.estimators.ensemble import NestedTreesClassifier
            import numpy as np
            from scikit_longitudinal.data_preparation import LongitudinalDataset

            # Load dataset
            dataset = LongitudinalDataset('./stroke_longitudinal.csv')
            dataset.load_data()
            dataset.load_target(target_column="stroke_w2")
            dataset.setup_features_group("elsa")
            dataset.load_train_test_split(test_size=0.2, random_state=42)

            clf = NestedTreesClassifier(features_group=dataset.feature_groups())
            clf.fit(dataset.X_train, dataset.y_train)
            y_pred = clf.predict(dataset.X_test)
            print(f"Accuracy: {accuracy_score(dataset.y_test, y_pred)}")
            ```

        !!! example "Advanced: customising inner tree hyperparameters"

            ```python
            # ... Similar setup as above ...

            inner_params = {'criterion': 'gini', 'max_depth': 3}
            clf = NestedTreesClassifier(
                features_group=features_group,
                non_longitudinal_features=non_longitudinal_features,
                inner_estimator_hyperparameters=inner_params
            )
            clf.fit(X, y)

            # ... Similar prediction and evaluation as above ...
            ```
    """

    # pylint: disable=too-many-arguments,too-many-positional-arguments,invalid-name,signature-differs,no-member
    def __init__(
        self,
        features_group: List[List[int]] = None,
        non_longitudinal_features: List[Union[int, str]] = None,
        max_outer_depth: int = 3,
        max_inner_depth: int = 2,
        min_outer_samples: int = 5,
        inner_estimator_hyperparameters: Optional[Dict[str, Any]] = None,
        class_weight: Optional[Union[dict, List[dict], str]] = None,
        save_nested_trees: bool = False,
        parallel: bool = False,
        num_cpus: int = -1,
    ):
        self.features_group = features_group
        self.non_longitudinal_features = non_longitudinal_features
        self.max_outer_depth = max_outer_depth
        self.max_inner_depth = max_inner_depth
        self.min_outer_samples = min_outer_samples
        self.inner_estimator_hyperparameters = inner_estimator_hyperparameters
        self.class_weight = class_weight
        self.save_nested_trees = save_nested_trees
        self.root = None
        self.parallel = parallel
        self.num_cpus = num_cpus
        self.classes_ = None
        self._predict_proba_single_classes_ = None

        if max_outer_depth <= 0:
            raise ValueError("max_outer_depth must be greater than 0.")

        if max_inner_depth <= 0:
            raise ValueError("max_inner_depth must be greater than 0.")

        if min_outer_samples <= 0:
            raise ValueError("min_outer_samples must be greater than 0.")

    class Node:
        """A node in the outer decision tree of the Nested Trees Classifier.

        Each node in the outer tree contains an inner decision tree. Leaf nodes
        are associated with a class label, while non-leaf nodes have a decision
        criterion based on the inner decision tree.

        Args:
            is_leaf (bool):
                Determines if the node is a leaf node. If True, the node represents a class label and has access to the
                tree associated with the last step it took to construct the leaf node; otherwise, it is an internal node
                with a decision criterion based on the inner decision tree.
            tree (DecisionTreeClassifier):
                A Scikit-learn DecisionTreeClassifier instance representing the inner decision tree for this node.
            node_name (str):
                A unique name for the node, used for visualization and debugging purposes.
            group (Optional[List[int]]):
                A list of feature group indices used to split the data in this node.
                Defaults to None.

        Attributes:
            is_leaf (bool):
                Indicates whether the node is a leaf node.
            tree (DecisionTreeClassifier):
                The inner decision tree associated with this node.
            children (List[Node]):
                A list of child nodes of this node in the outer decision tree.
            children_map (Dict[str, Node]):
                A dictionary mapping the name of each child node to the corresponding Node instance.
            node_name (str):
                The unique name of this node.
            group (Optional[List[int]]):
                A list of feature group indices used to split the data in this node.

        Raises:
            ValueError:
                If tree is not provided, or if node_name is an empty string.

        Examples:
            >>> from sklearn.tree import DecisionTreeClassifier
            >>> inner_tree = DecisionTreeClassifier()
            >>> node = Node(is_leaf=False, tree=inner_tree, node_name="dummy_node")

        """

        def __init__(
            self,
            is_leaf: bool,
            tree: DecisionTreeClassifier,
            node_name: str,
            group: Optional[List[int]] = None,
        ):
            if tree is None:
                raise ValueError("tree must be provided for (non-)leaf nodes.")

            if not node_name:
                raise ValueError("node_name must be a non-empty string.")

            self.is_leaf = is_leaf
            self.tree = tree
            self.children = []
            self.children_map = {}
            self.node_name = node_name
            self.group = group

        def __str__(self):
            return self.node_name

    def _get_inner_tree_hyperparameters(self) -> Dict[str, Any]:
        """Return the hyperparameters for inner decision trees with class weights applied."""

        params = dict(self.inner_estimator_hyperparameters or {})
        if "class_weight" not in params and self.class_weight is not None:
            params["class_weight"] = self.class_weight
        return params

    @override
    def _fit(
        self, X: np.ndarray, y: np.ndarray, sample_weight: Optional[np.ndarray] = None
    ) -> "NestedTreesClassifier":
        """Fit the classifier to the training data.

        Builds the nested tree structure recursively, integrating longitudinal and non-longitudinal features.

        Args:
            X (np.ndarray): Training input samples.
            y (np.ndarray): Target class labels.

        Returns:
            NestedTreesClassifier: Fitted classifier instance.

        Raises:
            ValueError: If `features_group` has fewer than 2 groups.

        !!! tip "Tuning Advice"
            Increase `max_outer_depth` for complex datasets, but monitor for overfitting with validation data.
        """
        if self.non_longitudinal_features is not None:
            self.features_group.append(self.non_longitudinal_features)
        if self.features_group is not None:
            self.features_group = clean_padding(self.features_group)

        if not self.features_group or len(self.features_group) <= 1:
            raise ValueError("features_group must be greater than 1.")
        if self.inner_estimator_hyperparameters is None:
            self.inner_estimator_hyperparameters = {}
        if self.classes_ is None:
            self.classes_ = unique_labels(y)
        self.root = self._build_outer_tree(
            X, y, 0, "outer_root", sample_weight=sample_weight
        )
        return self

    @override
    def _predict(self, X: np.ndarray) -> np.ndarray:
        """Predict class labels for input samples.

        Traverses the nested tree structure to assign labels based on outer and inner tree decisions.

        Args:
            X (np.ndarray): Input samples.

        Returns:
            np.ndarray: Predicted class labels.

        Raises:
            ValueError: If the classifier isn’t fitted (root is None).

        !!! tip "Quick Predictions"
            After fitting, use this method to generate predictions efficiently leveraging the nested structure.
        """
        if self.root is None:
            raise ValueError("The classifier must be fitted before making predictions.")
        return np.array([self._predict_single(x) for x in X])

    @override
    def _predict_proba(self, X: np.ndarray) -> np.ndarray:
        """Predict class probabilities for input samples.

        Provides probability estimates by traversing the nested structure and aligning each leaf-local probability
        vector to the global `classes_` order learned during fitting.

        Args:
            X (np.ndarray): Input samples.

        Returns:
            np.ndarray: Predicted class probabilities.

        Raises:
            ValueError: If the classifier isn’t fitted.
        """
        if self.root is None:
            raise ValueError("The classifier must be fitted before making predictions.")

        result = []
        for x in X:
            probas = self._predict_proba_single(x)
            aligned_probas = np.zeros(len(self.classes_), dtype=float)
            leaf_classes = self._predict_proba_single_classes_
            class_to_index = {label: index for index, label in enumerate(self.classes_)}

            for local_index, label in enumerate(leaf_classes):
                aligned_probas[class_to_index[label]] = probas[0, local_index]

            result.append(aligned_probas)

        return np.array(result)

    def _build_outer_tree(
        self,
        X: np.ndarray,
        y: np.ndarray,
        depth: int,
        outer_node_name: str,
        tree: Optional[DecisionTreeClassifier] = None,
        group: Optional[List[int]] = None,
        sample_weight: Optional[np.ndarray] = None,
    ) -> "NestedTreesClassifier.Node":
        """Build the outer decision tree recursively.

        The method starts at each node, competing between all possible inner decision trees.
        The best inner decision tree is chosen to split the data based on Gini impurity.
        The best inner decision tree's leaves are then used as the children of the current
        node in the outer decision tree, creating N (number of leaf nodes of the inner decision tree)
        children outer nodes in the outer decision tree.

        The max_outer_depth parameter determines the maximum depth of the outer decision tree.
        A minimum of 2 groups is required to process an outer node. The min_outer_samples parameter sets the
        minimum number of samples required to split an outer node.

        Args:
            X (np.ndarray):
                The training input samples.
            y (np.ndarray):
                The target values (class labels).
            depth (int):
                The current depth of the outer tree being built.
            outer_node_name (str):
                A unique name for the current node in the outer decision tree.
            tree (Optional[DecisionTreeClassifier], optional):
                The inner decision tree associated with this node. Defaults to None.
            group (Optional[List[int]], optional):
                The group of features associated with this node. Defaults to None.
            sample_weight (Optional[np.ndarray], optional):
                Sample weights for the training instances. Defaults to None.

        Returns:
            NestedTreesClassifier.Node: A node in the outer decision tree.

        """
        if (
            depth == (self.max_outer_depth - 1)
            or len(self.features_group) < 2
            or len(X) < self.min_outer_samples
        ):
            return self.Node(
                is_leaf=True, tree=tree, node_name=outer_node_name, group=group
            )

        best_tree, best_split, best_group = self._find_best_tree_and_split(
            X, y, outer_node_name, sample_weight
        )

        if len(best_split) == 1:
            return self.Node(
                is_leaf=True,
                node_name=outer_node_name,
                tree=best_tree,
                group=best_group,
            )

        node = self.Node(
            is_leaf=False, tree=best_tree, node_name=outer_node_name, group=best_group
        )
        self._add_children_to_node(node, best_split, depth)
        return node

    def _find_best_tree_and_split(
        self,
        X: np.ndarray,
        y: np.ndarray,
        outer_node_name: str,
        sample_weight: Optional[np.ndarray] = None,
    ) -> Tuple[
        DecisionTreeClassifier, List[Tuple[np.ndarray, np.ndarray, int]], List[int]
    ]:
        """Find the best inner decision tree and the associated split (i.e., the competition).

        This method evaluates all possible inner decision trees and selects the one that results
        in the lowest Gini impurity. The method can be parallelised for faster computation.

        Args:
            X (np.ndarray):
                The training input samples.
            y (np.ndarray):
                The target values (class labels).
            outer_node_name (str):
                A unique name for the current node in the outer decision tree.

        Returns:
            Tuple[DecisionTreeClassifier, List[Tuple[np.ndarray, np.ndarray, int]], List[int]]:
                A tuple containing the best inner decision tree, the associated split, and the best feature group.

        """
        ray = get_ray_for_parallel(self.parallel, self.num_cpus)
        min_gini = float("inf")
        best_tree = None
        subset_X = None
        best_group = None

        inner_tree_params = self._get_inner_tree_hyperparameters()

        if self.parallel and ray is not None:
            if sample_weight is not None:
                raise ValueError(
                    "Sample weights are not supported in parallel mode. Please set parallel=False."
                )
            parallel_fit = ray.remote(_fit_inner_tree_plus_calculate_gini_ray)
            tasks = [  # pragma: no cover
                parallel_fit.remote(
                    X[:, group],
                    y,
                    i,
                    outer_node_name,
                    self.max_inner_depth,
                    inner_tree_params,
                    self.save_nested_trees,
                    group,
                )
                for i, group in enumerate(self.features_group)
            ]
            results = ray.get(tasks)  # pragma: no cover
            best_tree, _, min_gini, _, subset_X, best_group = min(
                results, key=lambda x: x[2]
            )  # pragma: no cover
        else:
            for i, group in enumerate(self.features_group):
                subset_X_temp = X[:, group]
                tree, _, gini = _fit_inner_tree_and_calculate_gini(
                    subset_X_temp,
                    y,
                    i,
                    outer_node_name,
                    self.max_inner_depth,
                    inner_tree_params,
                    self.save_nested_trees,
                    sample_weight=sample_weight,
                )

                if gini < min_gini:
                    min_gini = gini
                    best_tree = tree
                    subset_X = subset_X_temp
                    best_group = group

        best_split = self._create_split(X, subset_X, y, best_tree)
        return best_tree, best_split, best_group

    def _add_children_to_node(
        self,
        node: "NestedTreesClassifier.Node",
        best_split: List[Tuple[np.ndarray, np.ndarray, int]],
        depth: int,
    ) -> None:
        """Add children to a node in the outer decision tree based on the best split.

        Args:
            node (NestedTreesClassifier.Node):
                The node in the outer decision tree to add children to.
            best_split (List[Tuple[np.ndarray, np.ndarray, int]]):
                The best split of the data, represented as a list of tuples with (X subset, y subset, leaf number).
            depth (int):
                The current depth of the node in the outer decision tree.

        """
        for i, (subset_X, subset_y, leaf_number) in enumerate(best_split):
            child_node_name = f"outer_{node.node_name}_d{depth + 1}_g{i}_l{leaf_number}"
            child_node = self._build_outer_tree(
                subset_X, subset_y, depth + 1, child_node_name, node.tree, node.group
            )
            node.children.append(child_node)
            node.children_map[leaf_number] = child_node

    def _create_split(
        self,
        X: np.ndarray,
        subset_X: np.ndarray,
        y: np.ndarray,
        tree: DecisionTreeClassifier,
    ) -> List[Tuple[np.ndarray, np.ndarray, int]]:
        """Create a split of the data based on the leaf nodes of the decision tree.

        Args:
            X (np.ndarray):
                The original feature matrix. subset_X (np.ndarray): The feature matrix for the current group.
            y (np.ndarray):
                The target labels.
            tree (DecisionTreeClassifier):
                The decision tree used for creating the split.

        Returns:
            List[Tuple[np.ndarray, np.ndarray, int]]:
                A list of tuples representing the split data, with each tuple containing:
                    * X subset corresponding to a leaf node
                    * y subset corresponding to a leaf node
                    * Leaf number

        """
        leaves = tree.apply(subset_X)
        unique_leaves = np.unique(leaves)
        return [(X[leaves == leaf], y[leaves == leaf], leaf) for leaf in unique_leaves]

    def _predict_single(self, x: np.ndarray) -> int:
        """Predict the class label for a single input sample.

        Args:
            x (np.ndarray):
                The input sample.

        Returns:
            int: The predicted class label for the input sample.

        """
        node = self.root
        leaf_subset = None

        while not node.is_leaf:
            subset_x = x[node.group]
            next_node_leaf_number = node.tree.apply([subset_x])[0]

            node = node.children_map[next_node_leaf_number]
            leaf_subset = x[node.group]

        return node.tree.predict(leaf_subset.reshape(1, -1))[0]

    def _predict_proba_single(self, x: np.ndarray) -> np.ndarray:
        """Predict the class probabilities for a single input sample.

        Args:
            x (np.ndarray): The input sample.

        Returns:
            np.ndarray: The predicted class probabilities for the input sample.

        """
        node = self.root
        leaf_subset = None

        while not node.is_leaf:
            subset_x = x[node.group]
            next_node_leaf_number = node.tree.apply([subset_x])[0]

            node = node.children_map[next_node_leaf_number]
            leaf_subset = x[node.group]

        self._predict_proba_single_classes_ = node.tree.classes_
        return node.tree.predict_proba(leaf_subset.reshape(1, -1))

    def print_nested_tree(
        self,
        node: Optional["NestedTreesClassifier.Node"] = None,
        depth: int = 0,
        prefix: str = "",
        parent_name: str = "",
    ) -> None:
        """Print the nested tree structure for interpretation.

        Args:
            node (Optional[Node]): Starting node (defaults to root if None).
            depth (int): Current depth (defaults to 0).
            prefix (str): String to prepend to node names (defaults to "").
            parent_name (str): Parent node name (defaults to "").

        !!! tip "Debugging Aid"
            Use this to visualize the tree hierarchy and verify model construction.
            Careful, it could be very verbose for large trees.
        """
        if node is None:
            node = self.root

        node_name_parts = node.node_name.split("_")
        unique_node_name_parts = _remove_consecutive_duplicates(node_name_parts)
        node_name = "_".join(unique_node_name_parts)

        if parent_name:
            node_name = node_name.replace(f"{parent_name}_", "")

        if node.is_leaf:
            print(f"{prefix}* Leaf {depth}: {node_name}")
        else:
            print(f"{prefix}* Node {depth}: {node_name}")
            for child in node.children:
                self.print_nested_tree(child, depth + 1, f"{prefix}  ", node_name)

`fit(X, y=None, sample_weight=None)` ¶

Fit the classifier to the training data.

Validates X (and y when provided) with scikit-learn's check_X_y / check_array and then delegates to the subclass implementation in _fit. sample_weight is forwarded only when the subclass's _fit declares it.

Parameters:

Name	Type	Description	Default
`X`	`ndarray`	Training input samples of shape `(n_samples, n_features)`.	required
`y`	`ndarray`	Target class labels of shape `(n_samples,)`.	`None`
`sample_weight`	`ndarray`	Per-sample weights of shape `(n_samples,)`. Forwarded to `_fit` only when supported.	`None`

Returns:

Name	Type	Description
`CustomClassifierMixinEstimator`	`CustomClassifierMixinEstimator`	The fitted estimator (`self`).

Source code in scikit_longitudinal/templates/custom_classifier_mixin_estimator.py

@final
def fit(
    self, X: np.ndarray, y: np.ndarray = None, sample_weight: np.ndarray = None
) -> "CustomClassifierMixinEstimator":
    """Fit the classifier to the training data.

    Validates ``X`` (and ``y`` when provided) with scikit-learn's
    ``check_X_y`` / ``check_array`` and then delegates to the subclass
    implementation in ``_fit``. ``sample_weight`` is forwarded only when
    the subclass's ``_fit`` declares it.

    Args:
        X (np.ndarray):
            Training input samples of shape ``(n_samples, n_features)``.
        y (np.ndarray, optional):
            Target class labels of shape ``(n_samples,)``.
        sample_weight (np.ndarray, optional):
            Per-sample weights of shape ``(n_samples,)``. Forwarded to
            ``_fit`` only when supported.

    Returns:
        CustomClassifierMixinEstimator: The fitted estimator (``self``).
    """
    if y is None:
        return self._check_array_decorator(self._fit)(X)
    _fit_sig = inspect.signature(self._fit)
    if "sample_weight" in _fit_sig.parameters:
        return self._check_X_y_decorator(self._fit)(
            X, y, sample_weight=sample_weight
        )
    else:
        return self._check_X_y_decorator(self._fit)(X, y)

`predict(X)` ¶

Predict class labels for the input samples.

Validates X with scikit-learn's check_array and delegates to the subclass implementation in _predict.

Parameters:

Name	Type	Description	Default
`X`	`ndarray`	Input samples of shape `(n_samples, n_features)`.	required

Returns:

Type	Description
`ndarray`	np.ndarray: Predicted class labels of shape `(n_samples,)`.

Source code in scikit_longitudinal/templates/custom_classifier_mixin_estimator.py

@final
def predict(self, X: np.ndarray) -> np.ndarray:
    """Predict class labels for the input samples.

    Validates ``X`` with scikit-learn's ``check_array`` and delegates to
    the subclass implementation in ``_predict``.

    Args:
        X (np.ndarray):
            Input samples of shape ``(n_samples, n_features)``.

    Returns:
        np.ndarray: Predicted class labels of shape ``(n_samples,)``.
    """
    return self._check_array_decorator(self._predict)(X)

`predict_proba(X)` ¶

Predict class probabilities for the input samples.

Validates X with scikit-learn's check_array and delegates to the subclass implementation in _predict_proba.

Parameters:

Name	Type	Description	Default
`X`	`ndarray`	Input samples of shape `(n_samples, n_features)`.	required

Returns:

Type	Description
`ndarray`	np.ndarray: Class probabilities of shape `(n_samples, n_classes)`,
`ndarray`	with columns ordered as in `self.classes_`.

Source code in scikit_longitudinal/templates/custom_classifier_mixin_estimator.py

@final
def predict_proba(self, X: np.ndarray) -> np.ndarray:
    """Predict class probabilities for the input samples.

    Validates ``X`` with scikit-learn's ``check_array`` and delegates to
    the subclass implementation in ``_predict_proba``.

    Args:
        X (np.ndarray):
            Input samples of shape ``(n_samples, n_features)``.

    Returns:
        np.ndarray: Class probabilities of shape ``(n_samples, n_classes)``,
        with columns ordered as in ``self.classes_``.
    """
    return self._check_array_decorator(self._predict_proba)(X)

`print_nested_tree(node=None, depth=0, prefix='', parent_name='')` ¶

Print the nested tree structure for interpretation.

Parameters:

Name	Type	Description	Default
`node`	`Optional[Node]`	Starting node (defaults to root if None).	`None`
`depth`	`int`	Current depth (defaults to 0).	`0`
`prefix`	`str`	String to prepend to node names (defaults to "").	`''`
`parent_name`	`str`	Parent node name (defaults to "").	`''`

Debugging Aid

Use this to visualize the tree hierarchy and verify model construction. Careful, it could be very verbose for large trees.

Source code in scikit_longitudinal/estimators/ensemble/nested_trees/nested_trees.py

def print_nested_tree(
    self,
    node: Optional["NestedTreesClassifier.Node"] = None,
    depth: int = 0,
    prefix: str = "",
    parent_name: str = "",
) -> None:
    """Print the nested tree structure for interpretation.

    Args:
        node (Optional[Node]): Starting node (defaults to root if None).
        depth (int): Current depth (defaults to 0).
        prefix (str): String to prepend to node names (defaults to "").
        parent_name (str): Parent node name (defaults to "").

    !!! tip "Debugging Aid"
        Use this to visualize the tree hierarchy and verify model construction.
        Careful, it could be very verbose for large trees.
    """
    if node is None:
        node = self.root

    node_name_parts = node.node_name.split("_")
    unique_node_name_parts = _remove_consecutive_duplicates(node_name_parts)
    node_name = "_".join(unique_node_name_parts)

    if parent_name:
        node_name = node_name.replace(f"{parent_name}_", "")

    if node.is_leaf:
        print(f"{prefix}* Leaf {depth}: {node_name}")
    else:
        print(f"{prefix}* Node {depth}: {node_name}")
        for child in node.children:
            self.print_nested_tree(child, depth + 1, f"{prefix}  ", node_name)

Nested Trees Classifier¶

NestedTreesClassifier ¶

fit(X, y=None, sample_weight=None) ¶

predict(X) ¶

predict_proba(X) ¶

print_nested_tree(node=None, depth=0, prefix='', parent_name='') ¶

`fit(X, y=None, sample_weight=None)` ¶

`predict(X)` ¶

`predict_proba(X)` ¶

`print_nested_tree(node=None, depth=0, prefix='', parent_name='')` ¶