Skip to content

Custom bootstrapping scheme #287

@tecosaur

Description

@tecosaur

Hello,

I'm interested in applying EvoTrees to some of my work, however there's a particular constraint that I need to apply to my dataset that I can't see any easy way to incorporate into EvoTrees.

I'm working with a dataset with many "clumps" of rows, which are highly correlated with each other. When fitting a plain random forest to this data previously we adjusted for this by employing a stratified sampling technique: each round of training we randomly selected one row from each "clump". This preserves the assumption of independence during bootstrapping.

If I'm mistaken and there is a way to acomplish this with EvoTrees (even if it means reaching into internals a bit) please let me know how I might do this, otherwise this is a feature (a custom/stratified bootstrap scheme) that would be tremendously useful to have (for my use case at least :).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions