-
Notifications
You must be signed in to change notification settings - Fork 23
Description
Hello,
I'm interested in applying EvoTrees to some of my work, however there's a particular constraint that I need to apply to my dataset that I can't see any easy way to incorporate into EvoTrees.
I'm working with a dataset with many "clumps" of rows, which are highly correlated with each other. When fitting a plain random forest to this data previously we adjusted for this by employing a stratified sampling technique: each round of training we randomly selected one row from each "clump". This preserves the assumption of independence during bootstrapping.
If I'm mistaken and there is a way to acomplish this with EvoTrees (even if it means reaching into internals a bit) please let me know how I might do this, otherwise this is a feature (a custom/stratified bootstrap scheme) that would be tremendously useful to have (for my use case at least :).