Some minor differences in random forest implementations

I've been comparing some random forest implementations recently (https://github.com/tecosaur/TreeComparison), one of the results of which is #159, but I also have some other information which may be of interest.

For starters, here's the colour coding I use:
![image](https://user-images.githubusercontent.com/20903656/167845522-788d48cd-65f8-49d2-8261-38aa4ed15ce6.png)

Error rates mostly converged among the different implementations I tested, however sometimes ranger does a little bit better:
![image](https://user-images.githubusercontent.com/20903656/167838710-71e1f3eb-4bf8-4bdb-8008-7947e76bee64.png)

![image](https://user-images.githubusercontent.com/20903656/167844828-1ca81b89-2024-4088-90d6-7f362fa9f650.png)

Precision-recall and ROC curves generally look near-identical, as they should.
![image](https://user-images.githubusercontent.com/20903656/167844747-51574fcd-77c2-46a1-975b-b929a14c67b4.png)

I've also noticed some larger differences in the depth and size of the random trees created. Across a number of datasets DecisionTrees.jl and randomForest produce narrower/deeper trees than ranger and sklearn.

![image](https://user-images.githubusercontent.com/20903656/167839004-f09f0ca6-44b4-4d44-8ef7-a84a685c4b4f.png)

![image](https://user-images.githubusercontent.com/20903656/167844579-2c631e5e-e45e-4067-82c1-2ce903bcf914.png)

![image](https://user-images.githubusercontent.com/20903656/167844890-0737c47b-8d3b-4e7e-abe1-1bf2362826c8.png)

![image](https://user-images.githubusercontent.com/20903656/167845107-a6276427-f35d-47b8-a23b-df44c5891422.png)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Some minor differences in random forest implementations #160

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Some minor differences in random forest implementations #160

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions