Why is DecisionTree using same feature and same condition twice

When trying to fit scikit-learn DecisionTreeClassifier on my data, I am observing some weird behavior.

x[54] (a boolan feature) is used to break the 19 samples into 2 and 17 on top left node. Then again, the same feature with exact same condition appears again in its True branch.

This time it again has True and False branches leading to leaf nodes.

I am using gini for deciding the split.

My question is, since we are in True branch, how can same boolean feature generate non-zero entropy or impurity at all? After all the new set can only have 0s for that feature. So there should not be any posibility of split.

What am I missing.

edited May 2 at 5:32

AcK

2,2262 gold badges25 silver badges32 bronze badges

asked May 2 at 1:14

Krishna

1,6322 gold badges15 silver badges37 bronze badges

1

Does the feature have missing values? I think that's now supported by the model but not by the display?

Ben Reiniger
– Ben Reiniger

2025-05-02 02:58:03 +00:00
Commented May 2 at 2:58
1

maybe you should show code and data which gives this result.

furas
– furas

2025-05-02 09:41:31 +00:00
Commented May 2 at 9:41
1

maybe you could ask on similar portals DataScience, CrossValidated, Artificial Intelligence or forum Kaggle - they may have better experience with ML

furas
– furas

2025-05-02 09:42:34 +00:00
Commented May 2 at 9:42
about reproducible setup, its reasonably big corporate data, can't expose directly, let me try to trim it down without loosing the bug, guess its easy said then done

Krishna
– Krishna

2025-05-02 12:40:43 +00:00
Commented May 2 at 12:40

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Why is DecisionTree using same feature and same condition twice

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest