binning_table on a subset of the original sample used for fitting #232

chapmanh · 2023-02-17T16:57:31Z

Hi,
This might not be statistically appropriate to do, nonetheless I'm looking to use OptimalBinning to fit a variable to a binary target, and then look at how a subset of the data performs using identical bins to the originally fitted data.

I'm achieving this by providing a new OptimalBinning object the splits obtained in the original fitting and then 'fitting' the subset to these splits with monotonic_trend set to None to prevent shifting. I'm also interested in how the individual IVs change within each bin depending on the subset which is used.

there are three suggestions off the back of this:
one rather radical;

The most radical solution to help me would be some sort of method in binning_table which can ingest a completely different variable array and build a table as if this was the data that created the bins

and two less intense ideas!
2. In instances where the subset does not span the entire range of the original sample, the fitting errors if there are no samples in the bin. Could this default to all zeroes, much like the 'Special' bin already does?
3. Could metric='iv' be an option when using binning_table.plot? This would be really helpful as currently I can only view WoE and event rate within each bin. Thinking about this one, perhaps I just need a crash course on building my own plots from scratch!

Many thanks,
H

The text was updated successfully, but these errors were encountered:

guillermo-navas-palencia added the question Further information is requested label Feb 19, 2023

guillermo-navas-palencia self-assigned this Feb 19, 2023

alexliap mentioned this issue Nov 8, 2023

Add plot metric #281

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

binning_table on a subset of the original sample used for fitting #232

binning_table on a subset of the original sample used for fitting #232

chapmanh commented Feb 17, 2023

binning_table on a subset of the original sample used for fitting #232

binning_table on a subset of the original sample used for fitting #232

Comments

chapmanh commented Feb 17, 2023