Data Science Asked by stardust123 on January 11, 2021
I have a mortgage/credit data set that contains a list of customers (600k rows) and has a 100 columns inclusive of the customer’s general info (address, city, zipcode, etc), income, fico scores, number of current mortgages, mortgages in the past, aggregate mortgage amounts, number of bank card trades, etc. The data pertains to customers that are already good candidates to contact for issuing a credit product, however if one is to narrow down the list to 350K:
What would be the best method to rank the list to cut it down?
PS Your insights are much appreciated.
So it a little hard to know how to answer the question. Why do you need to cut the number of records down? If you want to rank them, rank them to what purpose? Ranking implies there are positives and negatives, or at least less positives.
One way you could approach this is to perform PCA. That said, I think you should consider disqualifying some fields, like the address fields.
Answered by Skiddles on January 11, 2021
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP