Very interesting work. If you don't mind sharing, what course/degree are you taking at university?
I'm studying for a Masters of Science - Quantitative Analysis. Think Doug Fuller. This project is for a course I'm taking in Stochastic Modeling. My next project (for a different course) is performing a regression analysis of the extended credit scoring against default rates. One thing I want to examine is whether the PD07 extra data makes a difference in defaults.
The normal status of a loan should be "current." Does a loan that runs to maturity have one count in this column and then 1 count in the FT_PIF column? In other words, for a loan that performs as we all hope, should I see 97% in the "Current" column and 3% in the "FT_PIF" column? That's about the only way I can explain the low numbers I see in the "Early PIF" column, since I looked at my stats and a few other long-term lenders and we all have about 15-20% of our loans showing as having being repaid.
This is hard to think about conceptually, because there are a couple of regularly recurring states. The FT_PIF column (I don't know why I didn't explain that earlier) stands for "full term pay in full." We have had no loans reach full term, so there should be zeros in that column. Early PIF is early pay in full. Because of regularly recurring states, a loan can run up 36 counts in the current column before transition to a "1" in the PIF column.
As I said about the difficulty in conceptualizing this, and this is something I have to talk to my prof about, is whether I defined my states properly. See, this is a "month to month" roll rate. The way you read the data is looking at that first row and column and saying "if a payment is current this month, it has a 97% chance of being current next month." But a loan can only become "FT PIF" once it has made the 36th payment. The way the math behind this model really works, if I want to differentiate between payoff states, I have to have a state for month 36.
So to directly comment on the numbers we "should" see, I'm not sure... a loan would rack up 36 counts in the current column before transitioning over to the "FT PIF" column.
Now I assume the bright green "Current" data point (91%) is what we really want our loans to show up as. After that it becomes a little bit unclear.
Actually, conceptually, the lines <15, late, 1 mo, 2 mo, 3 mo lates are the easiest to understand. They don't contain any regularly recurring states in ways that "Current" and "4+" do. Remember, a loan that is always current will count 36 times in the current column and then once in the FT PIF column. The 4+ column holds loans that have sat there for several months, and every month a single loan hangs out there, it racks up another count in the column before changing states.
I suppose the yellow diagonal line is to show the loans that deteriorate, in other words, of all the loans, about 7% will then go "<15 late."
Technically, what the chart shows is that if a loan (more technically the payment) is current one month, it has a 7% chance of going <15 the next month.
Of this 7% about 52% (or 4% of the 7%) will flip back to "current" and 46% will deteriorate further into"late." Then of the 46%, 66% will deteriorate into "1 month late," and so on? Is this interpretation correct?
No. The proper interpretation is "for a payment in state x, the odds of it being in state y next month are z%." You might ask me what the difference is, but I really don't know..
This seems to be extraordinary low rates, and I suppose it can be accounted for only if you count all the current loans for each presentation of current. So in effect you're counting state changes in the Prosper loans database? That would also then account for the 70% of "4Month late" loans remaining in this status and only 23% seemingly exit this process?
Yes, I'm counting state changes, with the understanding that some states are recurring.
If my understanding is incorrect, would you talk me (and others as uncomprehending as I), through your table in more detail?
If you want more detail, PM me your phone number and a good time. I'll call you. This is a lot to write, and sometimes not intuitively obvious to understand.