Originaly published on LinkedIn, March 6 2016
The Data – 1.5 million rows across 16 files were brought into the analysis. Each file contained a specific year and each row a specific branch.
The Math – For the forecasting, the model methodology used was ARIMA (Autoregressive Integrated Moving Average). This kind of model works well for time series analysis. To see the full R Markup report describing how each model was chosen click here.
The Tech – Everything in the Infographic was laid out using lines of code in R. All of the source code and ancillary files are available in my GitHub account, here is the link.
Final words – These highlights were from the highest level aggregates. There are even more insights we can gain where we are all able to drill down into lower levels. For the next article, I plan to make the data available via an interactive report.
Disclaimers: The opinions in this article do not reflect those of my employer. This analysis is meant for general knowledge purposes; the writer accepts no responsibility for any action taken by the reader or others based on these results. Please also consider any disclaimers from the data provider, in this case the FDIC, which may be found at http://www.FDIC.gov