{"id":8924,"date":"2022-02-28T00:00:00","date_gmt":"2022-02-28T00:00:00","guid":{"rendered":"https:\/\/nurs.essaybishops.com\/8924-2\/"},"modified":"2022-02-28T00:00:00","modified_gmt":"2022-02-28T00:00:00","slug":"8924-2","status":"publish","type":"post","link":"https:\/\/www.homeworkacetutors.com\/nursing\/8924-2\/","title":{"rendered":""},"content":{"rendered":"<p>UNCLASSIFIED<\/p>\n<p>1<\/p>\n<p>Paper 1047-2021<\/p>\n<p>SAS\u00ae Time Series Analysis &amp; Forecasting<\/p>\n<p>(TSAF) at the Canada Revenue Agency (CRA), with COVID impacts<\/p>\n<p>Jason A. Oliver, Senior Compliance Analyst, Canada Revenue Agency (CRA)<\/p>\n<p>ABSTRACT<\/p>\n<p>It may well be a recurring theme of this year&#8217;s SAS Global Forum that we are faced with more pressure to use flexible thinking &#8211; not just critical thinking &#8211; and when it comes to<\/p>\n<p>time series analysis and forecasting (TSAF) in SAS, it&#8217;s all about &#8220;rethinking the curve&#8221;.<\/p>\n<p>At the Canada Revenue Agency (CRA) Compliance Programs Branch (CPB), we have grappled with reliable forecasting for macro-level tax variables on a month-to-month basis, even before the COVID-19 pandemic hit. But now we face a particularly difficult<\/p>\n<p>challenge. As with many large organizations, it is not easy to foretell what the fallout may be from such a cataclysm.<\/p>\n<p>In setting up SAS to right the trajectory, we must be extra cautious about some of the fallacies in applying TSAF in this context: the lagged effect for tax revenues realized based on audits of the previous tax year, the need to differentiate average tax recovery<\/p>\n<p>per case from sum of tax recovery (month-to-month), realizing that industry sectors are not &#8220;one size fits all&#8221;, and accounting for relatively temporary effects of staffing re-<\/p>\n<p>orientation in the conversion to a virtual workplace versus the more enduring effects of business disruptions. With SAS Enterprise Miner&#8217;s abilities to continuously adjust forecasts, sub-categorize datapoints by tax office or industry sector, and apply lagged<\/p>\n<p>cross-correlation analysis, we are suitably equipped with the right tools and this can provide abstract learnings for other large organizations.<\/p>\n<p>INTRODUCTION<\/p>\n<p>The Canada Revenue Agency (CRA) is Canada\u2019s federal tax administration. As with all tax<\/p>\n<p>jurisdictions, the CRA has been challenged to keep pace with COVID-19 shocks and<\/p>\n<p>manifestations, which began in March 2020 (the last month of our fiscal year).<\/p>\n<p>Fortunately, SAS\u00ae Enterprise Miner\u2122 has been an invaluable aid in gauging these impacts.<\/p>\n<p>Enterprise Miner\u2122 includes a highly versatile set of functional nodes for configuring and<\/p>\n<p>processing time series data. It can decompose time series components such as seasonality<\/p>\n<p>and trend, show trend lines and expected forecast within configurable prediction intervals,<\/p>\n<p>and demonstrate complex correlation analyses.<\/p>\n<p>While this has been of great benefit to the CRA in gauging the trajectory of macro-variables<\/p>\n<p>related to tax revenues and auditor performance, the findings of this research paper could<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>2<\/p>\n<p>conceivably be applied in the abstract to large organizations with process-oriented<\/p>\n<p>functions, and not just to other foreign tax jurisdictions.<\/p>\n<p>Let us provide a Glossary of terms to set the stage:<\/p>\n<p>\uf0b7 TSAF: Time Series Analysis &amp; Forecasting.<\/p>\n<p>\uf0b7 TEBA: tax earned by audit, which is the amount of tax collectible that is agreed upon in the course of a taxpayer audit. It is in NPV (Net Present Value).<\/p>\n<p>\uf0b7 TAR: the tax-at-risk, which is the amount that CRA risk assessors arrive at as the precursor to auditing activity.<\/p>\n<p>\uf0b7 C\/AR ratio: the ratio of [audit] cases completed, to action requests [submitted]<\/p>\n<p>for assistance. It is a tentative measure of auditor productivity.<\/p>\n<p>\uf0b7 Integras: the tool used by CRA auditors to process cases.<\/p>\n<p>TIME SERIES FUNCTIONAL NODES &amp; SETUP<\/p>\n<p>In SAS\u00ae Enterprise Miner\u2122, you have six TSAF nodes in the \u201cTime Series\u201d ribbon; but we\u2019re<\/p>\n<p>only going to use four of them. Below is the Time Series ribbon with the functional nodes in<\/p>\n<p>question:<\/p>\n<p>Figure 1. Time Series Functional Nodes<\/p>\n<p>\uf0b7 TS Data Preparation: this node allows you to specify basic time series properties<\/p>\n<p>including interval, cycle, start\/end time, and accumulation (i.e. by total, min or max,<\/p>\n<p>mean, etc.)<\/p>\n<p>o Below, the interval is \u201cautomatic\u201d, so we specify \u201cMonth\u201d as the interval.<\/p>\n<p>o We can leave the seasonal cycle and start\/end time as \u201cDefault\u201d, as SAS\u00ae<\/p>\n<p>Enterprise Miner\u2122 will auto-determine these parts from the data.<\/p>\n<p>o In our case, the data was pre-accumulated in SAS\u00ae Enterprise Guide\u2122 row-<\/p>\n<p>by-row on a per-month basis, so we can leave Accumulation = \u201cTotal\u201d (else,<\/p>\n<p>we would have to set it \u201cAverage\u201d).<\/p>\n<p>Figure 2. TS Data Preparation node \u2013 basic properties<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>3<\/p>\n<p>\uf0b7 TS Decomposition: this node allows you to specify similar basic settings to that of<\/p>\n<p>the TS Data Prep node, but the Number of Periods can be configured, and moreover,<\/p>\n<p>you can configure which Export Components you want to display.<\/p>\n<p>o By default, it will only display \u201cTrend-Cycle\u201d component (=Yes), which is<\/p>\n<p>generally regarded as the most salient one.<\/p>\n<p>o However, in our case, we want to view ALL Components, so we would set that<\/p>\n<p>value to \u201cYes\u201d.<\/p>\n<p>Figure 3. TS Decomposition node \u2013properties<\/p>\n<p>TS Correlation: this node allows you to set up your TSA for autocorrelation analysis, or<\/p>\n<p>alternatively for CCA (Cross-correlation analysis). When you select one of those methods,<\/p>\n<p>the other one\u2019s properties will be greyed out.<\/p>\n<p>Figure 4. TS Correlation node \u2013properties<\/p>\n<p>Both the TS Correlation and TS Decomposition nodes must be preceded by a TS Data<\/p>\n<p>Preparation node (which occurs right after the source data node).<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>4<\/p>\n<p>TS Exponential Smoothing: this node allows you to conduct forecasting based on your<\/p>\n<p>known data; as such, you would connect it to a TS Data Preparation node, not directly to<\/p>\n<p>your source data node.<\/p>\n<p>\uf0b7 The interval is automatic (which will be month in the case of our pre-accumulated<\/p>\n<p>data), and the accumulation defaults to \u201cTotal\u201d (which is OK in our case, for the<\/p>\n<p>same reason).<\/p>\n<p>\uf0b7 SAS will pick what it deems to be the best forecasting method.<\/p>\n<p>\uf0b7 The default selection criterion is MSE, or Mean Squared Error.<\/p>\n<p>\uf0b7 We will see more on the Forecast lead, back, and significance level parameters<\/p>\n<p>during the forecast demonstration in this paper.<\/p>\n<p>Figure 5. TS Exponential Smoothing node \u2013properties<\/p>\n<p>For our initial workspace setup, we can scrutinize on the C\/AR (Case to Action Request)<\/p>\n<p>ratio, which as per our glossary is a tentative measure of tax auditor performance. The<\/p>\n<p>initial diagram workspace is called \u201cAggreg_Integras_27mths\u201d, which runs from January<\/p>\n<p>2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online to March 2020. This is arranged this way for a reason: because it ends on the month<\/p>\n<p>of the COVID shutdown.<\/p>\n<p>Our dataset name is \u201cTSA_AGGREG_SINGLE_LINE_27MTHS\u201d.<\/p>\n<p>So, when I bring this in, I need to set all variables to Role = \u201cRejected\u201d except a) C\/AR ratio<\/p>\n<p>and b) my MONTH (Time ID) variable.<\/p>\n<p>Figure 6. Variable Role selection from data source<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>5<\/p>\n<p>You would set your variables once you bring the data source to your diagram (workspace).<\/p>\n<p>Figure 7. TS Data Source to Diagram flow<\/p>\n<p>NOTE: I do not cover the mechanics behind bringing in a data source, as the principal focus<\/p>\n<p>is on conducting TSAF in SAS\u00ae Enterprise Miner\u2122. All we need to be concerned with is that<\/p>\n<p>as Data Sources become available in the top-left menu, we can drag-and-drop them to our<\/p>\n<p>diagram workspace (which are also created by right-clicking \u2018Diagrams\u2019 in the left panel).<\/p>\n<p>In examining the TS Data Preparation node, it is fairly simple: we see the known trajectory of the C\/AR variable, simply by right-clicking the node \uf0e0 Run \uf0e0 Results.<\/p>\n<p>Figure 8. Time Series Plot, for C\/AR ratio variable<\/p>\n<p>We can see that the C\/AR ratio has fallen off as of mid-2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online, and continued on a very<\/p>\n<p>gradual downward path. Which means that case auditors are completing disproportionately<\/p>\n<p>less cases to the action requests they submit for help, albeit with a seasonal factor and<\/p>\n<p>some rebounding of the trend-line in March 2020.<\/p>\n<p>So, we can scrutinize on the more specific components of the time series line by using a TS<\/p>\n<p>Decomposition node.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>6<\/p>\n<p>DECOMPOSITION OF TIME SERIES<\/p>\n<p>In running our TS Decomposition node, and viewing the results, the first one to examine<\/p>\n<p>is the Seasonal Component Plot. When it comes to the C\/AR ratio, the seasonal index range<\/p>\n<p>is between a high of about 1.3 down to about 0.75.<\/p>\n<p>Figure 9. Seasonal Component Plot, for C\/AR ratio variable<\/p>\n<p>During the months of March and December, we see fairly high seasonality. This is normal<\/p>\n<p>for the time, since the push to complete cases is higher at the end of the CRA fiscal year<\/p>\n<p>(March), and ostensibly at the end of the calendar year, also. Auditors are completing<\/p>\n<p>proportionally more cases vs. the number of action requests they submit to the service<\/p>\n<p>desk. So it is likely that they are fulfilling cases that do not require as many interventions<\/p>\n<p>during those months. Even in March 2020, C\/AR still remained high \u2013 it was<\/p>\n<p>resilient to the initial COVID effects, due to being a ratio variable and not an absolute<\/p>\n<p>sum variable.<\/p>\n<p>In the decomposed results, we can also examine combinatory components; for instance, the<\/p>\n<p>Trend-Cycle Component Plot:<\/p>\n<p>Figure 10. Trend-Cycle Component Plot, for C\/AR ratio variable<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>7<\/p>\n<p>This tells us what we had surmised from the initial data preparation, that the series has<\/p>\n<p>been on a steadily downwards trajectory. Now when it comes to tax-related time series<\/p>\n<p>data, there is no real cycle per se; at best, it is an inherited cycle from world economy<\/p>\n<p>fluctuations. The proper definition of cycle in a TSA context is not the entity\u2019s operational<\/p>\n<p>lifecycle; rather, it refers to the boom-and-bust business cycles which are largely<\/p>\n<p>unpredictable. Ergo, we are mainly concerned about trend here.<\/p>\n<p>Now, if we substitute the Average TEBA (tax earned by audit) variable for C\/AR [using the<\/p>\n<p>Data Source node shown in figure 6 earlier], we can see what emerges in our decomposed<\/p>\n<p>time series results.<\/p>\n<p>Figure 11. Paneled Component Plots, TS Decomp. for Avg. TEBA<\/p>\n<p>This time, as per the panel graph at bottom-left, we see that our seasonality index is<\/p>\n<p>broader than that of C\/AR ratio; it goes from a high of about 1.8 to a low of ~0.7. This is<\/p>\n<p>largely attributable to the heightened pressures towards fiscal year-end to increase<\/p>\n<p>realization of TEBA, which we see in Feb.-March. At the opposite end, we see rather low<\/p>\n<p>seasonality for May, August, and November.<\/p>\n<p>For the original series plot, bottom-right, the trend continues gradually upwards with<\/p>\n<p>seasonality readily apparent. In the trend-cycle component plot, at top-left, we see that the<\/p>\n<p>trend (with cycle, such as it is) is rising steadily upwards but then reaches a virtual plateau.<\/p>\n<p>The key challenge then, has been to resolve and reconcile the expected forecast as of March<\/p>\n<p>2020 with the new COVID-19 realities.<\/p>\n<p>FORECASTING MACRO TAX VARIABLES<\/p>\n<p>AVERAGE TEBA<\/p>\n<p>We can proceed to evaluate the expected trajectory of the AVG. TEBA variable, on a<\/p>\n<p>monthly interval. Recall that this variable is pre-accumulated at data source.<\/p>\n<p>When we conduct our forecast, we use the TS Exponential Smoothing node.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>8<\/p>\n<p>Figure 12. TS Exponential Smoothing node in the TSAF diagram<\/p>\n<p>We let SAS\u00ae pick the best forecasting method, as well as selection criterion (forecast<\/p>\n<p>measure). In this case, the latter value is the MSE [Mean Squared Error] as you can see at<\/p>\n<p>the bottom of the properties of the node.<\/p>\n<p>Figure 13. Properties of the TS Exponential Smoothing node<\/p>\n<p>For our Significance Level, we set this to 0.5; it governs the blue bracket around the<\/p>\n<p>forecast line, a.k.a. the prediction interval. So it is a confidence band of sorts. The way this<\/p>\n<p>figure works is the opposite of what some of us might know from frequentist confidence<\/p>\n<p>intervals; that is, the lower the \u201calpha\u201d value, the wider the band (prediction interval) so an<\/p>\n<p>\u201calpha\u201d of 0.01 would produce a very wide band, and an \u201calpha\u201d value = 0.99 would be<\/p>\n<p>virtually limited to just the forecast line itself. So we aim in the middle (which actually is<\/p>\n<p>closer to the outline of the trend line, as this figure is more \u201clog-like\u201d in its manifestation).<\/p>\n<p>Figure 14. TEBA_NPV_Mean: forecast line from trend<\/p>\n<p>SAS logically expects the trend will continue upwards (while maintaining seasonality, of<\/p>\n<p>course) due to \u201cseries momentum\u201d. Had we began our time series at, say, January 2016: 2024 &#8211; Do my homework &#8211; Help write my assignment online<\/p>\n<p>rather than Jan. 2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online, that momentum might have been more pronounced. The clich\u00e9s of<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>9<\/p>\n<p>\u201cfuture behavior is governed by past behavior\u201d and \u201cyou can\u2019t know where you\u2019re going,<\/p>\n<p>unless you know where you\u2019ve been\u201d have never been truer. However, enter COVID-19,<\/p>\n<p>and that is a whole new wrench in the gears of the tax-auditing apparatus.<\/p>\n<p>As for the selection of \u201cBest\u201d Forecasting Method: you could try to experiment with<\/p>\n<p>different models \u2013 there are eight in all, as per fundamental TSAF science \u2013 but I can tell<\/p>\n<p>from the shape of the forecast line that it\u2019s based, appropriately, on the Additive Winters<\/p>\n<p>method1. I ascertained this by running the node with this method selected, and the<\/p>\n<p>resulting graph was identical to \u201cbest\u201d method. Unlike the Multiplicative Winters method,<\/p>\n<p>this forecast line is predicated on fairly consistent seasonal \u201cinverted V\u201d shapes in the curve.<\/p>\n<p>If those inverted V shapes became noticeable larger (or smaller), then Multiplicative Winters<\/p>\n<p>would likely be the \u201cbest\u201d method that SAS would auto-select.<\/p>\n<p>Figure 15. Available Forecasting Methods, properties of TS Exp. Smoothing node<\/p>\n<p>We see that in the resulting forecast, it predicts ahead exactly 12 months. This is the<\/p>\n<p>difference between the figures of \u201cForecast Lead\u201d and \u201cForecast Back\u201d in the properties. We<\/p>\n<p>saw on the previous page that the \u201cForecast Back\u201d = 6; this acts as our validation partition,<\/p>\n<p>using the last six months of known data (i.e. Oct. 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers to March 2020). So this gets<\/p>\n<p>subtracted from the \u201cForecast Back\u201d value of 18 to arrive at 12 periods out. Ideally, you<\/p>\n<p>want your \u201cback\u201d [validation] period to be between 20-25% of your known data, which it is<\/p>\n<p>out of 27 months; even when we increase the known months to 30, it will still be 20% of<\/p>\n<p>this.<\/p>\n<p>SUM OF TEBA<\/p>\n<p>When we run a TSAF experiment on the SUM of TEBA \u2013 as opposed to its average \u2013 we<\/p>\n<p>realize a drastic difference in the scale. Because TEBA is a sum value, not a ratio (i.e.<\/p>\n<p>C\/AR, or [Average] TEBA\/case), it is simply not as resilient to sudden shocks like COVID-19<\/p>\n<p>\u2013 as we will later see when adjusting the forecast based on incremental months (April, May,<\/p>\n<p>June) of known values.<\/p>\n<p>1 The essence of the Winters method is to combine discernible trend with seasonality.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>10<\/p>\n<p>Figure 16. TEBA SUM Forecast (post-March 2020)<\/p>\n<p>Note that the MSE selection criterion (default) graphs a trend line around the known values<\/p>\n<p>(which are represented by the red dots here). The SUM TEBA for Feb. 2020 is nearly double<\/p>\n<p>what it was for March 2020, as you can see by the relatively large separation of the red dots<\/p>\n<p>from the blue dots (on trendline) for those two months. Yet SAS\u00ae \u201cthinks\u201d that the trend<\/p>\n<p>will continue positively, as it is \u201cCOVID-agnostic\u201d.<\/p>\n<p>What may also seem shocking to the reader is that the lower limit of the prediction interval<\/p>\n<p>for April 2020 (at ~$674.5M) actually exceeds the actual value for April 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers, which was<\/p>\n<p>slightly below $500 million. It is not until the fall until we see that the midpoint of actual<\/p>\n<p>2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers data approximates the LCL (lower confidence limit) of the forecasted band for Sept.<\/p>\n<p>2020. This is ostensibly due to the \u201cpositive momentum\u201d of the time series that I alluded to<\/p>\n<p>earlier.<\/p>\n<p>C\/AR RATIO<\/p>\n<p>Next, we switch out the SUM of TEBA for the C\/AR ratio, once again. In forecasting a<\/p>\n<p>relatively low continuous ratio variable such as C\/AR, the prediction interval can be less<\/p>\n<p>reliable. We have to examine the midpoint distribution. While the midpoint post-March<\/p>\n<p>2020 tends to be at or above the 10.0 line, this is rare for 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers datapoints.<\/p>\n<p>Figure 17. C\/AR ratio Forecast<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>11<\/p>\n<p>I used the Mean Relative Abs. Error as the forecast metric (selection criterion), which I<\/p>\n<p>found to be more appropriate. Regardless, what we see in the actuals for the spring of 2020<\/p>\n<p>is a very low C\/AR ratio, telling us that case throughput has suffered as a result of the<\/p>\n<p>pandemic AND that Action Requests for help did not decline proportionally; there was still<\/p>\n<p>an apparent high need for action requests.<\/p>\n<p>FORECASTING AVG. HOURS PER CASE<\/p>\n<p>For forecasting average hours per [audit] case, I determined that the more ideal Selection<\/p>\n<p>Criterion was \u201cMedian Relative Abs. Error\u201d. No matter what Selection Criterion I used (or<\/p>\n<p>Significance Level), the prediction interval still dipped into the negative range. Sometimes,<\/p>\n<p>this is unavoidable. But then the prediction interval becomes spurious; you can\u2019t have<\/p>\n<p>negative hours. So we tend to just focus on the midpoint values in this situation.<\/p>\n<p>Figure 18. Average hours per case Forecast<\/p>\n<p>We can see that the midpoint goes very subtly upwards for the first few forecasted points<\/p>\n<p>(post-March 2020), then sharply up for summer. As it turns out, this is a fairly good<\/p>\n<p>approximation of the reality, since the Avg. Hours per case during the middle of 2020 is<\/p>\n<p>about 1.5-2.0 times that of the previous year. What is especially pronounced is that the<\/p>\n<p>Average Hours of March 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers were only 6.25, whereas for March 2020, it was 35.44. This<\/p>\n<p>was predicated on an Agency policy-induced change; refer to the link and passage below:<\/p>\n<p>https:\/\/www.mondaq.com\/canada\/audit\/1030308\/cra-moves-forward-with-international-audits- despite-continued-backlog-?email_access=on In March 2020, the CRA announced that it was suspending the vast majority of audit activity for a<\/p>\n<p>minimum of four weeks, other than audits involving the very largest taxpayers. This suspension meant<\/p>\n<p>that the CRA ceased requests for information relating to existing audits, finalizing existing audits, and<\/p>\n<p>issuing reassessments. Further, deadlines for information or document requests were suspended and no<\/p>\n<p>action was required from taxpayers under audit during this time. This suspension remained in effect until<\/p>\n<p>June 2020, though audits of small and medium businesses did not resume until late fall.<\/p>\n<p>This is also arguably responsible for the \u201cpulse\u201d effect we see in actual Avg. TEBA for July<\/p>\n<p>2020, as per the monthly incremental analysis that comes next.<\/p>\n<p>https:\/\/www.mondaq.com\/canada\/audit\/1030308\/cra-moves-forward-with-international-audits-despite-continued-backlog-?email_access=on<br \/>\nhttps:\/\/www.mondaq.com\/canada\/audit\/1030308\/cra-moves-forward-with-international-audits-despite-continued-backlog-?email_access=on<br \/>\nUNCLASSIFIED<\/p>\n<p>12<\/p>\n<p>INCREMENTAL ALIGNMENT<\/p>\n<p>APRIL 2020, KNOWN VALUES<\/p>\n<p>Now when we add the month of April 2020 to our data (making it 28 mths total), we would<\/p>\n<p>expect the AVG. TEBA actuals for subsequent months to become closer to \/ within forecast<\/p>\n<p>range. As an example in the graph cross-section that follows, the forecast for September,<\/p>\n<p>October, and December 2020 becomes more within range of later-known actuals, once we<\/p>\n<p>add April 2020 data. However, the July 2020 actual (~$122,000) is still above the forecast<\/p>\n<p>band for this incremental dataset\u2019s forecast. This was likely due to the resumption of<\/p>\n<p>standard large business audit as of June 2020 (see previous page article\/passage).<\/p>\n<p>Figure 19. Revised AVG. TEBA forecast, incremental inclusion of APRIL 2020<\/p>\n<p>Again, we typically use the measure of MSE [Mean Squared Error] in gauging efficacy or<\/p>\n<p>proximity of a forecast to actual [values]. See the Appendix tables at the end of this paper<\/p>\n<p>for a breakdown of this analysis, where I illustrate monthly incremental effect on accuracy<\/p>\n<p>of the last six months of the calendar year (i.e. from July to Dec. 2020).<\/p>\n<p>MAY 2020, KNOWN VALUES<\/p>\n<p>Clearly, the addition of April wasn\u2019t enough to right the trajectory of the expanding \u201cCOVID<\/p>\n<p>window\u201d. So in continuing our analysis of monthly incremental effect, I added May 2020\u2019s<\/p>\n<p>known data and I changed the forecast significance level from 0.5 to 0.25. But it makes no<\/p>\n<p>difference: July actual is still out of forecast range. We must simply accept that July 2020<\/p>\n<p>Avg. TEBA is an irregular value (~$122K), since July 2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online had Avg. TEBA =~$45K, and July<\/p>\n<p>2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers\u2019s Avg. TEBA was ~$57K. It is clear that this is a COVID-adjustment spike.<\/p>\n<p>Figure 20. Revised AVG. TEBA forecast, incremental inclusion of MAY 2020<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>13<\/p>\n<p>We can therefore define July 2020 as a pulse, or a one-time brief event, that caused a<\/p>\n<p>spike in the accumulated time series value for that month. This emphasis on larger<\/p>\n<p>business for audit while suspending SMB audits at the time is further substantiated by the<\/p>\n<p>fact that in July 2020, there was an average of 50.75 hrs per case completed, which is<\/p>\n<p>extremely high. For April, which had a very high Average TEBA of $185.5K, the figure was<\/p>\n<p>52.16 average hours per case.<\/p>\n<p>JUNE 2020, KNOWN VALUES<\/p>\n<p>Predictably, for the addition of June 2020, it didn\u2019t improve the forecast band to include the<\/p>\n<p>actual Avg. TEBA for July. So this strengthens the theory that July\u2019s value was a one-time<\/p>\n<p>event, or pulse, in the time series. It also strengthens the theory that Avg. TEBA was more<\/p>\n<p>resilient to initial COVID-19 transition measures (being a ratio value, in essence). To wit:<\/p>\n<p>observe below that the April-May-June line for the original forecast (left) and actual data<\/p>\n<p>points (right) is just above the $50K line, and follows the same trajectory.<\/p>\n<p>Figure 21. Comparing Q1 of FY2020-21 forecast vs. actual data points<\/p>\n<p>In taking MSE and RMSE (R is \u201croot\u201d) measurements for both the as-of-March and as-of-<\/p>\n<p>June forecasts, we only note a slight improvement (reduction) in that value. Which also<\/p>\n<p>goes to show the resilience of this variable, and the \u201cpulse\u201d nature of July\u2019s spike.<\/p>\n<p>MEASURE \/ as of MONTH MARCH 2020 JUNE 2020<\/p>\n<p>AVG. TEBA (MSE) $ 954,467,257.64 $ 888,454,004.34<\/p>\n<p>RMSE $ 30,894.45 $ 29,806.95<\/p>\n<p>Table 1. Point-in-time [R]MSE for AVG. TEBA forecast-to-actual: July to Dec. 2020<\/p>\n<p>Refer to the Appendix at the end of this paper for a more detailed month-by-month<\/p>\n<p>breakdown of these calculations.<\/p>\n<p>FALLACY: COMPARING SUM OF TEBA SHIFT TO AVG. TEBA CHANGES<\/p>\n<p>TSAF works best when you accumulate data records by average, not by sum total. If we<\/p>\n<p>tried this exercise using SUM TEBA per month, it would not turn out very well, because sum<\/p>\n<p>totals are immediately impacted by any severe transition, i.e. auditor work re-arrangements<\/p>\n<p>and temporary audit case policy due to COVID-19 fallout as of March 2020.<\/p>\n<p>Evaluating the March 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers-2020 comparison in the following table, the TEBA_SUM and<\/p>\n<p>Case Count have dropped significantly in March 2020, yet the C\/AR ratio has augmented.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>14<\/p>\n<p>Table 2. Year-over-Year March comparison, key macro-variables in TSA<\/p>\n<p>However, as the staffing situation has attempted to stabilize in the intervening months<\/p>\n<p>(April to June 2020), the C\/AR ratio has dropped dramatically. (Not shown in above table.)<\/p>\n<p>The same is true for the TEBA\/AR pattern.<\/p>\n<p>SUM OF TEBA: DRASTIC CHANGE<\/p>\n<p>We now compare the SUM TEBA forecast as of March 2020 (left image) and that of June<\/p>\n<p>2020 known data points (right image).<\/p>\n<p>Figure 22. Comparison of SUM of TEBA forecast as of March vs. as of June (2020)<\/p>\n<p>For the first image, none of the actuals of the last six months of 2020 fall in the forecast<\/p>\n<p>band. Whereas, for the second image, two of the actuals of the last six months (Oct., Nov.)<\/p>\n<p>fall in the forecast band.<\/p>\n<p>Also observe how some of the accumulated data points in the forecast are more \u201cdepressed\u201d<\/p>\n<p>in the latter graph; while there is a discernible peak, it doesn\u2019t quite have the same<\/p>\n<p>buoyancy or upwards momentum as the former graph. (We must keep in mind, though,<\/p>\n<p>that this is still using the MSE method, i.e. taking a line of best fit, where the red dots are<\/p>\n<p>the actual values.)<\/p>\n<p>So, there is little point in using the MSE to gauge efficacy of the monthly adjustment, simply<\/p>\n<p>because the values would be so huge (as opposed to those in the Avg. TEBA MSE).<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>15<\/p>\n<p>ADVERSE IMPACTS AND DELAYED EFFECTS<\/p>\n<p>LATENT EFFECTS OF SHOCKS<\/p>\n<p>We would also expect that lower Avg. TEBA wouldn\u2019t manifest until much later in the fiscal<\/p>\n<p>year 2020-21, due to most of 2020 consisting of past year audits. The graph below covers<\/p>\n<p>known Avg. TEBA trend data points right up to December 2020, the lowest point.<\/p>\n<p>Figure 23. Calendar-year-end (2020) Avg. TEBA; lowest point<\/p>\n<p>This extremely low Average TEBA of ~$32,000 per case could be a harbinger of further<\/p>\n<p>average TEBA decline, but we\u2019d have to observe the last quarter of the fiscal year \u2013 January<\/p>\n<p>to March 2020, once available \u2013 and validate that theory. (Then we might apply an<\/p>\n<p>intervention to the time series line.)<\/p>\n<p>Incidentally, when it comes to SUM of TEBA with actuals up to Dec. 2020, the forecast trend<\/p>\n<p>line for 2021 is far more credible, showing all datapoints as being well under $1 billion, and<\/p>\n<p>mostly under $500 million.<\/p>\n<p>INTERVENTIONS<\/p>\n<p>As alluded to before, a TSAF exercise may use interventions, if the extreme or irregular<\/p>\n<p>event is known in advance (or shortly thereafter). This is an adjustment to the \u201cregular\u201d<\/p>\n<p>time series, using a \u201cdummy\u201d variable for the period of observation. In this case study,<\/p>\n<p>we\u2019d recommend an intervention for the SUM of TEBA as of March 2020, and possibly for<\/p>\n<p>AVG TEBA as of Dec. 2020. Plus, we might use a \u201cpulse effect\u201d for July 2020. However,<\/p>\n<p>programming an intervention requires SAS\u00ae Studio\u2122, which is out of scope for this paper.<\/p>\n<p>Figure 24. Basic denotation of input variables (interventions) by type<\/p>\n<p>Lowest actual in 3 years; Dec. 2020 Avg. TEBA of $32,404<\/p>\n<p>A step would work best as an intervention (for March 2020 and Dec. 2020), since the trend line shift is sudden and sustained; it does not happen gradually then return to baseline.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>16<\/p>\n<p>TS CORRELATION NODE<\/p>\n<p>AUTOCORRELATION<\/p>\n<p>When we deal with a significant seasonal and\/or trend component, we usually find a greater<\/p>\n<p>degree of autocorrelation factor (abbreviated \u201cACF\u201d). As the name suggests, this is the<\/p>\n<p>tendency of a variable to self-influence. It could also be regarded as momentum, or<\/p>\n<p>\u201cmuscle memory\u201d.<\/p>\n<p>In a similar vein, when frontline auditing teams are performing well, some of that<\/p>\n<p>momentum carries over from one period to the next, as they build \u201cmuscle memory\u201d and<\/p>\n<p>are better-equipped to deal with more trying scenarios that have [abstract] aspects in<\/p>\n<p>common with recent cases worked on. This presents opportunities for \u201cboilerplate\u201d copying<\/p>\n<p>and pasting of common findings from one case to another, adjusting for specifics, and<\/p>\n<p>accelerating average time to complete as well as garnering more average TEBA per case.<\/p>\n<p>Clearly, during the current COVID-19 climate at this writing, and the embargo of SMB case<\/p>\n<p>audit during the spring 2020 period, we can expect some of that momentum to be adversely<\/p>\n<p>impacted \u2013 since auditors were working on more complex large business cases overall. But<\/p>\n<p>first, let us examine a baseline from the years 2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online-2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers, below:<\/p>\n<p>Figure 25. ACF Plot, three key tax-related macro-variables (2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online-2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers)<\/p>\n<p>From the three variables plotted above, Est. TAR-AI (tax-at-risk \u2013 audit issue) has low ACF,<\/p>\n<p>TEBA has moderately high ACF, and Total [Avg. Case] Hours has very high ACF. To wit: at<\/p>\n<p>lag t=5, TEBA reaches the zero line; but Total Hours is still at ACF=0.45.<\/p>\n<p>By stark contrast, in 2020 (below), the ACF for both Avg. TEBA and Case Hours is very<\/p>\n<p>weak overall. In fact, both drop precipitously at the very outset of 2020, just prior to<\/p>\n<p>COVID-19.<\/p>\n<p>Figure 26. ACF Plot, same macro-variables, for 2020<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>17<\/p>\n<p>CCA \u2013 CROSS-CORRELATION ANALYSIS<\/p>\n<p>When we explore lagged effects between risk-related variables \u2013 in this case, TAR (tax-at-<\/p>\n<p>risk) and TEBA (tax earned by audit) \u2013 we would use a CCA plot. We are also considering<\/p>\n<p>Total Hours (on audit cases) here. The plots below are at t=3 months and t=12 months<\/p>\n<p>out, with the influencing variables on the vertical axis, and the influenced variables on the<\/p>\n<p>X-axis. The color shading is somewhat counterintuitive, whereby red means more positively<\/p>\n<p>cross-correlated, and blue means less so. Again, we set a baseline of expectations using<\/p>\n<p>tax data from 2016: 2024 &#8211; Do my homework &#8211; Help write my assignment online to 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers (48 months) here.<\/p>\n<p>Figure 27. CCA Map, at time lags 3 and 12, key macro-variables<\/p>\n<p>Note the pronounced difference in CCA factor: for time lag 3, the Estimated TAR has<\/p>\n<p>virtually no effect on TEBA or Total Hours per case (because it\u2019s too close time-wise), but 12<\/p>\n<p>months out (at right) it has a very pronounced effect on total case hours, and a moderate<\/p>\n<p>effect on TEBA (~22%). Also, in the first graph for time lag 3, TEBA highly influences Total<\/p>\n<p>Hours and to a noticeable degree vice-versa too. But when we get to 12 months out, Total<\/p>\n<p>Hours has virtually no lagged effect on TEBA, and vice-versa.<\/p>\n<p>If we repeat the experiment from 2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online data up to 2020 (COVID window) data, evaluating<\/p>\n<p>lagged effects of TAR on TEBA for 2020, we find a very different pattern at t=3 and t=12.<\/p>\n<p>For time lag=3, the best we get is ~3% influence; for t=12, it\u2019s absolutely nothing.<\/p>\n<p>Figure 28. CCA Map, at time lags 3 and 12, inclusive of COVID-19 period<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>18<\/p>\n<p>SUBSETTED ANALYSIS<\/p>\n<p>INDUSTRY PROFILING ANALYSIS<\/p>\n<p>Using the same data for CCA, we can subdivide our dataset by industry sector, or NAICS<\/p>\n<p>code. I can set this input to \u201cCross ID\u201d in the data source\u2019s variables list, then re-run the<\/p>\n<p>flow. From the TS Data Prep node\u2019s Results, right-click in the Time Series Plot and select<\/p>\n<p>Data Options. We\u2019ll pick a NAICS code at random. And you can see that it fell at the outset<\/p>\n<p>of COVID, and struggled to regain its footing \u2013 yet exceeding it by calendar year-end.<\/p>\n<p>Figure 29. Industry Profile (NAICS) subsetting of Avg. TEBA in TS Plot (in 2020)<\/p>\n<p>Note that when you have over 100 categorical values \u2013 as in the case of NAICS industry<\/p>\n<p>codes here \u2013 it will only allow you to select from the first 100. In my opinion and<\/p>\n<p>experience, I prefer SAS VIYA when it comes to subsetting TSA by key categories.<\/p>\n<p>BY TSO (TAX SERVICES OFFICE)<\/p>\n<p>So let us examine a subsetting TSA for an under-100 categorical set. I use the TSO, or Tax<\/p>\n<p>Service Office parameter, so again I set the Case_TSO_ID input to \u201cCross ID\u201d at the data<\/p>\n<p>source node. Then I re-run the flow and access the Results.<\/p>\n<p>Figure 30. Tax Services Office (TSO) subsetting of Avg. TEBA in TS Plot (in 2020)<\/p>\n<p>By default, this will display all TSO IDs in the Input TS Plot; so I have to right-click the plot<\/p>\n<p>area and select \u201cData Options\u201d to specify filters (WHERE TSO = 5, 18, or 40). Note that<\/p>\n<p>while all of these TSOs converge at various points, in the month of April we find a very<\/p>\n<p>strange anomaly: TSO 18 has AVG. TEBA =~ $600K, but the other two TSOs have TEBA<\/p>\n<p>just under $10,000. Yet all three of them re-converge later in 2020.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>19<\/p>\n<p>CONCLUSION<\/p>\n<p>We have seen the power and versatility of SAS\u00ae Enterprise Miner\u2122 for conducting TSAF<\/p>\n<p>exercises. It is clear that not all macro-variables in the Canada Revenue Agency exhibit the<\/p>\n<p>same behaviors or resilience at various points in the turbulent COVID-19 period, but a good<\/p>\n<p>deal of this can be attributed to whether they were pure sum variables, or derived ratio-like<\/p>\n<p>variables. Some disruptions \u2013 prompting the insertion of intervention effects \u2013 were<\/p>\n<p>ostensibly due to policies in place to \u201ctake the edge off\u201d more vulnerable business.<\/p>\n<p>Many of us can also take away abstract learnings from this paper, even if such individuals<\/p>\n<p>are not employed in the tax sector \u2013 because in the end, it is all about maintaining a certain<\/p>\n<p>buoyancy of the macro-variables that matter most, to the extent possible \u2013 these are not<\/p>\n<p>easy times to navigate and we wish those adversely impacted the most clement journey to<\/p>\n<p>a regained prosperity.<\/p>\n<p>REFERENCES<\/p>\n<p>Sarma, Kattamuri S., PhD. Copyright \u00a9 2017. Predictive Modeling with SAS\u00ae Enterprise<\/p>\n<p>Miner\u2122: Practical Solutions for Business Applications, Third Edition. Cary, NC, USA: SAS<\/p>\n<p>Institute, Inc.<\/p>\n<p>ACKNOWLEDGMENTS<\/p>\n<p>I am grateful to my family for their encouragement on this endeavor. I am also grateful to<\/p>\n<p>the numerous staff of the CRA who were the audience in my internal presentation of this<\/p>\n<p>TSAF subject matter. I also acknowledge and admit defeat to the spell checker in insisting<\/p>\n<p>on the spelling of \u201cendeavor\u201d as it is, not like it ought to be as it is on the space shuttle.<\/p>\n<p>Which, unlike CRA time series, must be expected to follow a known trajectory.<\/p>\n<p>RECOMMENDED READING<\/p>\n<p>\uf0b7 Milh\u00f8j, Anders. Practical Time Series Analysis Using SAS\u00ae. Copyright \u00a9 2013, SAS<\/p>\n<p>Institute Inc., Cary, NC, USA.<\/p>\n<p>\uf0b7 Shumway, Robert H. and Stoffer, David S. Time Series Analysis and its Applications. 4th<\/p>\n<p>ed. \u00a9 Springer International Publishing AG, 2017, Univ. of California at Davis. Davis,<\/p>\n<p>CA, USA.<\/p>\n<p>\uf0b7 Brocklebank, John C., Dickey, David A, and Choi, Bong S. SAS\u00ae for Forecasting Time<\/p>\n<p>Series. 3rd ed. Copyright \u00a9 2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online, SAS Institute Inc., Cary, NC, USA.<\/p>\n<p>\uf0b7 Svolba, Gerhard. Applying Data Science: Business Case Studies Using SAS\u00ae. Copyright<\/p>\n<p>\u00a9 2017, SAS Institute Inc., Cary, NC, USA.<\/p>\n<p>CONTACT INFORMATION<\/p>\n<p>Your comments and questions are valued and encouraged. Contact the author at:<\/p>\n<p>Jason A. Oliver, Senior Compliance Analyst &amp; Data Scientist<\/p>\n<p>Canada Revenue Agency<\/p>\n<p>Jason.oliver@cra-arc.gc.ca<\/p>\n<p>mailto:Jason.oliver@cra-arc.gc.ca<br \/>\nUNCLASSIFIED<\/p>\n<p>20<\/p>\n<p>APPENDIX: TABLES OF ACTUAL-TO-FORECAST ANALYSIS<\/p>\n<p>This contains detailed breakdowns of the incremental monthly additions of accumulated<\/p>\n<p>data to the COVID-19 observation window.<\/p>\n<p>AVERAGE TEBA<\/p>\n<p>This begins with Average TEBA, being subject to both MSE and RMSE (Mean Squared<\/p>\n<p>Error, and Root Mean Squared Error).<\/p>\n<p>At this juncture, between April and May 2020 known data, the MSE \/ RMSE actually<\/p>\n<p>regresses slightly, telling us that we might as well have gone straight to June 2020\u2019s data.<\/p>\n<p>In the end, this substantiates our earlier findings, that because Average TEBA is in essence<\/p>\n<p>a ratio variable and more resilient to initial COVID window \u2013 especially since it is predicated<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>21<\/p>\n<p>on audits of past year\u2019s tax filings \u2013 there was no real near-future benefit to forecast<\/p>\n<p>alignment based on incremental monthly additions for spring.<\/p>\n<p>C\/AR RATIO<\/p>\n<p>This, once again, is the Cases [Completed] to Action Requests [Submitted] ratio. Here I<\/p>\n<p>break down the monthly forecast measure, using MSE (no RMSE), of the last six months of<\/p>\n<p>calendar year 2020 and incrementing known months from March up to June. For March to<\/p>\n<p>May, I include the spring months not yet arrived at in each incremental forecast.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>22<\/p>\n<p>From adding April known data, the forecast actually worsens; this is arguably due to having<\/p>\n<p>been accustomed to high C\/AR values for so long. It is not until we add MAY that it becomes<\/p>\n<p>more realistic.<\/p>\n<p>Given this extremely low MSE value, brought on by the actual 2.57 C\/AR value of May, we<\/p>\n<p>have reached the optimum point \u2013 as evidenced by adding June to known values:<\/p>\n<p>CASE HOURS<\/p>\n<p>Lastly, in speaking to Hours per [audit] case forecast, I provide a condensed analysis using<\/p>\n<p>a simplified MAE [Mean Absolute Error] criterion.<\/p>\n<p>\uf0b7 As of March 2020; forecast of April to Dec. 2020: MAE = 78.52<\/p>\n<p>\uf0b7 As of April 2020; forecast of May to Dec. 2020: MAE = 95.83<\/p>\n<p>\uf0b7 As of May 2020; forecast of June to Dec. 2020: MAE = 107.99<\/p>\n<p>\uf0b7 As of June 2020; forecast of July to Dec. 2020: MAE = 71.51<\/p>\n<p>So, all in all, this proved a very difficult variable to effectively forecast.<\/p>\n<p>Applied Sciences<br \/>\nArchitecture and Design<br \/>\nBiology<br \/>\nBusiness &amp; Finance<br \/>\nChemistry<br \/>\nComputer Science<br \/>\nGeography<br \/>\nGeology<br \/>\nEducation<br \/>\nEngineering<br \/>\nEnglish<br \/>\nEnvironmental science<br \/>\nSpanish<br \/>\nGovernment<br \/>\nHistory<br \/>\nHuman Resource Management<br \/>\nInformation Systems<br \/>\nLaw<br \/>\nLiterature<br \/>\nMathematics<br \/>\nNursing<br \/>\nPhysics<br \/>\nPolitical Science<br \/>\nPsychology<br \/>\nReading<br \/>\nScience<br \/>\nSocial Science<br \/>\nHome<br \/>\nHomework Answers<br \/>\nBlog<br \/>\nArchive<br \/>\nTags<br \/>\nReviews<br \/>\nContact<br \/>\ntwitterfacebook<\/p>\n<p>1<\/p>\n<p>Paper 1047-2021<\/p>\n<p>SAS\u00ae Time Series Analysis &amp; Forecasting<\/p>\n<p>(TSAF) at the Canada Revenue Agency (CRA), with COVID impacts<\/p>\n<p>Jason A. Oliver, Senior Compliance Analyst, Canada Revenue Agency (CRA)<\/p>\n<p>ABSTRACT<\/p>\n<p>It may well be a recurring theme of this year&#8217;s SAS Global Forum that we are faced with more pressure to use flexible thinking &#8211; not just critical thinking &#8211; and when it comes to<\/p>\n<p>time series analysis and forecasting (TSAF) in SAS, it&#8217;s all about &#8220;rethinking the curve&#8221;.<\/p>\n<p>At the Canada Revenue Agency (CRA) Compliance Programs Branch (CPB), we have grappled with reliable forecasting for macro-level tax variables on a month-to-month basis, even before the COVID-19 pandemic hit. But now we face a particularly difficult<\/p>\n<p>challenge. As with many large organizations, it is not easy to foretell what the fallout may be from such a cataclysm.<\/p>\n<p>In setting up SAS to right the trajectory, we must be extra cautious about some of the fallacies in applying TSAF in this context: the lagged effect for tax revenues realized based on audits of the previous tax year, the need to differentiate average tax recovery<\/p>\n<p>per case from sum of tax recovery (month-to-month), realizing that industry sectors are not &#8220;one size fits all&#8221;, and accounting for relatively temporary effects of staffing re-<\/p>\n<p>orientation in the conversion to a virtual workplace versus the more enduring effects of business disruptions. With SAS Enterprise Miner&#8217;s abilities to continuously adjust forecasts, sub-categorize datapoints by tax office or industry sector, and apply lagged<\/p>\n<p>cross-correlation analysis, we are suitably equipped with the right tools and this can provide abstract learnings for other large organizations.<\/p>\n<p>INTRODUCTION<\/p>\n<p>The Canada Revenue Agency (CRA) is Canada\u2019s federal tax administration. As with all tax<\/p>\n<p>jurisdictions, the CRA has been challenged to keep pace with COVID-19 shocks and<\/p>\n<p>manifestations, which began in March 2020 (the last month of our fiscal year).<\/p>\n<p>Fortunately, SAS\u00ae Enterprise Miner\u2122 has been an invaluable aid in gauging these impacts.<\/p>\n<p>Enterprise Miner\u2122 includes a highly versatile set of functional nodes for configuring and<\/p>\n<p>processing time series data. It can decompose time series components such as seasonality<\/p>\n<p>and trend, show trend lines and expected forecast within configurable prediction intervals,<\/p>\n<p>and demonstrate complex correlation analyses.<\/p>\n<p>While this has been of great benefit to the CRA in gauging the trajectory of macro-variables<\/p>\n<p>related to tax revenues and auditor performance, the findings of this research paper could<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>2<\/p>\n<p>conceivably be applied in the abstract to large organizations with process-oriented<\/p>\n<p>functions, and not just to other foreign tax jurisdictions.<\/p>\n<p>Let us provide a Glossary of terms to set the stage:<\/p>\n<p>\uf0b7 TSAF: Time Series Analysis &amp; Forecasting.<\/p>\n<p>\uf0b7 TEBA: tax earned by audit, which is the amount of tax collectible that is agreed upon in the course of a taxpayer audit. It is in NPV (Net Present Value).<\/p>\n<p>\uf0b7 TAR: the tax-at-risk, which is the amount that CRA risk assessors arrive at as the precursor to auditing activity.<\/p>\n<p>\uf0b7 C\/AR ratio: the ratio of [audit] cases completed, to action requests [submitted]<\/p>\n<p>for assistance. It is a tentative measure of auditor productivity.<\/p>\n<p>\uf0b7 Integras: the tool used by CRA auditors to process cases.<\/p>\n<p>TIME SERIES FUNCTIONAL NODES &amp; SETUP<\/p>\n<p>In SAS\u00ae Enterprise Miner\u2122, you have six TSAF nodes in the \u201cTime Series\u201d ribbon; but we\u2019re<\/p>\n<p>only going to use four of them. Below is the Time Series ribbon with the functional nodes in<\/p>\n<p>question:<\/p>\n<p>Figure 1. Time Series Functional Nodes<\/p>\n<p>\uf0b7 TS Data Preparation: this node allows you to specify basic time series properties<\/p>\n<p>including interval, cycle, start\/end time, and accumulation (i.e. by total, min or max,<\/p>\n<p>mean, etc.)<\/p>\n<p>o Below, the interval is \u201cautomatic\u201d, so we specify \u201cMonth\u201d as the interval.<\/p>\n<p>o We can leave the seasonal cycle and start\/end time as \u201cDefault\u201d, as SAS\u00ae<\/p>\n<p>Enterprise Miner\u2122 will auto-determine these parts from the data.<\/p>\n<p>o In our case, the data was pre-accumulated in SAS\u00ae Enterprise Guide\u2122 row-<\/p>\n<p>by-row on a per-month basis, so we can leave Accumulation = \u201cTotal\u201d (else,<\/p>\n<p>we would have to set it \u201cAverage\u201d).<\/p>\n<p>Figure 2. TS Data Preparation node \u2013 basic properties<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>3<\/p>\n<p>\uf0b7 TS Decomposition: this node allows you to specify similar basic settings to that of<\/p>\n<p>the TS Data Prep node, but the Number of Periods can be configured, and moreover,<\/p>\n<p>you can configure which Export Components you want to display.<\/p>\n<p>o By default, it will only display \u201cTrend-Cycle\u201d component (=Yes), which is<\/p>\n<p>generally regarded as the most salient one.<\/p>\n<p>o However, in our case, we want to view ALL Components, so we would set that<\/p>\n<p>value to \u201cYes\u201d.<\/p>\n<p>Figure 3. TS Decomposition node \u2013properties<\/p>\n<p>TS Correlation: this node allows you to set up your TSA for autocorrelation analysis, or<\/p>\n<p>alternatively for CCA (Cross-correlation analysis). When you select one of those methods,<\/p>\n<p>the other one\u2019s properties will be greyed out.<\/p>\n<p>Figure 4. TS Correlation node \u2013properties<\/p>\n<p>Both the TS Correlation and TS Decomposition nodes must be preceded by a TS Data<\/p>\n<p>Preparation node (which occurs right after the source data node).<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>4<\/p>\n<p>TS Exponential Smoothing: this node allows you to conduct forecasting based on your<\/p>\n<p>known data; as such, you would connect it to a TS Data Preparation node, not directly to<\/p>\n<p>your source data node.<\/p>\n<p>\uf0b7 The interval is automatic (which will be month in the case of our pre-accumulated<\/p>\n<p>data), and the accumulation defaults to \u201cTotal\u201d (which is OK in our case, for the<\/p>\n<p>same reason).<\/p>\n<p>\uf0b7 SAS will pick what it deems to be the best forecasting method.<\/p>\n<p>\uf0b7 The default selection criterion is MSE, or Mean Squared Error.<\/p>\n<p>\uf0b7 We will see more on the Forecast lead, back, and significance level parameters<\/p>\n<p>during the forecast demonstration in this paper.<\/p>\n<p>Figure 5. TS Exponential Smoothing node \u2013properties<\/p>\n<p>For our initial workspace setup, we can scrutinize on the C\/AR (Case to Action Request)<\/p>\n<p>ratio, which as per our glossary is a tentative measure of tax auditor performance. The<\/p>\n<p>initial diagram workspace is called \u201cAggreg_Integras_27mths\u201d, which runs from January<\/p>\n<p>2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online to March 2020. This is arranged this way for a reason: because it ends on the month<\/p>\n<p>of the COVID shutdown.<\/p>\n<p>Our dataset name is \u201cTSA_AGGREG_SINGLE_LINE_27MTHS\u201d.<\/p>\n<p>So, when I bring this in, I need to set all variables to Role = \u201cRejected\u201d except a) C\/AR ratio<\/p>\n<p>and b) my MONTH (Time ID) variable.<\/p>\n<p>Figure 6. Variable Role selection from data source<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>5<\/p>\n<p>You would set your variables once you bring the data source to your diagram (workspace).<\/p>\n<p>Figure 7. TS Data Source to Diagram flow<\/p>\n<p>NOTE: I do not cover the mechanics behind bringing in a data source, as the principal focus<\/p>\n<p>is on conducting TSAF in SAS\u00ae Enterprise Miner\u2122. All we need to be concerned with is that<\/p>\n<p>as Data Sources become available in the top-left menu, we can drag-and-drop them to our<\/p>\n<p>diagram workspace (which are also created by right-clicking \u2018Diagrams\u2019 in the left panel).<\/p>\n<p>In examining the TS Data Preparation node, it is fairly simple: we see the known trajectory of the C\/AR variable, simply by right-clicking the node \uf0e0 Run \uf0e0 Results.<\/p>\n<p>Figure 8. Time Series Plot, for C\/AR ratio variable<\/p>\n<p>We can see that the C\/AR ratio has fallen off as of mid-2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online, and continued on a very<\/p>\n<p>gradual downward path. Which means that case auditors are completing disproportionately<\/p>\n<p>less cases to the action requests they submit for help, albeit with a seasonal factor and<\/p>\n<p>some rebounding of the trend-line in March 2020.<\/p>\n<p>So, we can scrutinize on the more specific components of the time series line by using a TS<\/p>\n<p>Decomposition node.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>6<\/p>\n<p>DECOMPOSITION OF TIME SERIES<\/p>\n<p>In running our TS Decomposition node, and viewing the results, the first one to examine<\/p>\n<p>is the Seasonal Component Plot. When it comes to the C\/AR ratio, the seasonal index range<\/p>\n<p>is between a high of about 1.3 down to about 0.75.<\/p>\n<p>Figure 9. Seasonal Component Plot, for C\/AR ratio variable<\/p>\n<p>During the months of March and December, we see fairly high seasonality. This is normal<\/p>\n<p>for the time, since the push to complete cases is higher at the end of the CRA fiscal year<\/p>\n<p>(March), and ostensibly at the end of the calendar year, also. Auditors are completing<\/p>\n<p>proportionally more cases vs. the number of action requests they submit to the service<\/p>\n<p>desk. So it is likely that they are fulfilling cases that do not require as many interventions<\/p>\n<p>during those months. Even in March 2020, C\/AR still remained high \u2013 it was<\/p>\n<p>resilient to the initial COVID effects, due to being a ratio variable and not an absolute<\/p>\n<p>sum variable.<\/p>\n<p>In the decomposed results, we can also examine combinatory components; for instance, the<\/p>\n<p>Trend-Cycle Component Plot:<\/p>\n<p>Figure 10. Trend-Cycle Component Plot, for C\/AR ratio variable<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>7<\/p>\n<p>This tells us what we had surmised from the initial data preparation, that the series has<\/p>\n<p>been on a steadily downwards trajectory. Now when it comes to tax-related time series<\/p>\n<p>data, there is no real cycle per se; at best, it is an inherited cycle from world economy<\/p>\n<p>fluctuations. The proper definition of cycle in a TSA context is not the entity\u2019s operational<\/p>\n<p>lifecycle; rather, it refers to the boom-and-bust business cycles which are largely<\/p>\n<p>unpredictable. Ergo, we are mainly concerned about trend here.<\/p>\n<p>Now, if we substitute the Average TEBA (tax earned by audit) variable for C\/AR [using the<\/p>\n<p>Data Source node shown in figure 6 earlier], we can see what emerges in our decomposed<\/p>\n<p>time series results.<\/p>\n<p>Figure 11. Paneled Component Plots, TS Decomp. for Avg. TEBA<\/p>\n<p>This time, as per the panel graph at bottom-left, we see that our seasonality index is<\/p>\n<p>broader than that of C\/AR ratio; it goes from a high of about 1.8 to a low of ~0.7. This is<\/p>\n<p>largely attributable to the heightened pressures towards fiscal year-end to increase<\/p>\n<p>realization of TEBA, which we see in Feb.-March. At the opposite end, we see rather low<\/p>\n<p>seasonality for May, August, and November.<\/p>\n<p>For the original series plot, bottom-right, the trend continues gradually upwards with<\/p>\n<p>seasonality readily apparent. In the trend-cycle component plot, at top-left, we see that the<\/p>\n<p>trend (with cycle, such as it is) is rising steadily upwards but then reaches a virtual plateau.<\/p>\n<p>The key challenge then, has been to resolve and reconcile the expected forecast as of March<\/p>\n<p>2020 with the new COVID-19 realities.<\/p>\n<p>FORECASTING MACRO TAX VARIABLES<\/p>\n<p>AVERAGE TEBA<\/p>\n<p>We can proceed to evaluate the expected trajectory of the AVG. TEBA variable, on a<\/p>\n<p>monthly interval. Recall that this variable is pre-accumulated at data source.<\/p>\n<p>When we conduct our forecast, we use the TS Exponential Smoothing node.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>8<\/p>\n<p>Figure 12. TS Exponential Smoothing node in the TSAF diagram<\/p>\n<p>We let SAS\u00ae pick the best forecasting method, as well as selection criterion (forecast<\/p>\n<p>measure). In this case, the latter value is the MSE [Mean Squared Error] as you can see at<\/p>\n<p>the bottom of the properties of the node.<\/p>\n<p>Figure 13. Properties of the TS Exponential Smoothing node<\/p>\n<p>For our Significance Level, we set this to 0.5; it governs the blue bracket around the<\/p>\n<p>forecast line, a.k.a. the prediction interval. So it is a confidence band of sorts. The way this<\/p>\n<p>figure works is the opposite of what some of us might know from frequentist confidence<\/p>\n<p>intervals; that is, the lower the \u201calpha\u201d value, the wider the band (prediction interval) so an<\/p>\n<p>\u201calpha\u201d of 0.01 would produce a very wide band, and an \u201calpha\u201d value = 0.99 would be<\/p>\n<p>virtually limited to just the forecast line itself. So we aim in the middle (which actually is<\/p>\n<p>closer to the outline of the trend line, as this figure is more \u201clog-like\u201d in its manifestation).<\/p>\n<p>Figure 14. TEBA_NPV_Mean: forecast line from trend<\/p>\n<p>SAS logically expects the trend will continue upwards (while maintaining seasonality, of<\/p>\n<p>course) due to \u201cseries momentum\u201d. Had we began our time series at, say, January 2016: 2024 &#8211; Do my homework &#8211; Help write my assignment online<\/p>\n<p>rather than Jan. 2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online, that momentum might have been more pronounced. The clich\u00e9s of<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>9<\/p>\n<p>\u201cfuture behavior is governed by past behavior\u201d and \u201cyou can\u2019t know where you\u2019re going,<\/p>\n<p>unless you know where you\u2019ve been\u201d have never been truer. However, enter COVID-19,<\/p>\n<p>and that is a whole new wrench in the gears of the tax-auditing apparatus.<\/p>\n<p>As for the selection of \u201cBest\u201d Forecasting Method: you could try to experiment with<\/p>\n<p>different models \u2013 there are eight in all, as per fundamental TSAF science \u2013 but I can tell<\/p>\n<p>from the shape of the forecast line that it\u2019s based, appropriately, on the Additive Winters<\/p>\n<p>method1. I ascertained this by running the node with this method selected, and the<\/p>\n<p>resulting graph was identical to \u201cbest\u201d method. Unlike the Multiplicative Winters method,<\/p>\n<p>this forecast line is predicated on fairly consistent seasonal \u201cinverted V\u201d shapes in the curve.<\/p>\n<p>If those inverted V shapes became noticeable larger (or smaller), then Multiplicative Winters<\/p>\n<p>would likely be the \u201cbest\u201d method that SAS would auto-select.<\/p>\n<p>Figure 15. Available Forecasting Methods, properties of TS Exp. Smoothing node<\/p>\n<p>We see that in the resulting forecast, it predicts ahead exactly 12 months. This is the<\/p>\n<p>difference between the figures of \u201cForecast Lead\u201d and \u201cForecast Back\u201d in the properties. We<\/p>\n<p>saw on the previous page that the \u201cForecast Back\u201d = 6; this acts as our validation partition,<\/p>\n<p>using the last six months of known data (i.e. Oct. 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers to March 2020). So this gets<\/p>\n<p>subtracted from the \u201cForecast Back\u201d value of 18 to arrive at 12 periods out. Ideally, you<\/p>\n<p>want your \u201cback\u201d [validation] period to be between 20-25% of your known data, which it is<\/p>\n<p>out of 27 months; even when we increase the known months to 30, it will still be 20% of<\/p>\n<p>this.<\/p>\n<p>SUM OF TEBA<\/p>\n<p>When we run a TSAF experiment on the SUM of TEBA \u2013 as opposed to its average \u2013 we<\/p>\n<p>realize a drastic difference in the scale. Because TEBA is a sum value, not a ratio (i.e.<\/p>\n<p>C\/AR, or [Average] TEBA\/case), it is simply not as resilient to sudden shocks like COVID-19<\/p>\n<p>\u2013 as we will later see when adjusting the forecast based on incremental months (April, May,<\/p>\n<p>June) of known values.<\/p>\n<p>1 The essence of the Winters method is to combine discernible trend with seasonality.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>10<\/p>\n<p>Figure 16. TEBA SUM Forecast (post-March 2020)<\/p>\n<p>Note that the MSE selection criterion (default) graphs a trend line around the known values<\/p>\n<p>(which are represented by the red dots here). The SUM TEBA for Feb. 2020 is nearly double<\/p>\n<p>what it was for March 2020, as you can see by the relatively large separation of the red dots<\/p>\n<p>from the blue dots (on trendline) for those two months. Yet SAS\u00ae \u201cthinks\u201d that the trend<\/p>\n<p>will continue positively, as it is \u201cCOVID-agnostic\u201d.<\/p>\n<p>What may also seem shocking to the reader is that the lower limit of the prediction interval<\/p>\n<p>for April 2020 (at ~$674.5M) actually exceeds the actual value for April 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers, which was<\/p>\n<p>slightly below $500 million. It is not until the fall until we see that the midpoint of actual<\/p>\n<p>2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers data approximates the LCL (lower confidence limit) of the forecasted band for Sept.<\/p>\n<p>2020. This is ostensibly due to the \u201cpositive momentum\u201d of the time series that I alluded to<\/p>\n<p>earlier.<\/p>\n<p>C\/AR RATIO<\/p>\n<p>Next, we switch out the SUM of TEBA for the C\/AR ratio, once again. In forecasting a<\/p>\n<p>relatively low continuous ratio variable such as C\/AR, the prediction interval can be less<\/p>\n<p>reliable. We have to examine the midpoint distribution. While the midpoint post-March<\/p>\n<p>2020 tends to be at or above the 10.0 line, this is rare for 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers datapoints.<\/p>\n<p>Figure 17. C\/AR ratio Forecast<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>11<\/p>\n<p>I used the Mean Relative Abs. Error as the forecast metric (selection criterion), which I<\/p>\n<p>found to be more appropriate. Regardless, what we see in the actuals for the spring of 2020<\/p>\n<p>is a very low C\/AR ratio, telling us that case throughput has suffered as a result of the<\/p>\n<p>pandemic AND that Action Requests for help did not decline proportionally; there was still<\/p>\n<p>an apparent high need for action requests.<\/p>\n<p>FORECASTING AVG. HOURS PER CASE<\/p>\n<p>For forecasting average hours per [audit] case, I determined that the more ideal Selection<\/p>\n<p>Criterion was \u201cMedian Relative Abs. Error\u201d. No matter what Selection Criterion I used (or<\/p>\n<p>Significance Level), the prediction interval still dipped into the negative range. Sometimes,<\/p>\n<p>this is unavoidable. But then the prediction interval becomes spurious; you can\u2019t have<\/p>\n<p>negative hours. So we tend to just focus on the midpoint values in this situation.<\/p>\n<p>Figure 18. Average hours per case Forecast<\/p>\n<p>We can see that the midpoint goes very subtly upwards for the first few forecasted points<\/p>\n<p>(post-March 2020), then sharply up for summer. As it turns out, this is a fairly good<\/p>\n<p>approximation of the reality, since the Avg. Hours per case during the middle of 2020 is<\/p>\n<p>about 1.5-2.0 times that of the previous year. What is especially pronounced is that the<\/p>\n<p>Average Hours of March 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers were only 6.25, whereas for March 2020, it was 35.44. This<\/p>\n<p>was predicated on an Agency policy-induced change; refer to the link and passage below:<\/p>\n<p>https:\/\/www.mondaq.com\/canada\/audit\/1030308\/cra-moves-forward-with-international-audits- despite-continued-backlog-?email_access=on In March 2020, the CRA announced that it was suspending the vast majority of audit activity for a<\/p>\n<p>minimum of four weeks, other than audits involving the very largest taxpayers. This suspension meant<\/p>\n<p>that the CRA ceased requests for information relating to existing audits, finalizing existing audits, and<\/p>\n<p>issuing reassessments. Further, deadlines for information or document requests were suspended and no<\/p>\n<p>action was required from taxpayers under audit during this time. This suspension remained in effect until<\/p>\n<p>June 2020, though audits of small and medium businesses did not resume until late fall.<\/p>\n<p>This is also arguably responsible for the \u201cpulse\u201d effect we see in actual Avg. TEBA for July<\/p>\n<p>2020, as per the monthly incremental analysis that comes next.<\/p>\n<p>https:\/\/www.mondaq.com\/canada\/audit\/1030308\/cra-moves-forward-with-international-audits-despite-continued-backlog-?email_access=on<br \/>\nhttps:\/\/www.mondaq.com\/canada\/audit\/1030308\/cra-moves-forward-with-international-audits-despite-continued-backlog-?email_access=on<br \/>\nUNCLASSIFIED<\/p>\n<p>12<\/p>\n<p>INCREMENTAL ALIGNMENT<\/p>\n<p>APRIL 2020, KNOWN VALUES<\/p>\n<p>Now when we add the month of April 2020 to our data (making it 28 mths total), we would<\/p>\n<p>expect the AVG. TEBA actuals for subsequent months to become closer to \/ within forecast<\/p>\n<p>range. As an example in the graph cross-section that follows, the forecast for September,<\/p>\n<p>October, and December 2020 becomes more within range of later-known actuals, once we<\/p>\n<p>add April 2020 data. However, the July 2020 actual (~$122,000) is still above the forecast<\/p>\n<p>band for this incremental dataset\u2019s forecast. This was likely due to the resumption of<\/p>\n<p>standard large business audit as of June 2020 (see previous page article\/passage).<\/p>\n<p>Figure 19. Revised AVG. TEBA forecast, incremental inclusion of APRIL 2020<\/p>\n<p>Again, we typically use the measure of MSE [Mean Squared Error] in gauging efficacy or<\/p>\n<p>proximity of a forecast to actual [values]. See the Appendix tables at the end of this paper<\/p>\n<p>for a breakdown of this analysis, where I illustrate monthly incremental effect on accuracy<\/p>\n<p>of the last six months of the calendar year (i.e. from July to Dec. 2020).<\/p>\n<p>MAY 2020, KNOWN VALUES<\/p>\n<p>Clearly, the addition of April wasn\u2019t enough to right the trajectory of the expanding \u201cCOVID<\/p>\n<p>window\u201d. So in continuing our analysis of monthly incremental effect, I added May 2020\u2019s<\/p>\n<p>known data and I changed the forecast significance level from 0.5 to 0.25. But it makes no<\/p>\n<p>difference: July actual is still out of forecast range. We must simply accept that July 2020<\/p>\n<p>Avg. TEBA is an irregular value (~$122K), since July 2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online had Avg. TEBA =~$45K, and July<\/p>\n<p>2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers\u2019s Avg. TEBA was ~$57K. It is clear that this is a COVID-adjustment spike.<\/p>\n<p>Figure 20. Revised AVG. TEBA forecast, incremental inclusion of MAY 2020<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>13<\/p>\n<p>We can therefore define July 2020 as a pulse, or a one-time brief event, that caused a<\/p>\n<p>spike in the accumulated time series value for that month. This emphasis on larger<\/p>\n<p>business for audit while suspending SMB audits at the time is further substantiated by the<\/p>\n<p>fact that in July 2020, there was an average of 50.75 hrs per case completed, which is<\/p>\n<p>extremely high. For April, which had a very high Average TEBA of $185.5K, the figure was<\/p>\n<p>52.16 average hours per case.<\/p>\n<p>JUNE 2020, KNOWN VALUES<\/p>\n<p>Predictably, for the addition of June 2020, it didn\u2019t improve the forecast band to include the<\/p>\n<p>actual Avg. TEBA for July. So this strengthens the theory that July\u2019s value was a one-time<\/p>\n<p>event, or pulse, in the time series. It also strengthens the theory that Avg. TEBA was more<\/p>\n<p>resilient to initial COVID-19 transition measures (being a ratio value, in essence). To wit:<\/p>\n<p>observe below that the April-May-June line for the original forecast (left) and actual data<\/p>\n<p>points (right) is just above the $50K line, and follows the same trajectory.<\/p>\n<p>Figure 21. Comparing Q1 of FY2020-21 forecast vs. actual data points<\/p>\n<p>In taking MSE and RMSE (R is \u201croot\u201d) measurements for both the as-of-March and as-of-<\/p>\n<p>June forecasts, we only note a slight improvement (reduction) in that value. Which also<\/p>\n<p>goes to show the resilience of this variable, and the \u201cpulse\u201d nature of July\u2019s spike.<\/p>\n<p>MEASURE \/ as of MONTH MARCH 2020 JUNE 2020<\/p>\n<p>AVG. TEBA (MSE) $ 954,467,257.64 $ 888,454,004.34<\/p>\n<p>RMSE $ 30,894.45 $ 29,806.95<\/p>\n<p>Table 1. Point-in-time [R]MSE for AVG. TEBA forecast-to-actual: July to Dec. 2020<\/p>\n<p>Refer to the Appendix at the end of this paper for a more detailed month-by-month<\/p>\n<p>breakdown of these calculations.<\/p>\n<p>FALLACY: COMPARING SUM OF TEBA SHIFT TO AVG. TEBA CHANGES<\/p>\n<p>TSAF works best when you accumulate data records by average, not by sum total. If we<\/p>\n<p>tried this exercise using SUM TEBA per month, it would not turn out very well, because sum<\/p>\n<p>totals are immediately impacted by any severe transition, i.e. auditor work re-arrangements<\/p>\n<p>and temporary audit case policy due to COVID-19 fallout as of March 2020.<\/p>\n<p>Evaluating the March 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers-2020 comparison in the following table, the TEBA_SUM and<\/p>\n<p>Case Count have dropped significantly in March 2020, yet the C\/AR ratio has augmented.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>14<\/p>\n<p>Table 2. Year-over-Year March comparison, key macro-variables in TSA<\/p>\n<p>However, as the staffing situation has attempted to stabilize in the intervening months<\/p>\n<p>(April to June 2020), the C\/AR ratio has dropped dramatically. (Not shown in above table.)<\/p>\n<p>The same is true for the TEBA\/AR pattern.<\/p>\n<p>SUM OF TEBA: DRASTIC CHANGE<\/p>\n<p>We now compare the SUM TEBA forecast as of March 2020 (left image) and that of June<\/p>\n<p>2020 known data points (right image).<\/p>\n<p>Figure 22. Comparison of SUM of TEBA forecast as of March vs. as of June (2020)<\/p>\n<p>For the first image, none of the actuals of the last six months of 2020 fall in the forecast<\/p>\n<p>band. Whereas, for the second image, two of the actuals of the last six months (Oct., Nov.)<\/p>\n<p>fall in the forecast band.<\/p>\n<p>Also observe how some of the accumulated data points in the forecast are more \u201cdepressed\u201d<\/p>\n<p>in the latter graph; while there is a discernible peak, it doesn\u2019t quite have the same<\/p>\n<p>buoyancy or upwards momentum as the former graph. (We must keep in mind, though,<\/p>\n<p>that this is still using the MSE method, i.e. taking a line of best fit, where the red dots are<\/p>\n<p>the actual values.)<\/p>\n<p>So, there is little point in using the MSE to gauge efficacy of the monthly adjustment, simply<\/p>\n<p>because the values would be so huge (as opposed to those in the Avg. TEBA MSE).<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>15<\/p>\n<p>ADVERSE IMPACTS AND DELAYED EFFECTS<\/p>\n<p>LATENT EFFECTS OF SHOCKS<\/p>\n<p>We would also expect that lower Avg. TEBA wouldn\u2019t manifest until much later in the fiscal<\/p>\n<p>year 2020-21, due to most of 2020 consisting of past year audits. The graph below covers<\/p>\n<p>known Avg. TEBA trend data points right up to December 2020, the lowest point.<\/p>\n<p>Figure 23. Calendar-year-end (2020) Avg. TEBA; lowest point<\/p>\n<p>This extremely low Average TEBA of ~$32,000 per case could be a harbinger of further<\/p>\n<p>average TEBA decline, but we\u2019d have to observe the last quarter of the fiscal year \u2013 January<\/p>\n<p>to March 2020, once available \u2013 and validate that theory. (Then we might apply an<\/p>\n<p>intervention to the time series line.)<\/p>\n<p>Incidentally, when it comes to SUM of TEBA with actuals up to Dec. 2020, the forecast trend<\/p>\n<p>line for 2021 is far more credible, showing all datapoints as being well under $1 billion, and<\/p>\n<p>mostly under $500 million.<\/p>\n<p>INTERVENTIONS<\/p>\n<p>As alluded to before, a TSAF exercise may use interventions, if the extreme or irregular<\/p>\n<p>event is known in advance (or shortly thereafter). This is an adjustment to the \u201cregular\u201d<\/p>\n<p>time series, using a \u201cdummy\u201d variable for the period of observation. In this case study,<\/p>\n<p>we\u2019d recommend an intervention for the SUM of TEBA as of March 2020, and possibly for<\/p>\n<p>AVG TEBA as of Dec. 2020. Plus, we might use a \u201cpulse effect\u201d for July 2020. However,<\/p>\n<p>programming an intervention requires SAS\u00ae Studio\u2122, which is out of scope for this paper.<\/p>\n<p>Figure 24. Basic denotation of input variables (interventions) by type<\/p>\n<p>Lowest actual in 3 years; Dec. 2020 Avg. TEBA of $32,404<\/p>\n<p>A step would work best as an intervention (for March 2020 and Dec. 2020), since the trend line shift is sudden and sustained; it does not happen gradually then return to baseline.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>16<\/p>\n<p>TS CORRELATION NODE<\/p>\n<p>AUTOCORRELATION<\/p>\n<p>When we deal with a significant seasonal and\/or trend component, we usually find a greater<\/p>\n<p>degree of autocorrelation factor (abbreviated \u201cACF\u201d). As the name suggests, this is the<\/p>\n<p>tendency of a variable to self-influence. It could also be regarded as momentum, or<\/p>\n<p>\u201cmuscle memory\u201d.<\/p>\n<p>In a similar vein, when frontline auditing teams are performing well, some of that<\/p>\n<p>momentum carries over from one period to the next, as they build \u201cmuscle memory\u201d and<\/p>\n<p>are better-equipped to deal with more trying scenarios that have [abstract] aspects in<\/p>\n<p>common with recent cases worked on. This presents opportunities for \u201cboilerplate\u201d copying<\/p>\n<p>and pasting of common findings from one case to another, adjusting for specifics, and<\/p>\n<p>accelerating average time to complete as well as garnering more average TEBA per case.<\/p>\n<p>Clearly, during the current COVID-19 climate at this writing, and the embargo of SMB case<\/p>\n<p>audit during the spring 2020 period, we can expect some of that momentum to be adversely<\/p>\n<p>impacted \u2013 since auditors were working on more complex large business cases overall. But<\/p>\n<p>first, let us examine a baseline from the years 2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online-2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers, below:<\/p>\n<p>Figure 25. ACF Plot, three key tax-related macro-variables (2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online-2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers)<\/p>\n<p>From the three variables plotted above, Est. TAR-AI (tax-at-risk \u2013 audit issue) has low ACF,<\/p>\n<p>TEBA has moderately high ACF, and Total [Avg. Case] Hours has very high ACF. To wit: at<\/p>\n<p>lag t=5, TEBA reaches the zero line; but Total Hours is still at ACF=0.45.<\/p>\n<p>By stark contrast, in 2020 (below), the ACF for both Avg. TEBA and Case Hours is very<\/p>\n<p>weak overall. In fact, both drop precipitously at the very outset of 2020, just prior to<\/p>\n<p>COVID-19.<\/p>\n<p>Figure 26. ACF Plot, same macro-variables, for 2020<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>17<\/p>\n<p>CCA \u2013 CROSS-CORRELATION ANALYSIS<\/p>\n<p>When we explore lagged effects between risk-related variables \u2013 in this case, TAR (tax-at-<\/p>\n<p>risk) and TEBA (tax earned by audit) \u2013 we would use a CCA plot. We are also considering<\/p>\n<p>Total Hours (on audit cases) here. The plots below are at t=3 months and t=12 months<\/p>\n<p>out, with the influencing variables on the vertical axis, and the influenced variables on the<\/p>\n<p>X-axis. The color shading is somewhat counterintuitive, whereby red means more positively<\/p>\n<p>cross-correlated, and blue means less so. Again, we set a baseline of expectations using<\/p>\n<p>tax data from 2016: 2024 &#8211; Do my homework &#8211; Help write my assignment online to 2019: 2024 &#8211; Online Assignment Homework Writing Help Service By Expert Research Writers (48 months) here.<\/p>\n<p>Figure 27. CCA Map, at time lags 3 and 12, key macro-variables<\/p>\n<p>Note the pronounced difference in CCA factor: for time lag 3, the Estimated TAR has<\/p>\n<p>virtually no effect on TEBA or Total Hours per case (because it\u2019s too close time-wise), but 12<\/p>\n<p>months out (at right) it has a very pronounced effect on total case hours, and a moderate<\/p>\n<p>effect on TEBA (~22%). Also, in the first graph for time lag 3, TEBA highly influences Total<\/p>\n<p>Hours and to a noticeable degree vice-versa too. But when we get to 12 months out, Total<\/p>\n<p>Hours has virtually no lagged effect on TEBA, and vice-versa.<\/p>\n<p>If we repeat the experiment from 2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online data up to 2020 (COVID window) data, evaluating<\/p>\n<p>lagged effects of TAR on TEBA for 2020, we find a very different pattern at t=3 and t=12.<\/p>\n<p>For time lag=3, the best we get is ~3% influence; for t=12, it\u2019s absolutely nothing.<\/p>\n<p>Figure 28. CCA Map, at time lags 3 and 12, inclusive of COVID-19 period<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>18<\/p>\n<p>SUBSETTED ANALYSIS<\/p>\n<p>INDUSTRY PROFILING ANALYSIS<\/p>\n<p>Using the same data for CCA, we can subdivide our dataset by industry sector, or NAICS<\/p>\n<p>code. I can set this input to \u201cCross ID\u201d in the data source\u2019s variables list, then re-run the<\/p>\n<p>flow. From the TS Data Prep node\u2019s Results, right-click in the Time Series Plot and select<\/p>\n<p>Data Options. We\u2019ll pick a NAICS code at random. And you can see that it fell at the outset<\/p>\n<p>of COVID, and struggled to regain its footing \u2013 yet exceeding it by calendar year-end.<\/p>\n<p>Figure 29. Industry Profile (NAICS) subsetting of Avg. TEBA in TS Plot (in 2020)<\/p>\n<p>Note that when you have over 100 categorical values \u2013 as in the case of NAICS industry<\/p>\n<p>codes here \u2013 it will only allow you to select from the first 100. In my opinion and<\/p>\n<p>experience, I prefer SAS VIYA when it comes to subsetting TSA by key categories.<\/p>\n<p>BY TSO (TAX SERVICES OFFICE)<\/p>\n<p>So let us examine a subsetting TSA for an under-100 categorical set. I use the TSO, or Tax<\/p>\n<p>Service Office parameter, so again I set the Case_TSO_ID input to \u201cCross ID\u201d at the data<\/p>\n<p>source node. Then I re-run the flow and access the Results.<\/p>\n<p>Figure 30. Tax Services Office (TSO) subsetting of Avg. TEBA in TS Plot (in 2020)<\/p>\n<p>By default, this will display all TSO IDs in the Input TS Plot; so I have to right-click the plot<\/p>\n<p>area and select \u201cData Options\u201d to specify filters (WHERE TSO = 5, 18, or 40). Note that<\/p>\n<p>while all of these TSOs converge at various points, in the month of April we find a very<\/p>\n<p>strange anomaly: TSO 18 has AVG. TEBA =~ $600K, but the other two TSOs have TEBA<\/p>\n<p>just under $10,000. Yet all three of them re-converge later in 2020.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>19<\/p>\n<p>CONCLUSION<\/p>\n<p>We have seen the power and versatility of SAS\u00ae Enterprise Miner\u2122 for conducting TSAF<\/p>\n<p>exercises. It is clear that not all macro-variables in the Canada Revenue Agency exhibit the<\/p>\n<p>same behaviors or resilience at various points in the turbulent COVID-19 period, but a good<\/p>\n<p>deal of this can be attributed to whether they were pure sum variables, or derived ratio-like<\/p>\n<p>variables. Some disruptions \u2013 prompting the insertion of intervention effects \u2013 were<\/p>\n<p>ostensibly due to policies in place to \u201ctake the edge off\u201d more vulnerable business.<\/p>\n<p>Many of us can also take away abstract learnings from this paper, even if such individuals<\/p>\n<p>are not employed in the tax sector \u2013 because in the end, it is all about maintaining a certain<\/p>\n<p>buoyancy of the macro-variables that matter most, to the extent possible \u2013 these are not<\/p>\n<p>easy times to navigate and we wish those adversely impacted the most clement journey to<\/p>\n<p>a regained prosperity.<\/p>\n<p>REFERENCES<\/p>\n<p>Sarma, Kattamuri S., PhD. Copyright \u00a9 2017. Predictive Modeling with SAS\u00ae Enterprise<\/p>\n<p>Miner\u2122: Practical Solutions for Business Applications, Third Edition. Cary, NC, USA: SAS<\/p>\n<p>Institute, Inc.<\/p>\n<p>ACKNOWLEDGMENTS<\/p>\n<p>I am grateful to my family for their encouragement on this endeavor. I am also grateful to<\/p>\n<p>the numerous staff of the CRA who were the audience in my internal presentation of this<\/p>\n<p>TSAF subject matter. I also acknowledge and admit defeat to the spell checker in insisting<\/p>\n<p>on the spelling of \u201cendeavor\u201d as it is, not like it ought to be as it is on the space shuttle.<\/p>\n<p>Which, unlike CRA time series, must be expected to follow a known trajectory.<\/p>\n<p>RECOMMENDED READING<\/p>\n<p>\uf0b7 Milh\u00f8j, Anders. Practical Time Series Analysis Using SAS\u00ae. Copyright \u00a9 2013, SAS<\/p>\n<p>Institute Inc., Cary, NC, USA.<\/p>\n<p>\uf0b7 Shumway, Robert H. and Stoffer, David S. Time Series Analysis and its Applications. 4th<\/p>\n<p>ed. \u00a9 Springer International Publishing AG, 2017, Univ. of California at Davis. Davis,<\/p>\n<p>CA, USA.<\/p>\n<p>\uf0b7 Brocklebank, John C., Dickey, David A, and Choi, Bong S. SAS\u00ae for Forecasting Time<\/p>\n<p>Series. 3rd ed. Copyright \u00a9 2018: 2024 &#8211; Write My Essay For Me | Essay Writing Service For Your Papers Online, SAS Institute Inc., Cary, NC, USA.<\/p>\n<p>\uf0b7 Svolba, Gerhard. Applying Data Science: Business Case Studies Using SAS\u00ae. Copyright<\/p>\n<p>\u00a9 2017, SAS Institute Inc., Cary, NC, USA.<\/p>\n<p>CONTACT INFORMATION<\/p>\n<p>Your comments and questions are valued and encouraged. Contact the author at:<\/p>\n<p>Jason A. Oliver, Senior Compliance Analyst &amp; Data Scientist<\/p>\n<p>Canada Revenue Agency<\/p>\n<p>Jason.oliver@cra-arc.gc.ca<\/p>\n<p>mailto:Jason.oliver@cra-arc.gc.ca<br \/>\nUNCLASSIFIED<\/p>\n<p>20<\/p>\n<p>APPENDIX: TABLES OF ACTUAL-TO-FORECAST ANALYSIS<\/p>\n<p>This contains detailed breakdowns of the incremental monthly additions of accumulated<\/p>\n<p>data to the COVID-19 observation window.<\/p>\n<p>AVERAGE TEBA<\/p>\n<p>This begins with Average TEBA, being subject to both MSE and RMSE (Mean Squared<\/p>\n<p>Error, and Root Mean Squared Error).<\/p>\n<p>At this juncture, between April and May 2020 known data, the MSE \/ RMSE actually<\/p>\n<p>regresses slightly, telling us that we might as well have gone straight to June 2020\u2019s data.<\/p>\n<p>In the end, this substantiates our earlier findings, that because Average TEBA is in essence<\/p>\n<p>a ratio variable and more resilient to initial COVID window \u2013 especially since it is predicated<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>21<\/p>\n<p>on audits of past year\u2019s tax filings \u2013 there was no real near-future benefit to forecast<\/p>\n<p>alignment based on incremental monthly additions for spring.<\/p>\n<p>C\/AR RATIO<\/p>\n<p>This, once again, is the Cases [Completed] to Action Requests [Submitted] ratio. Here I<\/p>\n<p>break down the monthly forecast measure, using MSE (no RMSE), of the last six months of<\/p>\n<p>calendar year 2020 and incrementing known months from March up to June. For March to<\/p>\n<p>May, I include the spring months not yet arrived at in each incremental forecast.<\/p>\n<p>UNCLASSIFIED<\/p>\n<p>22<\/p>\n<p>From adding April known data, the forecast actually worsens; this is arguably due to having<\/p>\n<p>been accustomed to high C\/AR values for so long. It is not until we add MAY that it becomes<\/p>\n<p>more realistic.<\/p>\n<p>Given this extremely low MSE value, brought on by the actual 2.57 C\/AR value of May, we<\/p>\n<p>have reached the optimum point \u2013 as evidenced by adding June to known values:<\/p>\n<p>CASE HOURS<\/p>\n<p>Lastly, in speaking to Hours per [audit] case forecast, I provide a condensed analysis using<\/p>\n<p>a simplified MAE [Mean Absolute Error] criterion.<\/p>\n<p>\uf0b7 As of March 2020; forecast of April to Dec. 2020: MAE = 78.52<\/p>\n<p>\uf0b7 As of April 2020; forecast of May to Dec. 2020: MAE = 95.83<\/p>\n<p>\uf0b7 As of May 2020; forecast of June to Dec. 2020: MAE = 107.99<\/p>\n<p>\uf0b7 As of June 2020; forecast of July to Dec. 2020: MAE = 71.51<\/p>\n<p>So, all in all, this proved a very difficult variable to effectively forecast.<\/p>\n<p>Applied Sciences<br \/>\nArchitecture and Design<br \/>\nBiology<br \/>\nBusiness &amp; Finance<br \/>\nChemistry<br \/>\nComputer Science<br \/>\nGeography<br \/>\nGeology<br \/>\nEducation<br \/>\nEngineering<br \/>\nEnglish<br \/>\nEnvironmental science<br \/>\nSpanish<br \/>\nGovernment<br \/>\nHistory<br \/>\nHuman Resource Management<br \/>\nInformation Systems<br \/>\nLaw<br \/>\nLiterature<br \/>\nMathematics<br \/>\nNursing<br \/>\nPhysics<br \/>\nPolitical Science<br \/>\nPsychology<br \/>\nReading<br \/>\nScience<br \/>\nSocial Science<br \/>\nHome<br \/>\nHomework Answers<br \/>\nBlog<br \/>\nArchive<br \/>\nTags<br \/>\nReviews<br \/>\nContact<br \/>\ntwitterfacebook<\/p>\n","protected":false},"excerpt":{"rendered":"<p>UNCLASSIFIED 1 Paper 1047-2021 SAS\u00ae Time Series Analysis &amp; Forecasting (TSAF) at the Canada Revenue Agency (CRA), with COVID impacts Jason A. Oliver, Senior Compliance Analyst, Canada Revenue&hellip;<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1348,1345,92,2503,2494,1347],"tags":[2453,2489,2487,2368,2507,2370,1462,2488,2506,2490,2504,2505],"class_list":["post-8924","post","type-post","status-publish","format-standard","hentry","category-best-nursing-assignment-writing-service","category-cheap-online-nursing-essays-writing","category-nursing","category-nursing-essay-writers","category-online-nursing-essay-writing-service","category-write-my-nursing-paper-online","tag-bsn-papers","tag-dnp-assignment","tag-health-care-essays","tag-masters-essays","tag-nurs-essays","tag-nursing-assessment","tag-nursing-assignment","tag-nursing-homework","tag-online-nursing-essays","tag-online-nursing-papers","tag-professional-nursing-essay-writing","tag-top-nursing-papers"],"_links":{"self":[{"href":"https:\/\/www.homeworkacetutors.com\/nursing\/wp-json\/wp\/v2\/posts\/8924","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.homeworkacetutors.com\/nursing\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.homeworkacetutors.com\/nursing\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.homeworkacetutors.com\/nursing\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.homeworkacetutors.com\/nursing\/wp-json\/wp\/v2\/comments?post=8924"}],"version-history":[{"count":0,"href":"https:\/\/www.homeworkacetutors.com\/nursing\/wp-json\/wp\/v2\/posts\/8924\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.homeworkacetutors.com\/nursing\/wp-json\/wp\/v2\/media?parent=8924"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.homeworkacetutors.com\/nursing\/wp-json\/wp\/v2\/categories?post=8924"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.homeworkacetutors.com\/nursing\/wp-json\/wp\/v2\/tags?post=8924"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}