You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@nikhilwoodruff, let's make a list of additional items to include in soi_from_puf_tmd_2021.csv (or the underlying source), so that you can add them all at once. @martinholmer you may have thoughts about things I didn't think to mention below.
The primary purpose of this is data-quality diagnosis, but it also could affect future targeting.
The basic idea is to capture concepts that we think are important for (1) tax revenue analysis, or (2) distributional analysis. How important something is depends on extent to which it is data that would play a significant role in (1) an important policy or political issue likely to arise over the next year or two, (b) an issue that is important to project sponsors, or (c) analysis that we think will be important.
We need to make a prioritized list because we don't want this work to crowd out other more-important work. I've listed items below in roughly my sense of priority order. We can defer items we don't have time for but many should be relatively easy because the mapping to IRS is straightforward.
The broad categories of items that I do not see at present - and I could be wrong so please consider this subject to discussion - are:
Filers - we want to have a filers subset of the data for 2021, because that allows us to compare our data to many IRS-published aggregates
Total # of returns with adjusted gross income; I see total # of nonzero returns for many individual income components, but not for AGI as a whole
Payroll tax
Major itemized deduction components. See the crosswalk doc. I see that you have SALT and medical-uncapped.
I think SALT may need some refinement. Key variables are e18400 and e18500.
Interest paid e19200 is large and would be useful.
Cash contribuions e19800.
Qualified business income deduction (we hit the total, of course, but we do want to see it by AGI range, eventually crossed with filing status)
We're going to want to delve into one measure of income tax liability where we are sure we can match the IRS concept with PUF concepts so that we have a definite apples-to-apples comparison. I think that is probably the IRS total income tax concept but another might do if we are 100% sure we can match PUF-IRS concepts perfectly.
We will certainly want to examine universe totals (i.e., including non-filers), as we discussed on the phone @nikhilwoodruff, but let's make that a separate issue because it requires careful examination of the appropriate control totals, how they relate to relevant tmd variables and because it is of slightly lower priority.
This discussion was converted from issue #108 on September 21, 2024 13:38.
Heading
Bold
Italic
Quote
Code
Link
Numbered list
Unordered list
Task list
Attach files
Mention
Reference
Menu
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
@nikhilwoodruff, let's make a list of additional items to include in soi_from_puf_tmd_2021.csv (or the underlying source), so that you can add them all at once. @martinholmer you may have thoughts about things I didn't think to mention below.
The primary purpose of this is data-quality diagnosis, but it also could affect future targeting.
The basic idea is to capture concepts that we think are important for (1) tax revenue analysis, or (2) distributional analysis. How important something is depends on extent to which it is data that would play a significant role in (1) an important policy or political issue likely to arise over the next year or two, (b) an issue that is important to project sponsors, or (c) analysis that we think will be important.
We need to make a prioritized list because we don't want this work to crowd out other more-important work. I've listed items below in roughly my sense of priority order. We can defer items we don't have time for but many should be relatively easy because the mapping to IRS is straightforward.
The broad categories of items that I do not see at present - and I could be wrong so please consider this subject to discussion - are:
We will certainly want to examine universe totals (i.e., including non-filers), as we discussed on the phone @nikhilwoodruff, but let's make that a separate issue because it requires careful examination of the appropriate control totals, how they relate to relevant tmd variables and because it is of slightly lower priority.
Beta Was this translation helpful? Give feedback.
All reactions