methods
aysra is a reconstruction system, not a directory.
three layers of processing
aysra takes filed IRS data and turns it into structured, comparable records. nothing is silently transformed. values are as filed or clearly marked as interpretive.
aysra operates in three distinct layers. each layer has a different level of certainty and is treated differently in the product.
structured
this layer reflects what can be read and organized directly from filings.
- filings are parsed into structured tables
- fields are standardized across forms and years
- grant records are normalized into comparable entries
these outputs are drawn directly from filings or computed from them in a defined way.
linked
this layer connects records that refer to the same entity.
- recipients are matched across filings
- filers and related entities are linked where identity is clear
- EIN relationships are resolved using consistent rules
not all matches are equally certain. aysra uses matching tiers and only promotes links when the evidence is sufficient. uncertain matches are not treated as resolved.
interpretive
this layer describes patterns across structured data.
- payout behavior
- grant concentration
- renewal and first-time giving
- narrative claims about funding behavior
these outputs summarize what the data shows. they do not introduce new facts.
interpretive outputs are marked with a kento symbol.
provenance
aysra makes a clear distinction between what is filed and what is interpreted.
- hanko marks values that come directly from filings or from defined calculations on those filings
- kento marks values that describe patterns or synthesize across multiple fields
if a value is not marked as interpretive, it can be traced back to a filing.
null is not zero
absence is treated as information.
- a blank field is not assumed to be zero
- a missing filing is not assumed to represent inactivity
- a behavior that does not appear in filings is not inferred
if a pattern cannot be observed, aysra will say so.
what aysra does not do
- it does not guess missing values
- it does not smooth across missing years
- it does not infer intent from mission language alone
- it does not present interpretation as fact
known limitations
aysra is built from public IRS filings. that makes the system traceable, but it also means the system inherits the limits of the filings themselves.
recipient names on grants are free text. the IRS does not require private foundations to report a recipient's EIN, so the same organization can appear under multiple name variants, such as "Harvard University," "Harvard Univ.," or "President and Fellows of Harvard College." aysra runs name normalization and uses a curated EIN override table to consolidate variants where the evidence is strong, but recipient matching is best-effort, not perfect.
tax-year lag is real. most organizations file months after their fiscal year closes, so the most recent tax year visible for any given filer usually trails the calendar year.
Schedule B donor names are redacted by the IRS for most filers. aysra does not have them either.
990-N postcard filers are not in the corpus. organizations with less than $50,000 in annual gross receipts generally file a short electronic postcard instead of a full Form 990, and the IRS does not release structured XML for those postcard filings.
boundary of inference
aysra's core principle is simple.
behavior is observed, not assumed.
mission language and narrative text provide context. filings determine the underlying signal.
if a statement cannot be tied to structured evidence, it is not presented as fact.
for how aysra uses AI in its interpretive layer, see the AI use policy.
AI use policy →