Incentive-Aware Learning in Stateful Environments: Internalizing Externalities in Principal-Agent MDPs and Welfare-Maximizing Diffusion Mechanisms | Synapse