AMS: Detecting Unsafe and Tampered Language Models via Activation Analysis | Synapse