Maximum-Entropy Regularized Decision Transformer with Reward Relabelling for Dynamic Recommendation | Synapse