Key points are not available for this paper at this time.
This paper presents a novel RL algorithm, S-REINFORCE, designed by leveraging two types of function approximators, namely Neural Network (NN) and Symbolic Regressor (SR), to produce numerical and symbolic policies for dynamic decision-making tasks, respectively. A symbolic policy uncovers functional relations between the underlying states and action-probabilities. Further, the symbolic policy is utilized through importance sampling (IS) to improve the rewards received during the learning process. The effectiveness of S-REINFORCE has been validated on various dynamic decision-making problems involving low and high dimensional action spaces. The results obtained clearly demonstrate that by leveraging the complementary strengths of NN and SR, S-REINFORCE generates policies that exhibit both good performance and interpretability. This makes S-REINFORCE an excellent choice for real-world applications where transparency and causality play a crucial role.
Building similarity graph...
Analyzing shared references across papers
Loading...
Rajdeep Dutta
Qincheng Wang
Ankur Singh
Nanyang Technological University
Agency for Science, Technology and Research
Institute for Infocomm Research
Building similarity graph...
Analyzing shared references across papers
Loading...
Dutta et al. (Mon,) studied this question.
www.synapsesocial.com/papers/68e7397eb6db6435876b2b3c — DOI: https://doi.org/10.1109/icassp48485.2024.10446037
Synapse has enriched 4 closely related papers on similar clinical questions. Consider them for comparative context: