April 30, 1988

Liquidity and execution costs in equity markets

Key Points

Key points are not available for this paper at this time.

Abstract

Abstract The BioCreative VII Track 5 calls for participants to tackle the multi-label classification task for automated topic annotation of COVID-19 literature. In our participation, we evaluated several deep learning models built on PubMedBERT, a pre-trained language model, with different strategies addressing the challenges of the task. Specifically, multi-instance learning was used to deal with the large variation in the lengths of the articles, and focal loss function was used to address the imbalance in the distribution of different topics. We found that the ensemble model performed the best among all the models we have tested. Test results of our submissions showed that our approach was able to achieve satisfactory performance with an F1 score of 0.9247, which is significantly better than the baseline model (F1 score: 0.8678) and the mean of all the submissions (F1 score: 0.8931).

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Joel Hasbrouck

University Foundation

Robert A. Schwartz

Baruch College

Journals

The Journal of Portfolio Management

Actions

Institutions

New York University

Samarkand Institute of Economics and Service

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Liquidity and execution costs in equity markets

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study