ABSTRACT Financial reports, including 10-K and 10-Q filings, are a primary source of textual data in business disciplines. However, extracting specific sections from these lengthy documents remains a challenge. Custom code development by each research team to parse these files leads to redundancy, inefficiency, and inconsistencies and is especially challenging for teams lacking technical expertise. We address this by offering raw textual data from MD C88; M4; M48.
Codesso et al. (Sun,) studied this question.