This article introduces COFINFAD (Colombian Fintech Financial Analytics Dataset), a single-company 12-month dataset containing comprehensive behavioral information from 48,723 customers (representing the complete active customer base) of a Colombian fintech company. The dataset spans January 4 to December 29, 2023, capturing 3,159,157 transactions alongside demographic profiles, product usage patterns, customer satisfaction metrics, and digital app engagement behaviors. Data collection involved API integration with the company’s CRM system, transaction databases, and customer surveys (6,965 respondents, 14.3% response rate), while prioritizing customer privacy through multi-layered anonymization (pseudonymization, generalization, k-anonymity verification). The dataset comprises 57 variables including transaction frequency and value, product portfolio composition (savings, credit cards, loans, investments, insurance), customer satisfaction scores, Net Promoter Score (NPS), app login frequency, feature usage patterns, and support interaction metrics—all measured in Colombian Pesos (COP). COFINFAD is distinguished by its integrated multi-dimensional design, combining transactional, demographic, satisfaction, and digital engagement data unavailable in existing publicly available fintech datasets. The dataset is openly accessible via Mendeley Data (DOI: 10.17632/mhb4zn3258.1) under CC BY 4.0 license, adhering to FAIR (Findable, Accessible, Interoperable, Reusable) data principles.
Guerrero et al. (Tue,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: