What question did this study set out to answer?

The research aims to enhance effort estimation and risk detection in software projects using machine learning techniques.

March 7, 2026Open Access

Machine Learning-Based Effort Prediction and Early Risk Detection in Software Development Projects: A Case Study

Puntos clave

The research aims to enhance effort estimation and risk detection in software projects using machine learning techniques.
Conducted a case study on 500 software development tasks with various features.
Utilized ensemble-based regression models, including Gradient Boosting and Random Forest.
Assessed model performance with Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and R².
Transformed prediction errors into deviation-based indicators for risk detection.
Employed threshold-based classifiers to identify tasks with significant schedule overruns.
Achieved effective task duration predictions with low MAE and RMSE values.
Successfully identified moderate and severe schedule overruns using confusion matrices.
Provided insights on the distribution of high-risk tasks to aid managerial decisions.

Resumen

Accurate effort estimation and early risk detection are critical for the success of software projects, as inaccurate forecasts can lead to schedule overruns, inefficient resource allocation, and unmet requirements. This study investigates the use of machine learning techniques to support task-level effort prediction and proactive risk identification in software project management. An applied case study was conducted on a simulated dataset of 500 software development tasks, described by planning, technical, and team-related features. Two ensemble-based regression models, Gradient Boosting and Random Forest, are evaluated for predicting actual task duration. Model performance is assessed using standard metrics, including Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and the coefficient of determination (R²). To enable early risk detection, prediction errors are transformed into deviation-based indicators, and threshold-based classifiers are employed to identify tasks with moderate (>20%) and severe (>30%) schedule overruns. Confusion matrices and classification metrics are used to evaluate the effectiveness of the proposed alerting mechanism, and the distribution of high-risk tasks across sprint quantiles is analyzed to support managerial decision-making.

Me gusta

Guardar

Ver artículo completo