A Fine-grained Data Set and Analysis of Tangling in Bug Fixing Commits
This study examines the prevalence of tangled commits in bug fixes, revealing that 66-87% of changes in production code files actually fix bugs. Using a crowdsourcing approach, we found significant noise in data due to tangling, suggesting that unvalidated data is likely very noisy and can alter research results.