|
95% of gaelic football results since 2010 can be explained by a mix of population, sport preferences, county financial resources, historical success, proximity to cities with significant career/study opportunities. A model created from these factors predicts win/loss results since 2010 with 95% accuracy when weighting games for relative importance. Worth noting that this year's championship has been only predictable to 70% accuracy mostly down to rule changes and Louth over-performing with ceiling still unknown :)
When ranking counties on how they've over-performed against this model since 2010, Monaghan is the top performing county in the country (i.e., wins more than it should), followed by Mayo, and then Kerry. Meath and Kildare have underperformed the most vs expectations which can be mostly attributed to the model not being perfectly fine-tuned for the negative impact that comes with proximity to Dublin (careers, commutes, study, other sports).
Here's a breakdown of the relative importance.
1. Code Selection & Religious Demographics Relative Contribution: 45% Context: This tier represents the primary filter on a county's gross population. The model applies a strict deduction based on competing athletic pathways. In the Republic, this tracks the dual-sport hurling drain (e.g., Cork, Galway, splitting talent down the middle) and urban soccer academies. In Northern Ireland, it applies a community background weight to reflect the available GAA player base. This explains why high-population counties like Kilkenny, Antrim, and Waterford sit near the bottom of the football standings. It shrinks their massive gross population down to their actual, football-playing talent pool.
2. Gravitational Commuter Drag Relative Contribution: 22% Context: This factor models the lifestyle fragmentation caused by commuting into major economic engines. This factor targets the "Leinster Commuter Trap." It accounts for the structural bottlenecks highlighted by the GAA National Demographics Committee-namely, that exploding suburban populations face a severe facility and time deficit. By penalizing counties within the immediate corporate orbits of Dublin and Cork, the model dramatically lowers the expected baselines for Meath and Kildare, shrinking their negative residuals. It probably doesn't do enough on this front but an r squared of 95% is pretty good.
3. Financial Preparation Capital Relative Contribution: 14% Context: Teams are amateur, but their preparation models are entirely corporate. This factor tracks annual team preparation expenditures relative to the national average. It recognizes that doubling an elite sports budget yields only a limited efficiency boost on the pitch (non-linear gains from investments into training/infrastructure). This prevents high-spending dual counties from breaking the model while properly elevating Dublin's baseline to account for its massive commercial advantage.
4. Student-Athlete Migration & Legacy Coaching Lines Combined Relative Contribution: 14% Context: These variables process population quality and institutional knowledge. This uses travel tracker metrics to penalize counties that must manage "exiled" college student training groups in Dublin or Belfast mid-week.
5. Unexplained Variance / Stochastic Error Relative Contribution: 5.0% Context: The remaining variance represents factors that cannot be captured by socio-economic data: generational talent anomalies (e.g., David Clifford emerging in a specific county), weather conditions, refereeing decisions, or short-term psychological momentum.
level (Louth) - Posts: 107 - 01/07/2026 22:31:30
2683548
Link
0
|
|
Replying To level: "95% of gaelic football results since 2010 can be explained by a mix of population, sport preferences, county financial resources, historical success, proximity to cities with significant career/study opportunities. A model created from these factors predicts win/loss results since 2010 with 95% accuracy when weighting games for relative importance. Worth noting that this year's championship has been only predictable to 70% accuracy mostly down to rule changes and Louth over-performing with ceiling still unknown :)
When ranking counties on how they've over-performed against this model since 2010, Monaghan is the top performing county in the country (i.e., wins more than it should), followed by Mayo, and then Kerry. Meath and Kildare have underperformed the most vs expectations which can be mostly attributed to the model not being perfectly fine-tuned for the negative impact that comes with proximity to Dublin (careers, commutes, study, other sports).
Here's a breakdown of the relative importance.
1. Code Selection & Religious Demographics Relative Contribution: 45% Context: This tier represents the primary filter on a county's gross population. The model applies a strict deduction based on competing athletic pathways. In the Republic, this tracks the dual-sport hurling drain (e.g., Cork, Galway, splitting talent down the middle) and urban soccer academies. In Northern Ireland, it applies a community background weight to reflect the available GAA player base. This explains why high-population counties like Kilkenny, Antrim, and Waterford sit near the bottom of the football standings. It shrinks their massive gross population down to their actual, football-playing talent pool.
2. Gravitational Commuter Drag Relative Contribution: 22% Context: This factor models the lifestyle fragmentation caused by commuting into major economic engines. This factor targets the "Leinster Commuter Trap." It accounts for the structural bottlenecks highlighted by the GAA National Demographics Committee-namely, that exploding suburban populations face a severe facility and time deficit. By penalizing counties within the immediate corporate orbits of Dublin and Cork, the model dramatically lowers the expected baselines for Meath and Kildare, shrinking their negative residuals. It probably doesn't do enough on this front but an r squared of 95% is pretty good.
3. Financial Preparation Capital Relative Contribution: 14% Context: Teams are amateur, but their preparation models are entirely corporate. This factor tracks annual team preparation expenditures relative to the national average. It recognizes that doubling an elite sports budget yields only a limited efficiency boost on the pitch (non-linear gains from investments into training/infrastructure). This prevents high-spending dual counties from breaking the model while properly elevating Dublin's baseline to account for its massive commercial advantage.
4. Student-Athlete Migration & Legacy Coaching Lines Combined Relative Contribution: 14% Context: These variables process population quality and institutional knowledge. This uses travel tracker metrics to penalize counties that must manage "exiled" college student training groups in Dublin or Belfast mid-week.
5. Unexplained Variance / Stochastic Error Relative Contribution: 5.0% Context: The remaining variance represents factors that cannot be captured by socio-economic data: generational talent anomalies (e.g., David Clifford emerging in a specific county), weather conditions, refereeing decisions, or short-term psychological momentum." 95% of gaelic football results since 2010 can be explained by a mix of population, sport preferences, county financial resources, historical success, proximity to cities with significant career/study opportunities.
No they can't.
GreenandRed (Mayo) - Posts: 8647 - 02/07/2026 12:16:34
2683641
Link
0
|
|
Replying To level: "95% of gaelic football results since 2010 can be explained by a mix of population, sport preferences, county financial resources, historical success, proximity to cities with significant career/study opportunities. A model created from these factors predicts win/loss results since 2010 with 95% accuracy when weighting games for relative importance. Worth noting that this year's championship has been only predictable to 70% accuracy mostly down to rule changes and Louth over-performing with ceiling still unknown :)
When ranking counties on how they've over-performed against this model since 2010, Monaghan is the top performing county in the country (i.e., wins more than it should), followed by Mayo, and then Kerry. Meath and Kildare have underperformed the most vs expectations which can be mostly attributed to the model not being perfectly fine-tuned for the negative impact that comes with proximity to Dublin (careers, commutes, study, other sports).
Here's a breakdown of the relative importance.
1. Code Selection & Religious Demographics Relative Contribution: 45% Context: This tier represents the primary filter on a county's gross population. The model applies a strict deduction based on competing athletic pathways. In the Republic, this tracks the dual-sport hurling drain (e.g., Cork, Galway, splitting talent down the middle) and urban soccer academies. In Northern Ireland, it applies a community background weight to reflect the available GAA player base. This explains why high-population counties like Kilkenny, Antrim, and Waterford sit near the bottom of the football standings. It shrinks their massive gross population down to their actual, football-playing talent pool.
2. Gravitational Commuter Drag Relative Contribution: 22% Context: This factor models the lifestyle fragmentation caused by commuting into major economic engines. This factor targets the "Leinster Commuter Trap." It accounts for the structural bottlenecks highlighted by the GAA National Demographics Committee-namely, that exploding suburban populations face a severe facility and time deficit. By penalizing counties within the immediate corporate orbits of Dublin and Cork, the model dramatically lowers the expected baselines for Meath and Kildare, shrinking their negative residuals. It probably doesn't do enough on this front but an r squared of 95% is pretty good.
3. Financial Preparation Capital Relative Contribution: 14% Context: Teams are amateur, but their preparation models are entirely corporate. This factor tracks annual team preparation expenditures relative to the national average. It recognizes that doubling an elite sports budget yields only a limited efficiency boost on the pitch (non-linear gains from investments into training/infrastructure). This prevents high-spending dual counties from breaking the model while properly elevating Dublin's baseline to account for its massive commercial advantage.
4. Student-Athlete Migration & Legacy Coaching Lines Combined Relative Contribution: 14% Context: These variables process population quality and institutional knowledge. This uses travel tracker metrics to penalize counties that must manage "exiled" college student training groups in Dublin or Belfast mid-week.
5. Unexplained Variance / Stochastic Error Relative Contribution: 5.0% Context: The remaining variance represents factors that cannot be captured by socio-economic data: generational talent anomalies (e.g., David Clifford emerging in a specific county), weather conditions, refereeing decisions, or short-term psychological momentum." is there an actual model or is this just a chat gpt speal?
tirawleybaron (Mayo) - Posts: 1966 - 02/07/2026 12:40:52
2683650
Link
0
|