声明:本文为本人毕业研究报告《The Exploration of Pairwise Comparison in Football Application》中的部分内容摘录与整理,仅用于学习与交流。
Introduction
Roger R. Davidson [3] introduced a scaling constant , and based on Luce’s Choice Axiom [4] and the geometric mean, Davidson established a new model which unifies the model’s denominator. Although both the Rao-Kupper model [1] and Davidson model are extensions of the Bradley-Terry model [2], there are significant differences.
In the search for Maximum Likelihood Estimators (MLEs), the existence and uniqueness of solutions were demonstrated based on Ford’s Assumption [5].
Luce’s Choice Axiom
Luce’s Choice Axiom
Given a finite set of treatments with associated worth for each treatment and , Luce’s Choice Axiom can be concisely formalized as:
where is the probability of choosing treatment from the pair , with or , under the condition that and .
Non-empty Subset Preference
Non-empty Subset Preference (Assumption)
The model assumes that in every division of treatments into two groups, at least one treatment in one group is preferred over another in the opposite group at least once. It is critical for ensuring that the likelihood function behaves properly and that a global maximum exists.
Model Formulation 4
In the context of pairwise comparison among treatments, where each treatment ’s inherent preference (or stimulus strength), denoted by , satisfies the normalisation with for all .
For scenarios without a clear preference, , is calculated as proportional to the geometric mean of individual preferences:
where acts as a scaling constant. Based on Lemma equation (above), the adapted model is:
ensuring the total probability constraint:
is the probability that is preferred over . Similarly, is the probability that is preferred over . represents the probability of a tie.
It is also important to note that the Bradley-Terry model forms a special case of both the Rao-Kupper model when the threshold parameter , and of the Davidson model when the scaling constant .
The Log-likelihood function:
where
is the total number of wins and ties for treatment .
and is the number of times treatment preferred over , and vice versa. .
is the number of times neither treatment is preferred, .
represents the total number of ties across all treatment comparisons.
is a matrix of wins .
is the number of independent responses for the comparison of treatments and , .
.
Each treatment is paired with others, and responses are independently recorded for each pairwise comparison. The total number of such comparisons is calculated as .
The maximum likelihood estimates (MLE) for the parameters is obtained by solving:
with the functions:
where
.
.
The Existence and Uniqueness of Solutions:
Following Ford’s [5] Assumption of Non-empty Subset Preference for the Bradley-Terry model, the maximisation of over the region is analyzed under a restriction on the matrix . This setup requires and sets on the boundary, allowing a uniformly continuous extension to the same region, which establishes the existence and uniqueness of the maximum.
For , explicit solutions are given by and .
For , iterative methods are needed.
Example 4
Using the model in a football tournament scenario among three teams: Team A, Team B, and Team C. Here’s how to calculate probabilities for one of the matches, Team A vs. Team B, to demonstrate the model’s application. Team strengths of A, B and C are . The scaling constant . According to the equation (above), calculated probabilities:
Team A winning:
Team B winning:
Draw:
Model Derivation 4
The geometric mean is defined as:
Thus, when , the geometric mean would be . Then we can easily have the tie equation, where is a scaling constant.
From the Lemma of Luce’s Choice Axiom, express in terms of :
The probability of a tie, , is defined as:
Given the total probability equation:
Finally, solving for :
Similarly, it is easy to obtain and .
Now, to find out the maximum likelihood estimates. The likelihood for all the observed outcomes is the product:
The log-likelihood is:
since .
By using the properties of logarithms and :
Substitute this back into the :
The contributions to each from all pairings in which is involved, either as the preferred or as the compared treatment, across all :
Similarly, it is easy find for . Thus,
Since, and .
To find the MLE, we take the partial derivatives of with respect to each parameter and , and set them to zero.
For example, under the constraint , set : we have only one pair (i.e., becomes ), simplifying the equation to:
Given that , we can substitute . Therefore, the log-likelihood becomes:
Derivative with respect to :
Derivative with respect to :
Then solving for :
where , and under the assumption that . Therefore, the MLEs are:
Conclusion
To sum up, In the Rao-Kupper model [1], the probabilities and each have denominators influenced by the opposing stimuli strengths and . On the other hand, the Davidson model [3] standardizes the denominators of and by combining the geometric mean of the two stimulus strengths . This results in the same denominator in the Davidson model, thereby simplifying its development and analysis.
However, the previous models still have shortcomings when considering real-life application scenarios, such as quantifying the defensive and offensive abilities. Maher model [6] is based on the Poisson distribution, resolves these issues and calculates the expected number of goals scored by each team.
References
[1] P. V. Rao and L. L. Kupper. Ties in paired-comparison experiments: A generalization of the Bradley-Terry model. Journal of the American Statistical Association, 62(317):194–204, Mar 1967. Available
[2] Ralph Allan Bradley. Some statistical methods in taste testing and quality evaluation. Biometrics, 9(1):22–38, 1953.
[3] Roger R. Davidson. On extending the Bradley-Terry model to accommodate ties in paired comparison experiments. Journal of the American Statistical Association, 65(329):317–328, 1970.
[5] Jr. Ford, L. R. Solution of a ranking problem from binary comparisons. The American Mathematical Monthly, 64(8):28–33, 1957. Part 2: To Lester R. Ford on His Seventieth Birthday.
[6] M. J. Maher. Modelling association football scores. Statistica Neerlandica, 36:109–118, 1982.