Let be a random vector in . Two realizations and of form a concordant pair if and have the same sign. What’s the probability of sampling a concordant pair when is bivariate normal?
For example, suppose and have zero means, unit variances, and a correlation of . The scatter plots below show 100 realizations of when . These realizations contain pairs, of which 36% are concordant when . This percentage rises to 48% when and to 71% when . Increasing makes concordance more likely because it makes larger and less noisy.
Different samples give different concordance rates due to sampling variation. We can remove this variation by deriving the concordance rate analytically. To begin, suppose has mean and covariance matrix Then is normal with mean and variance So for any two realizations and of we can write with . Now is normal, and so is standard normal and exceeds zero if and only if . Letting and be the density functions for and then gives where is the standard normal CDF, where uses the change of variables and where uses the symmetry of about . But is symmetric about , which implies and therefore The concordance rate depends on the correlation of and , but not their means or variances. It has value when because is constant. Intuitively, if and are uncorrelated then we can’t use to predict , which is equally likely to be positive or negative. Whereas if then predicts perfectly, and so and The chart below verifies that the concordance rate grows with . It also shows that Thus, for example, we have and . These values remove the sampling error from the estimates 0.36 and 0.71 obtained using the 100 realizations above.