Nonparametric Tests for Conditional Independence
(Dissertation Summary)
Liangjun Su



As David (1979) puts it, independence and conditional independence form the basis of probability theory and are equally fundamental in the theory of statistical inference. Applications of conditional independence are ubiquitous in econometrics. In a seminal paper, Granger (1980) introduces the concept of Granger non-causality at the distribution level, which is a particular case of conditional independence. Angrist (1997) shows that the problem of sample selection bias can be cast in terms of conditional independence. Further, conditional independence is often used as an identification condition in econometrics (e.g., Imbens and Newey, 2001; Lechner and Miquel, 2002).

Despite its wide use, few tests have been developed to test for conditional independence. Two exceptions are Linton and Gozalo (1997) who propose two nonparametric tests for conditional independence based on empirical distribution functions and Fernandes and Flores (1999) who employ a generalized entropy measure to test for conditional independence. Nevertheless, Monte Carlo simulations suggest that these tests have poor power against conditional dependence in a variety of data generating processes. My dissertation develops some more powerful nonparametric tests for the null hypothesis of conditional independence. Hence, I resolve a long-standing challenge in econometrics by providing asymptotic theory for powerful nonparametric tests of conditional independence.

In "A Nonparametric Hellinger Metric Test for Conditional Independence'' I propose a nonparametric test of conditional independence based on the weighted Hellinger distance between f(y|x,z) and f(y|x), where f(y|x,z) (resp. f(y|x)) is the conditional density of random variable Y given random variable (X,Z) (resp. X). Under the null hypothesis (Z does not carry extra information about Y once X is known), the distance is identically zero whereas under the alternative it is nonzero. The unnormalized test statistic takes values between 0 and 1 and serves as a first measure of the strength of conditional dependence. The normalized test statistic is asymptotically normal under regularity conditions that are standard in the literature. Due to the "curse of dimensionality'', however, the test is less satisfactory when the dimension of (X,Y,Z) is large.

To lessen the adverse effect of the dimension of (X,Y,Z) on the power of the test, I next look at an equivalent characterization of conditional distributions: conditional characteristic functions. In "Consistent Characteristic-Function-Based Test for Conditional Independence'' I explore tests based on the notion that two conditional distributions are equal if and only if their corresponding conditional characteristic functions are equal and propose a new test for conditional independence. The test is less severely subject to the adverse effect of the dimension of (X,Y,Z)  on the power than the Hellinger metric test (because the dimension of Y does not affect the convergence rate of the new test statistic), and at the same time it maintains the good properties of the Hellinger metric test: (1) the normalized test statistic is asymptotically normal under some regularity conditions; and (2) the test has power against deviations from the null. Simulation results suggest that this new test complements the Hellinger metric test when the dimension of  (X,Y,Z) is small, and it is also powerful when the dimension of (X,Y,Z) is relatively large.

"Testing Conditional Independence via Empirical Likelihood'' [Job Market Paper] I examine a class of empirical-likelihood-based tests for the null of conditional independence. Motivated by the optimality of the parametric likelihood ratio test, I look at its nonparametric analogue: the empirical likelihood test. I extend the applicability of empirical likelihood from testing a finite number of moment or conditional moment restrictions (e.g., Tripathi and Kitamura, 2002) to testing an infinite collection of conditional moment restrictions. Writing the null hypothesis in terms of conditional-distribution-based moment restrictions and employing the idea of ``smoothed'' empirical likelihood, I construct an intuitively appealing test statistic and show that it is asymptotically normal under the null. I also derive its asymptotic distribution under a sequence of local alternatives. Although this test statistic has intuitive appeal, it delivers poor power in small samples because of the discrete nature of the indicator functions used in forming the sample analogue of the moment restrictions. Thus I build on Su and White (2003) and use a class of smoother moment conditions to construct a new empirical-likelihood-based test. Then I show that in large samples both tests are weakly optimal in that they attain maximum average local power with respect to different spaces of functions for the local alternatives. Simulations suggest that the smoother-moment-conditions-based test outperforms all previous tests in small samples. I apply this latter test to a number of economic and financial time series and find that the test reveals some interesting nonlinear Granger causal relations that the traditional linear Granger causality test fails to detect. Further, the approach outlined in the paper easily extends to testing many other distribution-based hypotheses such as conditional goodness-of-fit, conditional homogeneity and conditional symmetry.