控制變量法

控制變量法（英語：control variates）是在蒙特卡洛方法中用於減少方差的一種技術方法。該方法通過對已知量的了解來減少對未知量估計的誤差。

原理

假設要估計的參數為 $\mu$ 。同時對於統計 $m$ ，其期望值為 $\mu$ ： $\mathbb {E} \left[m\right]=\mu$ ，即 $m$ 是 $\mu$ 的無偏差估計。此時，對於另一個統計 $t$ ，已知 $\mathbb {E} \left[t\right]=\tau$ 。於是，

m^{\star }=m+c\left(t-\tau \right)\,

也是 $\mu$ 的無偏差估計， $c$ 為任一給定係數。 $m^{\star }$ 的方差為

{\textrm {Var}}\left(m^{\star }\right)={\textrm {Var}}\left(m\right)+c^{2}\,{\textrm {Var}}\left(t\right)+2c\,{\textrm {Cov}}\left(m,t\right);

可以證明，使得方差最小的係數 $c$ 為

c^{\star }=-{\frac {{\textrm {Cov}}\left(m,t\right)}{{\textrm {Var}}\left(t\right)}};

此時，對應的方差則為

{\begin{aligned}{\textrm {Var}}\left(m^{\star }\right)&={\textrm {Var}}\left(m\right)-{\frac {\left[{\textrm {Cov}}\left(m,t\right)\right]^{2}}{{\textrm {Var}}\left(t\right)}}\\&=\left(1-\rho _{m,t}^{2}\right){\textrm {Var}}\left(m\right);\end{aligned}}

其中

\rho _{m,t}={\textrm {Corr}}\left(m,t\right)\,

為 $m$ 與 $t$ 之間的相關係數。 $\vert \rho _{m,t}\vert$ 越大時，方差越小。

當 ${\textrm {Cov}}\left(m,t\right)$ 、 ${\textrm {Var}}\left(t\right)$ 或 $\rho _{m,t}\;$ 未知時，可以通過蒙特卡洛模擬進行估計。由於該方法相當於一個最小二乘法系統，又被稱為回歸抽樣（regression sampling）。

示例

假設我們要使用蒙特卡洛方法估計

I=\int _{0}^{1}{\frac {1}{1+x}}\,\mathrm {d} x,

即估計

f(U)={\frac {1}{1+U}}

的期望值。其中， $U$ 滿足均勻分佈。假設有n個樣本 $u_{1},\cdots ,u_{n}$ ，該估計可表示為

I\approx {\frac {1}{n}}\sum _{i}f(u_{i});

此時，我們引入控制變量 $g(U)=1+U$ ，其已知期望值為 $\mathbb {E} \left[g\left(U\right)\right]=\int _{0}^{1}(1+x)\,\mathrm {d} x={\tfrac {3}{2}}$ 。由此，可以得到新的估計

I\approx {\frac {1}{n}}\sum _{i}f(u_{i})+c\left({\frac {1}{n}}\sum _{i}g(u_{i})-3/2\right).

以下為 $n=1500$ 並使用估計的最優係數 $c^{\star }\approx 0.4773$ 時，一次蒙特卡洛模擬所給出的積分估計值：

	估計	標準差
普通模擬	0.69475	0.01947
控制變量法	0.69295	0.00060

參考文獻

Ross, Sheldon M. (2002) Simulation 3rd edition ISBN 978-0-12-598053-1
Averill M. Law & W. David Kelton (2000), Simulation Modeling and Analysis, 3rd edition. ISBN 0-07-116537-1
S. P. Meyn (2007) Control Techniques for Complex Networks, Cambridge University Press. ISBN 978-0-521-88441-9. Downloadable draft (Section 11.4: Control variates and shadow functions)