期刊
BERNOULLI
卷 21, 期 4, 页码 2308-2335出版社
INT STATISTICAL INST
DOI: 10.3150/14-BEJ645
关键词
distributed computing; heavy-tailed noise; large deviations; linear models; low-rank matrix estimation; principal component analysis; robust estimation
资金
- National Institute of Environmental Health Sciences (NIEHS) of the National Institutes of Health (NIH) [NSF DMS-0847388, NSF CCF-0808847, R01-ES-017436]
In many real-world applications, collected data are contaminated by noise with heavy-tailed distribution and might contain outliers of large magnitude. In this situation, it is necessary to apply methods which produce reliable outcomes even if the input contains corrupted measurements. We describe a general method which allows one to obtain estimators with tight concentration around the true parameter of interest taking values in a Banach space. Suggested construction relies on the fact that the geometric median of a collection of independent weakly concentrated estimators satisfies a much stronger deviation bound than each individual element in the collection. Our approach is illustrated through several examples, including sparse linear regression and low-rank matrix recovery problems.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据