KMS Of Academy of mathematics and systems sciences, CAS
| A distributed multiple sample testing for massive data | |
| Xie Xiaoyue1,2; Shi Jian1,2; Song Kai3 | |
| 2021-04-08 | |
| 发表期刊 | JOURNAL OF APPLIED STATISTICS
![]() |
| ISSN | 0266-4763 |
| 页码 | 19 |
| 摘要 | When the data are stored in a distributed manner, direct application of traditional hypothesis testing procedures is often prohibitive due to communication costs and privacy concerns. This paper mainly develops and investigates a distributed two-node Kolmogorov-Smirnov hypothesis testing scheme, implemented by the divide-and-conquer strategy. In addition, this paper also provides a distributed fraud detection and a distribution-based classification for multi-node machines based on the proposed hypothesis testing scheme. The distributed fraud detection is to detect which node stores fraud data in multi-node machines and the distribution-based classification is to determine whether the multi-node distributions differ and classify different distributions. These methods can improve the accuracy of statistical inference in a distributed storage architecture. Furthermore, this paper verifies the feasibility of the proposed methods by simulation and real example studies. |
| 关键词 | Distributed scheme hypothesis testing fraud detection classification |
| DOI | 10.1080/02664763.2021.1911967 |
| 收录类别 | SCI |
| 语种 | 英语 |
| WOS研究方向 | Mathematics |
| WOS类目 | Statistics & Probability |
| WOS记录号 | WOS:000637242100001 |
| 出版者 | TAYLOR & FRANCIS LTD |
| 引用统计 | |
| 文献类型 | 期刊论文 |
| 条目标识符 | http://ir.amss.ac.cn/handle/2S8OKBNM/58424 |
| 专题 | 中国科学院数学与系统科学研究院 |
| 通讯作者 | Shi Jian |
| 作者单位 | 1.Chinese Acad Sci, Acad Math & Syst Sci, Beijing, Peoples R China 2.Univ Chinese Acad Sci, Sch Math Sci, Beijing, Peoples R China 3.Beijing Inst Technol, Sch Management & Econ, Beijing, Peoples R China |
| 推荐引用方式 GB/T 7714 | Xie Xiaoyue,Shi Jian,Song Kai. A distributed multiple sample testing for massive data[J]. JOURNAL OF APPLIED STATISTICS,2021:19. |
| APA | Xie Xiaoyue,Shi Jian,&Song Kai.(2021).A distributed multiple sample testing for massive data.JOURNAL OF APPLIED STATISTICS,19. |
| MLA | Xie Xiaoyue,et al."A distributed multiple sample testing for massive data".JOURNAL OF APPLIED STATISTICS (2021):19. |
| 条目包含的文件 | 条目无相关文件。 | |||||
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论