Differences between revisions 1 and 3 (spanning 2 versions)
Revision 1 as of 2018-06-10 02:03:01
Size: 352
Editor: localhost
Comment:
Revision 3 as of 2018-06-10 02:08:04
Size: 485
Editor: localhost
Comment:
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
= BigData NA =
Line 9: Line 9:
== 数据归约 ==
  比如,我們要計算年齡分段(0-100歲),分成少年,青年,中年,老年。

BigData NA

数据挖掘

  • 想要做数据挖掘,我们首先需要做以下的几个步骤。

数据集成

  • 将多个数据源或者是异构数据源的数据,在数据仓库本地进行整合。

数据清洗

  • 对冲突的属性进行删减,对噪声数据进行剔除,进行离群点检测。

数据归约

  • 比如,我們要計算年齡分段(0-100歲),分成少年,青年,中年,老年。

首頁/2018-06-10 (last edited 2018-06-10 08:41:32 by localhost)