Differences between revisions 2 and 3
Revision 2 as of 2018-06-10 02:03:19
Size: 366
Editor: localhost
Comment:
Revision 3 as of 2018-06-10 02:08:04
Size: 485
Editor: localhost
Comment:
Deletions are marked like this. Additions are marked like this.
Line 9: Line 9:
== 数据归约 ==
  比如,我們要計算年齡分段(0-100歲),分成少年,青年,中年,老年。

BigData NA

数据挖掘

  • 想要做数据挖掘,我们首先需要做以下的几个步骤。

数据集成

  • 将多个数据源或者是异构数据源的数据,在数据仓库本地进行整合。

数据清洗

  • 对冲突的属性进行删减,对噪声数据进行剔除,进行离群点检测。

数据归约

  • 比如,我們要計算年齡分段(0-100歲),分成少年,青年,中年,老年。

首頁/2018-06-10 (last edited 2018-06-10 08:41:32 by localhost)