thông tin biểu ghi

ISSN 1549-3636
DDC 004
Tác giả CN Vo, Ngoc Phu
Nhan đề A reformed K-Nearest neighbors algorithm for big data sets / Vo Ngoc Phu, Vo Thi Ngoc Tran
Thông tin xuất bản New York : Science Publications, 2018
Mô tả vật lý 13 p.
Tóm tắt A Data Mining Has Already Had Many Algorithms Which A K-Nearest Neighbors Algorithm, K-NN, Is A Famous Algorithm For Researchers. K-NN Is Very Effective On Small Data Sets, However It Takes A Lot Of Time To Run On Big Datasets. Today, Data Sets Often Have Millions Of Data Records, Hence, It Is Difficult To Implement K-NN On Big Data. In This Research, We Propose An Improvement To K-NN To Process Big Datasets In A Shortened Execution Time. The Reformed K-Nearest Neighbors Algorithm (R-K-NN) Can Be Implemented On Large Datasets With Millions Or Even Billions Of Data Records. R-K-NN Is Tested On A Data Set With 500,000 Records. The Execution Time Of R-K-NN Is Much Shorter Than That Of K-NN. In Addition, R-K-NN Is Implemented In A Parallel Network System With Hadoop Map (M) And Hadoop Reduce (R).
Thuật ngữ chủ đề Big Data
Từ khóa tự do Distributed System
Từ khóa tự do Algorithm
Từ khóa tự do K-Nearest Neighbors
Từ khóa tự do K-NN
Từ khóa tự do Parallel Network Environment
Từ khóa tự do Data Mining
Từ khóa tự do Cloudera
Từ khóa tự do Association Rules
Từ khóa tự do Hadoop Map
Từ khóa tự do Hadoop Reduce
Khoa Khoa Công nghệ Thông tin
Tác giả(bs) CN Vo, Thi Ngoc Tran
Nguồn trích Journal of Computer Science. Số: Vol. 14, Issue 9, P.1213-1225, ,
Địa chỉ Thư Viện Đại học Nguyễn Tất Thành
Tệp tin điện tử doi.org/10.3844/jcssp.2018.1213.1225
000 00000nam#a2200000u##4500
00119608
00212
004BA3E1B5A-E8AD-4F52-8205-F73A706A743C
005202003090053
008200302s2018 nyu eng
0091 0
022 |a1549-3636
039|a20200309005328|bphucvh|y20200302105605|zphucvh
040 |aNTT
041 |aeng
044 |anyu
082 |a004|223
100 |aVo, Ngoc Phu
245 |aA reformed K-Nearest neighbors algorithm for big data sets / |cVo Ngoc Phu, Vo Thi Ngoc Tran
260 |aNew York : |bScience Publications, |c2018
300 |a13 p.
520 |aA Data Mining Has Already Had Many Algorithms Which A K-Nearest Neighbors Algorithm, K-NN, Is A Famous Algorithm For Researchers. K-NN Is Very Effective On Small Data Sets, However It Takes A Lot Of Time To Run On Big Datasets. Today, Data Sets Often Have Millions Of Data Records, Hence, It Is Difficult To Implement K-NN On Big Data. In This Research, We Propose An Improvement To K-NN To Process Big Datasets In A Shortened Execution Time. The Reformed K-Nearest Neighbors Algorithm (R-K-NN) Can Be Implemented On Large Datasets With Millions Or Even Billions Of Data Records. R-K-NN Is Tested On A Data Set With 500,000 Records. The Execution Time Of R-K-NN Is Much Shorter Than That Of K-NN. In Addition, R-K-NN Is Implemented In A Parallel Network System With Hadoop Map (M) And Hadoop Reduce (R).
650 |aBig Data
653 |aDistributed System
653 |aAlgorithm
653 |aK-Nearest Neighbors
653 |aK-NN
653 |aParallel Network Environment
653|aData Mining
653|aCloudera
653|aAssociation Rules
653|aHadoop Map
653|aHadoop Reduce
690 |aKhoa Công nghệ Thông tin
700 |aVo, Thi Ngoc Tran
773|tJournal of Computer Science|gVol. 14, Issue 9, P.1213-1225
852 |aThư Viện Đại học Nguyễn Tất Thành
856|udoi.org/10.3844/jcssp.2018.1213.1225
890|c1|a0|b0|d2
Không tìm thấy biểu ghi nào