成人免费xxxxx在线视频软件_久久精品久久久_亚洲国产精品久久久_天天色天天色_亚洲人成一区_欧美一级欧美三级在线观看

如何立足Hadoop成功建立商務智能:七項必備訣竅

大數(shù)據(jù) Hadoop

在企業(yè)實施Hadoop技術時,其中的***用例無疑在于商務智能(簡稱BI)。根據(jù)新近發(fā)布的一項基準調(diào)查結果,我們整理出最適用于處理各類工作負載的幾款Hadoop SQL引擎。下面,我們一起來看:

1. 不存在萬試萬靈的選項

[[176156]]

No Single Best Engine

The benchmark results show that there is no one-size-fits-all general purpose engine for executing these types of queries. "Depending on raw data size, query complexity, and the target number of end-users, enterprises will find that each engine has its own 'sweet spot,'" according to the study's findings.

2. 小數(shù)據(jù)對大數(shù)據(jù)

[[176157]]

Small Vs. Big Data

The benchmark shows that Impala and Spark SQL are the stars when it comes to queries against small data sets. AtScale said that the most recent release of Hive LLAP (Live Long and Process) shows acceptable query response times on small data sets, and that Presto also shows promise for these types of queries.

3. 少對多

[[176158]]

Few Vs. Many

This metric looks at the performance when the data is hit with many queries at the same time. Presto, which AtScale included for the first time in this benchmark test, showed the best results for concurrency testing. Impala continued its strong concurrent query performance. Hive and Spark SQL registered significant improvements on this metric in the current benchmark test.

4. 復雜查詢情況

[[176159]]

Complex Queries

AtScale's Klahr warns that, while Impala and Presto do well on concurrency, the results shifted as queries became more complex. When it came to complex queries, SparkSQL started to outperform Impala, Klahr told InformationWeek. "You need to have a multi-engine strategy and a mechanism that can automatically route end-user queries to the right engine without the end-user having to think about 'Am I writing a Spark query or an Impala query?'" he said, noting that AtScale does perform that kind of automatic routing to the best engine.

5. 大規(guī)模數(shù)據(jù)集

[[176160]]

Large Data Sets

Querying big data sets generally means slower results. The fastest performing engines for these data sets were Spark SQL at less than 20 seconds, followed by Impala at less than 40 seconds. Response times for both of these engines improved significantly from the benchmark six months ago to today. Hive and Presto returned results in just over 2 minutes. Increasing the number of joins generally increased processing time, according to AtScale. Spark SQL and Impala were more likely to perform best as the number of joins increased.

6. 不同引擎各擅勝場

[[176161]]

Everybody Wins

All the engines that were evaluated registered significant performance improvements since AtScale's last benchmark test 6 months ago -- on the order of 2x to 4x, according to the company. "This is great news for those enterprises deploying BI workloads to Hadoop. We believe that a best-of-breed strategy -- best engine, best semantic Bilayer, best visualization tool -- will lead enterprises down the most successful path to BI-on-Hadoop success," the company said in its benchmark report.

7. 充分考慮開源優(yōu)勢

[[176162]]

Open Source Advances

Klahr told InformationWeek in an interview that between the first edition of the benchmark 6 months ago and today, the query performance of Hive improved by 3.5x, Spark by 2.5x, and Impala by 3x. "If I'm a buyer or an executive, these improvements are going to make me stop and question any investment on a proprietary Hadoop engine," Klahr said, because these open source tools are being improved at a rapid pace.

責任編輯:武曉燕 來源: 網(wǎng)絡大數(shù)據(jù)
相關推薦

2016-11-17 14:42:46

云企業(yè)訣竅

2022-08-01 08:48:06

數(shù)字化領導者企業(yè)

2012-02-03 10:18:52

移動商務智能方案

2022-08-01 10:41:03

人工智能認證人工智能

2020-05-25 22:39:38

機器學習物聯(lián)網(wǎng)IOT

2013-06-20 13:38:30

2011-04-13 12:56:53

計算機編程

2022-07-15 15:22:51

區(qū)塊鏈開發(fā)語言

2023-06-30 11:55:09

人工智能機器學習

2012-06-15 10:14:22

2012-06-13 10:43:39

英特爾酷睿博銳

2011-08-03 09:34:08

戴爾

2020-02-25 16:48:11

物聯(lián)網(wǎng)可穿戴設備智能眼鏡

2021-11-03 10:53:22

人工智能商業(yè)智能軟件

2022-09-14 10:31:27

網(wǎng)絡安全IT安全領導者

2020-04-06 13:52:45

數(shù)據(jù)倉庫大數(shù)據(jù)平臺Hadoop

2014-12-11 17:47:23

混合云私有云

2022-06-08 10:29:28

人工智能機器人

2019-01-08 10:26:19

人工智能 Python技術

2022-05-13 10:06:40

傳感器類型物聯(lián)網(wǎng)
點贊
收藏

51CTO技術棧公眾號

主站蜘蛛池模板: 日本三级做a全过程在线观看 | 久久美女网 | 久久精品国产免费 | 国产一区在线视频 | 国内精品久久久久久 | 久久久久久久久久久成人 | 羞羞的视频在线看 | 国产专区在线 | 成人精品鲁一区一区二区 | 欧区一欧区二欧区三免费 | 91国内精精品久久久久久婷婷 | 精品一二区 | 99精品视频一区二区三区 | 成人免费在线网 | 亚洲一区二区三区在线播放 | 日韩在线精品视频 | 精品久久久久久久人人人人传媒 | 日韩精品一区二区三区视频播放 | 在线视频一区二区 | 亚洲嫩草 | 黄视频网址 | 久久综合伊人 | 欧美性极品xxxx做受 | 在线国产中文字幕 | 亚洲国产精品久久久久婷婷老年 | 久热国产在线 | 国产97在线视频 | 丁香五月网久久综合 | 欧美日韩国产精品激情在线播放 | 黄色在线免费观看视频 | 二区视频| 欧美精品一二三区 | 亚洲精品久久久久国产 | 看毛片网站| 福利片在线| 国产在线拍偷自揄拍视频 | 国产乱码精品一区二区三区忘忧草 | 中文字幕一区二区三区乱码在线 | 伊人电影院av | 亚洲精品高清视频 | 日韩欧美黄色 |