070-773 試験問題を無料オンラインアクセス
試験コード: | 070-773 |
試験名称: | Analyzing Big Data with Microsoft R |
認定資格: | Microsoft |
無料問題数: | 40 |
更新日: | 2025-09-02 |
You are running a parallel function that uses the following R code segment. (Line numbers are included for reference only.)
0 1 cp <- 0.01 xval <- 0 maxdepth <- 5
0 2
0 3 (form, data = "segmentationDataBig", maxDepth = maxdepth, cp = cp, xval = xval, blocksPerRead = 250 You need to complete the R code. The solution must support chunking.
Which function should insert at line 02?
You need to use the ScaleR distributed processing in an Apache Hadoop environment.
Which data source should you use?
Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
You have a data source that is larger than memory.
You need to visualize the distribution of the values for a variable in the data source.
What should you use?
You have a slow Map Reduce job.
You need to optimize the job to control the number of mapper and runner tasks.
Which function should you use?
Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
You have a dataset that contains the physical characteristics of people.
You need to visualize a relationship between height and weight for a subset of observations in the dataset.
What should you use?