Big Data: Architectures And Data Analytics Exam Paper

Download a blank fillable Big Data: Architectures And Data Analytics Exam Paper in PDF format just by clicking the "DOWNLOAD PDF" button.

Open the file in any PDF-viewing software. Adobe Reader or any alternative for Windows or MacOS are required to access and complete fillable content.

Complete Big Data: Architectures And Data Analytics Exam Paper with your personal data - all interactive fields are highlighted in places where you should type, access drop-down lists or select multiple-choice options.

Some fillable PDF-files have the option of saving the completed form that contains your own data for later use or sending it out straight away.

ADVERTISEMENT

Big Data: Architectures and Data Analytics
st
July 1
, 2016
Student ID ______________________________________________________________
First Name ______________________________________________________________
Last Name ______________________________________________________________
The exam is open book and lasts 2 hours.
Part I
Answer to the following questions. There is only one right answer for each question.
1. (2 points) Consider the cache “mechanism” of Spark. Which one of the following
statements is true?
a) Caching an RDD is always useful
b) An RDD that is used only one time in an application must always be cached
c) An RDD must always be cached by using the MEMORY_ONLY storage level if
its size is larger than the maximum amount of main memory of the cluster
d) Caching an RDD that is used multiple times in an application can improve the
efficiency of the application (in terms of execution time).
2. (2 points) Consider the HDFS file log.txt. The size of log.txt is 1024MB. Suppose
that you are using an Hadoop cluster that can potentially run up to 2048 mappers in
parallel and suppose to execute the word count application, based on MapReduce,
by specifying log.txt as input file. Suppose that Hadoop automatically sets the
number of mappers to 2 (i.e., it runs two mappers) when you execute the word
count application by specifying log.txt as input file. What is the block size of the
HDFS file system?
a) Block size: between 1024MB and 2048MB
b) Block size: between 512MB and 1023MB
c) Block size: between 256MB and 511MB
d) Block size: between 128MB and 255MB

ADVERTISEMENT

00 votes

Related Articles

Related forms

Related Categories

Parent category: Education
Go
Page of 4