AWS Big Data Specialty Practice Test 1
1) Which of the following is a business analytics service provided by AWS?
(A) Business Objects
(B) Micro strategy
(C) Quick Sight
(D) Power Bi
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option DExplanation
“] [/efaccordion]
2) Your application currently uses Dynamo DB as the data store. You also have a test environment where you perform load tests on your application. There is a constant need to reset the data In the Dynamo DB tables. How can this be achieved? Choose 2 answers from the options below. Each answer forms part of the solution.
A. Use the Dynamo DB export feature to copy the data before the test begins
B. Use the Dynamo DB import feature to copy the data after the test ends.
C. Use the AWS Data Pipeline to export data from a Dynamo DB table to a file in an Amazon S3 bucket before the test begins ….„
D. Use the AWS Data Pipeline to import data from a Dynamo DB table from the file in an Amazon 53 bucket after the test ends
(A) C,D
(B) A,D
(C) B,C
(D) A,B
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option AExplanation
“] [/efaccordion]
3) You currently have data in Dynamo DB tables. You have a requirement to perform complex data analysis queries on the data stored In the Dynamo DB tables. How can this be achieved?
(A) Copy the data on AWS EMR and then perform the complex queries x
(B) Query the Dynamo DB tables, since it support complex queries.
(C) Copy the data on AWS Red shift and then perform the complex queries
(D) Copy the data on AWS Quick sight and then perform the complex queries
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option CExplanation
“] [/efaccordion]
4) Your company maintains an e-commerce site in AWS. They want to use AWS Machine learning to see how many units of a particular product will be sold. Which machine learning model would you use for this purpose?
(A) Regression classification
(B) Simple classification
(C) Binary classification
(D) Multi class classification x
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option AExplanation
“] [/efaccordion]
5) You currently have a Red shift Cluster defined in AWS. The data is currently unencrypted in nature. You have now decided that the cluster needs to have encrypted data. How can you achieve this? Choose 2 answers from the options given below. Each answer forms part of the solution ?
A. Unload the data from the existing. source cluster
B. Reload the data In a new, target cluster with the chosen encryption setting
C. Make a backup copy of the data from the existing source cluster and encrypt it with SSE
D. Enable the Encryption attribute of the cluster
(A) C,D
(B) A,B
(C) B,C
(D) A,D
[efaccordion id=”01″] [efitems title=”Answer ” text=”Option B“] [/efaccordion]6) You currently have web servers that put data from their log files onto Kinesis streams. In this scenario what role are the web servers playing
(A) The data stream role
(B) The consumers role
(C) The producers role
(D) The data stream role
[efaccordion id=”01″] [efitems title=”Answer ” text=”Option C“] [/efaccordion]7) Which of the following operations are available for scaling a Redshift Cluster Please select: Please select:
(A) Use the snapshot and restore operations to make a copy of an existing cluster, Then, resize the new cluster
(B) Scale the cluster in or out by changing the number of nodes
(C) Scale the cluster up or down by specifying a different node type
(D) All of the above
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option DExplanation
“] [/efaccordion]
8) there is a requirement to perform SQL querying along with complex queries on HDFS and 53 file systems. Which of the below tools can fulfil this requirement? Please select:
(A) LPresto
(B) YARN
(C) QuidcSight
(D) Kinesis
[efaccordion id=”01″] [efitems title=”Answer ” text=”Option B“] [/efaccordion]9) Your company currently has an order processing system in AWS. There are EC2 Instances in place to pick up the orders from the application and EC2 Instances in an Auto scaling Group to process the orders. Which of the following additional components can Ideally be used to ensure that the EC2 Processing instances are correctly scaled based on demand?
(A) Use Cloud Watch metrics to understand the load capacity on the processing servers and then scale the capacity accordingly
(B) Use SQS queues to decouple the architecture. Scale the processing servers based on the queue length.
(C) Use SQS queues to decouple the architecture. Scale the processing servers based on notifications sent from the SOS queues.
(D) Use Cloud Watch metrics to understand the load capacity on the processing servers. Ensure SNS is used to scale up the servers based on notifications
[efaccordion id=”01″] [efitems title=”Answer” text=”Option B“] [/efaccordion]10) In an AWS EMR Cluster which of the following nodes is responsible for running the YARN service?
(A) Core Node
(B) Master Node
(C) Primary Node
(D) Task Node
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option BExplanation
“] [/efaccordion]
11) A company is currently managing their data workload in Amazon Aurora. They are looking at encrypting the data at rest. Which of the following can be used for managing the encryptions keys?
(A) Client side encryption
(B) AWS CIoud HSM
(C) S3-SSE
(D) AWS KMS
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option DExplanation
“] [/efaccordion]
12) You are planning to use the AWS EMR service to create instances which make use of the Hadoop software. But apart from the Hadoop software, you also need some custom software to be installed on these systems. Which is the best way to get the custom software Installed on instances which are launched as part of the cluster?
(A) Use Cloud watch agents to install the custom software
(B) Ensure that the EMR Cluster Instance configuration has the location In 53 for the custom software
(C) Ensure that the EMR Cluster configuration has the location In S3 for the custom software x
(D) Use the Bootstrap actions to install the custom software
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option DExplanation
“] [/efaccordion]
13) Which of the following services isa fully-managed service that can be used to build machine learning models at any scale
(A) AWS SageMlaker
(B) AWSFargatex
(C) AWS Polly
(D) AWS Greengrass
[efaccordion id=”01″] [efitems title=”Answer” text=”Option D“] [/efaccordion]14) You enable encryption when you launch a cluster. To migrate from an unencrypted duster to an encrypted duster. you first unload your data from the existing, source duster. Then you reload the data In a new, target cluster What is the purpose of the Hadoop Encrypted Shuffle feature?
(A) The files are shuffled across nodes in a cluster
(B) The EC2 instances are shuffled across the cluster for better performance
(C) The data in transit between the nodes in a cluster is encrypted .
(D) The encryption keys used in a cluster are shuffled at regular intervals
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option CExplanation
“] [/efaccordion]
15) Which of the following can be used by an organization for archival of data for long periods of time? Please select:
(A) Amazon Kinesis
(B) Amazon S3
(C) Amazon Glacier
(D) Amazon EMR
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option CExplanation
“] [/efaccordion]
16) Which one of the following is not True about loT enabled devices?
(A) Maximum number of thing types in an AWS account is unlimited
(B) Message Broker provides a secure mechanism for devices and AWS loT applications to publish and receive messages from each other
(C) Number of thing types that can be associated with a thing is 1.
(D) Device Shadow is a YAML document used to store and retrieve current state information for a device
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option DExplanation
“] [/efaccordion]
17) There is a requirement for EC2 Instances in your private subnet to access Dynamo DB tables. How can this be achieved?
(A) Attach a virtual private gateway to the VPC
(B) Convert the private subnet to a public subnet since this is the only way for the access to be achieved
(C) There is no way for instances in a private subnet to access Dynamo DB tables
(D) Use VPC endpoint
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option DExplanation
“] [/efaccordion]
18) Which of the following is not a condition that needs to be met by the local secondary index for a Dynamo DB table Please select?
(A) The partition key is different from that of the base table
(B) The partition key is the same as that of its base table
(C) The sort key of the base table is projected into the index, where it acts as a non-key attribute
(D) The sort key consists of exactly one scalar attribute
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option AExplanation
“] [/efaccordion]
19) You have a defined a local secondary index for a Dynamo DB table. You are then performing queries against the index, but the performance is not ideal as expected. Which of the following can reduce the performance of querying an index when using the Local Secondary Index?
(A) When querying for a projected attribute x
(B) When querying for a different sort key value
(C) When querying for a non-projected attribute
(D) When querying a partition key that is not present in the LSI
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option CExplanation
“] [/efaccordion]
20) What is the current maximum total data read rate for a Kinesis shard
(A) 5 MB per second
(B) 1 MB per second
(C) 2 MB per second
(D) 10 MB per second
[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option CExplanation
“] [/efaccordion]