AWS Big Data Specialty Practice Test 1

AWS Big Data Specialty Practice Test 1

1) Which of the following is a business analytics service provided by AWS?

(A)  Business Objects

(B) Micro strategy

(C)  Quick Sight

(D)  Power Bi

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option D

Explanation

Amazon Quick sight is a fast cloud-powered business analytics service that makes it easy to build visualizations. perform ad-hoc analysis. and quickly get business insights from your data. Using our cloudbased service you cart easily connect to your data, perform advanced analysis, and create stunning visualizations and rich dashboards that can be accessed from any browser or mobile device.

“] [/efaccordion]


2) Your application currently uses Dynamo DB as the data store. You also have a test environment where you perform load tests on your application. There is a constant need to reset the data In the Dynamo DB tables. How can this be achieved? Choose 2 answers from the options below. Each answer forms part of the solution.

A. Use the Dynamo DB export feature to copy the data before the test begins

B. Use the Dynamo DB import feature to copy the data after the test ends.

C. Use the AWS Data Pipeline to export data from a Dynamo DB table to a file in an Amazon S3 bucket before the test begins ….„

D. Use the AWS Data Pipeline to import data from a Dynamo DB table from the file in an Amazon 53 bucket after the test ends

(A)  C,D

(B)  A,D

(C)  B,C

(D)  A,B

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option A

Explanation

You can use AWS Data Pipeline to export data from a Dynamo DB table to a file in an Amazon 53 bucket. You can also use the console to import data from Amazon 53 into a Dynamo DB table, in the same AWS region or in a different

“] [/efaccordion]

3) You currently have data in Dynamo DB tables. You have a requirement to perform complex data analysis queries on the data stored In the Dynamo DB tables. How can this be achieved?

(A)  Copy the data on AWS EMR and then perform the complex queries x

(B)  Query the Dynamo DB tables, since it support complex queries.

(C)  Copy the data on AWS Red shift and then perform the complex queries

(D)  Copy the data on AWS Quick sight and then perform the complex queries

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option C

Explanation

Amazon Reds hift complements Amazon Dynamo DB with advanced business intelligence capabilities and a powerful SQL-based Interface. When you copy data from a DynamoDB table Into Amazon Red shlft you can perform complex data analysis queries on that data, including joins with other tables in your Amazon Red shift cluster.

“] [/efaccordion]


4) Your company maintains an e-commerce site in AWS. They want to use AWS Machine learning to see how many units of a particular product will be sold. Which machine learning model would you use for this purpose?

(A)  Regression classification

(B)  Simple classification

(C)  Binary classification

(D)  Multi class classification x

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option A

Explanation

regression problems predict a numeric value. Amazon ML supports three types of ML models: binary classification. F Tll5MstuFV3I1t?!11t9

“] [/efaccordion]

5) You currently have a Red shift Cluster defined in AWS. The data is currently unencrypted in nature. You have now decided that the cluster needs to have encrypted data. How can you achieve this? Choose 2 answers from the options given below. Each answer forms part of the solution ?

A. Unload the data from the existing. source cluster

B. Reload the data In a new, target cluster with the chosen encryption setting

C. Make a backup copy of the data from the existing source cluster and encrypt it with SSE

D. Enable the Encryption attribute of the cluster

(A)  C,D

(B)  A,B

(C)  B,C

(D)  A,D

[efaccordion id=”01″] [efitems title=”Answer ” text=”Option B“] [/efaccordion]

6) You currently have web servers that put data from their log files onto Kinesis streams. In this scenario what role are the web servers playing

(A) The data stream role

(B) The consumers role

(C)  The producers role

(D)  The data stream role

[efaccordion id=”01″] [efitems title=”Answer ” text=”Option C“] [/efaccordion]

7) Which of the following operations are available for scaling a Redshift Cluster Please select: Please select:

(A)  Use the snapshot and restore operations to make a copy of an existing cluster, Then, resize the new cluster

(B) Scale the cluster in or out by changing the number of nodes

(C)  Scale the cluster up or down by specifying a different node type

(D)  All of the above

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option D

Explanation

As your data warehousing capacity and performance needs change or grow, you can resize your duster to make the best use of the computing and storage options that Amazon Reds hift provides. You can scale the duster in or out by changing the number of nodes. Or. you can scale the cluster up or down by specifying a different node type. You can resize your cluster by using one of the following approaches: Use the resize operation with an existing cluster. Use the snapshot and restore operations to make a copy of an existing cluster. Then, resize the new cluster.

“] [/efaccordion]

8) there is a requirement to perform SQL querying along with complex queries on HDFS and 53 file systems. Which of the below tools can fulfil this requirement? Please select:

(A) LPresto

(B) YARN

(C)  QuidcSight

(D) Kinesis

[efaccordion id=”01″] [efitems title=”Answer ” text=”Option B“] [/efaccordion]


9) Your company currently has an order processing system in AWS. There are EC2 Instances in place to pick up the orders from the application and EC2 Instances in an Auto scaling Group to process the orders. Which of the following additional components can Ideally be used to ensure that the EC2 Processing instances are correctly scaled based on demand? 

(A)  Use Cloud Watch metrics to understand the load capacity on the processing servers and then scale the capacity accordingly

(B) Use SQS queues to decouple the architecture. Scale the processing servers based on the queue length.

(C)  Use SQS queues to decouple the architecture. Scale the processing servers based on notifications sent from the SOS queues.

(D)  Use Cloud Watch metrics to understand the load capacity on the processing servers. Ensure SNS is used to scale up the servers based on notifications

[efaccordion id=”01″] [efitems title=”Answer” text=”Option B“] [/efaccordion]

10) In an AWS EMR Cluster which of the following nodes is responsible for running the YARN service? 

(A)  Core Node

(B)  Master Node

(C) Primary Node

(D)  Task Node

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option B

Explanation

The master node manages the duster and typically runs master components of distributed applications. For example. the master node runs the YARN Resource Manager service to manage resources for applications, as well as the HDFS Name Node service

“] [/efaccordion]

11) A company is currently managing their data workload in Amazon Aurora. They are looking at encrypting the data at rest. Which of the following can be used for managing the encryptions keys?   

(A)  Client side encryption

(B)  AWS CIoud HSM

(C)  S3-SSE

(D) AWS KMS

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option D

Explanation

Amazon Aurora now allows you to encrypt your databases using keys you manage through AWS Key Management Servtce (KMS). On a database instance running with Amazon Aurora encryption, data stored at rest in the underlying storage Is encrypted. as are the automated backups. snapshots. and replicas In the same duster. The correct answer is AWS KMS

“] [/efaccordion]

12) You are planning to use the AWS EMR service to create instances which make use of the Hadoop software. But apart from the Hadoop software, you also need some custom software to be installed on these systems. Which is the best way to get the custom software Installed on instances which are launched as part of the cluster?

(A)  Use Cloud watch agents to install the custom software

(B)  Ensure that the EMR Cluster Instance configuration has the location In 53 for the custom software

(C)  Ensure that the EMR Cluster configuration has the location In S3 for the custom software x

(D)  Use the Bootstrap actions to install the custom software

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option D

Explanation

In addition to the standard software and applications that are available for installation on your cluster, you can use bootstrap actions to install custom software. Bootstrap actions are scripts that run on the instances when your duster is launched, and that run on new nodes that are added to your cluster when they are created. Bootstrap actions are also useful to invoke AWS CU commands on each node to copy objects from Amazon S3 to each node in your duster.

“] [/efaccordion]


13) Which of the following services isa fully-managed service that can be used to build machine learning models at any scale

(A)  AWS SageMlaker

(B)  AWSFargatex

(C)  AWS Polly

(D)  AWS Greengrass

[efaccordion id=”01″] [efitems title=”Answer” text=”Option D“] [/efaccordion]

14) You enable encryption when you launch a cluster. To migrate from an unencrypted duster to an encrypted duster. you first unload your data from the existing, source duster. Then you reload the data In a new, target cluster What is the purpose of the Hadoop Encrypted Shuffle feature?   

(A)  The files are shuffled across nodes in a cluster

(B)  The EC2 instances are shuffled across the cluster for better performance

(C)  The data in transit between the nodes in a cluster is encrypted .

(D)  The encryption keys used in a cluster are shuffled at regular intervals

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option C

Explanation

The process of transferring data from the mappers to reducers is known as shuffling i.e. the process by which the system performs the sort and transfers the map output to the reducer as input. In the shuttle phase. Hadoop Map Reduce (MRV2) shuffles the output of each map task to reducers on different nodes using H1TP by default so Its in transit (in-flight) nature.

“] [/efaccordion]

15) Which of the following can be used by an organization for archival of data for long periods of time? Please select:  

(A)  Amazon Kinesis

(B) Amazon S3

(C)  Amazon Glacier

(D)  Amazon EMR

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option C

Explanation

Amazon Glacier is a storage service optimized for infrequently used data, or “cold data. The service provides durab4e and extremely low-cost storage with security features for data archiving and backup. With Amazon Glacier. you can store your data cost effectively for months. years. or even decades.

“] [/efaccordion]

16) Which one of the following is not True about loT enabled devices?  

(A)  Maximum number of thing types in an AWS account is unlimited

(B)  Message Broker provides a secure mechanism for devices and AWS loT applications to publish and receive messages from each other

(C)  Number of thing types that can be associated with a thing is 1.

(D)  Device Shadow is a YAML document used to store and retrieve current state information for a device

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option D

Explanation

Device Shadow is a JSON document used to store and retrieve current state information for a device.

“] [/efaccordion]

17) There is a requirement for EC2 Instances in your private subnet to access Dynamo DB tables. How can this be achieved? 

(A)  Attach a virtual private gateway to the VPC

(B)  Convert the private subnet to a public subnet since this is the only way for the access to be achieved

(C)  There is no way for instances in a private subnet to access Dynamo DB tables

(D)  Use VPC endpoint

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option D

Explanation

A VPC endpoint for Dynamo DB enables Amazon EC2 instances in your VPC to use their private IP addresses to access Dynamo DB with no exposure to the public Internet. Your EC2 instances do not require public IP addresses, and you do not need an Internet gateway. a NAT device, or a virtual private gateway in your VPC. You use endpoint policies to control access to Dynamo DB. Traffic between your VPC and the AWS service does not leave the Amazon network.

“] [/efaccordion]

18) Which of the following is not a condition that needs to be met by the local secondary index for a Dynamo DB table Please select? 

(A) The partition key is different from that of the base table

(B) The partition key is the same as that of its base table

(C) The sort key of the base table is projected into the index, where it acts as a non-key attribute

(D)  The sort key consists of exactly one scalar attribute

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option A

Explanation

Every local secondary index must meet the following conditions: The partition key Is the same as that of its base table. The sort key consists of exactly one scalar attribute. The sort key of the base table is projected into the index, where it acts as a non-key attribute,

“] [/efaccordion]


19) You have a defined a local secondary index for a Dynamo DB table. You are then performing queries against the index, but the performance is not ideal as expected. Which of the following can reduce the performance of querying an index when using the Local Secondary Index?

(A)  When querying for a projected attribute x

(B) When querying for a different sort key value

(C) When querying for a non-projected attribute

(D)  When querying a partition key that is not present in the LSI

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option C

Explanation

For index queries that read attributes that are not projected into the local secondary index. DynamoDB will need to fetch those attributes from the base table. In addition to reading the projected attributes from the Index. These fetches occur when you include any non-projected attributes in the Select or Projection Expression parameters of the Query operation. Fetching causes additional latency in query responses. and it also incurs a higher provisioned throughput cost. In addition to the reads from the local secondary index described above, you are charged for read capacity units for every base table Item fetched. This charge Is for reading each entire Item from the table, not just the requested attributes.

“] [/efaccordion]

20) What is the current maximum total data read rate for a Kinesis shard

(A) 5 MB per second

(B)  1 MB per second

(C) 2 MB per second

(D)  10 MB per second

[efaccordion id=”01″] [efitems title=”Answer & Explaination” text=”Option C

Explanation

Each shard can suppo up to 5 translators per second for reads, up to a maximum tol data read rate of 2 MB per second

“] [/efaccordion]

Check Also

Entrepreneurship MCQs with Answers Part # 2

Entrepreneurship MCQs with Answers Part 2 Questions 51 to 70 Questions 71 to 100 Click …

Ads Blocker Image Powered by Code Help Pro

Ads Blocker Detected!!!

We have detected that you are using extensions to block ads. Please support us by disabling these ads blocker.

Powered By
100% Free SEO Tools - Tool Kits PRO

You cannot copy content of this page

Social Media Auto Publish Powered By : XYZScripts.com
error: Content is protected !!