Apache Druid Fresco play Questions and Answers 2022

Apache Druid

Apache Druid is a high performance real-time analytics database. Druid’s main value add is to reduce time to insight and action.

Apache Druid is designed for workflows where fast queries and ingest really matter. Druid excels at powering UIs, running operational (ad-hoc) queries, or handling high concurrency. Consider Druid as an open source alternative to data warehouses for a variety of use cases. The design documentation explains the key concepts.

Apache Druid Architecture

Templates can contain HTML and text, mixed with Handlebars expressions.
Choose the correct option:
1)True
2)False

Correct answer of above question is : -1)True

Which of the following are common application areas for Apache Druid?
Choose the correct option:
1)Clickstream analytics
2)Network flow analytics
3)All the options
4)Digital marketing analytics
5)Application performance metrics

Correct answer of above question is : -3)All the options

The __ provides unified API gateway to the Druid brokers, overlords, and coordinators.
Choose the correct option:
1)Overlord process
2)Coordinator process
3)Broker process
4)Router process

Correct answer of above question is : -4)Router process

Predict the output.
Hi {{name}}.
is used with the context variable:
var context = {
“name” : “< b>Jane Roe< /b>”
}
Choose the correct option:
1)Jane Roe without any style
2)Nothing to display
3)< b>Jane Roe< /b>
4)< b>Jane Roe< /b>

Correct answer of above question is : -3)< b>Jane Roe< /b>

Apache Druid is a __ oriented distributed data store.
Choose the correct option:
1)Graph
2)Column
3)Key-Value
4)Document

Correct answer of above question is : -2)Column

Which of the following servers are used in Druid for easy deployment?
Choose the correct option:
1)Query Server
2)All the options
3)Data Server
4)Master Server

Correct answer of above question is : -2)All the options

The __ downloads segments from Deep Storage, and responds to queries about these segments.
Choose the correct option:
1)MiddleManager process
2)Broker process
3)Historical process
4)Router process

Correct answer of above question is : -3)Historical process

What is the alternative to merge in Git?
Choose the correct option:
1)Rebase
2)Fetch
3)Pull
4)Push

Correct answer of above question is : -1)Rebase

The _ controls the flow of data to Druid.
Choose the correct option:
1)Broker process
2)Overlord process
3)Coordinator process
4)Router process

Correct answer of above question is : -2)Overlord process

A time range in Druid is considered a _.
Choose the correct option:
1)Datasource
2)Segment
3)Chunk

Correct answer of above question is : -3)Chunk

Choose the available queries in Apache Druid.
Choose the correct option:
1)All the options
2)Aggregation Queries
3)Search Queries
4)Metadata Queries

Correct answer of above question is : -1)All the options

Which of the following servers in Apache Druid are responsible for providing endpoints?
Choose the correct option:
1)Data Server
2)Query Server
3)Master Server
4)All the options

Correct answer of above question is : -2)Query Server

Select the two main parts to a Data Source Metadata query.
Choose the correct option:
1)context & queryType
2)dataSource & context
3)queryType & dataSource
4)None of the options

Correct answer of above question is : -3)queryType & dataSource

Datasources are responsible for storing data in Druid.
Choose the correct option:
1)False
2)True

Correct answer of above question is : -2)True

_____are organized into time chunks.
Choose the correct option:
1)Columns
2)Segments
3)Files
4)Folders
5)None of the options

Correct answer of above question is : -2)Segments

Select the correct statement in Apache Druid.
Choose the correct option:
1)All the options
2)Druid must have enough disk space available in deep storage and across the historical processes for the data you plan to load.
3)Deep Storage also called Shared file storage, is accessible by every Druid server.
4)To respond to queries, Historical processes do not read from deep storage.

Correct answer of above question is : -1)All the options

Which processes manage data availability on the Server?
Choose the correct option:
1)MiddleManager
2)Overlord
3)Coordinator
4)Broker

Correct answer of above question is : -3)Coordinator

In Apache Druid , Select queries do not support pagination.
Choose the correct option:
1)True
2)False

Correct answer of above question is : -2)False

Which of the following statements is not correct in relation to GroupBy v1 and v2 strategies.
Choose the correct option:
1)The default strategy for a cluster is determined by the “druid.query.groupBy.defaultStrategy” runtime property on the Broker.
2)GroupBy v1 controls resource usage using a row-based limit (maxResults) whereas GroupBy v2 uses bytes-based limits.G
3)GroupBy v2 supports using chunkPeriod to parallelize merging on the Broker whereas GroupBy v1 ignores chunkPeriod.

Correct answer of above question is : -3)GroupBy v2 supports using chunkPeriod to parallelize merging on the Broker whereas GroupBy v1 ignores chunkPeriod.

Which of the following is the default GroupBy query strategy.
Choose the correct option:
1)v3
2)v1
3)v4
4)v2

Correct answer of above question is : -4)v2

The _ manages new data ingestion.
Choose the correct option:
1)Router process
2)Overlord process
3)MiddleManager process
4)Coordinator process

Correct answer of above question is : -3)MiddleManager process

Which of the following is a datasource type in Apache Druid?
Choose the correct option:
1)Union data source
2)All the options
3)Query data source
4)Table data source

Correct answer of above question is : -2)All the options

Which of the following is the core element of a Druid ingestion Spec?
Choose the correct option:
1)Time Column
2)Datasource name
3)dataSchema
4)All the Options
5)Parser

Correct answer of above question is : -3)dataSchema

****done untill here ***

Denormalized data in Apache Druid is ingested in _ formats.
Choose the correct option:
1)TSV
2)All the Options
3)JSON
4)CSV

Correct answer of above question is : -2)All the Options

Which of the following external dependencies is used to backup and transfer data from Druid processes in the background?
Choose the correct option:
1)Zookeeper
2)Deep storage
3)Metadata storage
4)All the options

Correct answer of above question is : -2)Deep storage

Apache Druid uses _ compressed bitmap indexes to create indexes.
Choose the correct option:
1)Both Concise and Roaring
2)Concise
3)None of the Options
4)Roaring

Correct answer of above question is : -1)Both Concise and Roaring

The __ receives requests from outside customers, and transmits these requests to data servers.
Choose the correct option:
1)Coordinator process
2)Broker process
3)Overlord process
4)Router process

Correct answer of above question is : -2)Broker process

_ is responsible for allocating segments to specific servers, and ensuring that segments are well balanced across Historicals.
Choose the correct option:
1)Overlord process
2)Broker process
3)Coordinator process
4)Router process

Correct answer of above question is : -3)Coordinator process

Which of the following is used to interpret input data in Druid?
Choose the correct option:
1)Datasource name
2)Time Column
3)dataSchema
4)Parser

Correct answer of above question is : -4)Parser

Which of the following Granularity types are supported in Apache Druid?
Choose the correct option:
1)Arbitrary
2)None of the options
3)Uniform and Arbitrary
4)Uniform

Correct answer of above question is : -3)Uniform and Arbitrary

Which of the following concept in Druid optionally replaces dimensional values with new values, enabling combined functionality?
Choose the correct option:
1)Lookups
2)Both virtual columns and Lookups
3)None of the Options
4)Virtual Columns

Correct answer of above question is : -1)Lookups

Which of the following query is not included in Aggregation query?
Choose the correct option:
1)Timeseries
2)GroupBy
3)TopN
4)Time Boundary

Correct answer of above question is : -4)Time Boundary

Which of the following concept in Druid optionally replaces dimensional values with new values, enabling combined functionality?
Choose the correct option:
1)Lookups
2)Both virtual columns and Lookups
3)None of the Options
4)Virtual Columns

Correct answer of above question is : -1)Lookups

Which of the following query is not included in Aggregation query?
Choose the correct option:
1)Timeseries
2)GroupBy
3)TopN
4)Time Boundary

Correct answer of above question is : -4)Time Boundary

Which option controls the number of rows returned in each block of paginated results?
Choose the correct option:
1)threshold
2)PagingSpec
3)pagingIdentifiers

Correct answer of above question is : -1)threshold

In case of a Druid server failure, data can be recovered from _.
Choose the correct option:
1)Sea Storage
2)None of the options
3)Deep Hub
4)Sea Hub
5)Deep Storage

Correct answer of above question is : -5)Deep Storage

In Apache Druid, Scan queries do not support pagination.
Choose the correct option:
1)False
2)True

Correct answer of above question is : -2)True

Among the following options, which aggregation query is the most flexible but has the poorest performance?
Choose the correct option:
1)None of the options
2)GroupBy
3)Timeseries
4)TopN

Correct answer of above question is : -3)Timeseries

Segment IDs in Apache Druid are comprised of________.
Choose the correct option:
1)Interval start time
2)Interval end time
3)All the options
4)Segment datasource
5)Version

Correct answer of above question is : -3)All the options

For grouping and sorting over a single dimension, queries are much more optimized than __.
Choose the correct option:
1)Timeseries, TopN
2)TopN, Timeseries
3)GroupBy, TopN
4)TopN, GroupBy

Correct answer of above question is : -4)TopN, GroupBy

For ease of deployment, Druid suggests organizing processes into __ server types.
Choose the correct option:
1)Master, Query, Analytics
2)Client, Master, Zookeeper
3)Admin, Data, Master
4)Master, Query, Data
5)None of the options

Correct answer of above question is : -4)Master, Query, Data

Which of the following is an example of a SELECTOR filter?
Choose the correct option:
1)All the options
2)filter: { “type”: “regex”, “dimension”: < dimension_string>, “pattern”: < pattern_string> }
3)filter: { “type”: “selector”, “dimension”: < dimension_string>, “value”: < dimension_value_string> }
4)filter: { “type”: “columnComparison”, “dimensions”: [< dimension_a>, < dimension_b>] }

Correct answer of above question is : -3)filter: { “type”: “selector”, “dimension”: < dimension_string>, “value”: < dimension_value_string> }

Druid data is stored in “datasources”, similar to _.
Choose the correct option:
1)Table in RDBMS
2)Column in RDBMS
3)row in RDBMS
4)Database in RDBMS

Correct answer of above question is : -1)Table in RDBMS

Which of the following queries are grouped by single dimension, and are sorted according to the metric?
Choose the correct option:
1)TopN
2)GroupBy
3)Time Boundry
4)Timeseries

Correct answer of above question is : -1)TopN

TopN queries can use HAVING conditions over aggregated data.
Choose the correct option:
1)False
2)True

Correct answer of above question is : -1)False

Periodically, segments are committed and published. At this point, they are written to deep storage, become immutable.
Choose the correct option:
1)False
2)True

Correct answer of above question is : -2)True

__ is the mechanism by which the segments are published and are served by historical process.
Choose the correct option:
1)Handoff
2)Sharding
3)Indexing

Correct answer of above question is : -1)Handoff

Each segment starts off being created on a MiddleManager, and at that point the segment is _.
Choose the correct option:
1)immutable & uncommitted
2)mutable & uncommitted
3)immutable & Committed
4)mutable & committed

Correct answer of above question is : -2)mutable & uncommitted

Select the processes which store queryable data.
Choose the correct option:
1)MiddleManager
2)None of the options
3)Historical
4)Broker
5)Coordinator

Correct answer of above question is : -3)Historical

Zookeeper is used for _.
Choose the correct option:
1)leader election
2)All the Options
3)coordination and leader collection
4)backup data
5)coordination

Correct answer of above question is : -3)coordination and leader collection

About Author


After years of Technical Work, I feel like an expert when it comes to Develop wordpress website. Check out How to Create a Wordpress Website in 5 Mins, and Earn Money Online Follow me on Facebook for all the latest updates.