Apache Flume Interview Questions and Answers 2022

Apache Flume is a distributed service data ingestion tool for efficiently collecting and moving huge amounts of streaming log data from one source to another source location.

Untitled 2

Apache Flume Questions and Answers

Question#1 Apache Flume 1.3.0 is the fourth release under the auspices of Apache of the so-called __ codeline.

a)NG

b)ND

c)NF

d)NR

Ans : NG

Question#2 How can Flume be used with HBase?

a)HBaseSink

b)AsyncHBaseSink

c)Both A and B

d)None of these

Ans : Both A and B

Question#3 What is true about Apache Flume?

a)Apache Flume is a reliable and distributed system for collecting, aggregating and moving massive quantities of log data.

b)It has a simple yet flexible architecture based on streaming data flows

c)Apache Flume is used to collect log data present in log files from web servers and aggregating it into HDFS for analysis.

d)All of the above

Ans : All of the above

Question#4 List the various types of “Cluster Managers in Spark

a)Standalone

b)Apache Mesos

c)YARN

d)All of above

Ans : All of above

Question#5 Point out the correct statement.

a)Flume is a distributed, reliable, and available service

b)Version 1.5.2 is the eighth Flume release as an Apache top-level project

c)Flume 1.5.2 is production-ready software for integration with hadoop

d)All of the mentioned

Ans : Flume is a distributed, reliable, and available service

Question#6 A Flume agent is a JVM process which has

a)3 components

b)4 components

c)5 components

d)6 components

Ans : 3 components

Question#7 What are the important steps in the configuration?

a)Every Source must have atleast one channel

b)Every Sink must have only one channel

c)Every Components must have a specific type

d)All of the above

Ans : All of the above

Question#8 _ was created to allow you to flow data from a source into your Hadoop environment.

a)Imphala

b)Oozie

c)Flume

d)All of the mentioned

Ans : Flume

Question#9 Flume Big data has different levels of reliability to offer?

a)best-effort delivery

b)end-to-end delivery

c)Both A and B

d)None

Ans : Both A and B

Question#10 What are the different channel types in Flume?

a)Memory Channel

b)File Channel

c)JDBC Channel

d)All of these

Ans : All of these

Question#11 A __ is an operation on the stream that can transform the stream.

a)Decorator

b)Source

c)Sinks

d)All of the mentioned

Ans : Source

Question#12 Flume carries data between?

a)sources and decorator

b)sources and sinks

c)start and decorator

d)decorator and sinks

Ans : sources and sinks

Question#13 What are the tools Used in Big Data?

a)Hadoop

b)Hive

c)Pig

d)All of these

Ans : All of these

Question#14 Point out the wrong statement.

a)Version 1.4.0 is the fourth Flume release as an Apache top-level project

b)Apache Flume 1.5.2 is a security and maintenance release that disables SSLv3 on all components in Flume that support SSL/TLS

c)Flume is backwards-compatible with previous versions of the Flume 1.x codeline

d)None of the mentioned

Ans : None of the mentioned

Question#15 This gathering of data can be?

a)scheduled

b)event-driven

c)user-defined

d)Both A and B

Ans : Both A and B

Question#16 A number of __ source adapters give you the granular control to grab a specific file.

a)multimedia file

b)text file

c)image file

d)None of the above

Ans : text file

Question#17 __ is used when you want the sink to be the input source for another operation.

a)Collector Tier Event

b)Agent Tier Event

c)Basic

d)All of above

Ans : Agent Tier Event

Question#18 _ is where you would land a flow (or possibly multiple flows joined together) into an HDFS-formatted file system.

a)Collector Tier Event

b)Agent Tier Event

c)Basic

d)All of above

Ans : Collector Tier Event

Question#19 __ sink can be a text file, the console display, a simple HDFS path, or a null bucket where the data is simply deleted.

a)Collector Tier Event

b)Agent Tier Event

c)Basic

d)All of above

Ans : Basic

Question#20 Flume deploys as one or more agents, each contained within its own instance of _.

a)JVM

b)Chunks

c)Channels

d)None of the above

Ans : JVM

Question#21 Flume Hadoop can also be used to transport event data including but not limited to network traffic data, data generated by social media websites and email messages.

a)True

b)False

Ans : True

About Author


After years of Technical Work, I feel like an expert when it comes to Develop wordpress website. Check out How to Create a Wordpress Website in 5 Mins, and Earn Money Online Follow me on Facebook for all the latest updates.

Leave a Comment