Nptel Big Data Computing Week 6 Solutions


 

What is Distributed K-Means Iterative Clustering?

 A single-node clustering algorithm

 A clustering algorithm that uses distributed computing to improve scalability

 A supervised machine learning algorithm

 A dimensionality reduction technique


What is the primary goal of the K-Means clustering algorithm?

 Classification of data points

 Regression analysis

 Finding the nearest neighbor for each data point

 Partitioning data points into clusters based on similarity

1 point

What is the main objective of using Parallel K-Means with MapReduce for Big Data Analytics?

 To reduce the dimensionality of the data

 To classify data points into predefined categories

 To efficiently cluster large datasets in a distributed manner

 To perform regression analysis on big data

1 point

What is the primary goal of using Parallel K-Means with MapReduce in Big Data Analytics?

 To perform regression analysis

 To classify data points into predefined categories

 To efficiently handle large-scale clustering tasks

 To visualize data patterns

1 point

Which of the following tasks can be best solved using Clustering?

 Predicting the amount of rainfall based on various cues

 Training a robot to solve a maze

 Detecting fraudulent credit card transactions

 All of the mentioned

1 point

Identify the correct statement(s) in context of overfitting in decision trees:


Statement I: The idea of Pre-pruning is to stop tree induction before a fully grown tree is built, that perfectly fits the training data.


Statement II: The idea of Post-pruning is to grow a tree to its maximum size and then remove the nodes using a top-bottom approach.


 Only statement I is true

 Only statement II is true

 Both statements are true

 Both statements are false

1 point

Identify the correct statement(s) in context of machine learning approaches:


Statement I: In supervised approaches, the target that the model is predicting is unknown or unavailable. This means that you have unlabeled data.


Statement II: In unsupervised approaches the target, which is what the model is predicting, is provided. This is referred to as having labeled data because the target is labeled for every sample that you have in your data set.


 Only Statement I is true

 Only Statement II is true

 Both Statements are false

 Both Statements are true

1 point

What is the primary focus of Machine Learning?

 Accessing data from databases

 Extracting meaning from big data

 Learning from data

 Predicting future outcomes

1 point

Which of the following is an essential activity in the Machine Learning process?

 Writing code for specific tasks

 Designing graphical user interfaces

 Collecting and preprocessing data

 Creating beautiful data visualizations

1 point

Which distance measure calculates the distance along strictly horizontal and vertical paths, consisting of segments along the axes?

 Euclidean distance

 Manhattan distance

 Cosine similarity

 Minkowski distance

Post a Comment (0)
Previous Question Next Question

You might like