A) tokens
B) stems
C) terms
D) stack
Correct Answer
verified
Multiple Choice
A) text mining
B) tokenization
C) stemming
D) corpus
Correct Answer
verified
Multiple Choice
A) 100
B) 125
C) 150
D) 175
Correct Answer
verified
Multiple Choice
A) convert the categories to numeric representations.
B) convert the categories to binary, dummy variables.
C) combine as many categories as possible.
D) let them remain categorical.
Correct Answer
verified
Multiple Choice
A) Single linkage
B) Ward's method
C) Average group linkage
D) Dendrogram
Correct Answer
verified
Multiple Choice
A) considering only the two most dissimilar observations in the two clusters.
B) computing the average dissimilarity between every pair of observations between the two clusters.
C) considering only the two most similar observations in the two clusters.
D) considering the distance between the cluster centroids.
Correct Answer
verified
Multiple Choice
A) standardization.
B) weighting.
C) removing outliers.
D) randomizing.
Correct Answer
verified
Multiple Choice
A) unsupervised learning
B) data mining
C) McQuitty's method
D) Ward's method
Correct Answer
verified
Multiple Choice
A) matching coefficient.
B) Jaccard's coefficient.
C) Euclidean distance.
D) antecedent.
Correct Answer
verified
Multiple Choice
A) book
B) corpus
C) library
D) consequent
Correct Answer
verified
Multiple Choice
A) consequent
B) validation count
C) support count
D) antecedent
Correct Answer
verified
Multiple Choice
A) dendrogram.
B) scatter chart.
C) decile-wise lift chart.
D) cumulative lift tree.
Correct Answer
verified
Multiple Choice
A) The short leg
B) The long leg
C) The hypotenuse
D) Euclidean distance is not related to right triangles.
Correct Answer
verified
Multiple Choice
A) Single linkage
B) Complete linkage
C) Average linkage
D) Average group linkage
Correct Answer
verified
Multiple Choice
A) 66.21
B) 72.28
C) 75.39
D) 88.57
Correct Answer
verified
Multiple Choice
A) It is used to measure dissimilarity between categorical variable observations.
B) It is not affected by the scale on which variables are measured.
C) It increases with the increase in similarity between variable values.
D) It is commonly used as a method of measuring dissimilarity between quantitative observations.
Correct Answer
verified
Multiple Choice
A) agglomerating observations into a series of nested groups based on a measure of similarity.
B) organizing observations into distinct groups based on a measure of similarity.
C) reducing the number of variables to consider in data-mining.
D) estimating the value of a continuous outcome variable.
Correct Answer
verified
Multiple Choice
A) most similar
B) most different
C) farthest apart
D) closest
Correct Answer
verified
Multiple Choice
A) objects
B) clusters
C) observations
D) ward
Correct Answer
verified
Multiple Choice
A) 0.5
B) 1
C) 1.5
D) 2
Correct Answer
verified
Showing 21 - 40 of 44
Related Exams