NO.PZ2021083101000010
问题如下:
Rivera makes the following statements:
Statement 1 Some of the methods used in the exploration step are different for structured and unstructured data, but for both types of data, the step involves feature selection and feature engineering.
Statement 2 A major difference when developing a text-based ML model is the curation step, which involves cleansing, preprocessing, and converting the data into a structured format usable for model training.
Which of Rivera’s statements about differences in ML model building steps is correct?
选项:
A. Only Statement 1
B. Only Statement 2
C. Both Statement 1 and Statement 2
解释:
A is correct.
Statement 1 is correct because some of the methods used in the fourth step of ML model building (data/text exploration) are different for structured and unstructured data, and for both structured and unstructured data, the exploration step encompasses feature selection and feature engineering.
Statement 2 is incorrect because Rivera described the text preparation and wrangling step, not the text curation step. The data (text) curation step involves gathering relevant external text data via web services or programs that extract raw content from a source.
B and C are incorrect because Statement 2 is incorrect.
考点:Steps in a Data Analysis Project
text curation step主要有什么?