开发者:上海品职教育科技有限公司 隐私政策详情

应用版本:4.2.11(IOS)|3.2.5(安卓)APP下载

西红柿面 · 2024年06月05日

题目到底问的是哪一个动作

* 问题详情,请 查看题干

NO.PZ202304050200007001

问题如下:

Omar Khan is investigating the potential benefits of incorporating non-financial data, specifically weather and social media posts, to improve their stock selection process in the retail industry.

Dataset 1: A database from a large national weather provider that contains detailed weather data (temperature, humidity, rainfall, atmospheric pressure, etc.) at a very localized geographic level or zone recorded by GPS coordinates for the past 36 months.

In reviewing Dataset 1, Khan notices that there are many data fields included that would likely be highly irrelevant to their analysis and begins the process of selecting a subset of data fields that he believes are applicable.

Khan identifies a data field called that contains such results as drizzle, rain, freezing rain, snow, and hail. Khan believes that this could be much more valuable as a prediction variable if it is converted into binary form (1 = yes, 0 = no).


Khan’s selection of a subset of data from the weather dataset is best described as:

选项:

A.

trimming

B.

feature selection

C.

feature engineering

解释:

B is correct. The process of identifying and removing unneeded, irrelevant, or redundant features in a dataset is known as feature selection.

A is incorrect. Trimming is a process for handling outliers in a dataset by simply removing the extreme values and is also known as truncation.

C is incorrect. Feature engineering is the process of combining, consolidating, or creating new features that do not exist in the current weather dataset.

“Khan identifies a data field called that contains such results as drizzle, rain, freezing rain, snow, and hail. Khan believes that this could be much more valuable as a prediction variable if it is converted into binary form (1 = yes, 0 = no).


这不就是One Hot Encoding 独热编码的过程吗?这个就是Feature Engineering的部分呀?


还是说这道题问的只是从整体数据中拎出来和分析irrelevant的subset数据集这个动作?

1 个答案
已采纳答案

品职助教_七七 · 2024年06月06日

嗨,努力学习的PZer你好:


这道题“Khan’s selection of a subset of data from the weather dataset ”对应为:

In reviewing Dataset 1, Khan notices that there are many data fields included that would likely be highly irrelevant to their analysis and begins the process of selecting a subset of data fields that he believes are applicable. 其中题干中的“ Dataset 1”,就是题目中的“weather dataset”。

就是根据这一句在选。跟后面的内容没有关系。

----------------------------------------------
努力的时光都是限量版,加油!

  • 1

    回答
  • 0

    关注
  • 114

    浏览
相关问题