开发者:上海品职教育科技有限公司 隐私政策详情

应用版本:4.2.11(IOS)|3.2.5(安卓)APP下载

暴走的小七 · 2023年03月26日

p-hacking是什么意思

NO.PZ2021101401000006

问题如下:

Emily Yuen is a senior analyst for a consulting firm that specializes in assessing equity strategies using backtesting and simulation techniques. She is working with an assistant, Cameron Ruckey, to develop multifactor portfolio strategies based on nine factors common to the growth style of investing. To do so, Yuen and Ruckey plan to construct nine separate factor portfolios and then use them to create factor-weighted allocation portfolios.

Yuen tasks Ruckey with specifying the investment universe and determining the availability of appropriate reporting data in vendor databases. Ruckey selects a vendor database that does not provide point-in-time data, so he adjusts the database to include point-in-time constituent stocks and a reporting lag of four months.

Following Ruckey’s adjustments to the initial vendor database, backtested returns will most likely be subject to:

选项:

A.

stale data

B.

data snooping

C.

p-hacking

解释:

A is correct. A reporting lag of four months is likely to introduce stale data into the backtest because many large-capitalization companies report earnings within 30–50 days of quarter end. Although assuming four months (120 days) of reporting lag will eliminate a source of look-ahead bias, it introduces a new problem (i.e., stale data).

B and C are incorrect. Data snooping and p-hacking refer to the same problem: a flawed approach to using data to make decisions. Data snooping and p-hacking are not characteristics of data, nor can they be added to a dataset by making an adjustment.

p-hacking是什么意思

1 个答案

星星_品职助教 · 2023年03月26日

同学你好,

p-hacking是B选项数据发掘的同义词。即反复在数据集里挖掘,直到找到“规律”。

具体内容可参照讲义的如下部分。