![Data generating process](https://www.english.nina.az/wikipedia/image/aHR0cHM6Ly91cGxvYWQud2lraW1lZGlhLm9yZy93aWtpcGVkaWEvY29tbW9ucy90aHVtYi80LzQwL0Zpc2hlcl9pcmlzX3ZlcnNpY29sb3Jfc2VwYWx3aWR0aC5zdmcvMTYwMHB4LUZpc2hlcl9pcmlzX3ZlcnNpY29sb3Jfc2VwYWx3aWR0aC5zdmcucG5n.png )
This article needs additional citations for verification.(December 2024) |
In statistics and in empirical sciences, a data generating process is a process in the real world that "generates" the data one is interested in. This process encompasses the underlying mechanisms, factors, and randomness that contribute to the production of observed data. Usually, scholars do not know the real data generating model and instead rely on assumptions, approximations, or inferred models to analyze and interpret the observed data effectively. However, it is assumed that those real models have observable consequences. Those consequences are the distributions of the data in the population. Those distributors or models can be represented via mathematical functions. There are many functions of data distribution. For example, normal distribution, Bernoulli distribution, Poisson distribution, etc.
References
- Tu, Jun; Zhou, Guofu (2004). "Data-generating process uncertainty: What difference does it make in portfolio decisions?". Journal of Financial Economics. 72 (2): 385–421. doi:10.1016/j.jfineco.2003.05.003.
This article needs additional citations for verification Please help improve this article by adding citations to reliable sources Unsourced material may be challenged and removed Find sources Data generating process news newspapers books scholar JSTOR December 2024 Learn how and when to remove this message In statistics and in empirical sciences a data generating process is a process in the real world that generates the data one is interested in This process encompasses the underlying mechanisms factors and randomness that contribute to the production of observed data Usually scholars do not know the real data generating model and instead rely on assumptions approximations or inferred models to analyze and interpret the observed data effectively However it is assumed that those real models have observable consequences Those consequences are the distributions of the data in the population Those distributors or models can be represented via mathematical functions There are many functions of data distribution For example normal distribution Bernoulli distribution Poisson distribution etc ReferencesTu Jun Zhou Guofu 2004 Data generating process uncertainty What difference does it make in portfolio decisions Journal of Financial Economics 72 2 385 421 doi 10 1016 j jfineco 2003 05 003 https stats stackexchange com questions 443320 what does a data generating process dgp actually meanThis statistics related article is a stub You can help Wikipedia by expanding it vte