Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can we use the true distribution to evaluate how well our estimation is? #5

Open
ichn-hu opened this issue Mar 16, 2019 · 2 comments
Open

Comments

@ichn-hu
Copy link
Collaborator

ichn-hu commented Mar 16, 2019

No, you should not do it!

Because you should pretend that you don't know the true distribution even if you can get to know it during your estimation. You should use the true distribution as a black box, and the only way you can touch it is to observe its output by get_data function.

In machine learning, peaking into the true distribution or the testing dataset during you estimation or training time is very wrong, and it is kind of like cheating. So please don't do that.


This is another question asked in person to me, I have to repost it here to make everybody aware of it. So in order to make our life easier, from now on please directly ask your question using this issue system!

@xlplayer
Copy link
Contributor

So can we utilize the true distribution after the estimation process to test how well our models are?
I think we can evaluate how well our estimation is after we finish the density estimation. What we can not do is using the true distribution to optimize our models.

@ichn-hu
Copy link
Collaborator Author

ichn-hu commented Mar 18, 2019

Yes, you ar more than welcomed to use the true distribution after all of your estimation.

ichn-hu pushed a commit that referenced this issue Apr 10, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants