-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Topic proposals for upcoming lessons #23
Comments
Eric: How you utilize your new high performance Dell workstation. |
Are Markhov-chain process models useful for predicting time-series data?-> Wiener Process-> Black-Scholes-Merton model https://en.wikipedia.org/wiki/Black%E2%80%93Scholes_model |
@joe-wojniak I encourage you to research Markov chains a little bit on your own and try to answer that question. Are there any scenarios in finance where you want to model discrete transitions between certain states? And the probabilities of those transitions? Can you imagine any scenarios where such a model would provide an improvement over other types of models? |
From the book "Common Stocks and Uncommon Profits" Question 7: Does the company have outstanding labor and personnel relations? Can you just web scrape for people's reviews on Glassdoor and discussion forums? |
@alexpryszlakh Great book and great question. |
Python is very good at web scraping- there are a couple of popular web
scraping libraries. BeautifulSoup and Scrapy being a couple.
I think it's a great idea, but great ideas need to be tested. We could try
it and see if we find a correlation.
…-Joe W.
On Thu, Feb 11, 2021 at 7:16 AM Chris Conlan ***@***.***> wrote:
@alexpryszlakh <https://github.com/alexpryszlakh> Great book and great
question.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#23 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AEAV4CHRMSSIMO2LRSNNQE3S6PRERANCNFSM4XJ4NBLA>
.
--
-Joe Wojniak
*CONFIDENTIALITY* NOTICE: The contents of this *email* message and any
attachments are intended solely for the addressee(s) and may contain
*confidential *and/or privileged *information* and may be legally protected
from disclosure.
|
https://www.crunchbase.com/ can be a source of data. |
Do we want to investigate whether py-polars is faster than pandas? Blog on the topic: https://medium.com/analytics-vidhya/is-pypolars-the-new-alternative-to-pandas-916400f03fd7 |
@joe-wojniak What do you think about py-polars? We can talk about lazy evaluation and query optimization, but it is very much a computer science and a database design topic. |
I was just going by the blog- it sounds like py-polars may work with larger
datasets and it seems there are claims of better performance. I'm happy to
work with it on a trial basis, but I wasn't sure about adding it to the
qttk environment while it's not proven.
…On Wed, Feb 24, 2021 at 9:51 AM Chris Conlan ***@***.***> wrote:
@joe-wojniak <https://github.com/joe-wojniak> What do you think about
py-polars? We can talk about lazy evaluation and query optimization, but it
is very much a computer science and a database design topic.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#23 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AEAV4CGKIDHITPZCESFZRBLTAUVAXANCNFSM4XJ4NBLA>
.
--
-Joe Wojniak
*CONFIDENTIALITY* NOTICE: The contents of this *email* message and any
attachments are intended solely for the addressee(s) and may contain
*confidential *and/or privileged *information* and may be legally protected
from disclosure.
|
I could start a test branch of the github, that way the main branch isn't
affected.
…On Wed, Feb 24, 2021 at 10:07 AM Joe Wojniak ***@***.***> wrote:
I was just going by the blog- it sounds like py-polars may work with
larger datasets and it seems there are claims of better performance. I'm
happy to work with it on a trial basis, but I wasn't sure about adding it
to the qttk environment while it's not proven.
On Wed, Feb 24, 2021 at 9:51 AM Chris Conlan ***@***.***>
wrote:
> @joe-wojniak <https://github.com/joe-wojniak> What do you think about
> py-polars? We can talk about lazy evaluation and query optimization, but it
> is very much a computer science and a database design topic.
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <#23 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/AEAV4CGKIDHITPZCESFZRBLTAUVAXANCNFSM4XJ4NBLA>
> .
>
--
-Joe Wojniak
*CONFIDENTIALITY* NOTICE: The contents of this *email* message and any
attachments are intended solely for the addressee(s) and may contain
*confidential *and/or privileged *information* and may be legally
protected from disclosure.
--
-Joe Wojniak
*CONFIDENTIALITY* NOTICE: The contents of this *email* message and any
attachments are intended solely for the addressee(s) and may contain
*confidential *and/or privileged *information* and may be legally protected
from disclosure.
|
@joe-wojniak Let's talk about lazy evaluation and query optimization at our next lesson. I want you to understand why there aren't necessarily any intrinsic speed gains buried within it, and why this library might be slower overall. |
ok, sounds good.
…On Wed, Feb 24, 2021 at 10:39 AM Chris Conlan ***@***.***> wrote:
@joe-wojniak <https://github.com/joe-wojniak> Let's talk about lazy
evaluation and query optimization at our next lesson. I want you to
understand why there aren't necessarily any intrinsic speed gains buried
within it, and why this library might be slower overall.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#23 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AEAV4CFSTV2YDGP47YXHFUDTAU2TJANCNFSM4XJ4NBLA>
.
--
-Joe Wojniak
*CONFIDENTIALITY* NOTICE: The contents of this *email* message and any
attachments are intended solely for the addressee(s) and may contain
*confidential *and/or privileged *information* and may be legally protected
from disclosure.
|
Update @joe-wojniak Pypolars philosophy relies on this design pattern:
Whereas
Theoretically, the Pypolars method above would be just as fast as Pandas Method number 2, because of lazy evaluation. Does this provide a speedup? No, not necessarily, and likely not at all. It just changes the way you write code, and it changes the way you optimize code. Is it worth it at this point to explore Pypolars? I don't think so. I would need to see the author of Pypolars show some provable speedups that go above and beyond lazy evaluation to even consider. Further, Pypolars seems to advertise built-in parallelization. I don't like this at all. Serious engineers need explicit control of parallelization. Python in-memory parallelization sucks in general, because it requires pickling and unpickling of code, which isn't fully supported throughout the language. I can guarantee that running any complex parallel |
I thought this article on adaptive filtering was interesting. It explores an application for predicting stock price: |
Interesting post. I don't know anything about this method. It could be an interesting, albeit complex, technical feature for an ML model. Definitely couldn't work as a standalone, though. |
The text was updated successfully, but these errors were encountered: