Topic proposals for upcoming lessons #23

chrisconlan · 2021-02-08T22:59:26Z

Alex: FPGAs for quant trading (and how it relates to Gamestop)

emican · 2021-02-08T23:25:39Z

Eric: How you utilize your new high performance Dell workstation.

joe-wojniak · 2021-02-09T15:06:51Z

Are Markhov-chain process models useful for predicting time-series data?-> Wiener Process-> Black-Scholes-Merton model https://en.wikipedia.org/wiki/Black%E2%80%93Scholes_model

chrisconlan · 2021-02-09T15:13:43Z

Are Markhov-chain process models useful for predicting time-series data?

@joe-wojniak I encourage you to research Markov chains a little bit on your own and try to answer that question. Are there any scenarios in finance where you want to model discrete transitions between certain states? And the probabilities of those transitions? Can you imagine any scenarios where such a model would provide an improvement over other types of models?

alexpryszlakh · 2021-02-11T04:46:30Z

From the book "Common Stocks and Uncommon Profits"

Question 7: Does the company have outstanding labor and personnel relations?
Question 8: Does the company have outstanding executive relations?
Question 9: Does the company have depth to its management?

Can you just web scrape for people's reviews on Glassdoor and discussion forums?

chrisconlan · 2021-02-11T14:16:23Z

@alexpryszlakh Great book and great question.

joe-wojniak · 2021-02-11T14:57:06Z

Python is very good at web scraping- there are a couple of popular web scraping libraries. BeautifulSoup and Scrapy being a couple. I think it's a great idea, but great ideas need to be tested. We could try it and see if we find a correlation.

…

-Joe W.

On Thu, Feb 11, 2021 at 7:16 AM Chris Conlan ***@***.***> wrote: @alexpryszlakh <https://github.com/alexpryszlakh> Great book and great question. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#23 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEAV4CHRMSSIMO2LRSNNQE3S6PRERANCNFSM4XJ4NBLA> .

-- -Joe Wojniak *CONFIDENTIALITY* NOTICE: The contents of this *email* message and any attachments are intended solely for the addressee(s) and may contain *confidential *and/or privileged *information* and may be legally protected from disclosure.

emican · 2021-02-11T22:04:39Z

https://www.crunchbase.com/ can be a source of data.

joe-wojniak · 2021-02-24T16:40:08Z

Do we want to investigate whether py-polars is faster than pandas?

Blog on the topic: https://medium.com/analytics-vidhya/is-pypolars-the-new-alternative-to-pandas-916400f03fd7

chrisconlan · 2021-02-24T16:51:08Z

@joe-wojniak What do you think about py-polars? We can talk about lazy evaluation and query optimization, but it is very much a computer science and a database design topic.

joe-wojniak · 2021-02-24T17:07:42Z

I was just going by the blog- it sounds like py-polars may work with larger datasets and it seems there are claims of better performance. I'm happy to work with it on a trial basis, but I wasn't sure about adding it to the qttk environment while it's not proven.

…

On Wed, Feb 24, 2021 at 9:51 AM Chris Conlan ***@***.***> wrote: @joe-wojniak <https://github.com/joe-wojniak> What do you think about py-polars? We can talk about lazy evaluation and query optimization, but it is very much a computer science and a database design topic. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#23 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEAV4CGKIDHITPZCESFZRBLTAUVAXANCNFSM4XJ4NBLA> .

-- -Joe Wojniak *CONFIDENTIALITY* NOTICE: The contents of this *email* message and any attachments are intended solely for the addressee(s) and may contain *confidential *and/or privileged *information* and may be legally protected from disclosure.

joe-wojniak · 2021-02-24T17:18:30Z

I could start a test branch of the github, that way the main branch isn't affected.

…

On Wed, Feb 24, 2021 at 10:07 AM Joe Wojniak ***@***.***> wrote: I was just going by the blog- it sounds like py-polars may work with larger datasets and it seems there are claims of better performance. I'm happy to work with it on a trial basis, but I wasn't sure about adding it to the qttk environment while it's not proven. On Wed, Feb 24, 2021 at 9:51 AM Chris Conlan ***@***.***> wrote: > @joe-wojniak <https://github.com/joe-wojniak> What do you think about > py-polars? We can talk about lazy evaluation and query optimization, but it > is very much a computer science and a database design topic. > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#23 (comment)>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/AEAV4CGKIDHITPZCESFZRBLTAUVAXANCNFSM4XJ4NBLA> > . > -- -Joe Wojniak *CONFIDENTIALITY* NOTICE: The contents of this *email* message and any attachments are intended solely for the addressee(s) and may contain *confidential *and/or privileged *information* and may be legally protected from disclosure.

-- -Joe Wojniak *CONFIDENTIALITY* NOTICE: The contents of this *email* message and any attachments are intended solely for the addressee(s) and may contain *confidential *and/or privileged *information* and may be legally protected from disclosure.

chrisconlan · 2021-02-24T17:38:46Z

@joe-wojniak Let's talk about lazy evaluation and query optimization at our next lesson. I want you to understand why there aren't necessarily any intrinsic speed gains buried within it, and why this library might be slower overall.

joe-wojniak · 2021-02-24T18:17:39Z

ok, sounds good.

…

On Wed, Feb 24, 2021 at 10:39 AM Chris Conlan ***@***.***> wrote: @joe-wojniak <https://github.com/joe-wojniak> Let's talk about lazy evaluation and query optimization at our next lesson. I want you to understand why there aren't necessarily any intrinsic speed gains buried within it, and why this library might be slower overall. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#23 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEAV4CFSTV2YDGP47YXHFUDTAU2TJANCNFSM4XJ4NBLA> .

-- -Joe Wojniak *CONFIDENTIALITY* NOTICE: The contents of this *email* message and any attachments are intended solely for the addressee(s) and may contain *confidential *and/or privileged *information* and may be legally protected from disclosure.

chrisconlan · 2021-02-25T12:07:51Z

Update @joe-wojniak

Pypolars philosophy relies on this design pattern:

lazy_df = lazy_df.filter(col("Rain") > (lit(120))) # Nothing happens
lazy_df = lazy_df.filter(col("Temp") > (lit(78))) # Nothing happens
lazy_df = lazy_df.filter(col("Earthquakes") > (lit(2))) # Nothing happens

# All of the above filter operations happen at once here
lazy_df.collect()

Whereas pandas would require this.

# Method 1 (slow way)
df = df[df.rain > 120] # Filter shrinks df
df = df[df.temp > 78] # Filter shrinks df
df = df[df.earthquakes > 2] # Filter shrinks df

# Method 2 (fast way)
df = df[df.rain > 120 & df.temp > 78 & df.earthquakes]

Theoretically, the Pypolars method above would be just as fast as Pandas Method number 2, because of lazy evaluation. Does this provide a speedup? No, not necessarily, and likely not at all. It just changes the way you write code, and it changes the way you optimize code. Is it worth it at this point to explore Pypolars? I don't think so. I would need to see the author of Pypolars show some provable speedups that go above and beyond lazy evaluation to even consider.

Further, Pypolars seems to advertise built-in parallelization. I don't like this at all. Serious engineers need explicit control of parallelization. Python in-memory parallelization sucks in general, because it requires pickling and unpickling of code, which isn't fully supported throughout the language. I can guarantee that running any complex parallel .apply function in Pypolars would cause unsolvable errors. I prefer to use Unix forking to parallelize my work, because it is the only thing that is sufficiently stable in Python.

joe-wojniak · 2021-03-22T05:05:06Z

I thought this article on adaptive filtering was interesting. It explores an application for predicting stock price:
https://towardsai.net/p/machine-learning/time-series-prediction-using-adaptive-filtering

chrisconlan · 2021-03-22T19:23:39Z

Interesting post. I don't know anything about this method.

It could be an interesting, albeit complex, technical feature for an ML model. Definitely couldn't work as a standalone, though.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Topic proposals for upcoming lessons #23

Topic proposals for upcoming lessons #23

chrisconlan commented Feb 8, 2021

emican commented Feb 8, 2021

joe-wojniak commented Feb 9, 2021 •

edited

Loading

chrisconlan commented Feb 9, 2021 •

edited

Loading

alexpryszlakh commented Feb 11, 2021

chrisconlan commented Feb 11, 2021

joe-wojniak commented Feb 11, 2021 via email

emican commented Feb 11, 2021

joe-wojniak commented Feb 24, 2021

chrisconlan commented Feb 24, 2021

joe-wojniak commented Feb 24, 2021 via email

joe-wojniak commented Feb 24, 2021 via email

chrisconlan commented Feb 24, 2021

joe-wojniak commented Feb 24, 2021 via email

chrisconlan commented Feb 25, 2021 •

edited

Loading

joe-wojniak commented Mar 22, 2021

chrisconlan commented Mar 22, 2021

Topic proposals for upcoming lessons #23

Topic proposals for upcoming lessons #23

Comments

chrisconlan commented Feb 8, 2021

emican commented Feb 8, 2021

joe-wojniak commented Feb 9, 2021 • edited Loading

chrisconlan commented Feb 9, 2021 • edited Loading

alexpryszlakh commented Feb 11, 2021

chrisconlan commented Feb 11, 2021

joe-wojniak commented Feb 11, 2021 via email

emican commented Feb 11, 2021

joe-wojniak commented Feb 24, 2021

chrisconlan commented Feb 24, 2021

joe-wojniak commented Feb 24, 2021 via email

joe-wojniak commented Feb 24, 2021 via email

chrisconlan commented Feb 24, 2021

joe-wojniak commented Feb 24, 2021 via email

chrisconlan commented Feb 25, 2021 • edited Loading

joe-wojniak commented Mar 22, 2021

chrisconlan commented Mar 22, 2021

joe-wojniak commented Feb 9, 2021 •

edited

Loading

chrisconlan commented Feb 9, 2021 •

edited

Loading

chrisconlan commented Feb 25, 2021 •

edited

Loading