feat(skorch): add an inherited class from skorch.NeuralNet that is compatible with PyTorch Frame #375

34j · 2024-03-11T13:06:40Z

Closes #147

@MacOS Please continue from here if it helps.
Sorry for being so loud, but this took me a whole day, so I would appreciate it very much if you could make me as a co-author if you used this code.

…et that is compatible with PyTorch Frame

for more information, see https://pre-commit.ci

codecov · 2024-03-11T13:09:46Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 93.52%. Comparing base (ee98b87) to head (aa5484d).
Report is 6 commits behind head on master.

❗ Current head aa5484d differs from pull request most recent head cb76e8d. Consider uploading reports for the commit cb76e8d to get more accurate results

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #375   +/-   ##
=======================================
  Coverage   93.52%   93.52%           
=======================================
  Files         124      124           
  Lines        6456     6456           
=======================================
  Hits         6038     6038           
  Misses        418      418

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

MacOS · 2024-03-11T13:57:18Z

Sure - on both!

34j · 2024-03-16T05:21:46Z

@weihua916 Would you mind reviewing if you think this is a good way to implement it?

Also, it is strange that mypy in pre-commit does not raise errors, but mypy in CI does. I don't think there is any way to deal with this.

34j · 2024-04-03T14:54:12Z

@weihua916 @zechengz @yiweny Would appreciate your review, thank you very much in advance.

weihua916

Thank you!

Do you mind adding unit test similar to https://github.com/pyg-team/pytorch-frame/blob/master/test/gbdt/test_gbdt.py?

examples/revisiting.py

weihua916 · 2024-05-01T22:39:56Z

A kind check-in. Is there any progress here?

34j · 2024-05-02T06:28:33Z

No progress, sorry

for more information, see https://pre-commit.ci

zechengz

Left some comments. Can @weihua916 or @yiweny helps to take a look? Thanks for your contribution!

examples/revisiting.py

examples/tutorial.py

zechengz · 2024-07-05T19:33:01Z

torch_frame/utils/skorch.py

+def _patch_skorch_support_tenforframe() -> None:
+    old_to_tensor = skorch.utils.to_tensor
+
+    def to_tensor(X, device, accept_sparse=False):


Add typing information?

As well, this is not recommended as skorch does not support typing.

torch_frame/utils/skorch.py

zechengz · 2024-07-05T19:35:00Z

torch_frame/utils/skorch.py

+    def __init__(
+        self,
+        # NeuralNet parameters
+        module,


Also add the typing information here?

Since skorch does not support typing, doing this is tedious and not recommended. (On the other hand, it would be inconvenient for users if it was **args, **kwargs, so I compromised and made it this way.)

…ough it can be used

README.md

examples/revisiting.py

examples/tutorial.py

test/utils/test_skorch.py

34j · 2024-07-11T10:04:23Z

@weihua916 Removed all tutorials and added examples/sklearn_api.py instead.
The failing tests are probably due to a pandas version update and are not related to this PR.

weihua916

Left a quick question.

weihua916 · 2024-07-12T03:51:50Z

test/utils/test_skorch.py

+        # [stype.text_embedded],
+        # [stype.numerical, stype.numerical, stype.text_embedded],


we don't support these stypes?

Currently not supported at this time due to lack of time to understand how to use these dtypes.
However, since it probably only require changes in the arguments of the NeuralNet, it should have little trouble extending it in the future.

weihua916 · 2024-07-12T03:54:36Z

test/utils/test_skorch.py

+    if pass_dataset:
+        net.fit(dataset)
+        _ = net.predict(test_dataset)
+    else:
+        net.fit(X_train, y_train)
+        _ = net.predict(X_test)


why don't we take tensor frame? It's also weird to sometimes take dataset and sometimes take data frame.

The main purpose of this PR is to allow DataFrame to be DIRECTLY fitted, as shown in examples/sklearn_api.py.

Since it is unclear how to create a Dataset from a TensorFrame, and if there is a TensorFrame, there should be also a Dataset, which means there is little need to implement this, and even to use skorch as the user might be familiar with deep learning.

Instead of "sometimes take dataset and sometimes take data frame", both are tested.

hmm if dataframe is directly fed, it is unclear why we need this feature within pytorch frame.
the whole point of pytorch frame is to materialize data frame into tensor frame, to be processed by pytorch.

It may not match your purpose but my goal is to use advanced neural networks implemented in pytorch_frame in existing sklearn pipeline.
This PR allows pytorch_frame to be used on top of existing scikit-learn code without having to heavily modify the existing code. Since many people use sklearn Pipeline, especially on Kaggle, it is easy to verify performance changes by changing or assembling the estimator in other people's code to my NeuralNetPytorchFrame. I am convinced that this will be very valuable.

34j · 2024-09-16T05:25:58Z

@weihua916 Would you please reconsider merging this PR?

This change makes it easy to try out torch-frame based neural networks on code that already uses scikit-learn. (which was demonstrated in #375 (comment)).

(This PR is not intended to save training of Pytorch models that do not use scikit-learn. lightning or fastai should be used for such applications.)

Thank you in advance.

34j · 2024-09-30T03:13:50Z

@weihua916 @yiweny @akihironitta Any chance that this PR could be merged?

34j added 2 commits March 11, 2024 22:01

feat(skorch): add prototype of an inherited class from skorch.NeuralN…

71fbea0

…et that is compatible with PyTorch Frame

docs: add tutorial for the last commit

b8e8ae4

34j marked this pull request as draft March 11, 2024 13:06

[pre-commit.ci] auto fixes from pre-commit.com hooks

df8ecc4

for more information, see https://pre-commit.ci

fix: patch skorch.utils.to_tensor()

ca95b8f

style: format code

0b9426f

34j force-pushed the feat/skorch-compatible branch from 1706c96 to 0b9426f Compare March 16, 2024 03:53

34j added 6 commits March 16, 2024 13:54

feat: fix multiple issues, support sklearn-like datasets and predict()

198b749

chore(example): test with regression as well

d264488

Merge branch 'master' into feat/skorch-compatible

691f204

docs: add changelog

9cc4fe1

fix(skorch): import annotations from __future__

98aea5c

revert: revert wrong changes

0f650d8

34j marked this pull request as ready for review March 16, 2024 05:15

34j added 3 commits March 16, 2024 14:49

style(skorch): fix typing

95688e3

fix(skorch): use classes if specified

7594b44

Merge branch 'master' into feat/skorch-compatible

aa5484d

weihua916 reviewed Apr 4, 2024

View reviewed changes

examples/revisiting.py Outdated Show resolved Hide resolved

examples/revisiting.py Outdated Show resolved Hide resolved

34j and others added 6 commits May 2, 2024 16:21

Merge branch 'master' into feat/skorch-compatible

cb76e8d

Merge branch 'master' into feat/skorch-compatible

4a7598d

chore: remove comments

bacf31f

chore(skorch): add more comments

1c50a59

[pre-commit.ci] auto fixes from pre-commit.com hooks

3a90392

for more information, see https://pre-commit.ci

test: add prototype test

474caff

fix: copy dataframe before adding columns

eca1905

zechengz reviewed Jul 5, 2024

View reviewed changes

34j added 10 commits July 6, 2024 15:58

docs: add docs to _patch_skorch_support_tenforframe

33009b6

fix(skorch): wrap with functools.wraps

e624953

fix: move imports

3276903

chore: do not use NeuralNetClassifierPytorchFrame for regression alth…

18a50ee

…ough it can be used

fix(skorch): add typing only for module

d53061d

fix: support specifying module as class

a09beb2

docs: add docs

a967b0d

fix: fix dtype

bc07d7b

Merge branch 'master' into feat/skorch-compatible

aa23ca0

Merge branch 'master' into feat/skorch-compatible

e6d5dfe

34j mentioned this pull request Jul 11, 2024

sklearn-compatible interface #147

Open

weihua916 reviewed Jul 11, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

examples/revisiting.py Outdated Show resolved Hide resolved

examples/tutorial.py Outdated Show resolved Hide resolved

test/utils/test_skorch.py Outdated Show resolved Hide resolved

34j added 6 commits July 11, 2024 18:51

test: remove comment

2fe4f69

Discard changes to examples/revisiting.py

14f0e7b

Discard changes to examples/tutorial.py

06ec88e

Discard changes to README.md

8d4d32d

fix: use args instead of kwargs to match typing

947daf1

feat: add example for sklearn api

3769f2d

34j requested review from weihua916 and zechengz July 11, 2024 10:05

weihua916 reviewed Jul 12, 2024

View reviewed changes

Merge branch 'master' into feat/skorch-compatible

05785eb

34j requested a review from weihua916 July 30, 2024 08:15

Merge branch 'master' into feat/skorch-compatible

8e46fb5

Merge branch 'master' into feat/skorch-compatible

eddecf8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(skorch): add an inherited class from skorch.NeuralNet that is compatible with PyTorch Frame #375

feat(skorch): add an inherited class from skorch.NeuralNet that is compatible with PyTorch Frame #375

34j commented Mar 11, 2024 •

edited

Loading

codecov bot commented Mar 11, 2024 •

edited

Loading

MacOS commented Mar 11, 2024

34j commented Mar 16, 2024 •

edited

Loading

34j commented Apr 3, 2024 •

edited

Loading

weihua916 left a comment

weihua916 commented May 1, 2024

34j commented May 2, 2024

zechengz left a comment

zechengz Jul 5, 2024

34j Jul 6, 2024

zechengz Jul 5, 2024

34j Jul 6, 2024 •

edited

Loading

34j commented Jul 11, 2024 •

edited

Loading

weihua916 left a comment

weihua916 Jul 12, 2024

34j Jul 12, 2024

weihua916 Jul 12, 2024

34j Jul 12, 2024 •

edited

Loading

weihua916 Jul 12, 2024

34j Jul 12, 2024 •

edited

Loading

34j commented Sep 16, 2024 •

edited

Loading

34j commented Sep 30, 2024 •

edited

Loading

		# [stype.text_embedded],
		# [stype.numerical, stype.numerical, stype.text_embedded],

feat(skorch): add an inherited class from skorch.NeuralNet that is compatible with PyTorch Frame #375

Are you sure you want to change the base?

feat(skorch): add an inherited class from skorch.NeuralNet that is compatible with PyTorch Frame #375

Conversation

34j commented Mar 11, 2024 • edited Loading

codecov bot commented Mar 11, 2024 • edited Loading

Codecov Report

MacOS commented Mar 11, 2024

34j commented Mar 16, 2024 • edited Loading

34j commented Apr 3, 2024 • edited Loading

weihua916 left a comment

Choose a reason for hiding this comment

weihua916 commented May 1, 2024

34j commented May 2, 2024

zechengz left a comment

Choose a reason for hiding this comment

zechengz Jul 5, 2024

Choose a reason for hiding this comment

34j Jul 6, 2024

Choose a reason for hiding this comment

zechengz Jul 5, 2024

Choose a reason for hiding this comment

34j Jul 6, 2024 • edited Loading

Choose a reason for hiding this comment

34j commented Jul 11, 2024 • edited Loading

weihua916 left a comment

Choose a reason for hiding this comment

weihua916 Jul 12, 2024

Choose a reason for hiding this comment

34j Jul 12, 2024

Choose a reason for hiding this comment

weihua916 Jul 12, 2024

Choose a reason for hiding this comment

34j Jul 12, 2024 • edited Loading

Choose a reason for hiding this comment

weihua916 Jul 12, 2024

Choose a reason for hiding this comment

34j Jul 12, 2024 • edited Loading

Choose a reason for hiding this comment

34j commented Sep 16, 2024 • edited Loading

34j commented Sep 30, 2024 • edited Loading

34j commented Mar 11, 2024 •

edited

Loading

codecov bot commented Mar 11, 2024 •

edited

Loading

34j commented Mar 16, 2024 •

edited

Loading

34j commented Apr 3, 2024 •

edited

Loading

34j Jul 6, 2024 •

edited

Loading

34j commented Jul 11, 2024 •

edited

Loading

34j Jul 12, 2024 •

edited

Loading

34j Jul 12, 2024 •

edited

Loading

34j commented Sep 16, 2024 •

edited

Loading

34j commented Sep 30, 2024 •

edited

Loading