You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
raiseValueError("The lengths of dataset and smiles_list are "
"different")
rng=numpy.random.RandomState(seed)
This bug will cause data split inconsistent across different models and different run, even if we explicitly specify the same seed, and the default seed 777 here is useless.
PS: I use Pycharm debug tool to validate above procedure.
The text was updated successfully, but these errors were encountered:
In this function, although there is a
seed=777
argument in the signature like this.chainer-chemistry/chainer_chemistry/datasets/molnet/molnet.py
Lines 24 to 28 in 56e83de
But it's never passed to any splitter, in the same function, the splitter is called here without
seed
argument:chainer-chemistry/chainer_chemistry/datasets/molnet/molnet.py
Lines 104 to 130 in 56e83de
Then, in the splitter (here the ScaffoldSplitter), the
seed
argument is stillNone
:chainer-chemistry/chainer_chemistry/dataset/splitters/scaffold_splitter.py
Lines 62 to 65 in 56e83de
According to the implementation, the
seed=None
means it be initialized by reading data from/dev/urandom
according to the numpy docs.chainer-chemistry/chainer_chemistry/dataset/splitters/scaffold_splitter.py
Lines 23 to 35 in 56e83de
This bug will cause data split inconsistent across different models and different run, even if we explicitly specify the same seed, and the default seed 777 here is useless.
PS: I use Pycharm debug tool to validate above procedure.
The text was updated successfully, but these errors were encountered: