Item model utils #269

drio18 · 2023-07-17T18:35:08Z

Here, we add utils for creating "item models", classes that enable navigating portal items directly as well as centralize common operations. Related snovault PR that adds navigation via requests here.

Additionally, we add a new testing_utils module to share code related to writing tests.

coveralls · 2023-07-17T18:37:33Z

Pull Request Test Coverage Report for Build 5754997444

135 of 140 (96.43%) changed or added relevant lines in 2 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.3%) to 76.251%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
dcicutils/item_model_utils.py	120	122	98.36%
dcicutils/testing_utils.py	15	18	83.33%

Totals
Change from base Build 5590491337:	0.3%
Covered Lines:	7590
Relevant Lines:	9954

💛 - Coveralls

…into drr_item_classes

netsettler

A lot of these comments are stylistic. I apologize in advance for that, butI found this code a bit hard to read because of the chosen coding style, which doesn't match our normal style in other modules. I've tried to cite rationales for why we use the style we use. I fought with PEP8 for a long time when I came to Python but mostly feel like trying to normalize style makes it easier to both read and especially to grep.

There are a few non-syntactic comments in here, and I may have more later. I'd like to talk to you interactively about some of this stuff before taking another look.

netsettler · 2023-08-04T03:35:44Z

dcicutils/item_model_utils.py

I might suggest either model_utils or item_utils. The name doesn't have to be complex, just enough to distinguish it from other things we're likely to have. Are we likely to have any other kinds of models that we want utils for? Or any other kinds of items? I'm OK leaving it if you want to, I'll leave that to you since you'll use it. But it just seems clumsy and like unnecessarily many letters.

netsettler · 2023-08-04T03:41:59Z

dcicutils/item_model_utils.py

+def get_link_to(
+    existing_item: PortalItem,
+    link_to: LinkTo,
+    item_to_create: PortalItem,
+) -> Union[str, PortalItem]:


As a matter of personal preference, I'd like to see this indented like this:

Suggested change

def get_link_to(

existing_item: PortalItem,

link_to: LinkTo,

item_to_create: PortalItem,

) -> Union[str, PortalItem]:

def get_link_to(existing_item: PortalItem,

link_to: LinkTo,

item_to_create: PortalItem,

) -> Union[str, PortalItem]:

The reason I like this style better is:

It leaves "def" and the function name easier to pick out of the crowd.

It means that the only thing in column 0 after a def is the next definition.

netsettler · 2023-08-04T03:55:07Z

dcicutils/item_model_utils.py

+    """Create new item model from existing one for given linkTo.
+
+    LinkTos be identifiers (e.g. UUIDs) or (partially) embedded objects.
+
+    Follow rules of existing item model for fetching linkTo via
+    request. If not fetching via request, then make item model from
+    existing properties if possible.
+    """


You are referring to parameters by their type rather than their name. I get why that is. The name is somewhat encouraged to be a python name and fights the notation used in the JSON. But in the worst case this style can be confusing if there's more than one thing with the same type, and falling back on different notation in that case can be irregular. I tend to think of documentation as being program-like and it's useful to refer to things that are parameters by the names of the parameter so the reader of the doc doesn't have to guess. e.g.,

Suggested change

"""Create new item model from existing one for given linkTo.

LinkTos be identifiers (e.g. UUIDs) or (partially) embedded objects.

Follow rules of existing item model for fetching linkTo via

request. If not fetching via request, then make item model from

existing properties if possible.

"""

"""Create new item model from existing_item for given link_to.

A link_to should be an identifier (e.g. a UUID) or a (partially) embedded object.

Follow rules of existing item model for fetching linkTo via

request. If not fetching via request, then make item model from

existing properties if possible.

"""

Note also that you referred in your text to LinkTos in plural, which is awkward to start because you are pluralizing a type name and there is no such plural name, but also the argument doesn't seem to allow multiple link_to ids, just one.

I was unable to interpret what the sentence starting "Follow rules of..." even means, so I left it alone. I also don't quite get how an existing item (which is presumably an object?) can also be an item to create (which presumably can't be an object (because it doesn't exist yet). Are these item types?

I am surprised PortalItem works as a forward reference. Did you try that in Python 3.7? I thought that did not work if you're using it in a type description but maybe it does if it's a class. I don't think it works for a free-standing assignment such as you did LinkTo if it follows. Usually you have to use 'PortalItem' instead for forward references.

netsettler · 2023-08-04T03:57:56Z

pyproject.toml

@@ -1,6 +1,6 @@
 [tool.poetry]
 name = "dcicutils"
-version = "7.6.0"
+version = "7.7.0.1b1"


Generally the convention we are using is to make the beta be epsilon more than the version it is built from, not the build it is targeting. For example, if a 7.7.0 is issued while this is being developed, and you have not merged that version, this version will sort higher and so things that seek "^7.7.0" will think you have support for everything below this number and I don't think you do. Unless maybe you've merged 7.7.0 into here, in which case ignore me. :) I often add a comment saying something about what the highest merged version is. The entire tech for all this is heuristic, but I hope that makes sense.

netsettler · 2023-08-04T04:12:11Z

dcicutils/item_model_utils.py

+
+@dataclass(frozen=True)
+class NestedProperty:
+    properties: Dict[str, Any]


I'd like a blank line before this, since there are blank lines between the definitions. It just looks better, even if PEP8 might not require it.

netsettler · 2023-08-04T12:52:12Z

dcicutils/testing_utils.py

+from unittest import mock
+
+
+AN_UNLIKELY_RETURN_VALUE = "unlikely return value"


The ordinary convention for Python is to just do

AN_UNLIKELY_RETURN_VALUE = object()

because as soon as you name a string, it becomes interned and possible for someone else to find by name in a pointer-equal way. It's annoying at times. But this is why I added dcicutils.NamedObject, so that you can do something a little classier that is actually reliably unique (you could name it something less probabilistic, too):

_UNSUPPLIED = NamedObject('unsupplied')

where the token NamedObject has unique identity and prints in a more stringy way.

>>> x = NamedObject('unsupplied') >>> str(x) '<unsupplied>' >>> repr(x) '<unsupplied@101036400>' >>> x <unsupplied@101036400>

(Note that I recommend using _ in the name of this because in my long-time considered opinion it's a bad idea to export such a name so that different modules or classes use the same variable for missing/undefined/etc or else one might pass its missing argument explicitly to something else who might think no argument had been passed. The semantics gets quickly messy because it's not clear that an explicitly passed missing argument is intended to mean missing (though that's subject to intent in each such situation). Better each use of unsupplied should be a separate object in my view unless you've thought it through carefully and really mean to be creating a whole ecosystem that conspires on the notion of what a filled or missing argument is, and usually that's a distraction from work you're trying to do. So making the var non-exported/shared seems better to me.)

netsettler · 2023-08-04T12:59:52Z

dcicutils/testing_utils.py

+    else:
+        target = f"{module.__name__}.{to_patch.__qualname__}"
+    with mock.patch(target, new_callable=new_callable, **kwargs) as mocked_item:
+        if return_value != AN_UNLIKELY_RETURN_VALUE:


If you use NamedObject, you can rely on object identity and this will read nicer:

if return_value is not UNSUPPLIED: mocked_item.return_value = return_value

netsettler · 2023-08-04T13:32:39Z

dcicutils/item_model_utils.py

+    def _get_link_tos(
+        self, link_tos: Iterable[LinkTo], item_to_create: PortalItem
+    ) -> List[Union[str, PortalItem]]:


I didn't mark all of the occasions of this issue in this PR, like I did in its companion, but there's no reason for this to extend over three lines. The function header would fit on one line. If you did have to break it, it shouldn't go to column zero (as I explained in more detail elsewhere).

netsettler · 2023-08-04T14:04:35Z

test/test_item_model_utils.py

+        assert result.should_fetch_links() == portal_item.should_fetch_links()
+
+
+def get_nested_property(


I assume this is just making a sample property. I find this kind of thing hard to read because in test files, it's often hard to tell what the newly-defined functions to be tested are and what are the testing artifacts. I like to name things like this something like make_sample_tested_property so that in context of use, it's clear it's a testing artifact. Also, and it's a subtle point, but I like verbs like get, find, lookup, and make to have consistent meanings, and while we don't have a global convention for some of the subtleties I use for myself, I do think get usually connotes finding something that exists already, and here you're making something anew, which is why I'd use make as the part of the function name that's the verb, though maybe you're thinking of this as a representation of something that exists in the data? If so, I would leave it as get but I would clarify that in a doc string, which should be present in any case since this function does something non-trivial.

Note that I'm not perfect at writing doc strings myself, but Will usually beats me up about it, presumably because it's hard to look at someone else's code and figure out what they're doing and he's usually on the outside of what I'm doing trying to claw his way in, as I'm doing here. :)

On the subject of docstrings, I really think some of these tests could use them and even some inline comments. They are quite complex relatively speaking and merit such documentation.

netsettler · 2023-08-04T14:08:50Z

test/test_item_model_utils.py

+            )
+            assert len(mock_get_metadata.call_args_list) == 1
+
+    @pytest.mark.parametrize(


Not necessarily something to change, but I have to say the body of this is so small and so conditional-ridden that I found myself wondering if individual tests for each of these cases would not be clearer. It's probably fine. But just something to ponder. There is a sense in which tests not only test things but offer little snippets where people can learn things from use cases, and sometimes the tests that are heavily conditionalized like this rob someone of the ability to see what the use case would be. I'm guilty of a lot of this stuff myself, so perhaps not one to talk. It's an ongoing struggle to balance sometimes-conflicting needs.

willronchetti

Kent left some good feedback that should be considered, not sure I have much more but I do really think it's worth documenting some of what's going on here especially in the more complex unit tests

willronchetti · 2023-08-04T16:27:07Z

test/test_item_model_utils.py

+        assert result.should_fetch_links() == portal_item.should_fetch_links()
+
+
+def get_nested_property(


On the subject of docstrings, I really think some of these tests could use them and even some inline comments. They are quite complex relatively speaking and merit such documentation.

drio18 added 3 commits June 29, 2023 15:00

Make first cut at item model

707e873

Merge branch 'master' into drr_item_classes

f6492d3

Tidy up + increase test coverage

2afdb25

drio18 mentioned this pull request Jul 17, 2023

Item models 4dn-dcic/snovault#260

Open

willronchetti and others added 7 commits July 20, 2023 14:58

give a beta

c6003ec

Add module kwarg when patching

8c8a62e

Merge branch 'drr_item_classes' of https://github.com/4dn-dcic/utils …

fc39fa4

…into drr_item_classes

Bump beta version

000ec5f

Clean up for PR

cbb954b

Merge branch 'master' into drr_item_classes

882c071

Bump beta version

ed82c78

drio18 requested review from netsettler and willronchetti August 3, 2023 19:54

netsettler reviewed Aug 4, 2023

View reviewed changes

willronchetti reviewed Aug 4, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Item model utils #269

Item model utils #269

drio18 commented Jul 17, 2023 •

edited

Loading

coveralls commented Jul 17, 2023 •

edited

Loading

netsettler left a comment

netsettler Aug 4, 2023

netsettler Aug 4, 2023

netsettler Aug 4, 2023

netsettler Aug 4, 2023

netsettler Aug 4, 2023

netsettler Aug 4, 2023

netsettler Aug 4, 2023

netsettler Aug 4, 2023

netsettler Aug 4, 2023

willronchetti Aug 4, 2023

netsettler Aug 4, 2023

willronchetti left a comment

willronchetti Aug 4, 2023

		from unittest import mock


		AN_UNLIKELY_RETURN_VALUE = "unlikely return value"

		assert result.should_fetch_links() == portal_item.should_fetch_links()


		def get_nested_property(

Item model utils #269

Are you sure you want to change the base?

Item model utils #269

Conversation

drio18 commented Jul 17, 2023 • edited Loading

coveralls commented Jul 17, 2023 • edited Loading

Pull Request Test Coverage Report for Build 5754997444

💛 - Coveralls

netsettler left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

willronchetti left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

drio18 commented Jul 17, 2023 •

edited

Loading

coveralls commented Jul 17, 2023 •

edited

Loading