Missing entry in TOC file for first video stimulus in playlist mode. #16

davidt0x · 2022-02-08T19:45:14Z

This bug occurs when using video stimulus playlist entries without any timing information specified. The TOC file output after the experiment is usually missing the first video stimulus played during the experiment.

davidt0x · 2022-02-08T20:16:36Z

Ok, I think I have figured out why this TOC issue is happening with video playlists with no timing info. It happens because when the main video server thread runs it creates the VideoStimPlaylist object:

fly-vr/flyvr/video/video_server.py

Lines 1188 to 1190 in 5dac72a

    
           playlist_stim = VideoStimPlaylist(*stims, random=random, 
        
                                             paused=paused if paused is not None else getattr(options, 'paused', False), 
        
                                             play_item=getattr(options, 'play_item', None))

whose constructor calls self.play_item immediately if not paused:

fly-vr/flyvr/video/video_server.py

Lines 91 to 92 in 5dac72a

    
           elif not self._paused: 
        
               self.play_item(next(self._playlist_iter))

This play_item method in turn updates the state of the playlist (advances the internal index) and then if and only if the fly_vr shared state variables are set it signals to append to the TOC file.:

fly-vr/flyvr/video/video_server.py

Lines 154 to 159 in 5dac72a

    
           if self._flyvr_shared_state is not None: 
        
               self._flyvr_shared_state.signal_new_playlist_item(identifier, BACKEND_VIDEO, 
        
                                                                 producer_instance_n=producer_instance_n, 
        
                                                                 producer_playlist_n=self._child_playlist_n, 
        
                                                                 # and a time for replay experiments 
        
                                                                 time_ns=self._flyvr_shared_state.TIME_NS)

The problem is that the constructor of the VideoStimPlaylist sets this flyvr_shared_state variable to None by default. This gets fixed later when the next item in the playlist is invoked by the main server loop after picking up the playlist on the message queue:

fly-vr/flyvr/video/video_server.py

Line 1009 in 5dac72a

stim_or_cmd.initialize(self.mywin, self._fps, self.flyvr_shared_state)

It seems to me that the first thing todo would be remove the reference of the flyvr shared state from the VideoPlaylist entirely and put this signaling responsibility onto the server main loop. This seems like a separation of concerns issue. The code for managing the playlist sequencing shouldn't be in charge of signaling anything to the state of the rest of the program. This is how it is in the audio playlist case I believe. There could be something I am missing for why this is tricky in the video case though. Alternatively, we could just pass the flyvr_share_state through to the VideoPlaylistStim constructor. Though, I can't see any real reason this playlist code needs to know about the state of the whole system.

I will try to implement this fix locally and test on the setup.

mwangq · 2022-02-10T16:37:28Z

hey, I ran another quick test this morning using this config file: config_avd_producer_test.txt

I wanted to test two things:

whether the toc file looks the same as the time/do instructions from the config file (close but not quite... there seem to be some extra entries in daq, some lost entries in audio) 20220210_1122.toc.txt
whether the producer_playlist_n is being incremented for daq stimuli (it is not. I wanted to test this because yday I noticed the producer_playlist_n is not incremented for audio stimuli either) (also reflected in the sync info in daq.h5 and sound_server.h5)

Originally I wanted to use the toc for two purposes: to get the numerical ID (e.g. 1) for a given playlist item (e.g. v_gratingL) (as described in the flyVR sync doc ) and then also to use the toc to help me figure out which items were played when (because this is not hardcoded if I am randomly selecting and playing items, using an experiment.py).

I can almost figure out which items were played when, from a combo of producer_playlist_n and producer_instance_n in the sync data (assuming each instance_n made its way onto the toc file so I could figure out that producer_instance_n of 1 = v_gratingL). And producer_playlist_n can be roughly back-calculated by looking at the blocks of producer_instance_n. That is true if the stimulus is switching every time. It is not true, for instance, in this config file, where I have some stimuli (e.g. audio) play the same thing while I change another backend (e.g. daq).

Not sure if you prefer to open a separate issue for producer_playlist_n increment. I include the comment here because it is related to users being able to pull out when stimuli were played, which I thought the toc was also supposed to be for. and the "-1" values for producer_playlist_n show up in the daq/audio lines of the toc. I glanced at video_server.py and I see the producer_playlist_n increment there; I could probably implement it in sound_server.py but don't want to potentially break anything... (edit: I made some quick changes to sound_server.py and I think I have the producer_playlist_n incrementing now. nothing has broken yet...)

nzjrs · 2022-02-10T19:41:00Z

a small related fix is that for clarity only one experiment _start should be written to the TOC

fly-vr/flyvr/hwio/phidget.py

Line 165 in bf4718a

if CommonMessages.EXPERIMENT_START in msg:

nzjrs · 2022-02-10T19:44:49Z

hmm, for backends that output chunks (daq,audio) the produce_playlist_n should be incremented in the chunker/data_generator. A test is here that seems to be testing this

fly-vr/tests/audio/test_samplechunks.py

Line 218 in bf4718a

assert c3.producer_playlist_n == 3

nzjrs · 2022-02-10T19:45:40Z

(alternative hypothesis is that missing TOC lines are from lost IPC messages - best to increase logging and check each send per backend gets received by the TOC writer)

davidt0x · 2022-02-10T20:05:13Z

It seems to me the missing entries in TOC are only ones where the stimulus is repeating? Am I missing something? If that is the case, it is probably this line of code that is the culprit:

fly-vr/flyvr/audio/sound_server.py

Lines 398 to 413 in 5dac72a

    
           if chunk_producers_differ(self._last_chunk, chunk): 
        
               self._log.debug('chunk from new producer: %r' % chunk) 
        
               self.flyvr_shared_state.signal_new_playlist_item(chunk.producer_identifier, BACKEND_AUDIO, 
        
                                                                chunk_producer_instance_n=chunk.producer_instance_n, 
        
                                                                chunk_n=chunk.chunk_n, 
        
                                                                chunk_producer_playlist_n=chunk.producer_playlist_n, 
        
                                                                chunk_mixed_producer=chunk.mixed_producer, 
        
                                                                chunk_mixed_start_offset=chunk.mixed_start_offset, 
        
                                                                # ensure identical values to the h5 row 
        
                                                                fictrac_frame_num=row[0], 
        
                                                                daq_output_num_samples_written=row[1], 
        
                                                                daq_input_num_samples_read=row[2], 
        
                                                                sound_output_num_samples_written=row[3], 
        
                                                                video_output_num_frames=row[4], 
        
                                                                # and a time for replay experiments 
        
                                                                time_ns=row[5])

nzjrs · 2022-02-10T21:04:36Z

It seems to me the missing entries in TOC are only ones where the stimulus is repeating? Am I missing something? If that is the case, it is probably this line of code that is the culprit:

fly-vr/flyvr/audio/sound_server.py

Lines 398 to 413 in 5dac72a

if chunk_producers_differ(self._last_chunk, chunk):

self._log.debug('chunk from new producer: %r' % chunk)

self.flyvr_shared_state.signal_new_playlist_item(chunk.producer_identifier, BACKEND_AUDIO,

chunk_producer_instance_n=chunk.producer_instance_n,

chunk_n=chunk.chunk_n,

chunk_producer_playlist_n=chunk.producer_playlist_n,

chunk_mixed_producer=chunk.mixed_producer,

chunk_mixed_start_offset=chunk.mixed_start_offset,

# ensure identical values to the h5 row

fictrac_frame_num=row[0],

daq_output_num_samples_written=row[1],

daq_input_num_samples_read=row[2],

sound_output_num_samples_written=row[3],

video_output_num_frames=row[4],

# and a time for replay experiments

time_ns=row[5])

The simple sequential case seems to be well enough tested here https://github.com/murthylab/fly-vr/blob/master/tests/audio/test_samplechunks.py

but I think this is the right track. I believe I know what is going on.

the time based experiment plays items, which hits this

fly-vr/flyvr/audio/sound_server.py

Line 203 in 5dac72a

self.data_generator = self._stim_playlist.play_item(stim)

which creates a new data_generator

but I think this check

fly-vr/flyvr/audio/signal_producer.py

Line 64 in 5dac72a

return this.mixed_producer or (prev.producer_identifier,

does not catch what is a new chunk from a new data generator. I would suggest to confirm that producer_instance_n is incremented correctly resulting from the and also check producer_instance_n equality in chunk_producers_differ. however, I'm AFK from my windows dev PC - so it could be in fact that this was not the design of producer_instance_n and so chunk_producers_differ should also check the hash() of the creating data generator to handle this case

(I can test next week FWIW)

nzjrs · 2022-02-10T21:07:39Z

(I can test next week FWIW)

for the above, a hello world test would be to make a test that essentially calls play_item on a playlist for a non sequential sequence of items in the playlist, and checks the subsequent first chunks from the returned generatorS are detected as chunk_producers_differ() == True

mwangq · 2022-02-11T00:54:30Z

hmm, for backends that output chunks (daq,audio) the produce_playlist_n should be incremented in the chunker/data_generator.

strange news. I found some .toc from Dec and Jan where the (audio) chunk_producer_playlist_n incremented as expected. all of these experiments were audio backend only, using a variety of stimuli (custom matlab made stimuli and "sin" item types). I wonder if the .toc issue I showed above (where the chunk_producer_playlist_n are all -1, for both daq and audio) might have something to do with multiple backends...

nzjrs · 2022-02-11T08:32:58Z

hmm, for backends that output chunks (daq,audio) the produce_playlist_n should be incremented in the chunker/data_generator.

strange news. I found some .toc from Dec and Jan where the (audio) chunk_producer_playlist_n incremented as expected. all of these experiments were audio backend only, using a variety of stimuli (custom matlab made stimuli and "sin" item types). I wonder if the .toc issue I showed above (where the chunk_producer_playlist_n are all -1, for both daq and audio) might have something to do with multiple backends...

I would expect this to have worked if the experiment was not dynamic - i.e. didn't use the experiment time stanza. If it was simply a predefined sequential playlist that played in order, then all the chunk_producer_playlist_n and friends will all be incremented correctly

that is, as I said #16 (comment) it's probbably a function of tearing down and recreating the data_generator which confuses the chunker change detection

ref: #16 `Playlist.play_item` returns a new generator that has default values for its producer. This means that some sequences of manually played items in a playlist will not register as coming from different producers - so events describing the change in the playing stimulus item will not be emitted and thus will be missing from the TOC file

ref: #16 be consistent with the playlist case and set the producer_playlist_n field also when extracting the single `AudioStim.data_generator` via `Playlist.play_item`

nzjrs · 2022-02-11T12:49:16Z

I believe all issues here should be fixed by these 2 commits

I could only test this on Linux so the commits should be cherry picked appropriately and the branch not merged wholesale.

mwangq · 2022-02-11T18:03:29Z

hi @nzjrs, yep, the experiments I mentioned in my last comment (where producer_playlist_n incremented as expected) were all experiments that played off a playlist, not an experiment time stanza or an experiment.py.

I pulled the 3 files you edited:
flyvr/audio/signal_producer.py
flyvr/audio/stimuli.py
tests/audio/test_samplechunks.py

and re-ran the same config file as yesterday, which has audio, video, daq backends, for which the playlist_instance_n should be audio: [2, 2, 0, 1, 2], video: [2, 2, 0, 0, 1], daq: [1, 0, 0, 2, 0]. I was hoping that the toc file would match exactly the instructed time/do stanza (5 entries each from audio, video, daq = 15 toc entries), and the producer_playlist_n for each backend would increment from 0 to 4.

for the test yesterday 20220210_1122.toc.txt: the chunk_producer_instance_n in toc was correct, but the chunk_producer_playlist_n were all -1 for audio and daq. video increments fine. there was a missing audio toc entry, and an additional daq toc entry.

for the test today, 20220211_1244.toc.txt: video is fine again, but the chunk_producer_instance_n has some strange -106, -107 ... values, and the chunk_producer_playlist_n now looks like chunk_producer_instance_n

maybe instance and playlist variables were accidentally swapped somewhere
maybe there were additional files I should have pulled... ?
there are still 2 additional daq entries on the toc before other things (the other backends?) come online (can look at the fictrac_frame_num to see the triplets of toc lines indicating a do: from the config file.).

nzjrs · 2022-02-11T18:11:51Z

for the test today, 20220211_1244.toc.txt: video is fine again, but the chunk_producer_instance_n has some strange -106, -107 ... values, and the chunk_producer_playlist_n now looks like chunk_producer_instance_n

maybe instance and playlist variables were accidentally swapped somewhere

maybe there were additional files I should have pulled... ?

there are still 2 additional daq entries on the toc before other things (the other backends?) come online (can look at the fictrac_frame_num to see the triplets of toc lines indicating a do: from the config file.).

OK, this all seems to be working now.

chunk_producer_playlist_n is the index of the playlist item in the playlist and seems to be correct in the file uploaded. To be honest if you just look at the toc then you can use the identifier to map the item back to its entry in the playlist. The _n values are just because .h5 doesnt let you store strings (ideally the identifier) intelligently.

chunk_producer_instance_n being -100 and decreasing is expected. Its now code for these were called by play_item. As long as these are the same values in the .h5 then the toc fulfills its role of being able to find the start of the row for the toc entry (by the addressing tuple chunk_producer_playlist_n, chunk_producer_instance_n)

the extra toc entries are the duplicated experiment _start entries and should be easy to ignore or a quick fix for @davidt0x to solve. As I said, I wrote all this without testing as I'm away from my Windows PC.

nzjrs · 2022-02-11T18:16:44Z

(also additional duplicate toc entries called _silence are because annoyingly for technical reasons many backends always need to be playing / outputting something)

nzjrs · 2022-02-11T18:22:08Z

oh the meaning of some toc stanzas being CHUNK_producer_playlist_n vs producer_playlist_n was to indicate if the backend is chunked or not - but they otherwise should convey the same information (which is the index into the playlist definition of the item).

the chunk_producer_instance_n and producer_instance_n values don't actually have a meaning beyond they should be sufficiently unique to allow unambiguous finding of the start/correct row in the appropriate .h5 file for the toc entry.

mwangq · 2022-02-11T20:32:21Z

from the sync doc
producer_playlist_n is the "running count of the number of stimulus items the backend playlist has played" whereas the producer_instance_n is the "numerical identifier of the instance of the underlying stimulus item"

you mention "the index into the playlist definition of the item" -- pretty sure that should be producer_instance_n, not producer_playlist_n (in your last comment) then?

it sounds to me like the chunk_producer_playlist_n and chunk_producer_instance_n values should be swapped... how do i do that...

mwangq · 2022-02-11T20:35:10Z

I thought (chunk or not) producer_instance_n would indicate a numerical identifier for the playlist item (e.g. [2, 2, 0, 0, 1]), and (chunk or not) producer_playlist_n would indicate the running count of the total number of stimulus items from a backend (e.g. [0, 1, 2, 3, 4]) (or for audio/daq: -105, -106, -107, -108, -109).

I am currently writing code to identify stim info:

how stim numerical identifier (e.g. 2) maps onto stim playlist name (e.g. "grating"). this is info that is ONLY available in the toc. currently, regardless of backend, I check the "producer_instance_n" field for this value. but in the current (see previous toc), we have instance_n and playlist_n swapped for audio/daq)
the fictrac frames corresponding to a given stim: to get this, I basically run. np.diff on both producer_instance_n and producer_playlist_n (in the .h5 sync files) to find the transition points between stimuli. if stimuli aren't repeated back-to-back then these are the same. if stimuli are repeated back-to-back then that's where the running count (producer_playlist_n as I understand it) is useful! which has now been fixed for the audio/daq (except saved into the "instance_n" field)

nzjrs · 2022-02-11T20:51:54Z

_producer_instance_n is to disambiguate in any combined .h5 file the case where two playlists are identical (one item named, 'foo') but one runs on daq and one runs on audio. _producer_instance_n will be unique for these two even though the first play through per backend of these two would have the same values of 'identifier' and producer_playlist_n

conceptually (because this is all dynamic) an analagous situation is loading a new playlist in while running that contains the same named item 'foo' but different data. In this case again, producer_playlist_n and identifier would be the same, but _producer_instance_n would be different

the meat of this bug is that also, conceptually in a dynamic experiment, calling (for example) play_item(foo) 10 times in a row was not saving enough information in the toc to disambiguate these were in fact 'foo' being output multiple times (a secondary consequence of the bug was the toc line being missing, but the primary cause was not handling this dynamic case like the other 're-user of things called the same thing and we cant store strings in h5 case'.

mwangq · 2022-02-12T00:28:32Z

in general, I understand the need for both a field (identifier of who was played) and a field (running count to distinguish start times of a stim played 10 times in a row). From the sync doc I linked above, the numerical 'identifier' for a specific item is supposed to be in instance_n. and the running_count is playlist_n.

from page 6 of the sync doc

Within the same backend (daq, audio, etc), producer_instance_n in the h5 files corresponds to item from the same row (record) in the TOC file, which corresponds to the identifier of the stimulus item in the backend playlist.

That is also how it is coded up in video_server and in the toc files I've sent! in either case, I think we should be consistent, and put the identifier (2 = 'grating') in instance_n, and the running count in playlist_n (as described in your sync doc) for audio and daq backends... or else, swap the fields in video_server, and ALSO update the sync doc to reflect the swap.

regarding your other comments - that's really interesting, i currently can't think of a situation where the experimenter would give items from two different backends with the same name... also flyVR2 doesn't have combined .h5 files anyway. i get separate daq/fic/sound/video server .h5 files. and in the toc file, the backend is listed, so even though both item names are 'foo', i know they are different items. :) (but I will never name two backends' items the same thing haha! :))

regarding your second para - also didn't know "loading a new playlist while running" was an option! I thought all playlist items were preset by the playlist section....

one thing I'm aware of, that Dudi and I noticed in January, is that in using a regular playlist (no time stanza, no experiment.py) it's absolutely necessary to name each item differently. If the items are listed with the same name, even though the fields (e.g. sine frequency) are different, flyVR will play the item based on the fields from the first time that item was named. anyway, that is how I found out that playlist name items have to be unique!

nzjrs · 2022-02-12T01:25:26Z

The test is the canonical description of the meaning of the fields. If there is a disagreement between to doc and the data or the test the test is correct and I have made a mistake writing the doc.

The explanation above was to illustrate potential degenerate cases students might end up with, not to suggest they make sense or are 'supported' in any sense.

I admit understanding the meaning of these fields is complicated and challenging in the dynamic experiment case. I suggest moving in increasing complexity from a static to a dynamic experiment and identifying issues along the way and filing new bugs - because (with the exception of multiple start messages) the bug at the core of this issue has been resolved.

mwangq · 2022-02-14T18:15:37Z

I see. Thanks very much for explaining, and for your help on this topic, @nzjrs !!

davidt0x added the bug Something isn't working label Feb 8, 2022

davidt0x self-assigned this Feb 8, 2022

nzjrs added a commit that referenced this issue Feb 11, 2022

stimuli: retain provenance in playlist when play_item

5722d59

ref: #16 be consistent with the playlist case and set the producer_playlist_n field also when extracting the single `AudioStim.data_generator` via `Playlist.play_item`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing entry in TOC file for first video stimulus in playlist mode. #16

Missing entry in TOC file for first video stimulus in playlist mode. #16

davidt0x commented Feb 8, 2022

davidt0x commented Feb 8, 2022

mwangq commented Feb 10, 2022 •

edited

Loading

nzjrs commented Feb 10, 2022

nzjrs commented Feb 10, 2022

nzjrs commented Feb 10, 2022

davidt0x commented Feb 10, 2022

nzjrs commented Feb 10, 2022

nzjrs commented Feb 10, 2022 •

edited

Loading

mwangq commented Feb 11, 2022

nzjrs commented Feb 11, 2022 •

edited

Loading

nzjrs commented Feb 11, 2022

mwangq commented Feb 11, 2022

nzjrs commented Feb 11, 2022 •

edited

Loading

nzjrs commented Feb 11, 2022

nzjrs commented Feb 11, 2022

mwangq commented Feb 11, 2022

mwangq commented Feb 11, 2022

nzjrs commented Feb 11, 2022

mwangq commented Feb 12, 2022

nzjrs commented Feb 12, 2022

mwangq commented Feb 14, 2022

Missing entry in TOC file for first video stimulus in playlist mode. #16

Missing entry in TOC file for first video stimulus in playlist mode. #16

Comments

davidt0x commented Feb 8, 2022

davidt0x commented Feb 8, 2022

mwangq commented Feb 10, 2022 • edited Loading

nzjrs commented Feb 10, 2022

nzjrs commented Feb 10, 2022

nzjrs commented Feb 10, 2022

davidt0x commented Feb 10, 2022

nzjrs commented Feb 10, 2022

nzjrs commented Feb 10, 2022 • edited Loading

mwangq commented Feb 11, 2022

nzjrs commented Feb 11, 2022 • edited Loading

nzjrs commented Feb 11, 2022

mwangq commented Feb 11, 2022

nzjrs commented Feb 11, 2022 • edited Loading

nzjrs commented Feb 11, 2022

nzjrs commented Feb 11, 2022

mwangq commented Feb 11, 2022

mwangq commented Feb 11, 2022

nzjrs commented Feb 11, 2022

mwangq commented Feb 12, 2022

nzjrs commented Feb 12, 2022

mwangq commented Feb 14, 2022

mwangq commented Feb 10, 2022 •

edited

Loading

nzjrs commented Feb 10, 2022 •

edited

Loading

nzjrs commented Feb 11, 2022 •

edited

Loading

nzjrs commented Feb 11, 2022 •

edited

Loading