You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Have I written custom code (as opposed to using a stock example script provided in TensorFlow.js):
Mostly stock inference code with a few small modifications.
OS Platform and Distribution: Macbook Pro 16 GB 2020 (Intel Mac) Running OS X 10.15.7
TensorFlow.js installed from: NPM
TensorFlow.js version: Reproduced with 3.9.0 and 4.21.0
Browser version: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/129.0.0.0 Safari/537.36
Tensorflow.js Converter Version:
Describe the current behavior
We're running a TF.js model in production that is a fine tuned MobileNetv1. This model works perfectly for all of our users except one, we are unable to reproduce the issue locally or detect the issue before it occurs so we could switch to CPU. This issue started about a month ago, during which time we had not updated any of our TF code or components.
Problem:
The model loads successfully using loadGraphModel
When we attempt to warm the model it goes into an infinite loop and the system becomes unresponsive.
The console reports "Unable to create WebGLTexture" but it is not clear if this happens before or after the loop.
We have also seen "Error: Failed to link vertex and fragment shaders." when trying to have them reproduce.
You can see the stackstrace for when the system going into a loop (also attached)
Describe the expected behavior
The model loads successfully using loadGraphModel
The model warms within 50-500ms
The model can be used normally.
Here is what the stackstrace looks like when the model successfully loads and warms.
Standalone code to reproduce the issue
We cannot reproduce on local systems. But are open to any ideas on how to reproduce the problem.
Other info / logs Include any logs or source code that would be helpful to
diagnose the problem. If including tracebacks, please include the full
traceback. Large logs and files should be attached.
We wrote a TF testing page to help isolate the issue, screen shots are below. These tests all pass for our dev and QA team, but running the model fails for our user.
System information
Mostly stock inference code with a few small modifications.
Macbook Pro 16 GB 2020 (Intel Mac) Running OS X 10.15.7
NPM
3.9.0
and4.21.0
Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/129.0.0.0 Safari/537.36
Describe the current behavior
We're running a TF.js model in production that is a fine tuned MobileNetv1. This model works perfectly for all of our users except one, we are unable to reproduce the issue locally or detect the issue before it occurs so we could switch to CPU. This issue started about a month ago, during which time we had not updated any of our TF code or components.
Problem:
loadGraphModel
You can see the stackstrace for when the system going into a loop (also attached)
Describe the expected behavior
loadGraphModel
Here is what the stackstrace looks like when the model successfully loads and warms.
Standalone code to reproduce the issue
We cannot reproduce on local systems. But are open to any ideas on how to reproduce the problem.
Other info / logs Include any logs or source code that would be helpful to
diagnose the problem. If including tracebacks, please include the full
traceback. Large logs and files should be attached.
tf.ENV.features:
We wrote a TF testing page to help isolate the issue, screen shots are below. These tests all pass for our dev and QA team, but running the model fails for our user.
Trace-20240930T110703.json.zip
The text was updated successfully, but these errors were encountered: