How to make batch inference in pipeline Transformers #3614

khongtrunght · 2023-02-24T10:49:55Z

khongtrunght
Feb 24, 2023

I'm new to BentoML. When I read this example : https://github.com/bentoml/BentoML/tree/main/examples/inference_graph . I see that the model is not batched.
When I follow the tutorial Adaptive Batching, I do somthing like that:

import bentoml

bentoml.transformers.save_model(
    name="unmasker",
    pipeline=unmasker,
    signatures={
        "__call__": {
            "batchable": True,
            "batch_dim": 0,
        },
    },
)

But it not work and yield error when run, can anybody with experiments helps me?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BentoML

How to make batch inference in pipeline Transformers #3614

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

BentoML

How to make batch inference in pipeline Transformers #3614

khongtrunght Feb 24, 2023

Replies: 0 comments

khongtrunght
Feb 24, 2023