RuntimeError: [enforce fail at alloc_cpu.cpp:114] data. DefaultCPUAllocator: not enough memory: you tried to allocate 33507323568 bytes. #21

Godly-GM · 2024-11-02T14:39:59Z

I tried to pass the context from a 19-page PDF to the model, but I encountered this error:
RuntimeError: [enforce fail at alloc_cpu.cpp:114] data. DefaultCPUAllocator: not enough memory: you tried to allocate 33507323568 bytes.

here input_text is the content of pdf.

guenthermi · 2024-11-04T11:56:35Z

It looks like your machine doesn't have enough memory to encode very long sequences of text. You could use the long late chunking method, which is implemented in the _embed_with_overlap method (

late-chunking/chunked_pooling/mteb_chunked_eval.py

Line 237 in db558c3

model_outputs = self._embed_with_overlap(model, model_inputs)

) in our evaluation code together with a lower number of tokens ( long_late_chunking_embed_size in the function) property to circumvent this issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: [enforce fail at alloc_cpu.cpp:114] data. DefaultCPUAllocator: not enough memory: you tried to allocate 33507323568 bytes. #21

RuntimeError: [enforce fail at alloc_cpu.cpp:114] data. DefaultCPUAllocator: not enough memory: you tried to allocate 33507323568 bytes. #21

Godly-GM commented Nov 2, 2024

guenthermi commented Nov 4, 2024

RuntimeError: [enforce fail at alloc_cpu.cpp:114] data. DefaultCPUAllocator: not enough memory: you tried to allocate 33507323568 bytes. #21

RuntimeError: [enforce fail at alloc_cpu.cpp:114] data. DefaultCPUAllocator: not enough memory: you tried to allocate 33507323568 bytes. #21

Comments

Godly-GM commented Nov 2, 2024

guenthermi commented Nov 4, 2024