-
Notifications
You must be signed in to change notification settings - Fork 29
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Added extra experiments - mainly around macro chunking (#16)
* initial experiment setup * changed len tokens * soft boundary correctly overlaps for later batches * removed incorrect soft * added loop over overlap sizes * added results * experiment for WikimQA * added narrativeQA task * added remainder of longembed datasets * typo * more tasks for soft/hard * experiments * added macro chunk experiment file * for merge * chunk size results * added benchmark files for macro chunks * removed raw results * added plotting files for results, requires running them first * renamed file * renamed file * added plt.show() * renamed to macro chunking * moved file * Update chunked_pooling/chunking.py according to comment Co-authored-by: Michael Günther <[email protected]> * updated main experiment file with long late chunking * remove redundant macro chunking file * updated default to truncation (8192) * updated error message/print statement * changed how local llm is loaded * removed comment on pip and update default model to phi --------- Co-authored-by: Michael Günther <[email protected]>
- Loading branch information
1 parent
f631894
commit 7c2bc57
Showing
5 changed files
with
432 additions
and
12 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.