Possible to do sub-sentence level extractive summarization? #41

Hellisotherpeople · 2020-12-21T02:44:09Z

After reading the documentation, it looks like the Extractive Summarization components only score sentences. While this is how the vast majority of extractive summarization papers work, some extractive summarization systems and datasets work at the word level of granularity (namely, my own work is exclusively word-level extractive summarization)

Is there some way to make TransformerSum work at the word level of granularity out of the box? When I trained extractive word-level models, I used a final token classification head for it. Maybe that can be implemented here alongside the current sentence scoring heads?

HHousen · 2020-12-22T01:12:48Z

@Hellisotherpeople Out of the box, TransformerSum only supports extractive summarization at the sentence level. It doesn't support word level granularity yet. This could be implemented into the library. However, there are no plans to integrate it yet since I'm not familiar with word-level extractive summarization. Possibly in the pooling module we could add another option that passes the token vectors through a classifier without condensing them into sentence vectors. We also may need to change the testing method to work at the word level. I will look into this sometime this week.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible to do sub-sentence level extractive summarization? #41

Possible to do sub-sentence level extractive summarization? #41

Hellisotherpeople commented Dec 21, 2020 •

edited

Loading

HHousen commented Dec 22, 2020

Possible to do sub-sentence level extractive summarization? #41

Possible to do sub-sentence level extractive summarization? #41

Comments

Hellisotherpeople commented Dec 21, 2020 • edited Loading

HHousen commented Dec 22, 2020

Hellisotherpeople commented Dec 21, 2020 •

edited

Loading