-
Notifications
You must be signed in to change notification settings - Fork 173
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
blurred pages not showing up #62
Comments
I also have this problem. Some pages will say "Blurred Content in Page ___" and all you see is blank white page. |
yep |
@peppeamend98 the issue here is that they use two different CDNs. One is hosted on their own server (https://pieces.studocu.com), and the other is hosted by Amazon (https://d3tvd1u91rr79.cloudfront.net). For some reason (it may be geographical nature to load resources more efficiently; Spotify and Co. use the same strategy to ensure global efficient distribution), they employ their own CDN for some documents, where the plugin works just fine, and for others, they use the Amazon one. When we try to replace the /blurred part in the image src, it gives a 403 (unauthorized access error). When a document is uploaded, it gets processed, where the text of the pdf is extracted, and only the parts (like graphics, images, and so on) are stored as images. When you now try to access the uploaded document, the image is loaded as background, and the text is laid over it. As you might have realized, no text is overlayed when the image is blurred. It is hard to circumvent this, as getting the unblurred images is only part of the problem. I am currently trying to understand how the text is loaded. However, the js code is quite messy as it uses extensive libraries (react, PerimeterX (against web-scrapping), and so on), which are loaded in chunks. |
This is the script that gets the text and this is the part of the code that loads the text (I mean, it's the part where the text is not loaded, because the pages are blurred) :
(the variable and this is where I got so far:
issues: apparently, the validation process includes reading the timestamp present in the Policy token [edit: fixed by replacing the timestamp in the decoded token) |
@KiSa04 have u made any progress? I'm getting stuck at the signature part. You can only use one signature for one page and not for another. Did you find a solution for this? |
@KXTOD I haven't looked any more into it, as I haven't had any time for it, but I doubt we will be able to crack down the signature. I'd say the best approach right now would be reverse engineering their Android App API and finding out if they use aws there as well (I'm very confident their approach is different there - it would be a mess to replicate their way of handling it on the website) |
It seems like they are migrating from their servers to aws. A generous number of documents have been migrated already, so this extension will not work anymore. |
what do i need to do?
The text was updated successfully, but these errors were encountered: