-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix IndexError in para_split_v3.py for empty line handling #916
Fix IndexError in para_split_v3.py for empty line handling #916
Commits on Nov 6, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 6b3e142 - Browse repository at this point
Copy the full SHA 6b3e142View commit details -
Configuration menu - View commit details
-
Copy full SHA for 54844a5 - Browse repository at this point
Copy the full SHA 54844a5View commit details
Commits on Nov 7, 2024
-
feat(model): add xycut algorithm for block sorting
- Implement xycut algorithm to sort blocks when layoutreader fails - Add recursive_xy_cut function to perform the xycut algorithm- Update pdf_parse_union_core_v2.py to use xycut when layoutreader fails - Modify draw_bbox.py to handle cases where layoutreader fails to sort blocks
Configuration menu - View commit details
-
Copy full SHA for 7d5850e - Browse repository at this point
Copy the full SHA 7d5850eView commit details -
Merge pull request #898 from myhloli/fix-line-over-512
feat(model): add xycut algorithm for block sorting
Configuration menu - View commit details
-
Copy full SHA for 2600d32 - Browse repository at this point
Copy the full SHA 2600d32View commit details
Commits on Nov 8, 2024
-
refactor(pdf_parse): adjust line count limit for layoutreader
- Decrease the maximum line count from 512 to 316 for layoutreader
Configuration menu - View commit details
-
Copy full SHA for 5468e56 - Browse repository at this point
Copy the full SHA 5468e56View commit details -
refactor(pdf_parse): adjust line count threshold for layoutreader
- Lower the line count threshold from 316 to 200 to ensure compatibility - This change aims to prevent potential issues with layoutreader's maximum line support
Configuration menu - View commit details
-
Copy full SHA for 5936684 - Browse repository at this point
Copy the full SHA 5936684View commit details -
Merge pull request #902 from myhloli/fix-line-over-512
refactor(pdf_parse): adjust line count threshold for layoutreader
Configuration menu - View commit details
-
Copy full SHA for 5f79453 - Browse repository at this point
Copy the full SHA 5f79453View commit details -
xu rui committed
Nov 8, 2024 Configuration menu - View commit details
-
Copy full SHA for 7859c73 - Browse repository at this point
Copy the full SHA 7859c73View commit details -
xu rui committed
Nov 8, 2024 Configuration menu - View commit details
-
Copy full SHA for 91f8cbe - Browse repository at this point
Copy the full SHA 91f8cbeView commit details -
Merge pull request #906 from icecraft/feat/add_en_docs
Feat/add en docs
Configuration menu - View commit details
-
Copy full SHA for 784c61a - Browse repository at this point
Copy the full SHA 784c61aView commit details -
xu rui committed
Nov 8, 2024 Configuration menu - View commit details
-
Copy full SHA for aa3df5f - Browse repository at this point
Copy the full SHA aa3df5fView commit details -
Merge pull request #907 from icecraft/feat/using_next_docs
feat: using next_docs
Configuration menu - View commit details
-
Copy full SHA for 9581fcd - Browse repository at this point
Copy the full SHA 9581fcdView commit details -
feat(table): integrate RapidTable model for table recognition
- Add RapidTable model support for table recognition - Update table model configuration and initialization - Modify table recognition process to use RapidTable when specified - Add RapidTable dependency to setup.py
Configuration menu - View commit details
-
Copy full SHA for 240fe99 - Browse repository at this point
Copy the full SHA 240fe99View commit details -
refactor(table): update default table model to Rapid Table
- Change the default table model from TABLE_MASTER to RAPID_TABLE
Configuration menu - View commit details
-
Copy full SHA for e78edb1 - Browse repository at this point
Copy the full SHA e78edb1View commit details -
Merge pull request #910 from myhloli/dev
feat(table): integrate RapidTable model for table recognition
Configuration menu - View commit details
-
Copy full SHA for 74fba47 - Browse repository at this point
Copy the full SHA 74fba47View commit details -
style(gradio-app): add missing file type in upload
- Add missing '.jpg' file type to the list of allowed file types for upload
Configuration menu - View commit details
-
Copy full SHA for 8ea2381 - Browse repository at this point
Copy the full SHA 8ea2381View commit details -
Configuration menu - View commit details
-
Copy full SHA for dd8da7b - Browse repository at this point
Copy the full SHA dd8da7bView commit details -
Merge pull request #911 from myhloli/dev
fix(gradio-app): add missing file type in upload
Configuration menu - View commit details
-
Copy full SHA for 8eb699e - Browse repository at this point
Copy the full SHA 8eb699eView commit details -
refactor(magic_pdf_parse_main): optimize model data handling and JSON…
… output - Add orig_model_list parameter to maintain original model data - Deep copy model_json and pipe.model_list to preserve data integrity - Update json_md_dump function call to include orig_model_list - Improve condition check for empty model_json
Configuration menu - View commit details
-
Copy full SHA for 1fc053d - Browse repository at this point
Copy the full SHA 1fc053dView commit details -
Merge pull request #912 from myhloli/dev
refactor(magic_pdf_parse_main): optimize model data handling and JSON output
Configuration menu - View commit details
-
Copy full SHA for 1e37e19 - Browse repository at this point
Copy the full SHA 1e37e19View commit details -
Configuration menu - View commit details
-
Copy full SHA for a09d9e8 - Browse repository at this point
Copy the full SHA a09d9e8View commit details -
Configuration menu - View commit details
-
Copy full SHA for b912797 - Browse repository at this point
Copy the full SHA b912797View commit details -
test(table): improve ppTableModel test coverage
- Update test_image2html to use unittest framework - Add more assertions
Configuration menu - View commit details
-
Copy full SHA for e65ff19 - Browse repository at this point
Copy the full SHA e65ff19View commit details -
Merge pull request #914 from myhloli/dev
test(table): improve ppTableModel test coverage
Configuration menu - View commit details
-
Copy full SHA for 5e0c9d2 - Browse repository at this point
Copy the full SHA 5e0c9d2View commit details -
feat(table): add RapidOCR support for RapidTable model
- Integrate RapidOCR with RapidTable model for table recognition - Improve memory management for devices with <= 8GB VRAM - Update table recognition process to use RapidOCR for RapidTable - Add rapidocr-paddle dependency in setup.py
Configuration menu - View commit details
-
Copy full SHA for fe2c2c0 - Browse repository at this point
Copy the full SHA fe2c2c0View commit details -
Merge pull request #915 from myhloli/dev
feat(table): add RapidOCR support for RapidTable model
Configuration menu - View commit details
-
Copy full SHA for 5a3872b - Browse repository at this point
Copy the full SHA 5a3872bView commit details
Commits on Nov 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e75076b - Browse repository at this point
Copy the full SHA e75076bView commit details
Commits on Nov 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 7b1984f - Browse repository at this point
Copy the full SHA 7b1984fView commit details -
Configuration menu - View commit details
-
Copy full SHA for f8ac8e1 - Browse repository at this point
Copy the full SHA f8ac8e1View commit details