- Apr 23, 2024
-
-
KevinHuSh authored
### What problem does this PR solve? #509 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)
-
chrysanthemum-boy authored
### What problem does this PR solve? Add `.doc` file parser, using tika. ``` pip install tika ``` ``` from tika import parser from io import BytesIO def extract_text_from_doc_bytes(doc_bytes): file_like_object = BytesIO(doc_bytes) parsed = parser.from_buffer(file_like_object) return parsed["content"] ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by:
chrysanthemum-boy <fannc@qq.com>
-
KevinHuSh authored
### What problem does this PR solve? ### Type of change - [x] Refactoring
-
- Apr 22, 2024
-
-
KevinHuSh authored
### What problem does this PR solve? #474 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)
-
Shaun authored
parse markdown files as txt ### What problem does this PR solve? support markdown files ### Type of change - [x] New Feature (non-breaking change which adds functionality)
-
KevinHuSh authored
### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)
-
- Apr 19, 2024
-
-
KevinHuSh authored
### What problem does this PR solve? #384 ### Type of change - [x] Performance Improvement
-
- Apr 16, 2024
-
-
KevinHuSh authored
### What problem does this PR solve? Issue link:#384 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)
-
- Apr 07, 2024
-
-
KevinHuSh authored
### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ Issue link:#[[Link the issue here](https://github.com/infiniflow/ragflow/issues/236)] ### Type of change - [x] Documentation Update
-
KevinHuSh authored
### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ Issue link:#[[Link the issue here](https://github.com/infiniflow/ragflow/issues/226)] ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)
-
- Mar 27, 2024
- Mar 22, 2024
-
-
KevinHuSh authored
* add use layout or not option * trival
-
- Mar 20, 2024
- Mar 19, 2024
-
-
KevinHuSh authored
-
- Mar 08, 2024
- Mar 05, 2024
- Mar 04, 2024
- Mar 01, 2024
-
-
KevinHuSh authored
-
- Feb 29, 2024
- Feb 23, 2024
-
-
KevinHuSh authored
* init README of deepdoc, add picture processer. * add resume parsing
-
- Feb 21, 2024
-
-
KevinHuSh authored
-
- Feb 19, 2024
-
-
KevinHuSh authored
-
- Feb 08, 2024
-
-
KevinHuSh authored
-
- Feb 05, 2024
-
-
KevinHuSh authored
-
- Feb 02, 2024
-
-
KevinHuSh authored
Add naive chunking method to RAG
-