Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

在线看了一下代码,似乎只支持纯文本 #6

Open
spar025 opened this issue Apr 28, 2024 · 2 comments
Open

在线看了一下代码,似乎只支持纯文本 #6

spar025 opened this issue Apr 28, 2024 · 2 comments

Comments

@spar025
Copy link

spar025 commented Apr 28, 2024

切割文本循环的地方是有bug,不过无伤大雅。
但是似乎只支持文本,我现在就头疼表格的问题,已经折腾4、5天了。作者什么时候有空处理一下pdf、word中存在复杂表格的情况呗?

@bigcyy bigcyy closed this as completed May 6, 2024
@bigcyy bigcyy reopened this May 6, 2024
@bigcyy
Copy link
Owner

bigcyy commented May 6, 2024

目前在忙毕业的事,项目就一直搁置了,你可以看看 Python 的 LangChain 库。

@spar025
Copy link
Author

spar025 commented May 21, 2024

目前在忙毕业的事,项目就一直搁置了,你可以看看 Python 的 LangChain 库。

辛苦了,即将投入社会的人才,langchain玩过,效果不好。其实切割、入库比较简单,没必要使用langchain,就是不知道其他公司都是怎么弄的,我也是一个人研究,资料少,比较难。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants