Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New Features: lm.get_URL(), lm_get_MD(), lm_get_PDF, lm_getTXT, etc... #33

Open
Gustavo-Livemix opened this issue Mar 18, 2024 · 1 comment

Comments

@Gustavo-Livemix
Copy link

It will be wonderfull if there were Helper functions for external retrieval like:

  • lm.get_URL(path) for using an URL
  • lm_get_MD(path) for using Markdown document
  • lm_get_PDF(path) for using PDF documents
  • lm_getTXT(path) for using txt documents
  • lm_get_DOC(path) for using MS Word documents
  • lm_get_JSON(path) for using JSON documents
  • etc
    Like there is for lm.get_wiki (), lm.get_weather(), lm.getdate()
    Thanks a lot and keep the great work! It's amazing!
@jncraton
Copy link
Owner

jncraton commented Mar 19, 2024

Thanks for the kind words! I'm glad that you are finding this package useful.

I can certainly see the value of simplifying retrieval from different types of documents. I'm not planning to implement this right now, but I'll consider ways to better integrate this in the future.

As you are probably aware, there are already packages such as textract that specialize in solving the problem of going from all sorts of documents to plain text. Using something like that would hopefully address your immediate needs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants