Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(document-search): add support for images #81

Open
mhordynski opened this issue Oct 10, 2024 · 0 comments · May be fixed by #121
Open

feat(document-search): add support for images #81

mhordynski opened this issue Oct 10, 2024 · 0 comments · May be fixed by #121
Assignees
Labels
document search Changes to the document search package feature New feature or request
Milestone

Comments

@mhordynski
Copy link
Member

mhordynski commented Oct 10, 2024

Feature description

New type of element should be introduced into DocumentSearch: ImageElement.

ImageElement can be originate either from:

  • bigger document like pdf
  • be a standalonde .img / .png document

ImageElement should consist:

  • summary / description generated by multi-modal llm
  • OCR extracted text
  • image bytes

Motivation

A lot of technical documentation / manuals consist a lot of technical drawings, schemas, etc. which may be really important part of context while answering user questions

Additional context

In the future, we may try directly embedding images rather than creating descriptions with LLM: #85

@mhordynski mhordynski added the feature New feature or request label Oct 10, 2024
@mhordynski mhordynski added this to the Ragbit 0.4 milestone Oct 10, 2024
@ludwiktrammer ludwiktrammer changed the title feat(document-search): add support to images feat(document-search): add support for images Oct 10, 2024
@micpst micpst added the document search Changes to the document search package label Oct 11, 2024
@konrad-czarnota-ds konrad-czarnota-ds linked a pull request Oct 21, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
document search Changes to the document search package feature New feature or request
Projects
Status: In Progress
Development

Successfully merging a pull request may close this issue.

3 participants