community: fix AzureSearch vectorstore asyncronous methods #24921

thedavgar · 2024-08-01T12:23:27Z

Description
Fix the asyncronous methods to retrieve documents from AzureSearch VectorStore. The previous changes from this commit create a similar code for the syncronous methods and the asyncronous ones but the asyncronous client return an asyncronous iterator "AsyncSearchItemPaged" as said in the issue #24740.
To solve this issue, the syncronous iterators in asyncronous methods where changed to asyncronous iterators.

@chrislrobert said in this comment that there was a still a flaw due to with blocks that close the client after each call. I removed this with blocks in the async_client following the same pattern as the sync client.

In order to close up the connections, a del method is included to gently close up clients once the vectorstore object is destroyed.

Issue: #24740 and #24064
Dependencies: No new dependencies for this change

Example notebook: I created a notebook just to test the changes work and gives the same results as the syncronous methods for vector and hybrid search. With these changes, the asyncronous methods in the retriever work as well.

Lint and test: Passes the tests and the linter

vercel · 2024-08-01T12:23:37Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Aug 1, 2024 3:51pm

chrislrobert · 2024-08-01T12:59:48Z

Thanks, @thedavgar, for tackling this!

From a quick look at the code, it looks like this should allow accessing search results, multiple calls, etc. Just two questions:

Is anything special needed to ensure that the async_client is eventually cleaned up (closed)?
There are still with blocks in aadd_embeddings and adelete, which leaves those as kind of one-way doors: they'll suffer the same issue where subsequent calls will fire exceptions about the client being closed. I can see focusing this fix on the actual search cases, but is there an issue somewhere to clean that up too? Shame to leave it broken.

Thanks again!

thedavgar · 2024-08-01T14:15:04Z

Thanks, @thedavgar, for tackling this!

From a quick look at the code, it looks like this should allow accessing search results, multiple calls, etc. Just two questions:

Is anything special needed to ensure that the async_client is eventually cleaned up (closed)?

There are still with blocks in aadd_embeddings and adelete, which leaves those as kind of one-way doors: they'll suffer the same issue where subsequent calls will fire exceptions about the client being closed. I can see focusing this fix on the actual search cases, but is there an issue somewhere to clean that up too? Shame to leave it broken.

Thanks again!

The best way could be to close the clients once the vectorstore instance is closed.
Thank you telling me the missing methods that still have the with block. I will try to tackle it.

libs/community/langchain_community/vectorstores/azuresearch.py

thedavgar

The async code works

chrislrobert

@thedavgar, this all looks good to me — but I have to say that I'm not an official reviewer for LangChain and don't know their standards and practices. If possible, I would ask @eyurtsev and/or @baskaryan to review, as they were involved in the original async PR #22075.

thedavgar · 2024-08-06T15:44:50Z

Hi @baskaryan,

I’m looking forward to approving this PR for the next LangChain community version. I’ve followed the steps in the contributor guide, but I’m not sure how to assign or approve the code.

Thanks in advance!

thedavgar · 2024-08-12T08:30:40Z

Hi @isahers1, the async version of AzureSearch vectorstore continues to fail. This PR solves several issues in the code. Could you help me get these changes approved to have it fixed for the next version of langchain community?

@chrislrobert

…-ai#24921) **Description** Fix the asyncronous methods to retrieve documents from AzureSearch VectorStore. The previous changes from [this commit](langchain-ai@ffe6ca9) create a similar code for the syncronous methods and the asyncronous ones but the asyncronous client return an asyncronous iterator "AsyncSearchItemPaged" as said in the issue langchain-ai#24740. To solve this issue, the syncronous iterators in asyncronous methods where changed to asyncronous iterators. @chrislrobert said in [this comment](langchain-ai#24740 (comment)) that there was a still a flaw due to `with` blocks that close the client after each call. I removed this `with` blocks in the `async_client` following the same pattern as the sync `client`. In order to close up the connections, a __del__ method is included to gently close up clients once the vectorstore object is destroyed. **Issue:** langchain-ai#24740 and langchain-ai#24064 **Dependencies:** No new dependencies for this change **Example notebook:** I created a notebook just to test the changes work and gives the same results as the syncronous methods for vector and hybrid search. With these changes, the asyncronous methods in the retriever work as well. ![image](https://github.com/user-attachments/assets/697e431b-9d7f-4d0d-b205-59d051ac2b67) **Lint and test**: Passes the tests and the linter

ND-code-ai · 2024-08-20T09:24:15Z

Amazing! For me your changes also fixed some faulty behaviour in the @search.score calculation where retrieved Doc objects would contain unexpected extremely low scores

fix async methods to retrieve documents

295ec79

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Aug 1, 2024

dosubot bot added community Related to langchain-community Ɑ: vector store Related to vector store module 🤖:bug Related to a bug, vulnerability, unexpected error with an existing feature labels Aug 1, 2024

fix linting issues in imports

4196062

thedavgar added 2 commits August 1, 2024 14:36

remove async with blocks in adelete and aadd_embeddings async methods

00a6da5

add deletion method to close gently the sync and async client

8313b48

chrislrobert reviewed Aug 1, 2024

View reviewed changes

libs/community/langchain_community/vectorstores/azuresearch.py Outdated Show resolved Hide resolved

thedavgar added 2 commits August 1, 2024 15:39

add checks of existing clients in the deletion method

1b4cd3c

add missing type notation in deletion method

e45d0f0

thedavgar commented Aug 6, 2024

View reviewed changes

thedavgar requested a review from chrislrobert August 6, 2024 08:11

chrislrobert approved these changes Aug 6, 2024

View reviewed changes

chrislrobert mentioned this pull request Aug 8, 2024

AzureSearch.avector_search_with_score() triggers "TypeError: 'AsyncSearchItemPaged' object is not iterable" when calling _results_to_documents() #24740

Open

5 tasks

isahers1 approved these changes Aug 13, 2024

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Aug 13, 2024

isahers1 merged commit 9d08369 into langchain-ai:master Aug 13, 2024
43 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

community: fix AzureSearch vectorstore asyncronous methods #24921

community: fix AzureSearch vectorstore asyncronous methods #24921

thedavgar commented Aug 1, 2024 •

edited

Loading

vercel bot commented Aug 1, 2024 •

edited

Loading

chrislrobert commented Aug 1, 2024

thedavgar commented Aug 1, 2024

thedavgar left a comment

chrislrobert left a comment

thedavgar commented Aug 6, 2024

thedavgar commented Aug 12, 2024 •

edited

Loading

ND-code-ai commented Aug 20, 2024

community: fix AzureSearch vectorstore asyncronous methods #24921

community: fix AzureSearch vectorstore asyncronous methods #24921

Conversation

thedavgar commented Aug 1, 2024 • edited Loading

vercel bot commented Aug 1, 2024 • edited Loading

chrislrobert commented Aug 1, 2024

thedavgar commented Aug 1, 2024

thedavgar left a comment

Choose a reason for hiding this comment

chrislrobert left a comment

Choose a reason for hiding this comment

thedavgar commented Aug 6, 2024

thedavgar commented Aug 12, 2024 • edited Loading

ND-code-ai commented Aug 20, 2024

thedavgar commented Aug 1, 2024 •

edited

Loading

vercel bot commented Aug 1, 2024 •

edited

Loading

thedavgar commented Aug 12, 2024 •

edited

Loading