Reuse KNNVectorFieldData for reduce disk usage #1571

luyuncheng · 2024-03-20T16:00:15Z

Description

in some scenarios, we want to reduce the disk usage and io throughput for the source field. so, we would excludes knn fields in mapping which do not store the source like( this would make knn field can not be retrieve and rebuild)

"mappings": { 
  "_source": { 
    "excludes": [
      "target_field1",
      "target_field2",
     ]
  }
}

so I propose to use doc_values field for the vector fields. like:

POST some_index/_search
{
  "docvalue_fields": [
    "vector_field1",
    "vector_field2",
  ],
  "_source": false
}'

Proposal

Rewrite KNNVectorDVLeafFieldData get data from docvalues

i rewrite KNNVectorDVLeafFieldData and make KNN80BinaryDocValues can return the specific knn docvalue_fields like: (vector_field1 is knn field type)

"hits":[{"_index":"test","_id":"1","_score":1.0,"fields":{"vector_field1":["1.5","2.5"]}},{"_index":"test","_id":"2","_score":1.0,"fields":{"vector_field1":["2.5","1.5"]}}]

optimize result:
1m SIFT dataset, 1 shard,
with source store: 1389MB
without source store: 1055MB(-24%)

for the continues dive in to knndocvalues fields, I think when use faiss engine, we can use reconstruct_n interface to retrieve the specific doc values and save the disk usage for BinaryDocValuesFormat. or like this issue comments for redesign a KnnVectorsFormat

composite vector field to _source

I added KNNFetchSubPhase and add a processor like FetchSourcePhase#FetchSubPhaseProcessor to combine the docvalue_fields into _source something like synthetic logic

Issues Resolved

#1087
#1572

1st I made KNNVectorDVLeafFieldData can return the vectorDocValue fields like script do.
2nd I write a KNNFetchSubPhase class which add a process in fetch phase, and it can fulfill the _source with 1st step docValues fields response. and this way something like synthetic source but need explicit add value from search body like docvalue_fields

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Commits are signed as per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

src/main/java/org/opensearch/knn/index/fetch/KNNFetchSubPhase.java

luyuncheng · 2024-03-20T16:32:24Z

@navneet1v Easy test:

create index with

 {"mappings":{"_source": {"excludes": ["vector_field1"] }, "properties": {"vector_field1": {"type": "knn_vector", "dimension": 2 }, "vector_field2": {"type": "knn_vector", "dimension": 4 }, "number_field": {"type": "long"} } } }

write data with

{"vector_field1" : [1.5, 2.5], "vector_field2" : [1.0, 2.0, 3.0, 4.0], "number_field":10 }

POST test/_search

response do not contain vector_field1

POST test/_search

{"docvalue_fields": ["vector_field1"] }
response contains vector_field1 in _source and in fields

navneet1v · 2024-03-20T16:36:16Z

@navneet1v Easy test:

create index with

 {"mappings":{"_source": {"excludes": ["vector_field1"] }, "properties": {"vector_field1": {"type": "knn_vector", "dimension": 2 }, "vector_field2": {"type": "knn_vector", "dimension": 4 }, "number_field": {"type": "long"} } } }

write data with

{"vector_field1" : [1.5, 2.5], "vector_field2" : [1.0, 2.0, 3.0, 4.0], "number_field":10 }

POST test/_search

response do not contain vector_field1

POST test/_search

{"docvalue_fields": ["vector_field1"] }
response contains vector_field1 in _source and in fields

My question was how we are ensuring that KNNSubphase is not running during the search and running only during the re-indexing.

luyuncheng · 2024-03-20T16:40:10Z

My question was how we are ensuring that KNNSubphase is not running during the search and running only during the re-indexing.

@navneet1v gotcha, I will do the continues tests like reindex and other scenarios.

navneet1v · 2024-03-20T17:11:08Z

My question was how we are ensuring that KNNSubphase is not running during the search and running only during the re-indexing.

@navneet1v gotcha, I will do the continues tests like reindex and other scenarios.

Also there is something called as _recovery_source which is added as a fallback to support the re-indexing. If you are testing locally I would recommend to remove these line of code
https://github.com/opensearch-project/OpenSearch/blob/e6975e412b09a8d82675edd9a43c20f7c325c0f9/server/src/main/java/org/opensearch/index/mapper/SourceFieldMapper.java#L215-L219

To ensure that recovery source is never created. This recovery source gets deleted after some when if indexing is happening continuously, but I have never tested this to understand does this really happen or not.

src/main/java/org/opensearch/knn/index/KNNVectorDVLeafFieldData.java

luyuncheng · 2024-03-27T14:10:11Z

My question was how we are ensuring that KNNSubphase is not running during the search and running only during the re-indexing.

@navneet1v i tested search source and reindex scenarios with KNNSubPhase, it shows correctly without nested field.

src/main/java/org/opensearch/knn/index/fetch/KNNFetchSubPhase.java

src/main/java/org/opensearch/knn/index/KNNVectorDVLeafFieldData.java

luyuncheng · 2024-04-07T12:39:12Z

Why cant we have public BytesRef nextValue() throws IOException { return the whole string represetation of the current vector?

@jmazanec15 as I see, we say synthetic source field is type XContentType.JSON.xContent . but SortedBinaryDocValues#BytesRef nexValue() is bin bytes array. I am not sure we need trans the bin bytes in DocValuesFormat to utf-8 JSON like format bytes string.

also, I see we are trying to reconstruct the vector format with KnnVectorsFormat. so I think in KnnVectorsFormat we can simply rewrite docvalues with JSON.xContent format.

but I am not sure Which SortedBinaryDocValues#BytesRef nexValue() format is better(bin, or one double value).

what do you think which is better.

luyuncheng · 2024-04-07T12:55:19Z

Also there is something called as _recovery_source which is added as a fallback to support the re-indexing.

@navneet1v I added IT tests for the search, and reindex scenarios, I think it works with knnFetchSubPhase to synthesize the _source field

src/main/java/org/opensearch/knn/index/KNNVectorDVLeafFieldData.java

src/main/java/org/opensearch/knn/index/fetch/KNNFetchSubPhase.java

src/main/java/org/opensearch/knn/index/KNNSettings.java

src/main/java/org/opensearch/knn/index/KNNVectorDVLeafFieldData.java

src/main/java/org/opensearch/knn/index/fetch/KNNFetchSubPhase.java

luyuncheng · 2024-05-22T17:00:22Z

@navneet1v @jmazanec15 i resolved all review comments and all tests passed.

src/main/java/org/opensearch/knn/index/fetch/KNNFetchSubPhase.java

bugmakerrrrrr · 2024-05-27T16:19:24Z

AFAIK, the plugin's sub fetch phases will run after the OS core engine's sub fetch phases, which includes the FetchSourcePhase used for source filtering. Therefore, if I exclude a vector field (e.g. field1) in the index mappings and activate the synthetic logic introduced in this PR, I believe that when I submit a search request with the source filter logic ("_source": {"exclude": "field1"}), field1 will still be present in the response.

In current OS implementation, sub fetch phase has no order concept like ActionFilter, so we can't arrange them. We may need to introduce the filter logic in KNNFetchSubPhase if we don't change the logic of the core engine.

Please let me know if I have misunderstand it.

luyuncheng · 2024-05-27T16:30:33Z

AFAIK, the plugin's sub fetch phases will run after the OS core engine's sub fetch phases, which includes the FetchSourcePhase used for source filtering. Therefore, if I exclude a vector field (e.g. field1) in the index mappings and activate the synthetic logic introduced in this PR, I believe that when I submit a search request with the source filter logic ("_source": {"exclude": "field1"}), field1 will still be present in the response. Please let me know if I have misunderstand it.

@bugmakerrrrrr
synthetic is used for reducing disk usages which in majority scenarios we use vectors for search not for fetch, but we still need it in reindex. at the same time we can reduce the _source store field io usage. so the KNNFetchSubPhase can make it worked when disable mapping source which stored in _source field like followings.

"mappings":{"_source": {"excludes": ["vector_field1"] } } }

navneet1v · 2024-05-30T04:42:48Z

In current OS implementation, sub fetch phase has no order concept like ActionFilter, so we can't arrange them. We may need to introduce the filter logic in KNNFetchSubPhase if we don't change the logic of the core engine.

@bugmakerrrrrr

The fetch subphases runs in a order ref. The order is first all the Fetch subphases defined in Opensearch core will run and then all the plugins phases are run. The only exception for this is the InnerHit Fetchphase. The inner hit fetch phase is run at the end. Ref: https://github.com/opensearch-project/OpenSearch/blob/52b27f47bca5b3ab52cab237542f32c307d203b4/server/src/main/java/org/opensearch/search/fetch/FetchPhase.java#L104-L107 The order of these phases cannot be changed.

luyuncheng · 2024-05-30T05:12:11Z

The only exception for this is the InnerHit Fetchphase. The inner hit fetch phase is run at the end. Ref: https://github.com/opensearch-project/OpenSearch/blob/52b27f47bca5b3ab52cab237542f32c307d203b4/server/src/main/java/org/opensearch/search/fetch/FetchPhase.java#L104-L107 The order of these phases cannot be changed.

@navneet1v as i see the logic for InnerHit Fetchphase, it used for fulfill the response in inner_hits field,

"_source": {}
"inner_hits": {
  "<inner_hits_name>": {
    "hits": {
        "total": ...,
        "hits": [
          {
            "_type": ...,
            "_id": ...,
            "_source": ...,
             ...
          },
       ]
    }
  }
}

But KNNFetchphase fulfill the response in _source field. so the exception is we can not fulfill the _source filed in inner_hits but can get the parent topLevel _source field

bugmakerrrrrr · 2024-05-30T05:18:59Z

The order of these phases cannot be changed.

@navneet1v Indeed, this is the key point that I want to emphasize, and it is precisely why I suggest that we consider incorporating the filter logic that you mentioned in your comment into the KNNFetchSubPhase. Otherwise, it will cause conflicts at the API level (I requested to exclude certain fields in the response, but they appeared in the response). Or if it is too complex to implement the filter logic, we can consider it as a limitation and clearly mark it in the document.

luyuncheng · 2024-05-30T05:21:30Z

The order of these phases cannot be changed.

@navneet1v Indeed, this is the key point that I want to emphasize, and it is precisely why I suggest that we consider incorporating the filter logic that you mentioned in your comment into the KNNFetchSubPhase. Otherwise, it will cause conflicts at the API level (I requested to exclude certain fields in the response, but they appeared in the response). Or if it is too complex to implement the filter logic, we can consider it as a limitation and clearly mark it in the document.

LGTM, i like it.

navneet1v · 2024-05-30T06:45:15Z

The order of these phases cannot be changed.

@navneet1v Indeed, this is the key point that I want to emphasize, and it is precisely why I suggest that we consider incorporating the filter logic that you mentioned in your comment into the KNNFetchSubPhase. Otherwise, it will cause conflicts at the API level (I requested to exclude certain fields in the response, but they appeared in the response). Or if it is too complex to implement the filter logic, we can consider it as a limitation and clearly mark it in the document.

@luyuncheng and @bugmakerrrrrr agreed.

bugmakerrrrrr

@navneet1v @luyuncheng I've checked the filter logic in FetchSourcePhase, and I think that it's too complicated to implement in this subphase.

src/main/java/org/opensearch/knn/index/fetch/KNNFetchSubPhase.java

navneet1v · 2024-06-28T22:42:39Z

@luyuncheng can we fix up the comments and so that we can merge this change?

Signed-off-by: luyuncheng <[email protected]>

luyuncheng · 2024-07-08T02:14:52Z

@luyuncheng can we fix up the comments and so that we can merge this change?

@navneet1v FIXED at 2a61fcd

jmazanec15 · 2024-07-09T19:35:50Z

Thanks @luyuncheng. I have been reviewing and I think overall it looks good. Im still not confident on the nested portion, particularly the innerProcessOneNestedField. Could you add comments or explain more whats happening here?

Also, can we capture a list of known limitations in the issue? Somewhere we can refer to when developing the documentation what can and cannot be done with this feature. When testing, here is what I have:

[nested] passing inner_hits: {} for query results in an exception - inner_hits will not work
[nested] nested field partial doc only have vector and source exclude (from comment)
[non-nested] If source is disabled for the mapping completely, (i.e. "mappings": {"_source": {"enabled": false,"recovery_source_enabled": false} synthetic source will not work. So, in order for synthetic source to work, the vector fields need to be excluded explicitly

Also, do we know if it works with partially constructed non-nested documents? Are there any other limitations for non-nested case?

The functionality that will work:

Basic search when vector field is excluded from source, the vectors will be included
Basic nested search, the vectors will show up
Reindex of non-nested indices
Reindex of nested indices

jmazanec15 · 2024-10-10T17:02:10Z

@luyuncheng Im going to work on this one a little bit and see if I can add it to 2.18! Will open up a new PR.

luyuncheng · 2024-10-17T09:09:10Z

@luyuncheng Im going to work on this one a little bit and see if I can add it to 2.18! Will open up a new PR.

@jmazanec15 how about let me create a new PR, and rebase on the master?

jmazanec15 · 2024-10-17T19:50:02Z

@luyuncheng I rebased and started experimenting with it here: https://github.com/jmazanec15/k-NN-1/commits/vector-synthetic-source/. Please take a look! I think before raising a new PR, itd be best to figure out a couple high level approach big questions - just so we dont end up going back and forth on revisions too much.

Currently, I have a few concerns with implementing synthetic source as a fetch subphase:

The order fetch sub phases are executed in is non-deterministic. So, if there is a feature, say highlighting, that has its own fetch sub phase, the order in which the fetch subphases is processed will determine if the features work together or not. Thus, it will be difficult to ensure robustness
I am not sure if this approach will work with Field Level Security feature or not. Field level security will hide certain fields from users. I am not sure if this approach will circumvent that security mechanism and therefore present a vulnerability. As an initial step, we could just call out that this does not work with field level security, but blocking this explicitly may be somewhat tricky

I was discussing with @cwperks the field level security implementation and I thought it was pretty interesting and a similar strategy might be better for our use case. They implement the onIndexModule.setReaderWrapper() (see https://github.com/opensearch-project/security/blob/main/src/main/java/org/opensearch/security/OpenSearchSecurityPlugin.java#L698). Then, when reading certain fields, they will filter them out. For instance, for source: https://github.com/opensearch-project/security/blob/main/src/main/java/org/opensearch/security/configuration/DlsFlsFilterLeafReader.java#L652-L669. Given that our use case is just the opposite (we want to add fields when they are not present), it seems like this overall approach might make sense and give us a more robust solution that is compatible with a lot of features by default.

The major issue with this, however, is that indexModule.setReaderWrapper() can only be set once (https://github.com/opensearch-project/OpenSearch/blob/main/server/src/main/java/org/opensearch/index/IndexModule.java#L443-L459). Also, in the javadoc, it says "The wrapped reader can filter out document just like delete documents etc. but must not change any term or document content." I might be misinterpreting this - but it would seem like FLS might be breaking this contract (@cwperks am I incorrect here?)

In order for this to work, I think we would need to somehow apply the synthetic source injection before the security fls wrapper does.

That being said, I am wondering if we should put an extension point in OpenSearch core that will allow fields to inject into source here in a similar manner to how FLS security is implemented.

cwperks · 2024-10-17T20:07:29Z

"The wrapped reader can filter out document just like delete documents etc. but must not change any term or document content." I might be misinterpreting this - but it would seem like FLS might be breaking this contract (@cwperks am I incorrect here?)

^ that does appear to be the case. I'd have to dive into the change that introduced that comment to understand the motivation. It looks like its a change from before the fork. The FLS/FieldMasking does not change the stored data, but it does modify the result. In the case of FieldMasking it masks the value returned or with FLS it can choose to exclude fields from the result.

luyuncheng · 2024-10-21T04:12:40Z

The order fetch sub phases are executed in is non-deterministic. So, if there is a feature, say highlighting, that has its own fetch sub phase, the order in which the fetch subphases is processed will determine if the features work together or not. Thus, it will be difficult to ensure robustness

@jmazanec15 as the following code shows:

https://github.com/opensearch-project/OpenSearch/blob/0419e5d8a5b5327663c09e93feb931281da7b64e/server/src/main/java/org/opensearch/search/SearchModule.java#L1060-L1073

https://github.com/opensearch-project/OpenSearch/blob/0419e5d8a5b5327663c09e93feb931281da7b64e/server/src/main/java/org/opensearch/search/fetch/FetchPhase.java#L198-L211

highlight added before plugin's FetchSubPhase , so FetchSubPhase , there is some limitation in plugins. maybe we can add a explicit synthetic phase after FetchSourcePhase

luyuncheng · 2024-10-21T04:17:04Z

I am not sure if this approach will work with Field Level Security feature or not. Field level security will hide certain fields from users. I am not sure if this approach will circumvent that security mechanism and therefore present a vulnerability. As an initial step, we could just call out that this does not work with field level security, but blocking this explicitly may be somewhat tricky

@jmazanec15 @cwperks , hey, if we wrapper a SecurityFlsDlsIndexSearcherWrapper in the data node for field security, why not wrapper a fetchPhase in coordinator node, which handle less verification because finally hits is less then collector all docs.

jmazanec15 · 2024-10-21T16:51:56Z

@luyuncheng I had an alternative approach that I figured might let us cover more cases around fetch. For instance, other processors implementing fetch subphases. Im curious to hear your thoughts on it.

Currently, we already have our own custom codec. What if we created our own custom StoredFieldsFormat. The format would need to be incredibly light weight - it would implement a delegate pattern on the upstream. However, for the StoredFieldsReader (which implements StoredFields), we override document:

    private final BiConsumer<Integer, BytesReference> sourceConsumer;

    @Override
    public void document(int docId, StoredFieldVisitor storedFieldVisitor) throws IOException {
        delegate.document(docId, storedFieldVisitor);
        if (!(storedFieldVisitor instanceof FieldsVisitor)) {
            return;
        }
        sourceConsumer.accept(docId, ((FieldsVisitor) storedFieldVisitor).source());
    }

Then, we can configure the sourceConsumer to manipulate the source via other formats such as doc values reader or vector values reader.

Similarly, in the future, we could think about doing the same on the write side so that we can automatically disable source for vector fields by default.

This approach would allow us to:

Avoid non-deterministic behavior for ordering around fetch subphases by intercepting source at a lower layer
Easier support for FLS (I believe filtering happens in the visitor phase, so we would need to ensure that we are not accidentally adding it back in)
Automatically disable source for vector users without needing them to specify excludes flag. This would let us not have to signal to users to disable source in our docs making a smoother more performant oob experience.

That being said, Im not sure about:

Having dependencies across formats
Casting to FieldsVisitor covers all functionality

Does this approach sound reasonable @navneet1v @shatejas @heemin32?

luyuncheng · 2024-10-22T14:05:37Z

Currently, we already have our own custom codec. What if we created our own custom StoredFieldsFormat. The format would need to be incredibly light weight - it would implement a delegate pattern on the upstream. However, for the StoredFieldsReader (which implements StoredFields), we override document:

@jmazanec15 let me describe my understand of the usage for a new StoredFieldsFormat.

create new index with mapping, and exclude vector field
Set vector field for store. then using new StoredFieldsFormat to Store knn_vector
Finally Load data from knn file?

PUT vector_index
"mappings": { 
  "_source": { 
    "excludes": [
      "vector_field"
     ]
  },
  "properties": {
     "vector_field": {
         "type": "knn_vector",
         "dimension": 2,
       __"store": true__
     }
  }
}

As my proposal for this PR, i just want to cut the disk usage, and reuse the data in docValues. in the majority case, we do not want to retrieve knn_vector from source.

i like your idea, it can take some advantage as you mentioned, but i do not know how to save the disk usage for a new StoreFieldsFormat

jmazanec15 · 2024-10-22T16:53:10Z

i like your idea, it can take some advantage as you mentioned, but i do not know how to save the disk usage for a new StoreFieldsFormat

It would operate in the same way - exclude the vector field from source in the mapping. This would then save on disk. @luyuncheng internally, the "source" is just stored as a stored field in lucene. So, from StoredFieldsReader, we are able to access any stored fields including source.

Then, in the CustomStoredFieldsReader:

    @Override
    public void document(int docId, StoredFieldVisitor storedFieldVisitor) throws IOException {
        delegate.document(docId, storedFieldVisitor);
        if (!(storedFieldVisitor instanceof FieldsVisitor)) {
            return;
        }
        BytesReference originalSource = ((FieldsVisitor) storedFieldVisitor).source()
        BytesReference syntheticVector = getVectorFromDocValuesOrVectorValues(docId, field)
        putSyntheticVectorIntoSource(originalSource, syntheticVector, field);
    }

The FieldsVisitor contains the source that will be returned.

So, this would let us basically modify the source as early as possible. Thus, we would be able to support as many other features relying on source as possible

jmazanec15 · 2024-10-22T22:56:33Z

As update - I validated that the FetchSubPhase approach will work with field level security on this branch: https://github.com/jmazanec15/k-NN-1/tree/vector-synthetic-source.

So, my only concern remaining with this approach is:

The order fetch sub phases are executed in is non-deterministic. So, if there is a feature, say highlighting, that has its own fetch sub phase, the order in which the fetch subphases is processed will determine if the features work together or not. Thus, it will be difficult to ensure robustness

As @luyuncheng mentioned, we do not need to worry about this for core fetch subphases, but it could be problematic for non-core subphases. Also, Im wondering if there are any features out there that do not read the source via the fetch phase routine.

luyuncheng · 2024-10-23T08:03:02Z

As update - I validated that the FetchSubPhase approach will work with field level security on this branch: https://github.com/jmazanec15/k-NN-1/tree/vector-synthetic-source.

@jmazanec15 so, when we introduce a new StoredFieldsFormat or NOT it can work for field level security in FetchSubPhase AND also we do need a FetchSubPhase for reindex.

Because stored_fields return like followings, it would not allow us do Reindex from _source

==> CREATE
PUT my-index-000001
{
"mappings": { 
  "_source": { 
    "excludes": [
      "vector_field"
     ]
  },
  "properties": {
     "vector_field": {
         "type": "knn_vector",
         "dimension": 2,
       __"store": true__
     }
  }
}
==> SEARCH
GET my-index-000001/_search
{
  "stored_fields": [ "vector_field"] 
}
==> RESPONSE
{
  "hits": {
    "hits": [
      {
        "_index": "my-index-000001",
        "_id": "1",
        "_source": {
              .....
        },
        "fields": {
          "vector_field": [ .... ]
        }
      }
    ]
  }
}

jmazanec15 · 2024-10-23T15:09:54Z

@luyuncheng Not sure Im following completely.

Because stored_fields return like followings, it would not allow us do Reindex from _source

I think there is some confusion around Stored Fields from an OpenSearch user perspective and from a Lucene perspective. The _source field is stored in Lucene as a stored field. See here:

// fieldType().name() is "_source"
context.doc().add(new StoredField(fieldType().name(), ref.bytes, ref.offset, ref.length));

So, the "_source" is fetched by calling the StoredFieldsReader.document - See this FieldVisitor.

So, if we implement our own StoredFieldsReader, we have a chance to intercept the "_source" stored field on the Lucene level. FLS does something similar here: https://github.com/opensearch-project/security/blob/main/src/main/java/org/opensearch/security/configuration/DlsFlsFilterLeafReader.java#L89.

So, taking the stored fields approach,

PUT my-index-000001
{
"mappings": { 
  "_source": { 
    "excludes": [
      "vector_field"
     ]
  },
  "properties": {
     "vector_field": {
         "type": "knn_vector",
         "dimension": 2
     }
  }
}

// This would still return vector_field in the source
GET my-index-000001/_search
{
...
}
==> RESPONSE
{
  "hits": {
    "hits": [
      {
        "_index": "my-index-000001",
        "_id": "1",
        "_source": {
              "vector_field": ..
        }
      }
    ]
  }
}

luyuncheng requested review from heemin32, navneet1v, VijayanB, vamshin, jmazanec15, naveentatikonda, junqiu-lei, martin-gaievski and ryanbogan as code owners March 20, 2024 16:00

This was referenced Mar 20, 2024

[FEATURE] Reuse KNNVectorFieldData for reduce disk usage #1572

Open

Investigate migrating custom codec from BinaryDocValuesFormat to KnnVectorsFormat #1087

Closed

navneet1v reviewed Mar 20, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/fetch/KNNFetchSubPhase.java Show resolved Hide resolved

jmazanec15 reviewed Mar 25, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/KNNVectorDVLeafFieldData.java Outdated Show resolved Hide resolved

luyuncheng force-pushed the DVFieldData branch from fa4ff4c to 6a69e71 Compare March 27, 2024 14:06

jmazanec15 reviewed Mar 28, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/fetch/KNNFetchSubPhase.java Outdated Show resolved Hide resolved

src/main/java/org/opensearch/knn/index/KNNVectorDVLeafFieldData.java Outdated Show resolved Hide resolved

navneet1v mentioned this pull request Apr 9, 2024

[META] [Build-Time] Improving Build time for Vector Indices #1599

Open

7 tasks