(feat)First step towards full xml-configuration support #915

thomasht86 · 2024-09-16T12:37:54Z

I confirm that this contribution is made under the terms of the license found in the root directory of this repository's source tree and that I have the authority necessary to make this contribution on behalf of its copyright owner.

Why

Adding support for tags far down in the tree, ref #895, results in a lot of code, and with our current approach of using jinja-templates requires a lot of effort each time.

This PR is in preparation to allow for supporting any services.xml configuration from pyvespa, with the approach used by FastHTML as inspiration.

How

By adopting the approach in this PR, we only need to provide a list of valid xml-tags for each configuration file, and all the classes will be auto-generated.

E.g. to express the following services.xml:

<?xml version="1.0" encoding="utf-8" ?>
<services version='1.0' xmlns:deploy="vespa" xmlns:preprocess="properties">

  <container id='default' version='1.0'>
    <nodes count='1'/>
    <component id='ai.vespa.examples.Centroids' bundle='billion-scale-image-search'/>
    <component id='ai.vespa.examples.DimensionReducer' bundle='billion-scale-image-search'/>
    <component id="ai.vespa.examples.BPETokenizer" bundle='billion-scale-image-search'>
      <config name="ai.vespa.examples.bpe-tokenizer">
        <contextlength>77</contextlength>
        <vocabulary>files/bpe_simple_vocab_16e6.txt.gz</vocabulary>
      </config>
    </component>
    <model-evaluation>
      <onnx>
        <models>
          <model name="text_transformer">
            <intraop-threads>1</intraop-threads>
          </model>
          <model name="vespa_innerproduct_ranker">
            <intraop-threads>1</intraop-threads>
          </model>
        </models>
      </onnx>
    </model-evaluation>
    <search>
      <chain id='default' inherits='vespa'>
        <searcher id='ai.vespa.examples.searcher.DeDupingSearcher' bundle='billion-scale-image-search'/>
        <searcher id='ai.vespa.examples.searcher.RankingSearcher' bundle='billion-scale-image-search'/>
        <searcher id="ai.vespa.examples.searcher.CLIPEmbeddingSearcher" bundle="billion-scale-image-search"/>
        <searcher id='ai.vespa.examples.searcher.SPANNSearcher' bundle='billion-scale-image-search'/>
      </chain>
    </search>
    <document-api/>
    <document-processing>
      <chain id='neighbor-assigner' inherits='indexing'>
        <documentprocessor id='ai.vespa.examples.docproc.DimensionReductionDocProc'
                           bundle='billion-scale-image-search'/>
        <documentprocessor id='ai.vespa.examples.docproc.AssignCentroidsDocProc'
                           bundle='billion-scale-image-search'/>
      </chain>
    </document-processing>
  </container>

  <content id='graph' version='1.0'>
    <min-redundancy>1</min-redundancy>
    <documents>
      <document mode='index' type='centroid'/>
      <document-processing cluster='default' chain='neighbor-assigner'/>
    </documents>
    <nodes count='1'/>
    <engine>
      <proton>
        <tuning>
          <searchnode>
            <feeding>
              <concurrency>1.0</concurrency>
            </feeding>
          </searchnode>
        </tuning>
      </proton>
    </engine>
  </content>

  <content id='if' version='1.0'>
    <min-redundancy>1</min-redundancy>
    <documents>
      <document mode='index' type='image'/>
      <document-processing cluster='default' chain='neighbor-assigner'/>
    </documents>
    <nodes count='1'/>
    <engine>
      <proton>
        <tuning>
          <searchnode>
            <requestthreads>
              <persearch>2</persearch>
            </requestthreads>
            <feeding>
              <concurrency>1.0</concurrency>
            </feeding>
            <summary>
              <io>
                <read>directio</read>
              </io>
              <store>
                <cache>
                  <maxsize-percent>5</maxsize-percent>
                  <compression>
                    <type>lz4</type>
                  </compression>
                </cache>
                <logstore>
                  <chunk>
                    <maxsize>16384</maxsize>
                    <compression>
                      <type>zstd</type>
                      <level>3</level>
                    </compression>
                  </chunk>
                </logstore>
              </store>
            </summary>
          </searchnode>
        </tuning>
      </proton>
    </engine>
  </content>
</services>

We can do this from python with:

generated_services = services(
            container(id="default", version="1.0")(
                nodes(count="1"),
                component(
                    id="ai.vespa.examples.Centroids",
                    bundle="billion-scale-image-search",
                ),
                component(
                    id="ai.vespa.examples.DimensionReducer",
                    bundle="billion-scale-image-search",
                ),
                component(
                    id="ai.vespa.examples.BPETokenizer",
                    bundle="billion-scale-image-search",
                )(
                    config(name="ai.vespa.examples.bpe-tokenizer")(
                        vt(
                            "contextlength", "77"
                        ),  # using vt as this is not a predefined tag
                        vt(
                            "vocabulary", "files/bpe_simple_vocab_16e6.txt.gz"
                        ),  # using vt as this is not a predefined tag
                    ),
                ),
                model_evaluation(
                    onnx(
                        models(
                            model(name="text_transformer")(intraop_threads("1")),
                            model(name="vespa_innerproduct_ranker")(
                                intraop_threads("1")
                            ),
                        ),
                    ),
                ),
                search(
                    chain(id="default", inherits="vespa")(
                        searcher(
                            id="ai.vespa.examples.searcher.DeDupingSearcher",
                            bundle="billion-scale-image-search",
                        ),
                        searcher(
                            id="ai.vespa.examples.searcher.RankingSearcher",
                            bundle="billion-scale-image-search",
                        ),
                        searcher(
                            id="ai.vespa.examples.searcher.CLIPEmbeddingSearcher",
                            bundle="billion-scale-image-search",
                        ),
                        searcher(
                            id="ai.vespa.examples.searcher.SPANNSearcher",
                            bundle="billion-scale-image-search",
                        ),
                    ),
                ),
                document_api(),
                document_processing(
                    chain(id="neighbor-assigner", inherits="indexing")(
                        documentprocessor(
                            id="ai.vespa.examples.docproc.DimensionReductionDocProc",
                            bundle="billion-scale-image-search",
                        ),
                        documentprocessor(
                            id="ai.vespa.examples.docproc.AssignCentroidsDocProc",
                            bundle="billion-scale-image-search",
                        ),
                    ),
                ),
            ),
            content(id="graph", version="1.0")(
                min_redundancy("1"),
                documents(
                    document(mode="index", type="centroid"),
                    document_processing(cluster="default", chain="neighbor-assigner"),
                ),
                nodes(count="1"),
                engine(
                    proton(
                        tuning(
                            searchnode(
                                feeding(concurrency("1.0")),
                            ),
                        ),
                    ),
                ),
            ),
            content(id="if", version="1.0")(
                min_redundancy("1"),
                documents(
                    document(mode="index", type="image"),
                    document_processing(cluster="default", chain="neighbor-assigner"),
                ),
                nodes(count="1"),
                engine(
                    proton(
                        tuning(
                            searchnode(
                                requestthreads(persearch("2")),
                                feeding(concurrency("1.0")),
                                summary(
                                    io(read("directio")),
                                    store(
                                        cache(
                                            maxsize_percent("5"),
                                            compression(
                                                vt_type("lz4")
                                            ),  # Using vt_type as type is a reserved keyword
                                        ),
                                        logstore(
                                            chunk(
                                                maxsize("16384"),
                                                compression(
                                                    vt_type(
                                                        "zstd"
                                                    ),  # Using vt_type as type is a reserved keyword
                                                    level("3"),
                                                ),
                                            ),
                                        ),
                                    ),
                                ),
                            ),
                        ),
                    ),
                ),
            ),
            version="1.0",
        )

Added unit tests to check for equality between original xml and generated xml.

Notes

A separate PR will need to integrate this functionality while still ensuring compatibility with existing approach.
This approach can also be reused for other xml configuration files.
This PR only contains the foundation for doing this, and we will need to adapt the ApplicationPackage().services_to_text to use this functionality for it to take effect.
An added bonus is that we are now able to validate any of the xml configuration files against the relaxng-schema.
This also forms the foundation to allow us to generate pyvespa code from xml, like it is done here

There are many files in this PR because:

["**/services.xml", "**/validation-overrides.xml", "**/hosts.xml"] from sample-apps repo, to use as test files for the service configuration.
.rnc and .rng files from vespaengine/vespa for schema validation.

Only the 2 python files needs a review.

jobergum

This is a great direction, unlocking all the features of Vespa - I assume that excisting functionaliy works and that we can add examples of using the new syntax?

thomasht86 · 2024-09-20T07:39:37Z

Thanks. Yes - that is the goal of next iterations.

…

On Fri, Sep 20, 2024 at 9:27 AM Jo Kristian Bergum ***@***.***> wrote: ***@***.**** approved this pull request. This is a great direction, unlocking all the features of Vespa - I assume that excisting functionaliy works and that we can add examples of using the new syntax? — Reply to this email directly, view it on GitHub <#915 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AF3M74H6FQJJOKYGQNRAKY3ZXPE6PAVCNFSM6AAAAABOJGSC6OVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDGMJXGQ3TSMJUGU> . You are receiving this because you authored the thread.Message ID: ***@***.***>

thomasht86 added 25 commits September 3, 2024 12:43

script to get sample app files

cc1492b

update pyproject

7325ce5

dynamic code proto

bd73a08

add sample app files

49faa94

Merge branch 'master' into thomasht86/dynamic-xml-creation

2397dd5

dyn upd

e50c9e9

add rnc and g files

e69bdd2

looking like something

41ffe79

fix

5de4b98

more tests

f6f0bca

passing tests

d3fcc31

Merge branch 'master' into thomasht86/dynamic-xml-creation

3fefe0e

refine structure

81402fb

simplify tests

dfff05b

update dependencies

eefd154

refine vt

ee00767

working billion scale test

c4c620c

simplify

e95938e

clean comments

5f8e776

vt

0d7ddda

Merge branch 'master' into thomasht86/dynamic-xml-creation

2a35f74

add lxml

e512c3d

move copy script

0f01023

Merge branch 'master' into thomasht86/dynamic-xml-creation

45075ab

Merge branch 'master' into thomasht86/dynamic-xml-creation

a123256

thomasht86 marked this pull request as ready for review September 17, 2024 11:32

thomasht86 requested a review from jobergum September 18, 2024 07:45

Merge branch 'master' into thomasht86/dynamic-xml-creation

312b3a7

jobergum approved these changes Sep 20, 2024

View reviewed changes

thomasht86 merged commit 9915007 into master Sep 20, 2024
44 checks passed

thomasht86 deleted the thomasht86/dynamic-xml-creation branch September 20, 2024 07:31

This was referenced Sep 23, 2024

(feat) Next iteration of all xml-support #929

Merged

(feat) Support document expiry #936

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(feat)First step towards full xml-configuration support #915

(feat)First step towards full xml-configuration support #915

thomasht86 commented Sep 16, 2024 •

edited

Loading

jobergum left a comment

thomasht86 commented Sep 20, 2024 via email

(feat)First step towards full xml-configuration support #915

(feat)First step towards full xml-configuration support #915

Conversation

thomasht86 commented Sep 16, 2024 • edited Loading

jobergum left a comment

Choose a reason for hiding this comment

thomasht86 commented Sep 20, 2024 via email

thomasht86 commented Sep 16, 2024 •

edited

Loading