Skip to content

Commit

Permalink
Merge pull request #4237 from vespa-engine/vekterli/reduce-to-1M-dataset
Browse files Browse the repository at this point in the history
Reduce to 1M dataset and use synthetic ID attribute
  • Loading branch information
geirst authored Oct 23, 2024
2 parents 0867e3b + 63ecef2 commit b4e1dec
Show file tree
Hide file tree
Showing 3 changed files with 10 additions and 2 deletions.
2 changes: 1 addition & 1 deletion tests/performance/mmap_vs_directio/.gitignore
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
enwiki-20240801-pages.6819k.jsonl.zst
enwiki-20240801-pages.*.jsonl.zst
squad2-questions.fbench.141k.txt
3 changes: 2 additions & 1 deletion tests/performance/mmap_vs_directio/mmap_vs_directio.rb
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ def test_wikipedia_corpus_search_performance

query_file_name = 'squad2-questions.fbench.141k.txt'
report_io_stat_deltas do
feed_file('enwiki-20240801-pages.6819k.jsonl.zst')
feed_file('enwiki-20240801-pages.1M.jsonl.zst')
end

@search_node.trigger_flush # Shovel everything into a disk index
Expand Down Expand Up @@ -119,6 +119,7 @@ def benchmark_queries(query_file, type, clients, warmup = false, runtime = 20)
run_fbench2(@container,
node_file,
{:clients => clients,
:append_str => '&presentation.summary=minimal&hits=10',
:use_post => false,
:runtime => runtime,
:result_file => result_file},
Expand Down
7 changes: 7 additions & 0 deletions tests/performance/mmap_vs_directio/wikimedia.sd
Original file line number Diff line number Diff line change
@@ -1,6 +1,9 @@
schema wikimedia {

document wikimedia {
field id type int {
indexing: attribute | summary
}
field title type string {
indexing: index | summary
index: enable-bm25
Expand All @@ -21,4 +24,8 @@ schema wikimedia {
}
}

document-summary minimal {
summary id {}
}

}

0 comments on commit b4e1dec

Please sign in to comment.