Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[controller] log compaction #1282

Draft
wants to merge 56 commits into
base: main
Choose a base branch
from

Conversation

WhitneyDeng
Copy link

Summary

  • purpose: to get stores ready for compaction based on programmatically defined criteria
  • added API endpoints to get compaction-ready stores

How was this PR tested?

unit test: filter all stores on a cluster for compaction-ready stores

Does this PR introduce any user-facing changes?

  • No. You can skip the rest of this section.
  • Yes. Make sure to explain your proposed changes and call out the behavior change.

Whitney Deng added 10 commits October 29, 2024 11:52
…er methods VeniceParentHelixAdmin::isCompactionReady VeniceParentHelixAdmin::isLastCompactionTimeOlderThanThresholdHours
…ForCompaction & VeniceParentHelixAdmin::isLastCompactionTimeOlderThanThresholdHours JavaDocs description

- complete description of VeniceParentHelixAdmin::getStoresForCompaction
- future tense to present tense for VeniceParentHelixAdmin::isLastCompactionTimeOlderThanThresholdHours
…:filterStoresForCompaction

- extract filterStoresForCompaction() logic for testing
- write VeniceParentHelixAdminTest::testFilterStoresForCompaction
… TestVeniceParentHelixAdmin

- migrate from integration test files to unit test files
Copy link
Contributor

@mynameborat mynameborat left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good job on putting up the first PR quickly. Looks very good. I have put in some comments.

Let me know if you have questions.

Whitney Deng added 5 commits November 6, 2024 11:38
- remodularise VeniceParentHelixAdmin::isHybridStore to function var in VeniceParentHelixAdmin::isCompactionReady
- make VeniceParentHelixAdmin::isLastCompactionTimeOlderThanThresholdHours hardcoded int to class constant
Whitney Deng added 13 commits November 21, 2024 09:10
- change exception message in child controller
interface for repush: Azkaban/Airflow implementation for Linkedin internal. (potentially) Spark implementation for OSS.
- empty implementation for now
reason: since repush will be used for diff use cases (other than compaction), the term repush should be reserved for the abstracted repush action.
- instantiate in VeniceController
- this config determines the number of threads for LogCompactionService::executor
…into CompactionManager

- create CompactionManager & TestCompactionManager
- move TestCompactionManager::testFilterStoresForCompaction from TestVeniceParentHelixAdmin
…utor setup in LogCompactionService::startInner
Whitney Deng added 3 commits December 5, 2024 16:52
- create com.linkedin.venice.controller.logcompaction in tests
- change CompactionManager::filterStoresForCompaction from public to package private
@WhitneyDeng WhitneyDeng closed this Dec 6, 2024
@WhitneyDeng WhitneyDeng deleted the log-compaction branch December 6, 2024 19:25
@WhitneyDeng WhitneyDeng restored the log-compaction branch December 6, 2024 22:11
@WhitneyDeng
Copy link
Author

closed because branch renamed to whdeng/log-compaction-repush to follow branch naming convention

@WhitneyDeng WhitneyDeng reopened this Dec 6, 2024
@WhitneyDeng
Copy link
Author

reverted branch name to log-compaction for PR continuity.

Whitney Deng added 13 commits December 6, 2024 15:08
…abled feature flat

purpose: control rollout of log compaction service
- log trigger source (for now, can expand later)
…QueryParams initialisation to take explicit clusterName directly
- since triggerRepush() is beyond the original scope of log compaction, this method is parked for later addition. adding TODO to remember thought process & design decision to use byte[] + data model class rather than QueryParams for repush job details
- CompactionManager::compactStore & VeniceHelixAdmin::compactStore logs & rethrows exception from repush
- LogCompactionService logs exception & can be extended to perform other actions to handle exception

- RepushJobResponse will be passe from RepushOrchestrator::repush through CompactionManager::compactStore & VeniceHelixAdmin::compactStore to LogCompactionService

- added execution URL to RepushJobResponse, to follow repush.py's pattern
after studying information in Airflow response objects
to keep LogCompactionTask running & not bring down VeniceController
…ompactionManager if log compaction is enabled
- created TestRepushOrchestratorImpl, a dummy RepushOrchestrator implementation for testing
Whitney Deng added 6 commits December 17, 2024 11:51
- add integration test in TestHybrid::testHybridStoreLogCompaction
- add Admin::getCompactionManager for testing
- change scope of CompactionManager::isCompactionReady
…gCompaction

- setup incomplete. decided to do test in TestHybrid::testHybridStoreLogCompaction instead
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants