-
Notifications
You must be signed in to change notification settings - Fork 357
New Baseline Improvement Healthcheck #4673
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@shrutipatel31 has exported this pull request. If you are a Meta employee, you can view the originating Diff in D89372279. |
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #4673 +/- ##
==========================================
+ Coverage 96.66% 96.69% +0.02%
==========================================
Files 578 580 +2
Lines 60525 60713 +188
==========================================
+ Hits 58508 58707 +199
+ Misses 2017 2006 -11 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
0e0901a to
d0fa83c
Compare
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
d0fa83c to
fba126f
Compare
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
fba126f to
9ad5084
Compare
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
9ad5084 to
9b1c973
Compare
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
c509f2e to
cb4584a
Compare
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
cb4584a to
056bda8
Compare
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
056bda8 to
bb5c02a
Compare
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
Summary: This diff introduces a new BaselineImprovementAnalysis healthcheck that evaluates whether an optimization experiment has improved over a baseline arm. Key features: - Compares best performing arms against a baseline arm for each objective metric - Supports both single-objective and multi-objective optimization Status logic: Pass: All objectives improved over baseline Warning: Some objectives improved over baseline Fail: No objectives improved over baseline This analysis - - Automatically selects baseline (explicit → status quo → first trial) or accepts user-provided baseline - Supports customization via documentation_link and no_improvement_message parameters - Returns a HealthcheckAnalysisCard with detailed per-metric breakdown The healthcheck is integrated into the ExperimentAnalysisOverview to run automatically as part of the standard analysis suite. Differential Revision: D89372279 Privacy Context Container: L1307644
bb5c02a to
7d3ae07
Compare
|
This pull request has been merged in 8570a97. |
Differential Revision:
D89372279
Privacy Context Container: L1307644