Bivariate Profiles

The key function for evaluating bivariate profile boundaries is bivariate_confidenceprofiles!. The evaluated bivariate profile(s) will be contained within a BivariateConfidenceStruct that is stored in the LikelihoodModel.

LikelihoodBasedProfileWiseAnalysis.bivariate_confidenceprofiles! — Function

bivariate_confidenceprofiles!(model::LikelihoodModel, 
    θcombinations::Vector{Vector{Int}}, 
    num_points::Int; 
    <keyword arguments>)

Finds num_points profile_type boundary points at a specified confidence_level for each combination of two interest parameters using a specified method, optionally saving any found internal points. Saves these profiles by modifying model in place.

Arguments

model: a LikelihoodModel containing model information, saved profiles and predictions.
θcombinations: vector of pairs of parameters to profile, as a vector of vectors of model parameter indexes.
num_points: positive number of points to find on the boundary at the specified confidence level. Depending on the method, if a region of the user-provided bounds is inside the boundary some of these points will be on the bounds and inside the boundary. Set to at least 3 within the function as some methods need at least three points to work.

Keyword Arguments

confidence_level: a number ∈ (0.0, 1.0) for the confidence level on which to find the profile_type boundary. Default is 0.95 (95%).
dof: an integer ∈ [2, model.core.num_pars] for the degrees of freedom used to define the asymptotic threshold (LikelihoodBasedProfileWiseAnalysis.get_target_loglikelihood) which defines the boundary of the bivariate profile. For bivariate profiles that are considered individually, it should be set to 2. For profiles that are considered simultaneously, it should be set to model.core.num_pars. Default is 2. Setting it to model.core.num_pars should be reasonable when making predictions for well-identified models with <10 parameters. Note: values other than 2 and model.core.num_pars may not have a clear statistical interpretation.
profile_type: whether to use the true log-likelihood function or an ellipse approximation of the log-likelihood function centred at the MLE (with optional use of parameter bounds). Available profile types are LogLikelihood, EllipseApprox and EllipseApproxAnalytical. Default is LogLikelihood() (LogLikelihood).
method: a method of type AbstractBivariateMethod. For a list of available methods use bivariate_methods() (bivariate_methods). Default is RadialRandomMethod(5) (RadialRandomMethod).
θlb_nuisance: a vector of lower bounds on nuisance parameters, require θlb_nuisance .≤ model.core.θmle. Default is model.core.θlb.
θub_nuisance: a vector of upper bounds on nuisance parameters, require θub_nuisance .≥ model.core.θmle. Default is model.core.θub.
save_internal_points: boolean variable specifying whether to save points found inside the boundary during boundary computation. Internal points can be plotted in bivariate profile plots and will be used to generate predictions from a given bivariate profile. Default is true.
existing_profiles: Symbol ∈ [:ignore, :merge, :overwrite] specifying what to do if profiles already exist for a given θcombination, confidence_level, profile_type and method. See below for each symbol's meanings. Default is :merge.
find_zero_atol: a Real number greater than zero for the absolute tolerance of the log-likelihood function value from the target value to be used when searching for confidence intervals. Default is model.find_zero_atol.
optimizationsettings: a OptimizationSettings struct containing the optimisation settings used to find optimal values of nuisance parameters for a given pair of interest parameter values. Default is missing (will use model.core.optimizationsettings).
show_progress: boolean variable specifying whether to display progress bars on the percentage of θcombinations completed and estimated time of completion. Default is model.show_progress.
use_distributed: boolean variable specifying whether to use a normal for loop or a @distributed for loop across combinations of interest parameters. Set this variable to false if Distributed.jl is not being used. Default is true.
use_threads: boolean variable specifying, if use_distributed is false, whether to use parallelised for loops across Threads.nthreads() threads or a non-parallel for loops to find boundary points from methods where boundary points are found independently. Default is true.
- Fix1AxisMethod and RadialMLEMethod parallelise the finding point pair step and the finding the boundary from point pairs step.
- SimultaneousMethod and RadialRandomMethod do not parallelise the finding point pair step but parallelise finding the boundary from point pairs.
- IterativeBoundaryMethod parallelises finding the initial boundary but not the following boundary improvement steps.
- AnalyticalEllipseMethod does not require parallelisation.

existing_profiles meanings

:ignore means profiles that already exist will not be recomputed even if they contain fewer num_points boundary points.
:merge means profiles that already exist will be merged with profiles from the current algorithm run to reach num_points. If the existing profile already has at least num_points boundary points then that profile will not be recomputed. Otherwise, the specified method will be run starting from the difference between num_points and the number of points in the existing profile. The result of that method run will be merged with the existing profile. Predictions evaluated from the existing profile will be forgotten. To keep these predictions see extended help below.
:overwrite means profiles that already exist will be overwritten, regardless of how many points they contain. Predictions evaluated from the existing profile will be forgotten. To keep these predictions see extended help below.

Details

Using LikelihoodBasedProfileWiseAnalysis.bivariate_confidenceprofile this function calls the algorithm/method specified by method for each interest parameter combination in θcombinations (depending on the setting for existing_profiles and num_points if these profiles already exist). Nuisance parameters of each point in bivariate interest parameter space are found by maximising the log-likelihood function given by profile_type. Updates model.biv_profiles_df for each successful profile and saves their results as a BivariateConfidenceStruct in model.biv_profiles_dict, where the keys for the dictionary is the row number in model.biv_profiles_df of the corresponding profile. model.biv_profiles_df.num_points is the number of points found on the bivariate boundary (it does not include the number of saved internal points).

Extended help

Valid bounds

For methods that use points placed on parameter bounds to bracket for the confidence boundary, the bracketing method utilised via Roots.jl's find_zero will be unlikely to converge to the true confidence boundary for a given pair of interest parameters if the bounds on either parameter are +/- Inf or the log-likelihood function evaluates to +/- Inf. Bounds should be set to prevent this from occurring.

Preventing predictions from being forgotten when merging or overwriting profiles

To prevent predictions from being lost from existing profiles that would be overwritten when calling bivariate_confidenceprofiles!, existing profiles should be converted into a [CombinedBivariateMethod], prior to running new bivariate profiles. To do this use combine_bivariate_boundaries! on model with keyword argument not_evaluated_predictions set to false.

Distributed Computing Implementation

If Distributed.jl is being used and use_distributed is true, then the bivariate profiles of distinct interest parameter combinations will be computed in parallel across Distributed.nworkers() workers. If use_distributed is false and use_threads is true then for methods where finding boundary points is independent they will be computed in parallel across Threads.nthreads() threads for each pair of interest parameters.

Iteration Speed Of the Progress Meter

The time/it value is the time it takes for each new boundary point to be found (for all methods except for AnalyticalEllipseMethod). For AnalyticalEllipseMethod this is the time it takes to find all points on the boundary of the ellipse of two interest parameters.

Bivariate Profiles

Methods For Finding Boundaries

Sampling Internal Points From Boundaries

Merging Boundaries From Multiple Methods

Index