An Actuary’s Perspective: Unlocking the Power of Structured Underwriting Data
By Hezhong (Mark) Ma
Reinsurance News, December 2024
Both underwriters and actuaries bring deep expertise to experience-driven risk assessment. However, a general disconnect persists between underwriting and actuarial systems.
Underwriting evidence, impairments, and the rationale behind decisions are typically locked within legacy systems, workbenches, and rules engines—often in unstructured formats. Only a limited set of codes is passed to administration systems for use by actuarial models.
This leads to the familiar actuary refrain: “That is a brilliant idea, but we just don’t have the data.”
Advanced digitization tools strengthened by artificial intelligence (AI) are changing the situation.
This article examines several instances showcasing the power of structured underwriting data from an actuary’s perspective. It also details one partnership that demonstrates how AI can be used to extract useful data from piles of information and enable better risk management decisions.
Exhibit 1
Distribution of Classes by Total Cholesterol Count
A casual glance at the above chart might lead an underwriter to point out that cholesterol is only one of many factors affecting how underwriters assign classes. It might also lead an actuary to argue that the chart demonstrates no clear pattern between these two variables. Both are correct.
But both would also be missing the point.
For the first time in the industry’s history, insurers now possess the data and capabilities to create such charts. More importantly, they now have longitudinal data of vital signs and lab results such as weight, blood pressure, and A1C that can be linked to variables in administration systems and experience studies.
This capability allows the exploration of a wide variety of scenarios. For example:
- If the cholesterol threshold for super-preferred is lowered, how many more people would qualify?
- Which point-in-time cholesterol value, or the pattern of changes, is the most predictive of mortality?
While some companies may have captured insurance laboratory cholesterol values at the time of underwriting, virtually none would have captured every cholesterol value in clinical labs. These granular insights can refine underwriting manuals and, when combined with rules engines, produce actionable, more precise rules and models.
This machine-readable knowledge will transform underwriting processes. For instance, actuaries and underwriters can use structured data to identify cases more likely to be placed, thereby prioritizing valuable underwriting resources.
Exhibit 2
Distribution of Application Status by A1C
While this purely illustrative chart may not show a significant relationship, the integration of data such as A1C levels with the thousands of other available data points can be programmed into an underwriting process to pre-fill impairment codes in a workbench, target certain cases for post-issue audits, or build accelerated monitoring programs. The possibilities are truly endless.
Recent years have seen a surge in new underwriting data, tools, models, and innovative uses of existing information. Many carriers have questioned how protective some of these tools really are. Traditional actuarial approaches, while based on credible data sets and thorough experience studies, often miss the rich context considered by underwriters when evaluating a case.
On the other hand, the underwriting approach provides a holistic view of the application but lacks the scalability and credibility offered by data-driven methods. Successful integration of AI tools can enable protective value analysis at scale, speed, and efficiency. By submitting the same cases with different permutations of evidence, insurers can measure the exclusivity directly.
As a proof of concept, Table 1 below features a small number of cases submitted twice, once with attending physician statement (APS) and electronic health record (EHR), then again with APS and insurance labs and Rx histories. This study compared the number of medical conditions each run captured.
Table 1
Proof of Concept: Comparing Digital Data Source Combinations
Based on this very limited sample size, adding EHRs to APS unsurprisingly appears to capture more impairments than adding insurance labs and Rx histories. As underwriting files are scaled, specific evidence types can be targeted and their incremental value of capturing major impairments precisely quantified.
Experienced underwriters often are skeptical of this data-driven approach. How much does digital risk assessment compete with or supplement to decades of on-the-job expertise? Many InsurTech companies with novel products struggle to place their digital evidence alongside traditional underwriting processes to pinpoint improvements, win trust, and enable carriers to confidently move forward. By structuring natively imaged underwriting data files, companies can use established methods to validate and reconcile these new tools.
Establishing Partnerships
One example of this is RGA’s partnership with DigitalOwl, an insurance technology company that uses advanced AI to interpret and transform medical records into a comprehensive and interactive medical record summary. This partnership is part of that fundamental change in which digital tools exist alongside traditional underwriting expertise to create more informed risk management decisions.
As of this writing, RGA’s in-force analysis has included approximately 43,000 underwriting files, ten million pages, processed by DigitalOwl and returned to RGA. Although this analysis is at an early stage, the potential is undeniable.
Table 2 is a confusion matrix comparing when DigitalOwl identified Type 2 diabetes mellitus versus RGA manual impairment codes.
Table 2
AI versus Manual Identification of Type 2 Diabetes
The rows represent whether RGA identified Type II diabetes in its underwriting system, while the columns indicate whether DigitalOwl did. For example, in 5% of cases, DigitalOwl flagged an impairment that wasn't coded in RGA’s underwriting systems. While there are hypotheses for this mismatch—such as the purpose of capturing impairments in an internal system versus the purpose of APS summaries—the real breakthrough is the newfound ability to explain these differences with accuracy and confidence.
What Comes Next
The potential of structured data is only now starting to be explored. For the first time, insurance professionals can combine underwriting information with claims data, opening the door to leverage analytics for fraud detection and claims rescissions. The structured underwriting data can also assist in-force management. With the first-mention date extracted for each impairment, the percentage of the population that develops conditions such as diabetes over a period of time can be calculated. That insight enables targeting policies for term conversions or consumer engagement. It could also be instrumental to in-force analysis and reporting.
The numbers in this article are just illustrative examples. But they also represent only a fraction of 1% of our industry’s historical underwriting files. Thus, this transformation is only just beginning. The art of the possible is seeing through the current imperfections to the potential that structuring underwriting data will unlock.
At the heart of underwriting transformation is the ability to automate with confidence. Insurers should aim to leverage relevant experience to reinvent the design of rules, models, and underwriting processes. This is a blueprint for the future.
Statements of fact and opinions expressed herein are those of the individual authors and are not necessarily those of the Society of Actuaries, the newsletter editors, or the respective authors’ employers.
Hezhong (Mark) Ma, FSA, is VP and managing actuary for RGA. Mark can be reached at hma@rgare.com.