<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-US">
	<id>https://www.insurerbrain.com/w/index.php?action=history&amp;feed=atom&amp;title=Definition%3ACoarsened_exact_matching_%28CEM%29</id>
	<title>Definition:Coarsened exact matching (CEM) - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://www.insurerbrain.com/w/index.php?action=history&amp;feed=atom&amp;title=Definition%3ACoarsened_exact_matching_%28CEM%29"/>
	<link rel="alternate" type="text/html" href="https://www.insurerbrain.com/w/index.php?title=Definition:Coarsened_exact_matching_(CEM)&amp;action=history"/>
	<updated>2026-05-13T09:18:32Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.43.8</generator>
	<entry>
		<id>https://www.insurerbrain.com/w/index.php?title=Definition:Coarsened_exact_matching_(CEM)&amp;diff=22000&amp;oldid=prev</id>
		<title>PlumBot: Bot: Creating new article from JSON</title>
		<link rel="alternate" type="text/html" href="https://www.insurerbrain.com/w/index.php?title=Definition:Coarsened_exact_matching_(CEM)&amp;diff=22000&amp;oldid=prev"/>
		<updated>2026-03-27T06:00:51Z</updated>

		<summary type="html">&lt;p&gt;Bot: Creating new article from JSON&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;🎯 &amp;#039;&amp;#039;&amp;#039;Coarsened exact matching (CEM)&amp;#039;&amp;#039;&amp;#039; is a nonparametric matching method that groups observations into coarsened strata based on pre-treatment covariates, then matches treated and control units within the same strata to reduce [[Definition:Confounding variable | confounding]] in observational studies. Insurance analysts adopt CEM when [[Definition:Randomized controlled trial (RCT) | randomized experiments]] are impractical — a common scenario in the industry — and they need to evaluate the effect of an intervention such as a new [[Definition:Underwriting | underwriting]] rule, a [[Definition:Discount | premium discount]] for installing safety devices, or a change in [[Definition:Claims management | claims handling]] protocols. By coarsening continuous variables like policyholder age, [[Definition:Sum insured | sum insured]], or years of claims-free driving into discrete bins before matching, CEM avoids some of the model-dependency pitfalls that plague other techniques like [[Definition:Propensity score matching (PSM) | propensity score matching]].&lt;br /&gt;
&lt;br /&gt;
⚙️ The process works in three steps. First, the analyst temporarily coarsens each covariate into meaningful categories — for example, grouping [[Definition:Premium | premium]] bands into ranges or [[Definition:Exposure | exposure]] durations into yearly intervals. Second, the algorithm assigns each observation to a stratum defined by the unique combination of coarsened values and discards any stratum that does not contain at least one treated and one control unit. Third, within retained strata, observations are weighted to reflect the relative sizes of the treated and control groups, and analysis proceeds on the matched dataset using the original, uncoarsened variable values. This approach guarantees that the maximum [[Definition:Imbalance | imbalance]] between groups is bounded by the coarsening thresholds chosen, giving the analyst direct control over the trade-off between precision and [[Definition:Common support | sample size]]. In practice, an insurer evaluating whether a [[Definition:Telematics | telematics]] program reduces accident frequency might coarsen on vehicle type, driver age bracket, geographic zone, and historical claims count to ensure that program participants are compared only with genuinely similar non-participants.&lt;br /&gt;
&lt;br /&gt;
💡 CEM&amp;#039;s appeal for insurance applications lies in its transparency and the intuitive control it offers to domain experts. [[Definition:Actuary | Actuaries]] and data scientists can set coarsening thresholds based on actuarial judgment — they know, for instance, that grouping commercial fleet sizes into bands of 10 vehicles is meaningful, whereas bins of 100 would be too coarse to capture risk variation. Unlike propensity score methods, CEM does not require correct specification of a parametric model for treatment assignment, which reduces a major source of hidden bias. However, practitioners must be mindful that aggressive coarsening discards observations that fall outside [[Definition:Common support | common support]], potentially limiting generalizability. Despite this trade-off, CEM has gained traction among [[Definition:Insurtech | insurtechs]] and advanced analytics teams within traditional carriers seeking to produce credible evidence that specific interventions — from [[Definition:Fraud detection | fraud detection]] algorithms to [[Definition:Loss prevention | loss prevention]] incentives — genuinely move the needle on outcomes rather than simply reflecting pre-existing differences between policyholder segments.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Related concepts:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
{{Div col|colwidth=20em}}&lt;br /&gt;
* [[Definition:Propensity score matching (PSM)]]&lt;br /&gt;
* [[Definition:Common support]]&lt;br /&gt;
* [[Definition:Causal inference]]&lt;br /&gt;
* [[Definition:Confounding variable]]&lt;br /&gt;
* [[Definition:Selection bias]]&lt;br /&gt;
* [[Definition:Conditional average treatment effect (CATE)]]&lt;br /&gt;
{{Div col end}}&lt;/div&gt;</summary>
		<author><name>PlumBot</name></author>
	</entry>
</feed>