“Despite its simplicity and convenience, the Quetelet index is not a good indicator of body composition for individuals.”
— Ancel Keys, coiner of the term "Body Mass Index," Journal of Chronic Diseases, 1972
The Quetelet Origin — A Formula for Populations, Not Patients
Adolphe Quetelet was a Belgian mathematician and astronomer whose primary interest was not medicine. He was interested in what he called the science of human beings — a statistical project aimed at characterizing the average properties of large populations. In 1832, he published work introducing what he called the Quetelet Index: body weight in kilograms divided by height in meters squared. The formula was elegant and computation-free, requiring only a scale and a measuring tape.
Quetelet's purpose was explicit. He was studying the distribution of physical characteristics across populations — the kind of social-statistical analysis that would help him identify the properties of l'homme moyen, the average man. The index was designed to describe populations, not to assess individual health. Quetelet understood that a population-level tool could not be used to evaluate individuals without losing the statistical properties that made it meaningful in the aggregate. He said as much in his writing.
The formula sat largely dormant as a clinical tool for over a century. It was used in insurance actuarial work — Metropolitan Life's height-weight tables drew on related calculations — but it was not systematically promoted as a clinical measure of health risk. Its reentry into medicine came not through epidemiological validation but through the influence of a single researcher who borrowed the formula and renamed it.
The Keys Rebranding — r = 0.44 and a Name Change
Ancel Keys was an American physiologist best known for his Seven Countries Study on diet and cardiovascular disease. In 1972, he published a paper in the Journal of Chronic Diseases titled "Indices of Relative Weight and Obesity." Keys had collected data across multiple countries and needed a practical measure of body composition for large-scale epidemiological work. He tested several weight-height indices — the Quetelet Index, Ponderal Index, Rohrer Index, and others — and evaluated their correlation with direct measures of body fat.
His finding for the Quetelet Index: a correlation of r=0.44 with body fat percentage as measured by more accurate methods. This is a moderate statistical association. It means that BMI explains approximately 19% of the variance in actual body fat — leaving 81% explained by other factors or unexplained entirely. Keys acknowledged this clearly in the paper. He concluded that the Quetelet Index was adequate for population-level studies where individual precision was not required. He also renamed it the Body Mass Index — a more clinical-sounding designation than its population-statistics origin suggested.
Keys' paper did not recommend BMI for individual clinical diagnosis. He specifically noted that the index could not distinguish between lean mass and fat mass — a serious limitation for any clinical application. A heavily muscled athlete and a sedentary individual of equal weight and height would receive identical BMI scores despite radically different body compositions. Keys knew this. He documented it. The paper that named BMI also recorded, precisely, why BMI should not be used as an individual diagnostic tool.
Clinical Adoption — Administrative Convenience Over Measurement Validity
The transition of BMI from population-research tool to clinical diagnostic standard occurred through a combination of administrative convenience and institutional adoption rather than through evidence of clinical validity. BMI requires no equipment, no training, no laboratory analysis, and no specialized clinical judgment. A height and a weight — two numbers available from any routine medical intake form — are sufficient to compute it. For insurance companies, clinical databases, and health systems managing millions of patients, this convenience was decisive.
Insurance companies adopted BMI-based thresholds for risk classification in the 1970s and 1980s. The actuarial logic was straightforward: higher BMI correlates with higher rates of certain conditions at the population level, so it could function as a risk-stratification variable for premium pricing. The validity problem was the same one Keys had identified — the population-level correlation does not translate into reliable individual-level prediction — but insurance classification does not require individual accuracy, only statistical associations that hold in large pools.
The World Health Organization adopted BMI cutpoints in 1995: underweight below 18.5, normal between 18.5 and 25, overweight between 25 and 30, obese above 30. These cutpoints were based on limited epidemiological data — primarily from European and American populations — and were explicitly described as tentative. The WHO acknowledged that the thresholds were not validated across all ethnic populations and that the appropriate cutpoints for Asian populations might be substantially lower. These caveats did not prevent the thresholds from being adopted universally and treated as authoritative.
The 1998 Reclassification — The Largest Administrative Redefinition in Medical History
On June 17, 1998, the National Heart, Lung, and Blood Institute published new clinical guidelines under the title "Clinical Guidelines on the Identification, Evaluation, and Treatment of Overweight and Obesity in Adults." The document lowered the BMI threshold for "overweight" classification from the previous sex-specific values — 27.8 for men and 27.3 for women — to a uniform 25 for both sexes. The obesity threshold remained at 30. The guidelines were presented as harmonizing American standards with World Health Organization definitions.
The mathematical consequence was immediate: 29 million Americans were reclassified as overweight without any change in their bodies. They had not gained weight. Their health had not changed. Their bodies were identical on June 16 and June 17. Only the administrative threshold moved. Those 29 million people were, by the new definition, overweight — and therefore candidates for clinical intervention, insurance risk reclassification, and physician counseling that had not applied to them the previous day.
The June 17, 1998 NIH decision to lower the BMI overweight threshold from sex-specific values (27.8/27.3) to a uniform 25, reclassifying 29 million Americans as medically overweight without a change in any individual's body. The largest single administrative act of medical redefinition in American history. The Reclassification Event documents what happens when a threshold moves: not that bodies change, but that the administrative category that organizes clinical and economic treatment of those bodies changes — instantly, at scale, by committee decision.
The advisory panel that produced the 1998 guidelines included nine members with ties to the pharmaceutical or commercial weight-loss industry — companies that stood to benefit from the expansion of the "overweight" population that the new threshold would produce. This conflict of interest was not hidden: the guideline document itself listed the panel members' affiliations. It was, however, not treated as a reason to question the threshold selection. The threshold aligned with WHO definitions, which provided institutional legitimacy independent of whether the WHO thresholds were themselves well-validated.
The 1998 reclassification created the epidemiological "obesity epidemic" as a documented phenomenon. Before June 17, the overweight prevalence in the United States was approximately 55% of adults. After June 17, using the same body measurements, it was approximately 64%. The epidemic did not reflect a change in American bodies. It reflected a change in the administrative definition of which bodies counted as problematic.
The Correlation Record — What BMI Actually Predicts
BMI's clinical utility depends on what it actually correlates with in individual patients. The answer, across the research literature, is: moderate association with body fat percentage, moderate association with some health outcomes at the population level, and substantial misclassification of individuals. The correlation with direct body fat measurement (measured by dual-energy X-ray absorptiometry, or DEXA) is approximately r=0.44 — exactly what Keys found in 1972 — meaning BMI explains roughly 19% of the variance in measured body fat.
The practical consequence is systematic misclassification in both directions. A 2016 study by Tomiyama and colleagues in the International Journal of Obesity examined the metabolic health of over 40,000 adults using both BMI classification and direct cardiometabolic marker assessment. The findings were striking: 47% of individuals classified as "overweight" by BMI were metabolically healthy by all five standard cardiometabolic criteria. Among those classified as "obese," 29% were metabolically healthy. Conversely, 31% of individuals in the "normal weight" BMI range had at least two cardiometabolic risk factors.
The "obesity paradox" in cardiovascular and critical care medicine represents a further complication. Multiple studies have found that patients with BMI classified as "overweight" or mildly "obese" have equivalent or better clinical outcomes than those with "normal" BMI for conditions including heart failure, chronic obstructive pulmonary disease, and recovery from major surgery. The mechanism is not fully understood, but the finding is replicated across multiple patient populations and undermines the assumption that the BMI threshold at 25 identifies a health risk that begins at exactly that point.
Clinical Consequences — When the Measure Shapes the Treatment
The clinical consequences of BMI's dominance extend beyond inaccurate individual classification. Weight stigma in medical settings — documented across multiple specialties — produces systematic differences in clinical care. Patients with higher BMI receive shorter consultations, have their presenting symptoms more frequently attributed to weight irrespective of clinical indication, and are less likely to receive screening procedures recommended by guidelines for their age or risk profile. Studies of physician attitudes find that weight stigma is among the most commonly expressed biases in medical training environments.
The strongest argument for BMI retention is not that it is an accurate individual-level tool — the evidence is clear that it is not — but that it identifies real population-level associations that inform public health priorities. At the population level, higher BMI is genuinely associated with elevated rates of type 2 diabetes, hypertension, sleep apnea, and certain cancers. These associations are not artifacts of the 1998 threshold change; they appear across multiple studies using multiple methodologies. A public health system that tracked no weight-related metric would lose meaningful signal about a genuine problem.
The response this paper offers is not that weight-related health associations are false, but that BMI is an imprecise instrument for identifying them in individuals. The population-level signal is real; the individual-level instrument is unreliable. The policy question is whether an instrument that misclassifies roughly 30-47% of the individuals it classifies across different BMI categories is the right tool for individual clinical decision-making — particularly when more accurate alternatives exist.
Eating disorders represent a documented clinical downstream effect of BMI-based clinical conversations. Research on adolescent patient populations finds that physician-initiated weight discussions correlated with increased rates of disordered eating behaviors. The mechanism is not certainty, but the pathway is plausible: a clinical authority figure framing a BMI number as a health problem activates exactly the kind of outcome-focused weight preoccupation that clinical eating disorder literature identifies as a risk factor. The instrument designed to identify a health problem creates conditions for a different one.
The Alternatives — Measures That Do What BMI Claims to Do
Several alternatives to BMI demonstrate superior validity for the purposes BMI is routinely used to serve. The waist-to-height ratio — waist circumference in centimeters divided by height in centimeters — is consistently a better predictor of cardiovascular and metabolic risk than BMI across multiple populations and ethnic groups. The threshold is simple: a ratio above 0.5 (waist circumference greater than half of height) identifies elevated risk. It requires only a tape measure, takes no longer than BMI to compute, and better captures the clinically relevant distribution of adipose tissue around the abdomen.
Waist circumference alone has stronger associations with metabolic risk than BMI in most studies. The metabolic syndrome criteria — elevated fasting glucose, elevated triglycerides, reduced HDL cholesterol, elevated blood pressure, and elevated waist circumference — provide a direct measure of the health risks that BMI imperfectly proxies. These five markers require laboratory tests for four of the five, but they measure the actual biological processes that translate excess adiposity into disease rather than measuring a geometric ratio that correlates with those processes only moderately.
DEXA body composition scanning provides the gold-standard direct measurement of fat mass, lean mass, and bone density. It is more expensive and requires specialized equipment, limiting its use in routine primary care. For clinical populations where the question of body composition is clinically relevant, however, it provides the kind of individual-level accuracy that BMI's population-level design cannot deliver. The existence of accurate measurement tools, even expensive ones, demonstrates that BMI's continued use is not a measurement necessity — it is an administrative preference.
Why BMI Persists — The Economics of a Convenient Number
BMI's persistence despite documented limitations is explicable by the same logic that explains the persistence of other inadequate metrics that have become embedded in institutional infrastructure. The instrument provides a number — a single, comparable, instantly computable quantity — that organizes insurance classification, clinical documentation, research databases, public health surveillance, and pharmaceutical market sizing. Replacing it would require replacing every one of those systems simultaneously, or maintaining both systems in parallel during a transition period that no coordinating institution has incentive to initiate.
The pharmaceutical industry has a specific incentive to maintain the current BMI threshold structure. The overweight and obesity pharmaceutical market — which includes GLP-1 receptor agonists such as semaglutide and tirzepatide — is sized against the population classified as overweight or obese by current BMI thresholds. As of 2024, that market exceeds $50 billion annually and is projected to grow substantially. Raising the overweight threshold — returning it to the pre-1998 values, or replacing BMI with a more accurate metabolic risk assessment — would shrink the eligible patient population and reduce market size. The industry that most benefits from the current threshold has the greatest capacity to influence the clinical guideline processes that would be required to change it.
The 1832 formula designed by a Belgian mathematician to study population distributions is now the primary clinical instrument through which millions of Americans are categorized as medically problematic and directed toward interventions. The path from Quetelet's population statistics to a pharmaceutical eligibility criterion ran through a 1972 paper that documented BMI's limitations, a 1998 guideline revision that created 29 million new patients in a day, and the administrative inertia of systems built around the number those processes produced. The formula did not change. The body of evidence about its limitations did not disappear. What changed was the institutional interest in the number it generated.
The named condition this paper documents — The Reclassification Event — is not a historical curiosity. It is a demonstration of how clinical categories are established, by whom, and in whose interest. The 29 million people who became overweight on June 17, 1998 were not informed of the threshold change, were not given evidence about the limitations of the instrument, and were not offered an alternative framing of what the number meant. They were reclassified, and the clinical and economic systems oriented toward their bodies adjusted accordingly. The metric moved; the market followed.
Selected References
- Keys, A., et al. (1972). Indices of relative weight and obesity. Journal of Chronic Diseases, 25(6), 329–343.
- National Heart, Lung, and Blood Institute. (1998). Clinical Guidelines on the Identification, Evaluation, and Treatment of Overweight and Obesity in Adults. NIH Publication No. 98-4083.
- Tomiyama, A. J., et al. (2016). Misclassification of cardiometabolic health when using body mass index categories in NHANES 2005–2012. International Journal of Obesity, 40(5), 883–886.
- Bhaskaran, K., et al. (2018). Body-mass index and risk of 22 specific cancers: a population-based cohort study of 5·24 million UK adults. The Lancet, 384(9945), 755–765.
- Ashwell, M., Gunn, P., & Gibson, S. (2012). Waist-to-height ratio is a better screening tool than waist circumference and BMI for adult cardiometabolic risk factors: systematic review and meta-analysis. Obesity Reviews, 13(3), 275–286.
- Puhl, R. M., & Heuer, C. A. (2010). Obesity stigma: important considerations for public health. American Journal of Public Health, 100(6), 1019–1028.
- Hunger, J. M., et al. (2015). Weighing in on Stigma: Literature Review on Weight Stigma and Eating Disorders. Obesity Reviews.
- Lavie, C. J., et al. (2009). Obesity and cardiovascular disease: risk factor, paradox, and impact of weight loss. Journal of the American College of Cardiology, 53(21), 1925–1932.
- World Health Organization. (1995). Physical Status: The Use and Interpretation of Anthropometry. WHO Technical Report Series No. 854.