TraviaTechPie Review

Review Tech, Science, Finance

The Setup

Walk into the smartwatch aisle and the boxes shout the same words. ECG. Blood pressure. Sleep apnea. SpO2. HRV. Stress. Skin temperature. Recovery score. VO2 max. Some of those are clinical instruments wrapped in plastic. Some are educated guesses. And some are marketing dressed up as medicine.

Here’s the problem with most smartwatch reviews: they list everything in the Yes/No column and let you do the math. The honest math is harder. Yes, the watch has it and yes, it works the way the box implies are two different statements — and the gap between them is where bad purchase decisions live.

This is the pillar piece for our health-features deep dive. It does one thing: sorts every meaningful smartwatch health feature into three tiers based on what the medical literature, the FDA paperwork, and independent reviewers actually say. Then it shows where each major brand lands on each tier.

The thesis is simple. Smartwatch health features split into three groups:

1. Tier 1 — Clinical-grade. Genuinely validated. A doctor would not roll their eyes. 2. Tier 2 — Useful with limits. The number is not precise, but the trend is honest. Read it as a signal, not a measurement. 3. Tier 3 — Mostly marketing. Either the science isn’t there, the calibration is unreliable, or the feature is doing something less impressive than the name suggests.

A single watch usually mixes all three. The question is not “does it have everything.” The question is how much of the box is Tier 1, and is that the Tier 1 you actually need.

Let’s walk through each tier, then look at every brand on every feature.

Tier 1 — Clinical-grade (the features a doctor takes seriously)

Four features sit in this tier in 2026.

ECG (electrocardiogram). This is the strongest entry. Smartwatch ECG has FDA clearance on Apple Watch, Galaxy Watch, Pixel Watch, Garmin (Venu 4 and selected Forerunner / Fenix), and Fitbit (Charge 6, Sense 2). The clinical use case is atrial fibrillation screening — a 30-second single-lead reading from the wrist that catches the rhythm pattern. Multiple peer-reviewed studies have shown smartwatch ECG agrees with 12-lead clinical ECG on AFib detection with high sensitivity. This is the one feature where “the watch caught my arrhythmia” stories are not anecdotal — they are documented.

The catch is what ECG doesn’t do. It does not diagnose heart attacks. It does not catch every arrhythmia. It requires the user to start the reading manually (touch the bezel or crown for 30 seconds). It’s a screening tool, not a continuous monitor. But within those bounds, it works.

Sleep apnea detection. A newer entry. Samsung was first to receive FDA De Novo authorization (February 2024) — the feature works on Galaxy Watch 4 and later through the Samsung Health Monitor app. Apple followed in September 2024 with 510(k) clearance for Apple Watch Series 9, Series 10, and Ultra 2 (now also Series 11, Ultra 3, SE 3). Both track “breathing disturbances” over multiple nights and flag patterns suggesting moderate-to-severe obstructive sleep apnea. This is not the same as a clinical polysomnography (PSG) study — but as a screening trigger for go get tested, it works. Independent sleep medicine researchers have validated the underlying signal.

Garmin, Fitbit, Whoop, Huawei, Pixel Watch do not have FDA-cleared sleep apnea detection. They may track breathing disturbances internally, but they don’t make the diagnostic claim.

Heart rate monitoring (basic). Optical wrist-based heart rate is now a solved problem for resting and steady-state conditions. Every major watch agrees with chest-strap ECG to within 1–3 bpm at rest. The honest caveats: optical HR struggles during high-intensity intervals, weight training (wrist tension), and cold weather. For 95% of use cases — daily resting HR, walking, easy running — wrist HR is reliable.

This isn’t FDA-cleared in the same sense as ECG, but it’s the most thoroughly validated metric in the consumer wearable space.

Fall detection / crash detection. Apple Watch, Garmin (most models with accelerometer), Galaxy Watch, Pixel Watch. Triggers an emergency call after a hard fall if the wearer doesn’t respond. Validated through real-world incidents and emergency services data — this is the feature with the most published “the watch saved their life” cases. Works. Boring. Tier 1.

That’s it for Tier 1. Four features. Notice how many “health” claims printed on smartwatch boxes are not in this tier.

Tier 2 — Useful with limits (read it as a signal, not a number)

This is the largest tier. The features here are genuinely valuable — but only if you read them correctly. The number is approximate. The direction is honest. The trend over weeks is the real signal.

Sleep stages and sleep score. Every major smartwatch claims to break the night into Light / Deep / REM / Awake stages. The hard truth: compared to clinical PSG, smartwatch sleep staging is roughly 60–80% accurate depending on the brand and the night. Total sleep time is usually accurate within 15 minutes. Stage breakdown is much shakier. A 2023 meta-analysis found wrist wearables overestimate REM and underestimate deep sleep on average.

But here’s the catch — for individual trend tracking, the numbers are useful. If your sleep score is normally 80 and it drops to 60 for three nights, something is off, even if the absolute number isn’t lab-grade. Apple’s Sleep Score, Samsung’s Sleep Score with Sleep Coaching, Pixel Watch’s Sleep Profile (with Fitbit engine), Garmin’s Sleep Coach, and Whoop’s recovery score all work this way. Read the trend, not the single number.

SpO2 (blood oxygen). Smartwatch SpO2 is a spot measurement using red and infrared light through the wrist. Compared to a clinical pulse oximeter on the finger, wrist SpO2 is reliably off by 2–4%, with worse accuracy in darker skin tones (a documented and unresolved problem). A “94%” reading on the watch might be a real 96% — or a real 92%. As a one-shot reading, this is not useful.

What is useful: overnight SpO2 trends. If your average drops over time, or if you see frequent dips below 88%, that’s a signal worth investigating (it correlates with sleep apnea risk). The trend works; the single number does not.

HRV (heart rate variability). A measurement of the time variation between heartbeats. Low HRV correlates with stress, fatigue, illness, overtraining. Smartwatch HRV measurement is reasonably accurate when measured at consistent times (usually overnight). Garmin’s HRV Status, Whoop’s Recovery, Apple’s Vitals, Oura’s Readiness, and Pixel Watch’s Fitbit-engine HRV all use the same underlying signal.

The limitation: HRV is highly individual. A “good” HRV for one person is a “bad” HRV for another. Comparing your HRV to your neighbor’s is meaningless. Comparing your HRV today to your 30-day average — that’s the real metric.

Activity tracking (steps, calories, exercise minutes). Step counts are accurate within 5–10% for walking. They get less accurate when you’re carrying a stroller, doing yard work, or holding handlebars (no arm swing). Calorie estimates are generously described as ballpark — published validation studies find smartwatch calorie estimates can be off by 25–50%. The exercise minutes / “move ring” mechanism is more about gamification than measurement, but the gamification works for behavior change.

Honest framing: activity tracking is genuinely useful for sustaining a habit, not for precise calorie accounting.

GPS accuracy (for running, cycling, hiking). This one splits sharply. Multi-band (L1+L5) GPS is genuinely accurate — within 1–3 meters in open conditions, and (critically) it stays accurate in urban canyons and forest cover where single-band GPS fails. Apple Watch Ultra, Galaxy Watch Ultra, Pixel Watch 4, Garmin Fenix / Forerunner higher tiers, Huawei GT 6 Pro all have multi-band. Single-band GPS (most entry models, Apple Watch SE, lower Galaxy/Pixel) drifts 5–20 meters in tough conditions.

If you run in cities or forests, multi-band matters. If you only run on open tracks, single-band is fine.

Recovery and training-load scores. Garmin’s Body Battery and Training Status, Whoop’s Recovery score, Apple’s Training Load, Fitbit’s Daily Readiness all combine HRV + sleep + recent activity into a single readiness number. The science underneath (overload-recovery models from sports physiology) is real. The translation into a single “you’re 78% recovered today” number adds opinion to data — different brands give wildly different scores for the same conditions.

Treat these as one input among several, not as a verdict. If your recovery score says rest and your legs feel great, trust the legs.

Skin temperature (overnight trends). Apple Watch (Series only), Galaxy Watch, Pixel Watch, Garmin, Fitbit Sense 2 all track wrist skin temperature overnight. The absolute number is wrong (your wrist is colder than your core). The trend is informative — it tracks menstrual cycle phases, illness onset (a 0.3°C bump for two nights often precedes COVID symptoms by 24–48 hours, documented in multiple studies), and ovulation. Trend yes, fever measurement no.

That’s Tier 2. Seven features that genuinely tell you something — if you read them as trends, not as truth.

Tier 3 — Mostly marketing (where the box outruns the science)

This is the smaller, more controversial tier. These features are not necessarily useless. They’re features where the marketing claim outruns what the watch actually does. If a feature is in this tier, you should not pay extra for it — and you should not make medical decisions based on it.

“Blood pressure measurement” (without a cuff). This is the most aggressive marketing claim in the category. Samsung Galaxy Watch BP is marketed as a blood pressure measurement, but it requires monthly recalibration against a real cuff (otherwise drift makes the readings meaningless), and it does not have US FDA clearance for blood pressure measurement. Apple Watch (Series 11) introduced a “Hypertension Notification” feature in 2025, but Apple is careful to call it a notification — it analyzes vascular response over 30-day windows to flag people who might be hypertensive, with the explicit instruction to confirm with a real BP cuff. The FDA cleared the notification, not a measurement.

Translation: if you care about blood pressure, buy a $50 upper-arm cuff from Omron. It will be more accurate than any wrist-worn smartwatch BP feature in 2026, by a large margin. The watch BP claims are not a substitute. They are at best an early-screening trigger to go buy the cuff.

“Antioxidant index” / carotenoid sensing. Samsung introduced this in Galaxy Watch 7/8 using a skin-spectrum sensor that estimates carotenoid levels (a proxy for vegetable intake). Samsung itself acknowledges, in fine print, that the feature is not medically validated. It’s a wellness gimmick. Not harmful, not meaningful.

“Stress score” (as an absolute number). Garmin’s Stress Score, Galaxy’s Stress Score, Pixel Watch’s Stress Tracking. These all derive from HRV, which is in Tier 2 — so they share Tier 2’s “trend yes, number no” caveat. But marketing presents them as absolute readings (“Your stress is 73 today”), which they’re not. A 73 for one person is a 40 for another. As a daily prompt to take a breath, fine. As a measurement, no.

VO2 max (smartwatch estimate). Real VO2 max requires lab equipment — a treadmill, a mouthpiece, and a respiratory gas analyzer. Smartwatches estimate it from heart rate response during running. Published validation finds wrist-based VO2 max estimates are off by 8–15% on average versus lab measurement. Useful as a trend for trained runners (if your estimated VO2 max climbs over a season, training is working). Useless as a precise number.

Almost every major watch puts a VO2 max number in the app. Treat it as Tier 2 trend data dressed in Tier 1 clothing.

“Body composition” / segmental BIA. Some watches (Galaxy Watch advertised this prominently) attempt bioelectrical impedance through wrist contact. Wrist-only BIA is not reliable for body fat percentage — it samples too little of the body. Stand-on smart scales with two-point or four-point BIA are more accurate. Wrist BIA is well-intentioned but not where you’d get an honest body composition reading.

“Skin temperature for fever detection.” As covered in Tier 2 — overnight skin temperature trends are useful for cycle tracking and illness onset. But several brands have implied or directly suggested the feature can be used for fever screening. It cannot — your skin is several degrees off your core body temperature, and the watch is not calibrated for fever detection. The trend is Tier 2, the fever framing is Tier 3.

That’s Tier 3. Five categories of features that are either marketing-led or where the absolute number can’t be trusted.

The Master Table — every major brand on every feature

Here’s the matrix. Tier 1 = ✓✓ (clinical-grade where supported), Tier 2 = ✓ (works as trend), Tier 3 = ⚠️ (marketed but limited), — = not available.

Brands: Apple Watch Series 11, Galaxy Watch 8, Pixel Watch 4, Garmin Venu 4, Fitbit Charge 6, Whoop 5.0/MG, Huawei Watch GT 6 Pro.

FeatureTierApple Watch Series 11Galaxy Watch 8Pixel Watch 4Garmin Venu 4Fitbit Charge 6Whoop 5.0 / MGHuawei GT 6 Pro
ECG (single-lead)T1✓✓ FDA✓✓ FDA✓✓ FDA✓✓ FDA✓✓ FDA (Charge/Sense)✓✓ FDA (MG only)✓✓
Sleep apnea (FDA-cleared)T1✓✓ FDA✓✓ FDA
Heart rate (resting / steady)T1✓✓✓✓✓✓✓✓✓✓✓✓✓✓
Fall / crash detectionT1✓✓✓✓✓✓✓✓ (incident detection)
Sleep stages / sleep scoreT2✓ trend✓ trend + Sleep Coach✓ Fitbit engine✓ Sleep Coach✓ Recovery score
SpO2 (overnight trend)T2✓ trend✓ trend✓ trend✓ trend✓ trend✓ trend✓ trend
HRVT2✓ Vitals app✓ Fitbit engine✓ HRV Status✓ deep (Recovery)
Activity (steps, calories)T2✓ (no display)
GPS — multi-bandT2✓ (dual band)✓ (dual band)✓ (dual band)✓ (dual band)single-band only— (uses phone)✓ (dual band L1+L5)
Recovery / training-loadT2✓ Training Load✓ Energy Score✓ Body Battery + Training Status✓ Daily Readiness (Premium)✓✓ deep (Recovery + Strain Coach)
Skin temperature (trend)T2✓ (Series, not SE)✓ (Sense 2 only)
Blood pressureT3⚠️ Hypertension Notification only (FDA)⚠️ Measurement claim (no FDA, cuff recalibration)⚠️ overnight BP estimate (Life/MG only, beta)
Stress score (as number)T3— (Mindfulness only)⚠️⚠️⚠️ Body Battery⚠️ Daily Stress
Antioxidant / carotenoidT3⚠️ (Bioactive sensor, not validated)
VO2 max estimateT3 trend⚠️ trend⚠️ trend⚠️ trend⚠️ trend⚠️ trend⚠️ trend
Body composition (wrist BIA)T3⚠️ (BIA, limited accuracy)

A few patterns jump off this matrix.

Tier 1 is mostly settled. ECG is in every flagship-tier watch from every major brand. The two FDA-cleared sleep apnea entries (Apple and Samsung) are the actual differentiation — that single feature is one of the few real exclusive moats in 2026.

Tier 2 is where the noise is. Every brand claims every Tier 2 feature. The differences are in implementation depth, not feature presence. Garmin and Whoop are deeper on recovery/training-load. Apple and Samsung are deeper on sleep coaching. Fitbit and Pixel share the same Fitbit health engine (literally the same algorithms). Multi-band GPS is the cleanest Tier 2 hardware differentiator — and Fitbit Charge 6 and Whoop (no GPS) are the obvious gaps.

Tier 3 is where Samsung accumulates the most “wins” that aren’t really wins. Galaxy Watch 8’s longest feature list owes a lot to features in this column — antioxidant index, wrist BIA, wrist BP without FDA. They’re real features. They’re not real differentiation in the way the box implies. (Samsung’s FDA-cleared sleep apnea is the real win, sitting in Tier 1.)

The Buyer’s Question — re-framing the comparison

Now the practical reframe. When you walk into a smartwatch comparison and look at the “health features” column, the right question is not “how many.” It’s three questions in order.

1. Which Tier 1 features do I actually need?

If atrial fibrillation runs in your family — ECG matters. Pick a watch that has it (almost all do). If you snore badly or your partner complains about apnea-like pauses — FDA-cleared sleep apnea screening matters. That narrows you to Apple Watch or Galaxy Watch. If you’re elderly or live alone and a hard fall is a real risk — fall detection. Apple, Samsung, Pixel, Garmin.

If none of those apply to you specifically, the Tier 1 differentiator collapses. Every major watch covers basic heart rate. Buy on Tier 2 strength or non-health features.

2. Which Tier 2 features will I actually read?

If you’re a serious runner — multi-band GPS and recovery-score depth dominate. Garmin (Fenix, higher Forerunners) and Whoop pull ahead. See the running smartwatch pick for the model-level decision.

If you want sleep tracking as a daily habit — Apple, Samsung, and Pixel are roughly equivalent. Sleep Coach (Samsung) and Sleep Profile (Pixel/Fitbit) are the most opinionated UX layers.

If you want a single recovery number to organize your day around — Whoop and Garmin are deeper than the rest. See the Whoop deep dive for whether that’s worth the screenless trade-off.

3. Which Tier 3 features am I being charged for?

The honest test: if you removed the Tier 3 features entirely, would you still buy this watch?

For most buyers, yes. For some Galaxy Watch 8 buyers specifically — the appeal is the long feature list, which is loaded with Tier 3 entries. Recognize what you’re paying for.

If you bought the watch because the box said “blood pressure!” — buy the $50 Omron cuff anyway. If you bought it because the box said “antioxidant index!” — eat more vegetables and ignore the number. The watch is still a good watch. It’s just not actually doing those things in the way the box implied.

What’s Next in This Cluster

This pillar piece is the entry point for our health-features deep dive. Each Tier 1 and Tier 2 feature deserves its own piece — the master table here is the map, but each cell hides a deeper conversation about implementation differences, validation studies, and edge cases.

The follow-up pieces will cover ECG accuracy and the AFib detection literature in depth, sleep tracking comparisons across the four major brands, SpO2 and respiratory monitoring honestly, stress and HRV beyond the marketing number, GPS accuracy for runners (where multi-band actually matters), recovery and VO2 max as training tools, and the remaining health features — skin temperature, cycle tracking, fall detection — that don’t fit elsewhere.

If you want the model-level decisions rather than the feature theory, the existing pieces cover those: the five-part Apple vs Samsung vs Garmin comparison, the budget alternatives guide for Fitbit / Pixel / Huawei / Whoop, the smartwatch beginner’s guide, and the brand-specific buy guides for Apple, Samsung, and Garmin.

The Takeaway

The honest answer to “what smartwatch health features actually work” is shorter than the marketing list suggests.

Four features are clinical-grade — ECG, FDA-cleared sleep apnea detection (Apple and Samsung only), basic heart rate, and fall detection. These are the features a cardiologist would not roll their eyes at.

Seven features are useful as trends but not as numbers — sleep score, SpO2, HRV, activity, GPS (only multi-band), recovery/training load, and skin temperature. Read the direction, not the digit.

Five categories of features are mostly marketing — wrist-based blood pressure (buy the cuff), antioxidant index, stress as an absolute number, VO2 max as a precise reading, and wrist-based body composition. They’re not useless. They’re not what the box implies.

The math, then, is: if you remove Tier 3 from your decision, every major brand collapses into two real questions. Which Tier 1 features do I need (mostly, do I need FDA-cleared sleep apnea — Apple or Samsung) and which Tier 2 depth do I want (recovery — Whoop / Garmin; sleep coaching — Samsung; running GPS — Garmin or Apple Ultra; passive trackers — Fitbit / Pixel).

That’s a real comparison. The “Yes/No” column in most reviews is not.

### Buy on Amazon

As an Amazon Associate I earn from qualifying purchases.


Photo: Luke Chesser / Unsplash

Posted in

댓글 남기기

TraviaTechPie Review에서 더 알아보기

지금 구독하여 계속 읽고 전체 아카이브에 액세스하세요.

계속 읽기