ASTR 201: Astronomy for Science Majors (Spring 2026)

Learning Objectives

Define apparent and absolute magnitude and apply the distance modulus formula
Convert between magnitude differences and flux ratios using the Pogson relation
Construct the observer’s HR diagram (\(M_V\) vs. spectral type) and identify its major features
Construct the theorist’s HR diagram (\(\log L\) vs. \(\log T_{\text{eff}}\)) and derive lines of constant radius
Identify the main sequence, giant branch, and white dwarf sequence
Explain why the main sequence is a mass sequence
Articulate the questions the HR diagram raises — and why physics (Module 3) is needed to answer them

They expected scatter.

They found structure.

The question today is not what the HR diagram looks like — it’s why the universe prefers order over randomness.

Today’s Roadmap

The Magnitude System — the astronomer’s logarithmic brightness scale
The Observer’s HR Diagram — patterns from data alone
The Theorist’s HR Diagram — overlaying physics (lines of constant radius)
Mass: The Hidden Organizer — why the main sequence is a mass sequence
An Evolution Diagram — stars move, and the patterns demand physics (→ Module 3)

Map Builder

We’re building a map of the stars. By the end, every star has an address — and mass writes the zip code.

The Magnitude System

The astronomer’s brightness scale

Stars Span a Ridiculous Range

Betelgeuse delivers \(6 \times 10^7\!\times\) more flux than Proxima Centauri
The Sun delivers \(6 \times 10^{10}\!\times\) more flux than Betelgeuse (because it’s close)
Total range: more than \(10^{10}\) — a linear scale is useless

You need a logarithmic scale. That scale is the magnitude system — where multiplicative flux changes become additive magnitude shifts.

The Backward Scale

Brighter objects have smaller (more negative) magnitudes.

Object	Apparent magnitude \(m\)
Sun	\(-26.74\)
Sirius	\(-1.46\)
Vega	\(+0.03\)
Naked-eye limit	\({\sim}+6\)
Hubble limit	\({\sim}+31\)

Hipparchus (2nd century BCE): “first magnitude” = brightest. Modern astronomy formalized this — but kept the backward convention.

Why Logarithms? A 60-Second Refresher

The magnitude system is built on \(\log_{10}\). Three properties do all the work:

Property	Rule	Example
Product → Sum	\(\log(A \times B) = \log A + \log B\)	\(\log(100 \times 1{,}000) = 2 + 3 = 5\)
Ratio → Difference	\(\log(A / B) = \log A - \log B\)	\(\log(1{,}000 / 10) = 3 - 1 = 2\)
Power → Multiply	\(\log(A^n) = n \log A\)	\(\log(10^{3.6}) = 3.6\)

Why this matters: Magnitudes use \(\log_{10}(\text{flux ratio})\). The distance modulus uses \(\log_{10}(d)\). The HR diagram uses \(\log L\) and \(\log T\). Logarithms turn multiplicative relationships into additive ones — multiplication becomes addition.

Optional: 20-Second Log Refresher

Product: \(\log(AB)=\log A+\log B\)
Ratio: \(\log(A/B)=\log A-\log B\)
Power: \(\log(A^n)=n\log A\)
Anchor: \(\log_{10}(2)\approx 0.3\)

Logarithms in Action: Key Values

\(x\)	\(\log_{10} x\)	How to remember
\(1\)	\(0\)	\(10^0 = 1\)
\(10\)	\(1\)	Definition
\(100\)	\(2\)	\(10^2\)
\(1{,}000\)	\(3\)	\(10^3\)
\(2\)	\(0.301\)	\(\approx 0.3\) (useful!)
\(3\)	\(0.477\)	\(\approx 0.5\)
\(0.1\)	\(-1\)	\(10^{-1}\)
\(0.01\)	\(-2\)	\(10^{-2}\)

Mental math: \(\log_{10}(200) = \log(2 \times 100) = 0.3 + 2 = 2.3\). That’s how we’ll estimate distances from magnitudes.

Apparent Magnitude: The Pogson Relation

\[ m_1 - m_2 = -2.5\,\log_{10}\!\left(\frac{F_1}{F_2}\right) \tag{1}\]

\(m_1, m_2\) = apparent magnitudes of two sources
\(F_1, F_2\) = fluxes (energy per unit area per unit time)
Scaling: Each \(1~\text{mag}\) difference = factor of \(10^{0.4} \approx 2.512\) in flux
Unit check: Flux ratio is dimensionless; magnitude difference is dimensionless. \(\checkmark\)

Anchor: 5 magnitudes = exactly a factor of 100 in flux (by definition).

Magnitude–Flux Conversion: Key Numbers

Magnitude difference \(\Delta m\)	Flux ratio \(F_1/F_2\)
\(1~\text{mag}\)	\(10^{0.4} \approx 2.512\)
\(2~\text{mag}\)	\(10^{0.8} \approx 6.31\)
\(5~\text{mag}\)	\(10^{2.0} = 100\)
\(10~\text{mag}\)	\(10^{4.0} = 10{,}000\)
\(15~\text{mag}\)	\(10^{6.0} = 10^6\)

Mental math trick: \(\Delta m = 5\) → \(\times 100\). Double it: \(\Delta m = 10\) → \(\times 10{,}000\). Add 5 more: \(\Delta m = 15\) → \(\times 10^6\).

Quick Check: Flux Ratio

Star X has \(m = 3.0\) and Star Y has \(m = 8.0\). How many times brighter is Star X?

\(5\times\) — magnitude difference equals flux ratio
\(25\times\) — square of the magnitude difference
\(100\times\) — since \(\Delta m = 5.0\) and \(5~\text{mag} = 100\times\)
\(10{,}000\times\) — that would require \(\Delta m = 10\)

Absolute Magnitude: A Fair Comparison

Apparent magnitude \(m\) mixes intrinsic brightness with distance
A nearby dim star can look brighter than a distant luminous one
To compare stars fairly: place them all at the same distance

Absolute magnitude \(M\) = the apparent magnitude a star would have at a standard distance of \(10~\text{pc}\).

This measures intrinsic brightness — the star’s luminosity in the magnitude system.

Apparent vs. Absolute: Four Stars

Star	\(m\) (apparent)	Distance	\(M\) (absolute)	\(L/L_\odot\)
Sun	\(-26.74\)	\(5 \times 10^{-6}~\text{pc}\)	\(+4.83\)	\(1\)
Sirius	\(-1.46\)	\(2.64~\text{pc}\)	\(+1.42\)	\(25\)
Betelgeuse	\(+0.42\)	\(200~\text{pc}\)	\(-5.85\)	\({\sim}10^5\)
Proxima Cen	\(+11.13\)	\(1.30~\text{pc}\)	\(+15.53\)	\(0.0017\)

The Sun looks blindingly bright (\(m = -26.74\)) only because it’s absurdly close. At \(10~\text{pc}\), it would be \(M = +4.83\) — visible but unremarkable.

The Distance Modulus

\[ m - M = 5\log_{10}\!\left(\frac{d}{10\,\mathrm{pc}}\right) \tag{2}\]

\(m\) = apparent magnitude (observed)
\(M\) = absolute magnitude (intrinsic)
\(d\) = distance in parsecs
\((m - M)\) = the distance modulus
Scaling: Every \(5~\text{mag}\) increase in \((m-M)\) = \(10\times\) farther
What it is: The inverse-square law (\(F = L/4\pi d^2\)) rewritten in the astronomer’s logarithmic language

Distance Modulus: Sanity Checks

At \(d = 10~\text{pc}\): \(m - M = 5\log_{10}(10/10) = 0\) → \(m = M\). \(\checkmark\) (Definition of \(M\).)
At \(d = 100~\text{pc}\): \(m - M = 5\log_{10}(100/10) = 5\). Star appears \(5~\text{mag}\) (\(100\times\)) fainter. \(\checkmark\)
At \(d = 1{,}000~\text{pc}\): \(m - M = 10\). Star is \(10^4\!\times\) fainter. \(\checkmark\)

Pattern: Each factor of \(10\) in distance adds \(5~\text{mag}\) to the distance modulus.

Worked Example: How Far Is This Cepheid?

Problem: A Cepheid variable has \(m = 14.0\) and (from its pulsation period) \(M = -4.0\). Find \(d\).

Step 1 — Distance modulus: \(m - M = 14.0 - (-4.0) = 18.0~\text{mag}\)

Step 2 — Solve for distance:

\[\log_{10}\!\left(\frac{d}{10~\text{pc}}\right) = \frac{18.0}{5} = 3.6 \quad \Rightarrow \quad d = 10^{3.6} \times 10~\text{pc} \approx 4.0 \times 10^4~\text{pc} = 40~\text{kpc}\]

Sanity check: \(40~\text{kpc}\) is comparable to the Milky Way’s diameter — reasonable for a luminous Cepheid (\(M = -4.0\) means \(L \sim 5{,}000\,L_\odot\)). \(\checkmark\)

Quick Check: Distance Modulus

A star has apparent magnitude \(m = 5.0\) and absolute magnitude \(M = 5.0\). How far away is it?

\(1~\text{pc}\)
\(10~\text{pc}\) — by definition, \(m = M\) means the star is at \(10~\text{pc}\)
\(100~\text{pc}\)
Cannot determine without luminosity

Synthesis Check: Luminosity vs Distance

Star A is \(100\times\) more luminous than Star B, but Star A is also \(10\times\) farther away.

How do their apparent magnitudes compare?

Star A appears \(5~\text{mag}\) brighter
They appear equally bright (\(\Delta m = 0\)) — flux scales as \(L/d^2\), so \(100/(10^2)=1\)
Star A appears \(5~\text{mag}\) fainter
Cannot determine without spectral type

The Observer’s HR Diagram

Patterns from data alone

A Radical Idea: Plot Everything

By the early 1900s, astronomers had spectral types and parallax distances for thousands of stars.

Ejnar Hertzsprung (Denmark, 1911) and Henry Norris Russell (Princeton, 1913) independently asked: what happens if you plot absolute magnitude against spectral type?

Every star has different mass, age, and composition. The natural expectation: noise — a random spray of dots.

What they found instead was one of the most stunning patterns in all of science.

Cecilia Payne-Gaposchkin: Decoding the Diagram

Black-and-white photograph from circa 1890 showing seven women working at desks in a room at Harvard College Observatory, examining photographic plates and notebooks. A light-curve chart labeled 'B Aurigae Dec 1889' hangs on the wall behind them.

Credit: Harvard College Observatory / Smithsonian Institution

1925 PhD thesis (age 25): Connected the HR diagram’s axes to physics
The horizontal axis is a temperature sequence, not a composition sequence
All stars are overwhelmingly hydrogen and helium
Struve: “undoubtedly the most brilliant PhD thesis ever written in astronomy”

The Horizontal Axis: OBAFGKM Revisited

Composite image of stellar spectra arranged vertically from O6.5 at top to M5 at bottom, with wavelength increasing left to right across the visible band. Each horizontal strip shows a rainbow-colored spectrum with dark absorption lines at different positions. Star catalog identifiers (HD numbers) are labeled on the right. Three additional spectra at the bottom show an F4 metal-poor star, an M4.5 emission star, and a B1 emission star.

Credit: NOAO/AURA/NSF

From Lecture 3: the spectral sequence is a temperature sequence.

O → hot (\(> 30{,}000~\text{K}\)), blue
M → cool (\({\sim}3{,}000~\text{K}\)), red
Line strengths trace \(T\), not composition
This sequence becomes the HR diagram’s horizontal axis

The Observer’s HR Diagram

Color-magnitude diagram with absolute visual magnitude on the vertical axis (brighter at top) and B-V color index on the horizontal axis (blue-hot on left, red-cool on right). Hundreds of points form a diagonal main sequence from upper-left to lower-right, a clump of giants in the upper-right, scattered supergiants at the top, and white dwarfs in the lower-left. The Sun is marked at B-V = 0.65, M_V = 4.83. Spectral type labels O B A F G K M appear across the top.

Credit: ASTR 201 (generated)

Axes (both backwards!):

Vertical: \(M_V\) — brighter (more negative) at top
Horizontal: spectral type — hotter on the left, cooler on the right

Neither axis requires theory — this is pure measurement.

Three structures jump out immediately.

Axis Anchor (Do Not Forget)

Hotter \(\leftarrow\)

Brighter \(\uparrow\)

Every confusion about the HR diagram starts with forgetting one of these arrows.

Watch: Building an HR Diagram from Cluster Data

What to notice: a cluster’s stars do not land randomly. The main sequence, giant branch, and turnoff structure emerge from data and then demand a physical model.

Predict: Name the Structures

Look at the HR diagram. Stars are NOT randomly scattered — they cluster into distinct regions.

(a) Where do most stars fall? What shape is that region?

(b) What’s unusual about the stars in the upper right? (Cool but luminous — how?)

(c) What’s unusual about the stars in the lower left? (Hot but faint — how?)

Hint: think about Stefan-Boltzmann (\(L = 4\pi R^2 \sigma T^4\)).

The HR Diagram: Annotated View

Hertzsprung-Russell diagram with logarithmic luminosity on the vertical axis (relative to the Sun) and surface temperature on the horizontal axis decreasing from about 30000 K at left to 3000 K at right. Colored stellar points and shaded regions mark the main sequence, giants, supergiants, and white dwarfs, with the Sun labeled near luminosity 1 and temperature about 5800 K.

Credit: ESO

The labeled regions are not arbitrary — each corresponds to a physical class of stars with distinct sizes, luminosities, and evolutionary states.

If stars were arbitrary collections of gas, there would be no reason for a narrow diagonal band to exist. Something is constraining their internal structure.

The Main Sequence

A narrow diagonal band from upper-left (hot, bright) to lower-right (cool, faint)
90% of all stars fall on this band
The Sun sits roughly in the middle (G2, \(M_V = +4.83\))

This is not a coincidence. From Lecture 4: mass determines luminosity (\(L \propto M^{3.5}\)) and temperature. The main sequence is a mass sequence — but that reveal comes in Part 4.

Giants and White Dwarfs

Upper Right: Giants

Cool (\(T \sim 3{,}000\text{–}5{,}000~\text{K}\)) but luminous (\(100\text{–}10^4\,L_\odot\)).

Stefan-Boltzmann demands enormous radii: \(R \sim 10\text{–}100\,R_\odot\).

A red giant’s photosphere could extend halfway to Mercury’s orbit.

Lower Left: White Dwarfs

Hot (\(T \sim 10^4\text{–}3 \times 10^4~\text{K}\)) but faint (\({\sim}0.01\,L_\odot\)).

Stefan-Boltzmann demands tiny radii: \(R \sim 0.01\,R_\odot\) — Earth-sized.

Dead stellar cores, no longer burning fuel.

Think–Pair–Share: White Dwarf Radius

A star has \(T \sim 2.5 \times 10^4~\text{K}\) (about \(4\times\) hotter than the Sun) and \(L \sim 0.01\,L_\odot\).

Using \(L/L_\odot = (R/R_\odot)^2 (T/T_\odot)^4\), estimate \((R/R_\odot)^2\).
What familiar solar-system object has a similar radius?

30 seconds alone → 1 minute with a neighbor → share out

Luminosity Classes: Vertical Structure

Even at the same spectral type, stars can differ enormously in luminosity. Spectral line widths reveal why — they encode surface gravity.

Class	Name	Surface gravity	Lines	Example
I	Supergiant	Low \(g\)	Narrow	Betelgeuse (M1 I)
III	Giant	Moderate	—	Arcturus (K1 III)
V	Main-seq dwarf	High \(g\)	Broad	Sun (G2 V)

Full classification = spectral type + luminosity class: G2 V (Sun), M1 I (Betelgeuse).

Quick Check: Stellar Classification

A star is classified as K5 III. Which statement is correct?

It is hotter than the Sun and a main-sequence dwarf
It is cooler than the Sun and a main-sequence dwarf
It is cooler than the Sun and a giant — K is cooler than G; class III = giant
It is hotter than the Sun and a giant

Map Builder: Structures Identified

Three structures have emerged from the data.

Main sequence — 90% of stars, a diagonal band ordered by temperature
Giant branch — cool but luminous → enormous radii (\(10\text{–}100\,R_\odot\))
White dwarfs — hot but faint → tiny radii (\({\sim}0.01\,R_\odot\))
Luminosity classes add vertical structure: giants and dwarfs at the same temperature

The map has geography — but not yet physics. Let’s add it.

The Theorist’s HR Diagram

Overlaying physics on measurement

Same Patterns, Physical Axes

Observer’s diagram:

\(M_V\) vs. spectral type
Pure measurement
No theory required

Theorist’s diagram:

\(\log(L/L_\odot)\) vs. \(\log T_{\text{eff}}\)
Physical quantities
Can overlay theoretical relationships

The same patterns appear on both versions. The patterns are real features of stellar physics, not artifacts of the measurement system.

Stefan-Boltzmann → Lines of Constant Radius

From Lecture 2: \(L = 4\pi R^2 \sigma T^4\). On the HR diagram, \(L\) and \(T\) are the axes. Fix \(R\):

\[\log(L/L_\odot) = 2\log(R/R_\odot) + 4\log(T/T_\odot)\]

At fixed \(R\), this is a straight line with slope \(4\) on the \(\log L\) – \(\log T\) diagram.

Notice the steepness: at fixed radius, a \(0.1\) dex increase in \(\log T\) gives a \(0.4\) dex increase in \(\log L\).

Scaling: Double \(T\) at fixed \(R\) → \(L\) increases by \(2^4 = 16\times\) (steep!)
Shifting \(R\): Larger \(R\) moves the line up (brighter at every \(T\))
Each line represents all possible \((L, T)\) combinations for a star of that radius
These lines turn clustered points into constrained physical states

The Theorist’s HR Diagram

HR diagram with log luminosity in solar units on the vertical axis and log effective temperature on the horizontal axis (reversed, hotter on left). Dashed diagonal lines show constant stellar radii from 0.01 to 1000 solar radii. Main sequence stars form a diagonal band, giants cluster in the upper right near the 10-100 solar radii lines, and white dwarfs cluster in the lower left near the 0.01 solar radii line. The Sun is marked at log T = 3.76, log L = 0.

Credit: ASTR 201 (generated)

You can read radius directly from a star’s position. Three physical properties — \(L\), \(T\), \(R\) — encoded in a two-axis plot.

Reading Radius from Position

HR Region	Typical radius	Physical meaning
Main sequence	\(0.1\text{–}10\,R_\odot\)	Hydrogen-burning stars
Giants	\(10\text{–}100\,R_\odot\)	Evolved, expanded envelopes
Supergiants	\(100\text{–}1{,}000\,R_\odot\)	Most luminous evolved stars
White dwarfs	\({\sim}0.01\,R_\odot\)	Dead cores (Earth-sized)

Map Builder update: The map now has a size gradient — \(L\), \(T\), and \(R\) all readable from one diagram.

Size Along the Main Sequence

Artistic rendering of main-sequence stars arranged left to right by spectral type from M (small, red-orange) through K, G, F, A, B to O (large, blue-white), shown to relative scale against a black background. Each star is labeled with its spectral letter.

Credit: Wikimedia Commons (after Morgan & Keenan)

From O to M: luminosity spans \(10^{10}\), temperature spans a factor of \(14\), and radius spans a factor of \({\sim}100\). All driven by one hidden variable — mass.

Worked Example: Red Giant Radius

Problem: A red giant has \(L = 400\,L_\odot\) and \(T_{\text{eff}} = 4{,}000~\text{K}\). Find \(R/R_\odot\).

Ratio form (no constants needed!): \[\left(\frac{R}{R_\odot}\right)^2 = \frac{L/L_\odot}{(T/T_\odot)^4} = \frac{400}{(4{,}000/5{,}800)^4}\]

Temperature ratio: \(4{,}000/5{,}800 = 0.690\). Then \(0.690^4 = 0.226\).

\[\left(\frac{R}{R_\odot}\right)^2 = \frac{400}{0.226} = 1{,}770 \quad \Rightarrow \quad \frac{R}{R_\odot} = \sqrt{1{,}770} \approx 42\]

The red giant is \({\sim}42\,R_\odot\). Mercury orbits at \(0.39~\text{AU} \approx 84\,R_\odot\) — this star extends halfway there.

Quick Check: White Dwarf Radius

A white dwarf has \(T_{\text{eff}} = 2 \times 10^4~\text{K}\) and \(L = 0.01\,L_\odot\). What is its radius?

\({\sim}100\,R_\odot\) — a giant
\({\sim}1\,R_\odot\) — Sun-sized
\({\sim}0.1\,R_\odot\) — larger than a planet
\({\sim}0.01\,R_\odot\) (Earth-sized) — since \((R/R_\odot)^2 = 0.01/(3.45)^4 \approx 7 \times 10^{-5}\)

Mass — The Hidden Organizer

The variable that doesn’t appear on either axis

Predict: Mass Along the Main Sequence

From Lecture 4: more massive main-sequence stars are more luminous (\(L \propto M^{3.5}\)) and hotter.

If you labeled each main-sequence star with its mass, how would mass change along the sequence?

Mass increases randomly — no pattern
Mass increases from upper-left to lower-right
Mass increases from lower-right to upper-left — monotonically

Commit to an answer before the next slide.

The Main Sequence Is a Mass Sequence

Position	Type	\(M/M_\odot\)	\(L/L_\odot\)	\(T_{\text{eff}}\) (K)	Lifetime
Lower right	M5	\(0.1\)	\(0.001\)	\(3{,}000\)	\(> 100~\text{Gyr}\)
	M0	\(0.5\)	\(0.08\)	\(3{,}850\)	\({\sim}60~\text{Gyr}\)
Middle	G2 ☉	\(1.0\)	\(1.0\)	\(5{,}800\)	\({\sim}10~\text{Gyr}\)
	A0	\(2.5\)	\(40\)	\(9{,}900\)	\({\sim}1~\text{Gyr}\)
Upper left	B0	\(15\)	\(3 \times 10^4\)	\(3 \times 10^4\)	\({\sim}10~\text{Myr}\)
	O5	\(40\)	\(5 \times 10^5\)	\(4.2 \times 10^4\)	\({\sim}1~\text{Myr}\)

Mass increases monotonically from lower-right to upper-left. Mass — which does not appear on either axis — organizes the entire structure.

Nothing on the axes says “mass.” Yet mass organizes the entire pattern — the signature of a hidden variable.

The Mass-Luminosity Relation: Lecture 4 Callback

Log-log plot of luminosity versus mass for main-sequence stars in solar units. Points are color-coded by spectral type (blue for O/B, white for A, yellow for G, orange for K, red for M). A dashed line shows the power-law fit L proportional to M to the 3.5. The Sun is marked at (1, 1). Annotations show that 2 solar masses gives about 11 solar luminosities and 10 solar masses gives about 3000 solar luminosities.

Credit: ASTR 201 (generated)

Log-log scatter plot of stellar luminosity in solar units versus mass in solar units from Eker et al. 2018. Hundreds of gray data points form a tight diagonal band from lower-left (low mass, low luminosity) to upper-right (high mass, high luminosity). A red piecewise linear fit and blue dotted classical power-law fit overlay the data. Short vertical tick marks along the horizontal axis indicate mass boundaries between the four power-law segments.

Credit: Eker et al. 2018, MNRAS 479, 5491

\(L \propto M^{3.5}\) explains the main sequence: mass sets core temperature → nuclear burning rate → luminosity and surface temperature. The scatter is real — but the trend is tight.

For main-sequence stars, this is an empirical scaling (not an exact law): the exponent varies with mass range and stellar structure, but the monotonic trend is robust.

Spectroscopic Parallax: Distance from a Spectrum

Every main-sequence star of a given mass has essentially the same \(L\) and \(T\)
Know spectral type + luminosity class V → know \(M_V\) from the reference table
Measure apparent magnitude \(m\) → distance modulus → distance

\[d = 10^{(m - M)/5 + 1}~\text{pc}\]

Distance from a spectrum — no parallax needed. This extends the distance ladder far beyond Lecture 1’s geometric parallax.

This works cleanly for main-sequence stars; giants and white dwarfs break the one-to-one spectral-type-to-luminosity mapping.

(Misnomer: nothing to do with parallax. But the name stuck.)

What About Giants and White Dwarfs?

Giants/supergiants: Stars that have left the main sequence — exhausted core hydrogen, expanded dramatically. Position depends on mass + age + evolutionary state.
White dwarfs: Remnants of dead stars — no longer burning fuel, just cooling. Position depends on mass + cooling time.

Main sequence = where stars live. Giants = where stars age. White dwarfs = where stars end up.

Map Builder: Mass writes the addresses — but the HR diagram is not a snapshot. It’s an evolution diagram.

An Evolution Diagram

Why patterns need physics

Stars Move on the HR Diagram

The life of a Sun-like star (\(1\,M_\odot\)), traced on the HR diagram:

Birth: Collapse from gas cloud → contracts toward the main sequence (upper right → middle)
Main sequence (\({\sim}10~\text{Gyr}\)): Hydrogen burning → stable equilibrium at (G2, \(1\,L_\odot\)) for \(10\) billion years
Red giant: Core hydrogen exhausted → core contracts, envelope expands → moves to upper right (\(R \sim 100\,R_\odot\))
Death: Sheds outer layers → white dwarf remnant in lower left, then slowly cools rightward

This motion is not random — it is the response of a self-gravitating system to changing fuel. 90% of stars are on the main sequence because that’s where they spend 90% of their lives.

Mass Sets the Pace of Evolution

HR diagram with luminosity on the vertical axis and surface temperature on the horizontal axis, showing the main sequence and overlaid evolutionary tracks for stars of different masses from about 0.5 to 15 solar masses. Arrows and labels indicate approximate evolution timescales, highlighting that higher-mass stars evolve much faster than lower-mass stars.

High-mass stars peel away from the main sequence quickly; low-mass stars linger for billions of years. Same diagram, different clocks.

Even Arrival to the Main Sequence Depends on Mass

HR diagram with pre-main-sequence tracks labeled by stellar mass from about 0.5 to 15 solar masses, with arrows and timescales showing that higher-mass stars move onto the main sequence much faster than lower-mass stars.

High-mass protostars reach the main sequence quickly; low-mass stars can spend tens of millions of years contracting toward it. Mass sets the evolutionary clock before stable fusion even begins.

Mass Determines the Path

Low mass (\(0.5\,M_\odot\)):

Slow, gentle evolution
Main sequence: \({\sim}60~\text{Gyr}\)
Modest red giant
White dwarf endpoint
No low-mass star has ever died of old age

High mass (\(10\,M_\odot\)):

Fast, dramatic
Main sequence: \({\sim}20~\text{Myr}\)
Supergiant phase
Core-collapse supernova
Neutron star or black hole

Mass determines the path and the pace. (Lecture 4: \(t_{\text{MS}} \propto M^{-2.5}\))

What the HR Diagram Cannot Explain

The diagram reveals patterns — but does not explain them. Five questions for Module 3:

Why does the main sequence exist? What creates a stable equilibrium lasting billions of years?
Why does mass determine position? What physics connects core mass to surface properties?
Why do stars become giants? Why expand rather than simply turn off?
What sets the maximum white dwarf mass? (\({\sim}1.4\,M_\odot\) — the Chandrasekhar limit)
What sets the minimum mass for a star? (Below \({\sim}0.08\,M_\odot\) → no hydrogen fusion)

If the diagram were a messy scatter, classification would suffice. Because its structure is tight and lawful, measurement alone is incomplete — physics is required.

Think–Pair–Share: Cluster Turnoff

A star cluster formed all at once, \(100~\text{Myr}\) ago. All stars started on the main sequence.

Which stars have already left the main sequence? Which remain? (Use \(t_{\text{MS}} \propto M^{-2.5}\) with \(t_\odot = 10~\text{Gyr}\).)
Sketch what the cluster’s HR diagram looks like — where is the “turnoff point”?
How would the diagram differ for a younger cluster (\(10~\text{Myr}\))?

30 seconds alone → 1 minute with a neighbor → share out

The main sequence does not end because stars get old in place. It turns off because the most massive stars have already evolved away.

Observable → Model → Inference

Observable: Apparent brightness (\(m\)), color or spectral type, parallax (\(\pi\)) for thousands of stars.

Model: Pogson relation converts flux ratios to magnitudes. Distance modulus (\(m - M = 5\log(d/10~\text{pc})\)) yields absolute magnitude. Stefan-Boltzmann (\(L = 4\pi R^2 \sigma T^4\)) connects \(L\), \(T\), \(R\). Mass-luminosity (\(L \propto M^{3.5}\)) connects mass to position.

Inference: Stars cluster into a main sequence (a mass sequence), a giant branch (evolved stars), and white dwarfs (dead cores). Mass — invisible on both axes — organizes everything. The patterns demand physics (Module 3).

The Module 2 Inference Chain — Complete

Lecture	Tool	Question	What It Unlocks
1	Parallax	How far?	Distance \(d\); then \(L = 4\pi d^2 F\)
2	Color/flux + Stefan-Boltzmann	How hot? How big?	Temperature \(T\); radius \(R\)
3	Spectral lines + Doppler	What’s it made of? Moving?	Composition; velocity \(v_r\)
4	Binary orbits + Kepler III	How heavy?	Mass \(M\); mass-luminosity
5	HR diagram	*What patterns emerge?*	All properties organized

From photons alone: distance, luminosity, temperature, radius, composition, mass — organized on one diagram.

Summary: Key Takeaways

The magnitude system — logarithmic, inverted: 5 mag = \(100\times\) flux. Absolute magnitude \(M\) removes distance.
The distance modulus — \(m - M = 5\log_{10}(d/10~\text{pc})\) — the inverse-square law in logarithmic form
The observer’s HR diagram — pure measurement reveals three structures: main sequence, giants, white dwarfs
The theorist’s HR diagram — Stefan-Boltzmann overlays lines of constant radius
The main sequence is a mass sequence — organized by \(L \propto M^{3.5}\)
The HR diagram is an evolution diagram — stars move; mass determines the path
Patterns demand physics — Module 3 explains why the main sequence exists

Mass is invisible — but decisive.

The Takeaway

If you forget everything else from today, remember this:

Every star has an address — and mass writes the zip code.

The HR diagram organizes all stellar measurements into one powerful plot. Mass — invisible on both axes — controls the entire structure.

Questions?

“Why do astronomers use this backward magnitude system?” — Historical inertia from Hipparchus (2nd century BCE). Modern astronomy formalized it but kept the convention.
“Can the HR diagram tell us a star’s age?” — Not for a single star (without models). But for a cluster — yes: the turnoff point is a clock.
“What happens between the main sequence and the white dwarf sequence?” — That’s Module 3.

Next Time: Stellar Interiors

Module 3 begins: from patterns to physics.

What holds a star up against gravity? (Hydrostatic equilibrium)
What powers a star for billions of years? (Nuclear fusion)
Why does the main sequence exist? Why do stars become giants?

The HR diagram told us what. Module 3 tells us why.

Exam 1 (March 5) covers Modules 1 and 2 — everything from dimensional analysis through the HR diagram.