The Choice of Frequency and Annualization of Returns

4.11. The Choice of Frequency and Annualization of Returns#

4.11.1. 🎯 Learning Objectives#

By the end of this notebook, you will be able to:

Understand frequency choices — Why we use daily or monthly data in practice
Apply standard annualization — Convert monthly/daily statistics to annual terms
Aggregate returns with groupby — Compute exact multi-period returns without approximation
Compare methods — Know when approximation is acceptable vs. exact aggregation

4.11.3. 🛠️ Setup #

#@title 🛠️ Setup: Run this cell first (click to expand)

# Core libraries
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
%matplotlib inline

# Set consistent plot style
plt.style.use('seaborn-v0_8-whitegrid')
plt.rcParams['figure.figsize'] = [10, 6]
plt.rcParams['font.size'] = 12

# Suppress warnings for cleaner output
import warnings
warnings.filterwarnings('ignore')

print("✅ Libraries loaded successfully!")

4.11.4. Why Frequency Matters #

4.11.4.1. Data Comes at a Specific Frequency#

Financial data is always structured at a particular frequency:

Frequency	What it measures
Daily	Return from one closing price to the next
Monthly	Return from last trading day of one month to the next
Annual	Return over a calendar or fiscal year

This choice is arbitrary — transactions happen every millisecond!

4.11.4.2. Why Monthly or Daily?#

In this course (and in practice), we work with monthly or daily data:

Manageable size — Higher frequencies create massive datasets
Industry standard — Most practitioners use these frequencies
Sufficient data — Lower frequencies (annual) give too few observations

💡 Key Insight:

We analyze at monthly frequency, then annualize results. Annual numbers are easier to interpret and compare.

4.11.4.3. Load Monthly Data#

Let’s load a dataset of monthly global financial returns.

# Load monthly global financial data
url = "https://raw.githubusercontent.com/amoreira2/UG54/main/assets/data/GlobalFinMonthly.csv"
Data = pd.read_csv(url, na_values=-99)
Data['Date'] = pd.to_datetime(Data['Date'])
Data = Data.set_index(['Date'])

print(f"Data range: {Data.index.min().date()} to {Data.index.max().date()}")
print(f"Columns: {list(Data.columns)}")
Data.head()

4.11.5. Standard Annualization #

4.11.5.1. The Quick and Dirty Method#

Standard annualization formulas (from monthly data):

Statistic	Formula
Mean	\(\hat{\mu}_A = 12 \times \hat{\mu}_M\)
Variance	\(\hat{\sigma}^2_A = 12 \times \hat{\sigma}^2_M\)
Std Dev	\(\hat{\sigma}_A = \sqrt{12} \times \hat{\sigma}_M\)

These assume returns are i.i.d. (independent and identically distributed).

# Monthly statistics for market returns
mean_monthly = Data['MKT'].mean()
std_monthly = Data['MKT'].std()
var_monthly = Data['MKT'].var()

# Annualize using standard formulas
mean_annual = mean_monthly * 12
var_annual = var_monthly * 12
std_annual = std_monthly * np.sqrt(12)

print("━" * 50)
print("Market Return Statistics")
print("━" * 50)
print(f"Monthly mean:      {mean_monthly:>10.4%}")
print(f"Annualized mean:   {mean_annual:>10.2%}")
print("━" * 50)
print(f"Monthly std:       {std_monthly:>10.4%}")
print(f"Annualized std:    {std_annual:>10.2%}")
print("━" * 50)

4.11.5.2. Why Is This an Approximation?#

Annual returns compound, they don’t simply add:

\[R_A = (1+R_1)(1+R_2)\cdots(1+R_{12}) - 1\]

If returns were truly i.i.d., the exact formulas would be:

\[\mu_A = (1+\mu_M)^{12} - 1\]

\[\sigma_A^2 = [\sigma^2_M + (1+\mu_M)^2]^{12} - (1+\mu_M)^{24}\]

📌 Remember:

The standard annualization is an approximation that works well when:

Monthly returns are small (so \((1+r) \approx 1\))

You’re comparing assets at the same frequency

Always use standard annualization unless told otherwise.

4.11.5.3. Why Use the Approximation?#

Despite being technically “wrong,” we use it because:

Industry standard — Everyone uses it, so results are comparable
Good intuition — Gives correct order of magnitude
Easy t-statistics — Works well with monthly data for inference
Consistent comparisons — Fine if you don’t mix frequencies

4.11.6. Exact Aggregation with Groupby #

4.11.6.1. When You Need Exact Results#

Sometimes you want actual annual returns, not approximations.

To get exact annual returns, we must compound monthly returns:

\[R_{year} = \prod_{t \in year}(1 + R_t) - 1\]

This requires grouping data by year and multiplying gross returns.

🐍 Python Insight: groupby()

The pandas groupby() method is one of the most powerful tools for data analysis. It follows the Split → Apply → Combine pattern:
df.groupby(grouping_key).aggregate_function()
Step

Action

Example

Split

Divide data into groups

df.groupby(df.index.year)

Apply

Apply function to each group

.mean(), .sum(), .prod()

Combine

Merge results back together

Returns one row per group

We’ll use this extensively throughout the course!

4.11.6.2. The Groupby Method#

Pandas groupby lets us aggregate data by groups. Here’s the logic:

Step	Code	What it does
1	`(Data + 1)`	Convert net returns to gross returns
2	`.groupby(Data.index.year)`	Group by calendar year
3	`.prod()`	Multiply all values within each group
4	`- 1`	Convert back to net returns

# Aggregate monthly returns to annual returns (exact method)
DataYear = (Data + 1).groupby(Data.index.year).prod() - 1

print("Annual returns (first 5 years):")
DataYear.head()

4.11.6.3. Comparing Approximation vs. Exact#

# Compare the two methods
approx_mean = Data['MKT'].mean() * 12
exact_mean = DataYear['MKT'].mean()

approx_std = Data['MKT'].std() * np.sqrt(12)
exact_std = DataYear['MKT'].std()

print("━" * 50)
print("Comparison: Approximation vs. Exact Aggregation")
print("━" * 50)
print(f"Mean (approx):     {approx_mean:>10.2%}")
print(f"Mean (exact):      {exact_mean:>10.2%}")
print(f"Difference:        {abs(approx_mean - exact_mean):>10.2%}")
print("━" * 50)
print(f"Std (approx):      {approx_std:>10.2%}")
print(f"Std (exact):       {exact_std:>10.2%}")
print(f"Difference:        {abs(approx_std - exact_std):>10.2%}")
print("━" * 50)

💡 Key Insight:

The approximation and exact methods give similar results for typical returns. Use the approximation for quick analysis; use exact aggregation for final reports.

4.11.6.4. Visualizing Annual Returns#

# Plot annual market returns
fig, ax = plt.subplots(figsize=(12, 5))

colors = ['green' if x >= 0 else 'red' for x in DataYear['MKT']]
ax.bar(DataYear.index, DataYear['MKT'], color=colors, alpha=0.7)

ax.axhline(0, color='black', linewidth=0.5)
ax.axhline(DataYear['MKT'].mean(), color='blue', linestyle='--', 
           label=f"Mean = {DataYear['MKT'].mean():.1%}")

ax.set_xlabel('Year')
ax.set_ylabel('Annual Return')
ax.set_title('Market Annual Returns', fontsize=14, fontweight='bold')
ax.legend()

plt.tight_layout()
plt.show()

4.11.7. 📝 Exercises #

4.11.7.1. Exercise 1: Warm-up — Annualization Practice#

🔧 Exercise:

A stock has the following daily statistics:

Mean daily return: 0.04%

Daily standard deviation: 1.8%

Using standard annualization (252 trading days):

Compute the annualized mean return

Compute the annualized volatility

Compute the annualized Sharpe Ratio (assume rf = 0)

# Your code here
mean_daily = 0.0004  # 0.04%
std_daily = 0.018    # 1.8%

# Annualize

💡 Click to see solution

mean_daily = 0.0004  # 0.04%
std_daily = 0.018    # 1.8%

# Annualize
mean_annual = mean_daily * 252
std_annual = std_daily * np.sqrt(252)
sharpe_annual = mean_annual / std_annual

print(f"Annualized mean: {mean_annual:.2%}")
print(f"Annualized std: {std_annual:.2%}")
print(f"Annualized Sharpe: {sharpe_annual:.2f}")

4.11.7.2. Exercise 2: Extension — Aggregate to Quarterly#

🤔 Think and Code:

Instead of annual returns, compute quarterly returns:

Use Data.index.to_period('Q') to group by quarter

Compute exact quarterly returns using the compounding formula

What is the mean and std of quarterly market returns?

How do these compare to monthly mean × 3 and monthly std × √3?

# Your code here

💡 Click to see solution

# Aggregate to quarterly
DataQuarter = (Data + 1).groupby(Data.index.to_period('Q')).prod() - 1

# Exact quarterly statistics
exact_q_mean = DataQuarter['MKT'].mean()
exact_q_std = DataQuarter['MKT'].std()

# Approximation from monthly
approx_q_mean = Data['MKT'].mean() * 3
approx_q_std = Data['MKT'].std() * np.sqrt(3)

print(f"Quarterly mean (exact): {exact_q_mean:.2%}")
print(f"Quarterly mean (approx): {approx_q_mean:.2%}")
print(f"Quarterly std (exact): {exact_q_std:.2%}")
print(f"Quarterly std (approx): {approx_q_std:.2%}")

4.11.7.3. Exercise 3: Open-ended — Best and Worst Years#

🤔 Think and Code:

Using the annual returns data (DataYear):

Find the 5 best and 5 worst years for market returns

Create a bar chart showing only these 10 extreme years

Research: What major events caused the worst years?

Calculate: What fraction of years had negative returns?

# Your code here

💡 Click to see solution

# Best and worst years
best_5 = DataYear['MKT'].nlargest(5)
worst_5 = DataYear['MKT'].nsmallest(5)

print("Best 5 years:")
print(best_5)
print("\nWorst 5 years:")
print(worst_5)

# Combine for plotting
extreme_years = pd.concat([worst_5, best_5]).sort_index()

fig, ax = plt.subplots(figsize=(10, 5))
colors = ['green' if x >= 0 else 'red' for x in extreme_years]
ax.bar(extreme_years.index.astype(str), extreme_years, color=colors)
ax.set_title('Most Extreme Market Years')
ax.set_ylabel('Annual Return')
plt.xticks(rotation=45)
plt.show()

# Fraction negative
pct_negative = (DataYear['MKT'] < 0).mean()
print(f"\nFraction of negative years: {pct_negative:.1%}")

4.11.8. 🧠 Key Takeaways #

Frequency is arbitrary — We use monthly/daily for practical reasons (data size, industry standard)
Standard annualization: Mean × 12, Std × √12 (from monthly) — An approximation, but the industry standard
Exact aggregation uses groupby and compounding: \((1+R_1)(1+R_2)\cdots - 1\)
Use approximation for quick analysis; use exact when precision matters
Never mix frequencies — Don’t compare annual real estate returns to monthly stock returns using approximation

Next Notebook: We’ll explore how to access financial data through APIs — FRED, Ken French, and more.

Step	Action	Example
Split	Divide data into groups	`df.groupby(df.index.year)`
Apply	Apply function to each group	`.mean()`, `.sum()`, `.prod()`
Combine	Merge results back together	Returns one row per group