How do I use these prompt templates?

Copy the prompt template, customize it with your specific details, and paste it into ChatGPT, Claude, or any other AI assistant.

Can I modify these prompts?

Yes! These templates are starting points. Feel free to modify them to better suit your specific needs and context.

Statistical Analysis Prompt Templates

AI prompt templates for statistical analysis. Perform hypothesis testing, regression, and more.

Overview

Statistical analysis helps you move from 'I think this is true' to 'I can prove this is true.' These prompts cover the most common statistical tests and analyses, with plain English explanations of what the results mean. You don't need to be a statistician to use them, but you'll get statistically sound answers.

Best Practices

State your hypothesis clearly before diving into tests

Include sample size and data type (continuous, categorical, ordinal)

Mention your significance level preference (usually 0.05)

Provide context about what decision this analysis will inform

Ask for interpretation in plain English, not just p-values

Prompt Templates

1. Hypothesis Test Selector

Prompt Template

Help me choose the right statistical test.

Research question: [QUESTION]
Data I have: [DESCRIBE_YOUR_DATA]
Sample sizes: [N_PER_GROUP]
Variable types: [CONTINUOUS/CATEGORICAL/ORDINAL]
Groups: [INDEPENDENT/PAIRED]

What test should I use and why?

Example Input

Help me choose the right statistical test. Research question: Did our new onboarding flow improve activation rates? Data I have: Activation (yes/no) for control group and test group. Sample sizes: 2,500 per group. Variable types: Categorical (binary outcome). Groups: Independent (different users).

Example Output

Use a Chi-square test for independence (or equivalently, a two-proportion z-test).

Why this test:
- Categorical outcome (activated yes/no) with two independent groups
- Large sample size (n > 30 per group), so normal approximation works
- You're comparing proportions between groups

Alternative: Fisher's exact test if any cell count drops below 5, but with 2,500 per group that's unlikely.

Python code:
```python
from scipy.stats import chi2_contingency
import numpy as np

# Create contingency table
# [[control_activated, control_not], [test_activated, test_not]]
contingency = np.array([[320, 2180], [385, 2115]])

chi2, p_value, dof, expected = chi2_contingency(contingency)
print(f"Chi-square: {chi2:.2f}, p-value: {p_value:.4f}")
```

How to interpret: If p < 0.05, the difference in activation rates is statistically significant. But also calculate the actual difference (test rate - control rate) and confidence interval to understand practical significance.

When to Use

When you need help picking the right statistical test for your situation

Pro Tips

•Statistical significance isn't the same as practical significance. A tiny effect can be significant with large samples
•Always report effect size alongside p-values

2. Regression Analysis

Prompt Template

Help me build and interpret a regression model.

Goal: [PREDICT_Y_OR_UNDERSTAND_RELATIONSHIPS]
Dependent variable: [Y_VARIABLE]
Independent variables: [X_VARIABLES]
Data context: [BUSINESS_CONTEXT]
Concerns: [MULTICOLLINEARITY/OUTLIERS/NON_LINEARITY]

Tool: [PYTHON/R/EXCEL]
Provide code and interpretation guidance.

Example Input

Help me build and interpret a regression model. Goal: Understand what factors drive customer lifetime value. Dependent variable: LTV (continuous, in dollars). Independent variables: months_active, support_tickets, plan_tier (Basic/Pro/Enterprise), referral_source. Data context: SaaS subscription business, 5000 customers. Concerns: Plan tier might correlate with months_active. Tool: Python.

Example Output

```python
import pandas as pd
import statsmodels.api as sm
from statsmodels.stats.outliers_influence import variance_inflation_factor

# Prepare data: encode categorical variables
df_model = pd.get_dummies(df, columns=['plan_tier', 'referral_source'], drop_first=True)

# Define X and y
X = df_model[['months_active', 'support_tickets', 'plan_tier_Pro', 'plan_tier_Enterprise', 
 'referral_source_organic', 'referral_source_paid']]
X = sm.add_constant(X) # Add intercept
y = df_model['ltv']

# Fit model
model = sm.OLS(y, X).fit()
print(model.summary())

# Check multicollinearity
vif_data = pd.DataFrame()
vif_data['feature'] = X.columns
vif_data['VIF'] = [variance_inflation_factor(X.values, i) for i in range(X.shape[1])]
print(vif_data)
```

Interpretation guide:
- R-squared: What % of LTV variance is explained by these factors
- Coefficients: How much LTV changes per unit change in each variable
 - If months_active coef = 45, each additional month adds $45 to LTV
 - For plan_tier_Enterprise, the coefficient shows the difference vs. Basic (the reference)
- p-values: If < 0.05, that variable has a statistically significant relationship with LTV
- VIF: If > 5, you have multicollinearity concerns. Consider dropping one of the correlated variables

Your concern about plan_tier and months_active: check the VIF. If both are important, you might keep both but note the correlation in your interpretation.

When to Use

When you want to understand relationships between variables or predict an outcome

Pro Tips

•Check residual plots to verify assumptions (normality, homoscedasticity)
•For categorical variables, remember coefficients are relative to the dropped category

3. Sample Size Calculator

Prompt Template

Calculate the sample size I need for this study.

Test type: [A/B_TEST/SURVEY/EXPERIMENT]
Baseline metric: [CURRENT_VALUE]
Minimum detectable effect: [SMALLEST_CHANGE_WORTH_DETECTING]
Significance level: [USUALLY_0.05]
Power: [USUALLY_0.80]

Any constraints on sample size or test duration?

Example Input

Calculate sample size for an A/B test. Test type: A/B test on website. Baseline metric: 3.2% conversion rate. Minimum detectable effect: 10% relative improvement (so detecting a move to 3.52%). Significance level: 0.05. Power: 0.80. Constraint: We get about 10,000 visitors per week.

Example Output

```python
from statsmodels.stats.power import NormalIndPower
from statsmodels.stats.proportion import proportion_effectsize

baseline = 0.032 # 3.2%
new_rate = 0.0352 # 3.52% (10% relative lift)

# Calculate effect size (Cohen's h for proportions)
effect_size = proportion_effectsize(new_rate, baseline)

# Calculate required sample size per group
analysis = NormalIndPower()
n_per_group = analysis.solve_power(
 effect_size=effect_size,
 alpha=0.05,
 power=0.80,
 alternative='two-sided'
)

print(f"Required per group: {n_per_group:.0f}")
print(f"Total required: {n_per_group * 2:.0f}")
```

Result: You need approximately 23,500 per group, or 47,000 total visitors.

With 10,000 visitors per week split 50/50 between control and test, that's:
- 5,000 per group per week
- About 5 weeks to reach required sample size

Reality check: A 10% relative lift on 3.2% is a small absolute change (0.32 percentage points). Detecting small changes requires large samples. If you can only run for 2-3 weeks, consider either:
1. Testing a bigger change (maybe 20% relative lift)
2. Accepting lower power (higher risk of missing a real effect)
3. Using a different metric with more variance

When to Use

Before starting any test, to know how long to run it

Pro Tips

•Never stop a test early just because results look significant. That inflates false positives
•If you can't get enough sample, consider testing bigger changes or more sensitive metrics

4. Correlation vs Causation Analysis

Prompt Template

I found a correlation between [VARIABLE_A] and [VARIABLE_B]. Help me think through whether this could be causal.

Correlation: [STRENGTH_AND_DIRECTION]
Context: [BUSINESS_OR_RESEARCH_CONTEXT]
Possible confounders: [WHAT_ELSE_MIGHT_EXPLAIN_THIS]
Data available: [WHAT_DATA_DO_YOU_HAVE]

What additional analysis would strengthen or weaken a causal claim?

Example Input

I found a correlation between number of customer support tickets and churn. Customers who filed more tickets were more likely to churn. Correlation: r = 0.45 (moderate positive). Context: SaaS product, trying to reduce churn. Possible confounders: Maybe unhappy customers both file tickets AND churn, but tickets don't cause churn. Data available: Ticket timestamps, churn date, product usage data, plan tier, customer tenure.

Example Output

Good instinct to question causality. Here's the analysis framework:

Possible explanations:
1. Tickets cause churn: Frustrating support experience drives people away
2. Underlying problem causes both: Product issues lead to tickets AND churn
3. Reverse causation: Customers planning to churn file tickets (maybe to get refunds or complain)

Analyses to run:

1. **Timeline analysis**: Do tickets come before or after signals of disengagement?
```python
# Check if tickets precede reduced usage
df['days_ticket_before_usage_drop'] = df['first_usage_drop_date'] - df['first_ticket_date']
```
If tickets come after usage drops, tickets are a symptom, not a cause.

2. **Control for confounders**: Run a regression with product usage metrics
```python
# Does ticket count still predict churn after controlling for usage?
model = logistic_regression(churn ~ tickets + monthly_logins + feature_adoption + plan_tier)
```
If the ticket coefficient drops significantly, usage is the real driver.

3. **Segment analysis**: Check if correlation holds across different groups
- Does it hold for all plan tiers?
- Does it hold for new vs. established customers?
- Does it hold for different ticket types (bugs vs. how-to questions)?

4. **Natural experiment**: Look for situations where ticket volume changed for external reasons
- Did a support policy change affect ticket volume without changing the product?
- How did churn respond?

My hypothesis: Tickets are likely a symptom of problems (explanation 2). The actionable insight isn't 'reduce tickets' but 'fix the issues that generate tickets.'

When to Use

When you've found a correlation and need to understand if it's actionable

Pro Tips

•The phrase 'correlation doesn't imply causation' is true, but correlations are still useful signals
•Think about what experiment you would run if you could. That clarifies what causal claim you're making

Common Mistakes to Avoid

•

Running multiple tests and reporting only the significant ones (p-hacking). This inflates false positive rates

•

Confusing statistical significance with practical importance. A statistically significant result can be too small to matter

•

Using the wrong test for your data type (like t-tests for categorical outcomes)

Frequently Asked Questions

Try Prompt Optimizer More Data & Analytics Prompts

Statistical Analysis Prompt Templates

Overview

Best Practices

Prompt Templates

1. Hypothesis Test Selector

2. Regression Analysis

3. Sample Size Calculator

4. Correlation vs Causation Analysis

Common Mistakes to Avoid

Frequently Asked Questions

Related Templates

SQL Query Prompt Templates

Data Analysis Prompt Templates

Data Visualization Prompt Templates