Back to Portfolio
Performance Feedback Redesign A/B Pilot Analysis — 169 Participants | People & Culture, 2025
Data-informed design A/B methodology Performance analytics

Listen

About This Project

This analysis was developed independently to evaluate a proposed redesign of peer feedback methodology during the performance cycle. The existing format—a 5-point Likert agree/disagree scale—was producing heavily skewed data that offered little diagnostic value for development conversations or L&D resource targeting.

I designed and ran an A/B pilot with 169 participants comparing the existing scale against a dual-axis Frequency × Impact model, then analyzed results across overall averages, role type, location, and job family to identify where the new format produced meaningfully different signals.

This demonstrates: experimental learning design thinking, quantitative data analysis, iterative program development, and cross-functional insight synthesis. It was never formally a “project request”—I identified the problem, proposed the methodology, and ran the test.

The Problem

The existing peer feedback format used a 5-point agree/disagree Likert scale to evaluate core company values. The format produced limited diagnostic value:

RatingUsage
Strongly Agree85%+
Somewhat Agree~13%
Neutral / Disagree~2%

Ceiling concentration confirmed: 98%+ of ratings landed positive, making it impossible to distinguish high performers from adequate performers or to target L&D resources meaningfully.

The Hypothesis

A dual-axis Frequency × Impact rating scale (Option B) would produce more nuanced, actionable feedback data—enabling richer development conversations and a foundation for learning resource targeting.

A/B Test Design — 169 Participants

Option A (Current)Option B (Future State)
ScaleStrongly Agree → Strongly Disagree (5-pt)Frequency (4-pt) + Impact (4-pt), separately
n89 participants80 participants
Frequency labelsConsistently / Frequently / Sometimes / Rarely
Impact labelsExceptional / Effective / Developing / Limited

Option B: Results

Option B surfaced meaningful differentiation across values, job families, roles, and locations—data that could directly inform where to focus L&D resources. Scores varied by competency area, with some showing notable gaps between frequency of behavior and perceived impact—a pattern the original format made invisible.

Analysis Framework: Frequency × Impact Matrix

QuadrantPatternRecommended Intervention
High Freq / High ImpactConsistently performing with strong outcomesLeverage strengths; peer upskilling; explore leadership
High Freq / Low ImpactBehavior is frequent, but outcome is weakSkill refinement—how to perform the behavior more effectively
Low Freq / High ImpactHas capability; not applying it consistentlyHabit building—perform behavior more regularly
Low Freq / Low ImpactRarely performing, limited impactFoundational skill-building, coaching, targeted feedback

Open-Text Themes — 100 Responses Across A & B

Key finding: Strongest responses used specific situations with measurable impact. Weakest were generic praise with no actionable detail—reinforcing the need for structured behavioral frameworks in feedback prompts.

What I Built Beyond the Pilot


This Demonstrates