Data Preparation Guide

Follow these simple rules so WaterfallBridge can read your data correctly and produce accurate analysis.

← Back to Home

Why Data Format Matters

WaterfallBridge analyzes your spreadsheet automatically. To do this reliably, it needs the data in a flat, one-dimensional table — the same format used by databases, pivot tables, and BI tools. Think of it as "one column = one field, one row = one record."

1

Use a Flat (One-Dimensional) Table

Each column must contain one type of data only. Values of the same field should run vertically down the rows, not horizontally across columns. This is the "tidy data" layout that databases and analysis tools expect.

✗ Incorrect — Crosstab / Pivot Layout
ProductJanFebMar
Widget A10012090
Widget B200180210

Months are spread across columns. Each column mixes "month" and "revenue" concepts, making automated parsing impossible.

✓ Correct — Flat / Database Layout
MonthProductRevenue
JanWidget A100
FebWidget A120
MarWidget A90
JanWidget B200
FebWidget B180
MarWidget B210

One column per field, one row per record. Every cell in a column is the same data type.

Tip: If your data currently looks like a pivot table, use Excel's Unpivot feature (Power Query > Transform > Unpivot Columns) to convert it into a flat layout quickly.
2

No Merged Cells

Merged cells look tidy to humans but break machine reading. When cells are merged, many rows lose their value and the program cannot determine which data belongs to which record. Every cell must hold its own value.

✗ Incorrect — Merged Cells
RegionProductRevenue
EastWidget A500
Widget B300
Widget C450

"East" spans three rows. The program sees only one value for the first row; the other two rows appear blank.

✓ Correct — Repeat Values
RegionProductRevenue
EastWidget A500
EastWidget B300
EastWidget C450

Each row explicitly states its Region. No ambiguity, and the program reads every record correctly.

Tip: To unmerge quickly, select the merged area → Home > Merge & Center (click to unmerge) → select the blank cells → type = and point to the cell above → press Ctrl+D to fill down.
3

Formulas: Same-Row Only, Basic Arithmetic

You can use formulas within a row to derive new columns (e.g. =D3*E3 for Quantity × Price). However, do not reference other rows or other sheets. Only the basic operators + − * / ( ) are supported.

✗ Incorrect — Cross-Row / Cross-Sheet References
BCD (Revenue)
Row 3Widget A100=C3+C4
Row 4Widget B200=Sheet2!D4

=C3+C4 references another row; =Sheet2!D4 references another sheet. Both will cause errors.

✓ Correct — Same-Row Arithmetic
B (Qty)C (Price)D (Revenue)
Row 35010=B3*C3
Row 48012=B4*C4

Each formula only references cells in its own row and uses basic arithmetic. This is safe and reliable.

Allowed operators:+-*/() — no SUM(), no VLOOKUP(), no named ranges, no cross-sheet references.
4

Keep the Sheet Clean — No Stray Data

Do not place comments, totals, notes, or any other content in columns or rows that are outside the main data table. The program scans every non-empty cell; stray data will be misinterpreted as part of the dataset.

✗ Incorrect — Extra Data Outside the Table
MonthProductRevenue← Note
JanWidget A100top seller!
FebWidget A120
Grand Total220

Column D contains a personal note; the last row is a manual total. Both confuse the parser.

✓ Correct — Data Only
MonthProductRevenue
JanWidget A100
FebWidget A120

Only the defined columns and data rows remain. No extra text, no summary rows.

Tip: If you need notes, keep them on a separate sheet. The Data sheet should contain only the data table — nothing more.
5

Consistent Data Types Within Each Column

Every value in a column should be the same type — all numbers, all text, or all dates. Mixing types (e.g. numbers and text in the same column) can lead to misclassification or calculation errors.

✗ Incorrect — Mixed Types
MonthProductRevenue
JanWidget A100
FebWidget AN/A
MarWidget Asee note

"N/A" and "see note" are text in a numeric column. The program cannot calculate with these values.

✓ Correct — Uniform Types
MonthProductRevenue
JanWidget A100
FebWidget A0
MarWidget A90

Use 0 for zero revenue, or leave the cell empty. Never substitute text for a numeric value.

Tip: If a value is missing, leave the cell blank or enter 0 (for numeric fields). Avoid placeholders like "N/A", "TBD", or "-".
6

Start Data from the Designated Rows

WaterfallBridge expects the data to begin in a specific layout: Row 1 for Data Types, Row 2 for Field Names, and Row 3 onward for data records. Do not insert blank rows, title rows, or instructions above the table.

✗ Incorrect — Extra Header Rows
Sales Report Q1 2025
(all figures in USD)
MonthProductRevenue
JanWidget A100

Two extra rows above the actual header shift the data layout and confuse the program.

✓ Correct — Standard Layout
Row 1 (Type)DimensionDimensionSumY
Row 2 (Name)MonthProductRevenue
Row 3+ (Data)JanWidget A100

Row 1 = data types, Row 2 = field names, Row 3 onward = data. No extra titles or blank rows.

Tip: Use the "New Bridge File" button to generate a correctly formatted template. Then fill in your data starting from Row 3.

Quick Checklist Before You Analyze

Ready to get started? Jump back to the instructions or try the demo.

View Instructions Try Demo