Sample data

Download all tests (ZIP) — contains a tests/ folder with all CSV, spec, and expected files.

Or browse and download individual files below. Each test has a .csv (input data), a .spec (merger specification), and a .expected (expected merged output, data only).

These are basic examples. Additional, more complex examples (for example, longer pipelines and inputs with more top-level children) may be posted later on the course website.

What each test does:

TestDescriptionCSVSpecExpected
t01Sort by column 1 (text)t01.csvt01.spect01.expected
t02Filter column 2 (num) -ge 20t02.csvt02.spect02.expected
t03Unique on column 2 (num)t03.csvt03.spect03.expected
t04Two chains: sort (lines 1–3) + filter -ge 25 (lines 4–6)t04.csvt04.spect04.expected
t05Sort by column 2 (num)t05.csvt05.spect05.expected
t06Pipeline: sort | filter -ge 15t06.csvt06.spect06.expected
t07Sort -r (reverse) by column 1t07.csvt07.spect07.expected
t08Filter column 2 (num) -l 30 (less than)t08.csvt08.spect08.expected
t09Filter column 2 (num) -le 25t09.csvt09.spect09.expected
t10Unique on column 1 (text)t10.csvt10.spect10.expected
t11Pipeline: sort | filter -ge 20 (8 rows)t11.csvt11.spect11.expected
t12Sort column 2 (num) -r (reverse)t12.csvt12.spect12.expected
t13Filter column 1 (text) -g "b" (greater than)t13.csvt13.spect13.expected
t14Pipeline: sort | unique (column 2)t14.csvt14.spect14.expected
t15Two chains: sort (1–3) + filter -ge 30 (4–6)t15.csvt15.spect15.expected
t16Pipeline: filter -ge 10 | sortt16.csvt16.spect16.expected
t17Unique on column 2 (num, with duplicates)t17.csvt17.spect17.expected
t18Sub-merger test with local line numbers inside the sub-spect18.csvt18.spect18.expected
t19Sort by column 2 (date)t19.csvt19.spect19.expected
t20Pipeline: sort | filter -ge 10 | uniquet20.csvt20.spect20.expected

← Materials · ← Homework 1