Aggregates

Cascade’s query builder isn’t only for finding records — it crunches numbers too. Sum a column, group by category, filter by the aggregate result, sort the report. The same vocabulary works on MongoDB and Postgres; you don’t write driver-specific syntax for the common cases.

This page covers the whole single-chain aggregation family: simple aggregates (count/sum/avg/min/max/countDistinct), grouping (groupBy and the portable date-bucketing groupByDate), filtering grouped results (having), the driver-agnostic $agg helpers, and the typed column-expression DSL for summing computed values. Multi-stage analytics (window functions, cross-table rollups) is the Expressions guide and the Reporting recipe.

Simple aggregates — no grouping

Each of these runs one query and returns a single Promise<number>:

const total     = await Order.query().count();
const revenue   = await Order.query().sum("amount");
const avgAmount = await Order.query().avg("amount");
const cheapest  = await Order.query().min("amount");
const priciest  = await Order.query().max("amount");

They compose with .where(...) upstream — the filter applies before the aggregate:

const completedRevenue = await Order.where("status", "completed").sum("amount");

`countDistinct` and `distinct`

const uniqueCustomers = await Order.query().countDistinct("customerId"); // number
const categories = await Product.query().distinct<string>("category");   // string[]

countDistinct returns the count of distinct values; distinct returns the distinct values themselves as an array. On MongoDB, countDistinct ignores nulls by default; on Postgres it’s a plain COUNT(DISTINCT column).

Grouping with `groupBy`

The basic case

const grouped = await Order.query().groupBy("category").get();

groupBy(field) with no aggregates groups records by that field; the result shape is driver-dependent and rarely what you want on its own. Grouping earns its keep when you attach aggregates — next.

Multiple group fields

Pass an array to group by more than one column:

await Order.query().groupBy(["category", "status"]).get();

With aggregate operations — the useful form

This is the form you’ll reach for most. groupBy(fields, aggregates) takes the group field(s) plus an object mapping output names to aggregate expressions:

import { $agg } from "@warlock.js/cascade";

const stats = await Order.query()
  .groupBy("category", {
    total: $agg.sum("amount"),
    count: $agg.count(),
    avg: $agg.avg("amount"),
  })
  .get();
// each row: { category, total, count, avg }

The $agg.* helpers produce driver-agnostic aggregate expressions — the same code runs on MongoDB and Postgres. Cascade translates each to the native form ({ $sum: "$amount" } on Mongo, SUM(amount) on Postgres).

The driver-agnostic set is $agg.count(), $agg.sum(field), $agg.avg(field), $agg.min(field), $agg.max(field). These five behave identically on both drivers.

$agg.distinct, $agg.floor, $agg.first, $agg.last are MongoDB-only. On Postgres they throw at the .groupBy() call — fail-fast, with an actionable message:

Order.query().groupBy("category", { models: $agg.distinct("sku") });
// On Postgres throws:
// "$agg.distinct is MongoDB-only and not supported on a PostgreSQL groupBy.
//  Use selectRaw / havingRaw with the equivalent SQL (window function /
//  DISTINCT / FLOOR) if you need it here."

The reasons are semantic, not laziness: Mongo’s $distinct returns the array of distinct values while SQL DISTINCT is a set quantifier (not a scalar aggregate); $floor is a scalar function, not an aggregate; $first/$last need an ordering context SQL window functions require and $agg doesn’t carry. Rather than silently emit a different-meaning query, Postgres throws and points you at the escape hatch.

Driver-specific aggregates — the raw escape hatch

When you need an aggregate $agg.* doesn’t cover, pass a raw expression in the aggregates object. On Postgres that’s a SQL string:

await Order.query()
  .groupBy("category", {
    total: "SUM(amount)",
    weighted: "SUM(amount * weight)",
  })
  .get();

On MongoDB it’s an operator object ({ total: { $sum: "$amount" } }). The raw form is driver-specific by definition — it doesn’t carry across drivers, so reach for $agg.* first and drop to raw only for what it can’t express. (Passing a MongoDB operator object to a Postgres query throws a clear “not portable” error rather than emitting broken SQL.)

`groupByRaw` — non-column grouping

To group by an expression rather than a plain column (DATE(created_at), a JSON path), use groupByRaw:

await Order.query()
  .groupByRaw("DATE(created_at)")
  .get();

Driver-specific; use sparingly. The structured groupBy form carries across drivers, groupByRaw doesn’t.

Date-bucketed grouping — `groupByDate`

Time-series reports — “revenue per month”, “signups per day” — need to group rows into date buckets. Doing that portably is fiddly: Postgres wants date_trunc('month', column) and MongoDB wants $dateTrunc. groupByDate is the cross-driver shape that hides the difference:

import { $agg } from "@warlock.js/cascade";

const monthly = await Order.query()
  .groupByDate("created_at", "month", {
    revenue: $agg.sum("amount"),
    orders: $agg.count(),
  })
  .get();
// each row: { created_at, revenue, orders } — created_at is the bucket start

groupByDate(column, unit, aggregates?) buckets column to the given granularity and groups by the truncated value, exposing the bucket under the column’s own name in the result. The unit is one of "day" / "week" / "month" / "year". The optional aggregates follow the exact same rules as the two-arg groupBy (the $agg.* helpers or driver-native raw expressions).

Under the hood:

Postgres — date_trunc('<unit>', "column")
MongoDB — { $dateTrunc: { date: "$column", unit } } in the $group _id

The same call runs on both drivers. Combine it with orderBy on the bucket column for a chronological report:

await Order.query()
  .where("status", "completed")
  .groupByDate("created_at", "day", { revenue: $agg.sum("amount") })
  .orderBy("created_at", "asc")
  .get();

Expression-aware sums — `$agg.sum(expr)` and `$agg.sumRaw`

$agg.sum accepts a bare column name (the everyday case), but it also accepts a typed column expression so you can sum a computed value such as price * quantity without dropping to a raw string:

import { $agg, $expr } from "@warlock.js/cascade";

const revenue = await Order.query()
  .groupByDate("created_at", "month", {
    revenue: $agg.sum($expr.mul("price", "quantity")), // SUM(price * quantity)
  })
  .get();

The bare-column form is unchanged — $agg.sum("amount") still produces the identical payload it always did — so existing call sites keep working. Passing an expression node is purely additive.

The column-expression DSL

The expression builders are grouped under a single $expr object (mirroring $agg), so the scalar arithmetic that feeds an aggregate reads as one discoverable namespace:

import { $expr } from "@warlock.js/cascade";

Builder	Meaning
`$expr.col("price")`	a column reference (driver-quoted/escaped)
`$expr.lit(1.2)`	a numeric/boolean literal
`$expr.mul(a, b, …)`	multiply (variadic)
`$expr.add(a, b, …)`	add (variadic)
`$expr.sub(left, right)`	subtract
`$expr.div(left, right)`	divide
`$expr.raw("…")`	raw SQL fragment escape hatch

The nodes nest, and a bare string anywhere a node is expected is treated as a column reference ("price" === $expr.col("price")). So $expr.mul("price", $expr.lit(1.2)) is “price × 1.2”, and $expr.mul($expr.col("price"), $expr.col("quantity")) is “price × quantity”. Each compiles portably:

Postgres — SUM(("price" * "quantity"))
MongoDB — { $sum: { $multiply: ["$price", "$quantity"] } }

Only $expr.raw ever embeds an uninterpreted string. Everything else flows column names through the driver’s identifier-quoting path, so a user-supplied column name is never string-interpolated into SQL — reach for the typed builders first.

`$agg.sumRaw` — the raw escape hatch

When the typed builders can’t express the fragment (a vendor function, a complex parenthesized formula), $agg.sumRaw(expression) wraps a raw string and sums it. It’s equivalent to $agg.sum($expr.raw(expression)):

await Order.query()
  .groupBy("category", {
    net: $agg.sumRaw("price * quantity * (1 - discount)"),
  })
  .get();

Postgres — SUM(price * quantity * (1 - discount))
MongoDB — throws. A raw SQL fragment isn’t portable to a MongoDB pipeline, so on MongoDB use the typed $agg.sum($expr.mul(...)) form (or groupByRaw) instead.

:::caution — sumRaw is a raw string

The fragment is emitted verbatim into the generated query — never build it from untrusted input. Use the typed $agg.sum(...) / $expr.col / $expr.mul builders for anything driven by user data; sumRaw is for static, developer-authored formulas only.

:::

Filtering grouped results — `having`

where filters records before grouping. having filters groups after aggregation. The field you filter on is the alias you gave the aggregate:

const bigCategories = await Order.query()
  .groupBy("category", { total: $agg.sum("amount") })
  .having("total", ">", 1000)
  .get();

having("total", ">", 1000) keeps only groups whose summed amount exceeds 1000. "total" is the alias from the aggregates object.

:::note — Postgres aliases in HAVING just work

SQL doesn’t allow a SELECT alias in a raw HAVING clause — HAVING "total" > 1000 would fail with “column total does not exist.” Cascade rewrites .having("total", ...) into the underlying expression (HAVING SUM("amount") > $1) automatically, so you write the alias and it works on both drivers. You don’t need to know this — it’s called out only so the SQL in your logs doesn’t surprise you.

:::

`having` shapes

Same shape vocabulary as .where():

.having("total", 1000)            // equality
.having("total", ">", 1000)       // operator

A having on a grouped column (not an aggregate alias) works too and is left as a plain column filter:

await Order.query()
  .groupBy("category", { total: $agg.sum("amount") })
  .having("category", "=", "books")
  .get();

`havingRaw`

When the having condition is itself a raw expression:

await Order.query()
  .groupBy("category", { total: $agg.sum("amount") })
  .havingRaw("SUM(amount) > ?", [1000])
  .get();

The ? placeholders bind positionally — same convention as whereRaw. Use havingRaw only when the structured having can’t express the condition.

A complete report

The whole family in one chain — “top 10 categories by completed-order revenue, at least 5000 total, highest first”:

const topCategories = await Order.query()
  .where("status", "completed")          // filter rows BEFORE grouping
  .groupBy("category", {
    total: $agg.sum("amount"),
    count: $agg.count(),
  })
  .having("total", ">", 5000)            // filter groups AFTER aggregation
  .orderBy("total", "desc")
  .limit(10)
  .get();
// each row: { category, total, count }

One query, both drivers. The chain reads in execution order: filter rows, group, aggregate, filter groups, sort, limit.

Pitfalls

`.where()` vs `.having()` — put the filter in the right place

.where() filters records before grouping — it’s cheaper and can use indexes. .having() filters groups after aggregation — it can only reference aggregate results. Filter as early as possible:

// ✅ status filter before grouping (indexed, fewer rows to aggregate)
.where("status", "completed").groupBy("category", { total: $agg.sum("amount") })

// ✅ revenue threshold after grouping (it's an aggregate — nowhere else it can go)
.having("total", ">", 5000)

Putting a row-level filter in having still works but makes the database aggregate rows it’s about to discard. Putting an aggregate condition in where is impossible — the aggregate doesn’t exist yet at that stage.

Selecting non-aggregate columns in a group

SQL requires every non-aggregate selected column to appear in groupBy. MongoDB is more permissive but leaves non-grouped columns undefined. Cascade’s groupBy(fields, aggregates) shape sidesteps the whole class of error by being explicit: the first argument is what’s grouped, the second is what’s aggregated. Don’t reach for .select() alongside it expecting arbitrary columns through.

Empty results return `0`, not `null`

Verified on both drivers: count, sum, avg, min, max, and countDistinct all return 0 when no rows match — not null, not undefined.

const revenue = await Order.where("status", "nonexistent").sum("amount");
// → 0  (not null)

The footgun isn’t a stray null to guard — it’s the opposite. min() / max() / avg() returning 0 for an empty set is indistinguishable from a legitimate 0. If 0 is a meaningful value in your data and “no rows” needs different handling, gate on the count first:

const matched = await Order.where(filter).count();
const lowest = matched > 0 ? await Order.where(filter).min("amount") : null;

Going further

Window functions (running totals, ranks, partition aggregates), selectCase/selectWindow: Expressions guide
Cross-table aggregates via joins: Joins guide
Time-bucket reports (revenue per day/month) and the full reporting pattern: Reporting recipe
Every aggregate method signature: Query Builder API reference

Aggregates

Simple aggregates — no grouping

countDistinct and distinct

Grouping with groupBy

The basic case

Multiple group fields

With aggregate operations — the useful form

Driver-specific aggregates — the raw escape hatch

groupByRaw — non-column grouping

Date-bucketed grouping — groupByDate

Expression-aware sums — $agg.sum(expr) and $agg.sumRaw