Ingestion APIs

Overview

Moose Ingestion APIs are the entry point for getting data into your Moose application. They provide a fast, reliable, and type-safe way to move data from your sources into streams and tables for analytics and processing.

When to Use Ingestion APIs

Ingestion APIs are most useful when you want to implement a push-based pattern for getting data from your data sources into your streams and tables. Common use cases include:

Instrumenting external client applications
Receiving webhooks from third-party services
Integrating with ETL or data pipeline tools that push data

Why Use Moose's APIs Over Your Own?

Moose's ingestion APIs are purpose-built for high-throughput data pipelines, offering key advantages over other more general-purpose frameworks:

Built-in schema validation: Ensures only valid data enters your pipeline.
Direct connection to streams/tables: Instantly link HTTP endpoints to Moose data infrastructure to route incoming data to your streams and tables without any glue code.
Dead Letter Queue (DLQ) support: Invalid records are automatically captured for review and recovery.
OpenAPI auto-generation: Instantly generate client SDKs and docs for all endpoints, including example data.
Rust-powered performance: Far higher throughput and lower latency than typical Node.js or Python APIs.

Validation

Moose validates all incoming data against your interface (TypeScript) or Pydantic model (Python). If a record fails validation, Moose can automatically route it to a Dead Letter Queue (DLQ) for later inspection and recovery.

interface ExampleModel {  id: string;  userId: string;  timestamp: Date;  properties?: {    device?: string;    version?: number;  }} export const api = new IngestApi<ExampleModel>("your-api-route", {  destination: new Stream<ExampleModel>("your-stream-name"),  deadLetterQueue: new DeadLetterQueue<ExampleModel>("your-dlq-name")});

Using Index Signatures (TypeScript)

TypeScript's index signatures allow you to define types that accept additional properties. When used with IngestApi or Stream, the API will accept payloads with extra fields without returning validation errors:

import { IngestApi, Stream, Key, DateTime } from "@514labs/moose-lib"; // Input type with known fields + index signature for flexibilitytype UserEventInput = {  timestamp: DateTime;  eventName: string;  userId: Key<string>;  orgId?: string;  // Index signature: accept any additional properties  [key: string]: any;}; const inputStream = new Stream<UserEventInput>("UserEventInput"); // IngestApi accepts payloads with extra fields without validation errorsconst ingestApi = new IngestApi<UserEventInput>("user-events", {  destination: inputStream,});

How it works:

Known fields (timestamp, eventName, etc.) are validated against their declared types
Additional fields matching the index signature are accepted by the API (no validation error returned)
All fields (known and extra) are passed through to streaming functions for processing
Extra fields can be extracted in your streaming function and stored in a JSON column

Extracting Extra Fields

In your streaming function, use destructuring to separate known fields from extra fields:

userEventInputStream.addTransform(outputStream, (input) => {  const { timestamp, eventName, userId, ...extraFields } = input;  return { timestamp, eventName, userId, properties: extraFields };});

Using Index Signatures (TypeScript)

import { IngestApi, Stream, Key, DateTime } from "@514labs/moose-lib"; // Input type with known fields + index signature for flexibilitytype UserEventInput = {  timestamp: DateTime;  eventName: string;  userId: Key<string>;  orgId?: string;  // Index signature: accept any additional properties  [key: string]: any;}; const inputStream = new Stream<UserEventInput>("UserEventInput"); // IngestApi accepts payloads with extra fields without validation errorsconst ingestApi = new IngestApi<UserEventInput>("user-events", {  destination: inputStream,});

How it works:

Known fields (timestamp, eventName, etc.) are validated against their declared types
Additional fields matching the index signature are accepted by the API (no validation error returned)
All fields (known and extra) are passed through to streaming functions for processing
Extra fields can be extracted in your streaming function and stored in a JSON column

Extracting Extra Fields

In your streaming function, use destructuring to separate known fields from extra fields:

userEventInputStream.addTransform(outputStream, (input) => {  const { timestamp, eventName, userId, ...extraFields } = input;  return { timestamp, eventName, userId, properties: extraFields };});

import { IngestPipeline } from "@514labs/moose-lib"; interface ExampleModel {  id: string;  name: string;  value: number;  timestamp: Date;} const examplePipeline = new IngestPipeline<ExampleModel>("example-name", {  ingestApi: true, // Creates a REST API endpoint  stream: true, // Connects to a stream  table: true});

interface ExampleRecord {  id: string;  name: string;  value: number;  timestamp: Date;} // Create the ClickHouse tableconst exampleTable = new OlapTable<ExampleRecord>("example-table-name"); // Create the stream with specific settingsconst exampleStream = new Stream<ExampleRecord>("example-stream-name", {  destination: exampleTable    // Connect stream to table}); // Create the ingestion APIconst exampleApi = new IngestApi<ExampleRecord>("example-api-route", {  destination: exampleStream,  // Connect API to stream});

Ingestion APIs

Overview

When to Use Ingestion APIs

Why Use Moose's APIs Over Your Own?

Validation

Optional fields with ClickHouse defaults in IngestPipeline

Accepting Arbitrary Fields

Using Index Signatures (TypeScript)

Extracting Extra Fields

Using Index Signatures (TypeScript)

Extracting Extra Fields

Using Index Signatures (TypeScript)

Extracting Extra Fields

Creating Ingestion APIs

High-level: IngestPipeline (Recommended)

Low-level: Standalone IngestApi

Warning:

Ingestion Pipeline

IngestApi

You want to create a new ingestion endpoint, stream, and table

You have an existing Stream object that you want to connect to

You want to simplify configuration and reduce boilerplate

You want to manually configure the ingestion API

Configuration Reference