Features

Capture. Command. Connect.

From first-party data collection to server-side delivery, with governance and identity at every step.

Capture

4.2KB. That's it.

The smallest collector in the industry. While Google Tag Manager loads 80KB+ before a single vendor tag fires, Datafly.js delivers reliable event collection in just 4.2KB gzipped.

Built on modern browser APIs for guaranteed delivery, even when users navigate away. Custom filename and global variable support lets you bypass ad blockers that target known collector scripts. Per-source custom builds include only the vendor modules your site actually needs.

  • 4.2KB gzipped — smaller than most favicon files
  • sendBeacon transport for reliable delivery on page unload
  • Custom script filename and global variable to avoid ad blockers
  • Per-source custom builds with only the modules you need
  • UMD and ESM bundles for any deployment scenario
Your websiteHTML
<script src="https://data.yoursite.com/d.js"></script>
<script>
  _df.page();

  _df.track('Product Viewed', {
    product_id: 'SKU-123',
    price: 49.99
  });

  _df.identify('user-456', {
    email: 'user@example.com',
    plan: 'pro'
  });
</script>
Capture — First-Party

Your subdomain. Your cookies. Your data.

Every piece of data flows through infrastructure you own and control. No third-party domains, no shared endpoints, no data you cannot inspect.

DNS A Record

Not a CNAME. A true DNS A record pointing to your deployment. Browsers treat this as a genuine first-party connection because it is one. No CNAME cloaking that Safari can detect and restrict.

Server-Set Cookies

400-day expiry cookies set from the server, not JavaScript. Fully exempt from Safari's ITP restrictions that cap client-side cookies at 7 days. Your user identification persists as long as the cookie standard allows.

IndexedDB Storage

Vendor IDs stored in IndexedDB for resilient client-side storage that survives cookie clearing. A reliable fallback that maintains vendor attribution data across sessions.

Your Infrastructure

All data flows through infrastructure you control. No shared multi-tenant endpoints. Every byte is inspectable, auditable, and subject to your security policies.

Command — Governance

Organisation-wide data governance

A built-in governance layer runs on every event before any vendor-specific processing begins. Enforce data quality, privacy, and compliance at the platform level rather than per-integration.

Schema Validation

Validate every event against your defined schema before it enters the pipeline. Malformed events are caught immediately, not discovered downstream.

PII Handling

Four PII modes per field: hash (SHA-256), mask, strip, or pseudonymise. Applied at the org level before vendor pipelines see the data.

Consent Enforcement

Double-checked consent: validated at collection time and again at delivery time. If consent status changes between collection and delivery, delivery is blocked.

Field Standardisation

Normalise field names, data types, and values across all sources. Whether data comes from web, mobile, or server, it conforms to your schema.

Data Cleansing

Strip empty fields, trim whitespace, remove invalid characters, and enforce data type constraints. Clean data reaches every vendor without per-integration logic.

Audit Trail

Every transformation, every field change, every PII action is logged with before/after diffs. Complete traceability for compliance and debugging.

Connect — Delivery

Guaranteed delivery. Zero data loss.

Every vendor gets exactly the payload format it expects — not a generic JSON blob, but a correctly structured, authenticated, validated request. Datafly Signal handles millions of events per day with guaranteed at-least-once delivery to every destination simultaneously.

  • Up to 5 retries with exponential backoff over 24 hours — every event delivered
  • Intelligent rate limiting per vendor — no dropped events from API throttling
  • Failed events enter a review queue — inspectable and replayable, never lost
  • Simultaneous delivery to all destinations — one event, every platform, under 50ms
  • Per-event success/failure visibility in real-time — diagnose issues in seconds

Delivery Status

Live
Google Analytics 4
45msDelivered
Meta Conversions API
120msDelivered
TikTok Events API
Retrying (2/5)
Google Ads
62msDelivered

Retry Policy

Attempt 130sAttempt 22mAttempt 315mAttempt 42hAttempt 5...Review Queue
Connect — Identity

30+ vendor IDs. Every event. Automatically.

Every event is automatically enriched with every vendor identifier associated with that user — GA4 client ID, Meta pixel ID, TikTok browser ID, Trade Desk UID, click IDs, and more. All 30+ IDs are stored persistently and re-injected on every event without any configuration.

Why complete IDs matter

Vendors like Meta and Google build match tables from the signals you send. Each event that arrives with a complete set of identifiers — pixel ID, click ID, hashed email, external user ID — has a dramatically higher chance of being matched to a known user in their system. Incomplete signals mean unmatched conversions.

Persistent across browser resets

Vendor IDs are stored server-side and survive cookie clears, browser updates, and Safari's ITP restrictions. When a returning user is recognised by your first-party identifier, their full vendor ID profile is restored and sent with every event — even if their browser has been cleared.

Vendor IDs enriched on every event

Walled gardens

GA4 Client IDMeta Pixel ID (_fbp)TikTok Browser ID (_ttp)Pinterest Unauth ID

Ad platforms

Google Click ID (gclid)Meta Click ID (fbclid)TikTok Click ID (ttclid)Microsoft Click ID

Programmatic

The Trade Desk UIDCriteo GUM IDLiveRamp RampIDIndex Exchange IDXandr ID

Server-side

UID2 TokenID5 Universal IDPrebid SharedID

+ more vendor IDs enriched automatically

Capture — Builder

Per-source custom builds

Each source gets a custom collector tailored to its vendor integrations. No unused code. No bloat. Three delivery methods to fit your deployment model.

Coming Soon

Download

Download the built script and self-host on your own infrastructure. Full control over caching, CDN, and deployment timing.

CDN Origin

Serve from the cluster with your CDN as an origin cache. Automatic cache purge on rebuild. Zero-downtime updates.

Direct

Serve directly from your Datafly cluster. Simplest setup — the ingestion gateway serves the script alongside collecting events.

Command — Debugging

See every event in real-time

WebSocket-powered live event stream. Watch events flow from collection through processing to delivery. No more guessing what happened to an event between your website and the vendor API.

Inspect every stage of the pipeline: the raw input from the browser, the transformed output after governance, the vendor-specific request payload, and the vendor's response. Debug issues in seconds, not hours.

  • Live WebSocket stream with sub-second latency
  • Input, Transformed, Request, and Response inspection per event
  • Filter by event type, source, integration, or user ID
  • Compare before/after data at each processing stage
Event DebuggerLive
14:32:01.234Inputpage/products/running-shoes
14:32:01.238TransformedpageSchema validated, PII stripped, session enriched
14:32:01.242RequestGA4POST /mp/collect — page_view event
14:32:01.287ResponseGA4204 No Content — 45ms
14:32:01.243RequestMetaPOST /v18.0/{pixel}/events — PageView
14:32:01.363ResponseMeta200 OK — 120ms
Command — Management

Full control with granular permissions

Five purpose-built RBAC roles give every team member exactly the access they need. Full audit logging with before/after diffs means you always know who changed what and when. Enterprise SSO via SAML and OIDC.

Role-Based Access Control

OrgAdminFull organisation control — users, billing, all sources
SourceAdminManage assigned sources — integrations, pipelines, settings
SourceEditorEdit transformations and integration configs
SourceViewerRead-only access to source data and configurations
DataGovernanceAdminManage data governance — schema, PII rules, consent

5

RBAC Roles

100%

Audit Coverage

SSO

SAML / OIDC

See every feature in action

Book a technical walkthrough with our engineering team. We will show you exactly how Datafly Signal handles your specific vendor integrations.