What Datafly Signal does
First-party data collection, server-side delivery, governance, and identity resolution — the complete feature set, one page.
4.2KB. That's it.
The smallest collector in the industry. While Google Tag Manager loads 80KB+ before a single vendor tag fires, Datafly.js delivers reliable event collection in just 4.2KB gzipped.
Built on modern browser APIs for guaranteed delivery, even when users navigate away. Custom filename and global variable support lets you bypass ad blockers that target known collector scripts. Per-source custom builds include only the vendor modules your site actually needs.
- 4.2KB gzipped — smaller than most favicon files
- sendBeacon transport for reliable delivery on page unload
- Custom script filename and global variable to avoid ad blockers
- Per-source custom builds with only the modules you need
- UMD and ESM bundles for any deployment scenario
<script src="https://data.yoursite.com/d.js"></script>
<script>
_df.page();
_df.track('Product Viewed', {
product_id: 'SKU-123',
price: 49.99
});
_df.identify('user-456', {
email: '[email protected]',
plan: 'pro'
});
</script>Your subdomain. Your cookies. Your data.
Every piece of data flows through infrastructure you own and control. No third-party domains, no shared endpoints, no data you cannot inspect.
DNS A Record
Not a CNAME. A true DNS A record pointing to your deployment. Browsers treat this as a genuine first-party connection because it is one. No CNAME cloaking that Safari can detect and restrict.
Server-Set Cookies
400-day expiry cookies set from the server, not JavaScript. Fully exempt from Safari's ITP restrictions that cap client-side cookies at 7 days. Your user identification persists as long as the cookie standard allows.
IndexedDB Storage
Vendor IDs stored in IndexedDB for resilient client-side storage that survives cookie clearing. A reliable fallback that maintains vendor attribution data across sessions.
Your Infrastructure
All data flows through infrastructure you control. No shared multi-tenant endpoints. Every byte is inspectable, auditable, and subject to your security policies.
Organisation-wide data governance
A built-in governance layer runs on every event before any vendor-specific processing begins. Enforce data quality, privacy, and compliance at the platform level rather than per-integration.
Schema Validation
Validate every event against your defined schema before it enters the pipeline. Malformed events are caught immediately, not discovered downstream.
PII Handling
Four PII modes per field: hash (SHA-256), mask, strip, or pseudonymise. Applied at the org level before vendor pipelines see the data.
Consent Enforcement
Double-checked consent: validated at collection time and again at delivery time. If consent status changes between collection and delivery, delivery is blocked.
Field Standardisation
Normalise field names, data types, and values across all sources. Whether data comes from web, mobile, or server, it conforms to your schema.
Data Cleansing
Strip empty fields, trim whitespace, remove invalid characters, and enforce data type constraints. Clean data reaches every vendor without per-integration logic.
Audit Trail
Every transformation, every field change, every PII action is logged with before/after diffs. Complete traceability for compliance and debugging.
Event Filtering
Global and per-event delivery conditions. Don't send to Braze if email is empty, only send if marketing consent is granted. AND/OR logic groups with visual filter builder.
Guaranteed delivery. Zero data loss.
Every vendor gets exactly the payload format it expects — not a generic JSON blob, but a correctly structured, authenticated, validated request. Datafly Signal handles millions of events per day with guaranteed at-least-once delivery to every destination simultaneously.
- Up to 5 retries with exponential backoff over 24 hours — every event delivered
- Intelligent rate limiting per vendor — no dropped events from API throttling
- Failed events enter a review queue — inspectable and replayable, never lost
- Simultaneous delivery to all destinations — one event, every platform, under 50ms
- Per-event success/failure visibility in real-time — diagnose issues in seconds
Delivery Status
LiveRetry Policy
30+ vendor IDs. Every event. Automatically.
Every event is automatically enriched with every vendor identifier associated with that user — GA4 client ID, Meta pixel ID, TikTok browser ID, Trade Desk UID, click IDs, and more. All 30+ IDs are stored persistently and re-injected on every event without any configuration.
Why complete IDs matter
Vendors like Meta and Google build match tables from the signals you send. Each event that arrives with a complete set of identifiers — pixel ID, click ID, hashed email, external user ID — has a dramatically higher chance of being matched to a known user in their system. Incomplete signals mean unmatched conversions.
Persistent across browser resets
Vendor IDs are stored server-side and survive cookie clears, browser updates, and Safari's ITP restrictions. When a returning user is recognised by your first-party identifier, their full vendor ID profile is restored and sent with every event — even if their browser has been cleared.
Vendor IDs enriched on every event
Walled gardens
Ad platforms
Programmatic
Server-side
+ more vendor IDs enriched automatically
Device Recognition
Re-identify returning visitors even after they clear cookies. Server-side device recognition uses SHA-256 hashed HTTP signals — no fingerprinting scripts, no raw signals stored. Configurable confidence thresholds and TTL. Opt-in per organisation.
OIDC Cross-Domain Flow
Industry-standard authorization code flow with PKCE unifies identity across any TLD. Server-to-server backchannel exchange means the anonymous ID never appears in the URL. 50–150ms invisible redirect. Works in every browser.
Link Decoration
Automatic URL decoration when users click between your domains. AES-256-GCM encrypted tokens with 60-second TTL. Immune to Safari 17+ bounce tracking protection because the user initiates the navigation directly. Zero redirect latency.
Per-source custom builds
Each source gets a custom collector tailored to its vendor integrations. No unused code. No bloat. Three delivery methods to fit your deployment model.
Early AccessDownload
Download the built script and self-host on your own infrastructure. Full control over caching, CDN, and deployment timing.
CDN Origin
Serve from the cluster with your CDN as an origin cache. Automatic cache purge on rebuild. Zero-downtime updates.
Direct
Serve directly from your Datafly cluster. Simplest setup — the ingestion gateway serves the script alongside collecting events.
See every event in real-time
WebSocket-powered live event stream. Watch events flow from collection through processing to delivery. No more guessing what happened to an event between your website and the vendor API.
Inspect every stage of the pipeline: the raw input from the browser, the transformed output after governance, the vendor-specific request payload, and the vendor's response. Debug issues in seconds, not hours.
- Live WebSocket stream with sub-second latency
- Input, Transformed, Request, and Response inspection per event
- Filter by event type, source, integration, or user ID
- Compare before/after data at each processing stage
Full control with granular permissions
Five purpose-built RBAC roles give every team member exactly the access they need. Full audit logging with before/after diffs means you always know who changed what and when. Enterprise SSO via SAML and OIDC.
Role-Based Access Control
5
RBAC Roles
100%
Audit Coverage
SSO
SAML / OIDC
Enterprise Encryption
AES-256-GCM encryption at rest with cloud KMS envelope encryption (GCP, AWS, Azure, Vault).
Zero-downtime key rotation, mandatory encryption mode, and crypto health checks on every service.
Meets PCI-DSS, SOC 2, ISO 27001, and FCA/PRA key management requirements.
Brand workspaces and data monetisation
Register brands, control data access with field-level privacy, and let brands self-serve their own integrations. Turn your first-party data into a revenue stream.
Early AccessBrand Registry
Register and manage brand partners from a central dashboard. Each brand gets its own workspace with isolated configuration, event streams, and access controls.
Field-Level Privacy
Hash, mask, remove, or generalise PII on a per-brand basis. Each brand sees only the data you approve — full control over what fields are shared and in what form.
Brand Self-Service
Brands log in to their own workspace and manage their integrations, field mappings, and delivery settings. Reduce your operational burden while giving partners full autonomy.
Usage & Billing
Track events delivered per brand with full breakdowns by event type and integration. Export usage reports for billing reconciliation and revenue tracking.
See every feature in action
Book a technical walkthrough with our engineering team. We will show you exactly how Datafly Signal handles your specific vendor integrations.