Capture. Command. Connect.
From first-party data collection to server-side delivery, with governance and identity at every step.
4.2KB. That's it.
The smallest collector in the industry. While Google Tag Manager loads 80KB+ before a single vendor tag fires, Datafly.js delivers reliable event collection in just 4.2KB gzipped.
Built on modern browser APIs for guaranteed delivery, even when users navigate away. Custom filename and global variable support lets you bypass ad blockers that target known collector scripts. Per-source custom builds include only the vendor modules your site actually needs.
- 4.2KB gzipped — smaller than most favicon files
- sendBeacon transport for reliable delivery on page unload
- Custom script filename and global variable to avoid ad blockers
- Per-source custom builds with only the modules you need
- UMD and ESM bundles for any deployment scenario
<script src="https://data.yoursite.com/d.js"></script>
<script>
_df.page();
_df.track('Product Viewed', {
product_id: 'SKU-123',
price: 49.99
});
_df.identify('user-456', {
email: 'user@example.com',
plan: 'pro'
});
</script>Your subdomain. Your cookies. Your data.
Every piece of data flows through infrastructure you own and control. No third-party domains, no shared endpoints, no data you cannot inspect.
DNS A Record
Not a CNAME. A true DNS A record pointing to your deployment. Browsers treat this as a genuine first-party connection because it is one. No CNAME cloaking that Safari can detect and restrict.
Server-Set Cookies
400-day expiry cookies set from the server, not JavaScript. Fully exempt from Safari's ITP restrictions that cap client-side cookies at 7 days. Your user identification persists as long as the cookie standard allows.
IndexedDB Storage
Vendor IDs stored in IndexedDB for resilient client-side storage that survives cookie clearing. A reliable fallback that maintains vendor attribution data across sessions.
Your Infrastructure
All data flows through infrastructure you control. No shared multi-tenant endpoints. Every byte is inspectable, auditable, and subject to your security policies.
Organisation-wide data governance
A built-in governance layer runs on every event before any vendor-specific processing begins. Enforce data quality, privacy, and compliance at the platform level rather than per-integration.
Schema Validation
Validate every event against your defined schema before it enters the pipeline. Malformed events are caught immediately, not discovered downstream.
PII Handling
Four PII modes per field: hash (SHA-256), mask, strip, or pseudonymise. Applied at the org level before vendor pipelines see the data.
Consent Enforcement
Double-checked consent: validated at collection time and again at delivery time. If consent status changes between collection and delivery, delivery is blocked.
Field Standardisation
Normalise field names, data types, and values across all sources. Whether data comes from web, mobile, or server, it conforms to your schema.
Data Cleansing
Strip empty fields, trim whitespace, remove invalid characters, and enforce data type constraints. Clean data reaches every vendor without per-integration logic.
Audit Trail
Every transformation, every field change, every PII action is logged with before/after diffs. Complete traceability for compliance and debugging.
Guaranteed delivery. Zero data loss.
Every vendor gets exactly the payload format it expects — not a generic JSON blob, but a correctly structured, authenticated, validated request. Datafly Signal handles millions of events per day with guaranteed at-least-once delivery to every destination simultaneously.
- Up to 5 retries with exponential backoff over 24 hours — every event delivered
- Intelligent rate limiting per vendor — no dropped events from API throttling
- Failed events enter a review queue — inspectable and replayable, never lost
- Simultaneous delivery to all destinations — one event, every platform, under 50ms
- Per-event success/failure visibility in real-time — diagnose issues in seconds
Delivery Status
LiveRetry Policy
30+ vendor IDs. Every event. Automatically.
Every event is automatically enriched with every vendor identifier associated with that user — GA4 client ID, Meta pixel ID, TikTok browser ID, Trade Desk UID, click IDs, and more. All 30+ IDs are stored persistently and re-injected on every event without any configuration.
Why complete IDs matter
Vendors like Meta and Google build match tables from the signals you send. Each event that arrives with a complete set of identifiers — pixel ID, click ID, hashed email, external user ID — has a dramatically higher chance of being matched to a known user in their system. Incomplete signals mean unmatched conversions.
Persistent across browser resets
Vendor IDs are stored server-side and survive cookie clears, browser updates, and Safari's ITP restrictions. When a returning user is recognised by your first-party identifier, their full vendor ID profile is restored and sent with every event — even if their browser has been cleared.
Vendor IDs enriched on every event
Walled gardens
Ad platforms
Programmatic
Server-side
+ more vendor IDs enriched automatically
Per-source custom builds
Each source gets a custom collector tailored to its vendor integrations. No unused code. No bloat. Three delivery methods to fit your deployment model.
Coming SoonDownload
Download the built script and self-host on your own infrastructure. Full control over caching, CDN, and deployment timing.
CDN Origin
Serve from the cluster with your CDN as an origin cache. Automatic cache purge on rebuild. Zero-downtime updates.
Direct
Serve directly from your Datafly cluster. Simplest setup — the ingestion gateway serves the script alongside collecting events.
See every event in real-time
WebSocket-powered live event stream. Watch events flow from collection through processing to delivery. No more guessing what happened to an event between your website and the vendor API.
Inspect every stage of the pipeline: the raw input from the browser, the transformed output after governance, the vendor-specific request payload, and the vendor's response. Debug issues in seconds, not hours.
- Live WebSocket stream with sub-second latency
- Input, Transformed, Request, and Response inspection per event
- Filter by event type, source, integration, or user ID
- Compare before/after data at each processing stage
Full control with granular permissions
Five purpose-built RBAC roles give every team member exactly the access they need. Full audit logging with before/after diffs means you always know who changed what and when. Enterprise SSO via SAML and OIDC.
Role-Based Access Control
5
RBAC Roles
100%
Audit Coverage
SSO
SAML / OIDC
See every feature in action
Book a technical walkthrough with our engineering team. We will show you exactly how Datafly Signal handles your specific vendor integrations.