Transparency

InferLane is free at the point of use for most workloads. This page tells you exactly how that works, so you can verify that our incentives line up with yours.

Test mode — synthetic data

The marketplace currently runs in test mode. Any participant counts shown on InferLane's transparency surfaces (operators, arbiters, tasks) reflect synthetic test actors created during integration testing — not real users or real customer volume.

How we make money

We have four independent revenue legs. None of them requires charging you per request.

1. Routing markup

When you route traffic through our hosted endpoint using your own provider API key, we add a small percentage markup on the provider's cost (typically 5–10%). The markup funds the router, moderation gate, and fuel gauge you're using.

2. Provider rebates (planned)

As we route enough volume to qualify, we intend to negotiate privately-negotiated rates with model providers: we would quote near rack rate and be invoiced at the partnership rate, and the delta would be revenue. No rebate arrangements are active yet — see the rebate table below, which will populate as we sign partnership rate agreements.

3. Capacity commitments

Enterprise customers can pre-purchase a block of inference capacity for a specified period at a fixed per-token rate. This is a commercial volume commitment — not a financial instrument, not tradeable, not transferable to other parties. The difference between their committed rate and the spot rate at fulfilment is our margin (positive or negative).

4. Premium surfaces (Pro, Enterprise)

Advanced tooling (team budgets, Slack alerts, SSO, audit logs, dedicated capacity) is sold as a subscription. Routing itself is never behind a paywall.

5. Peer-network platform share

When consumers pay for inference served by peer operators on our network, operators are credited 90% of the service value in kT credits and we retain 10% as the platform share. kT credits are redeemable for inference on the network — they do not convert to cash. The Service operates in a credits-only mode. See the "What InferLane is not" box below.

What InferLane is not

Not a bank, money transmitter, or money services business
Not a broker-dealer, futures commission merchant, or securities exchange
Not a deposit-taking institution
Not a custodian of customer funds — Stripe (or equivalent licensed processor) holds funds; we record the service-credit balance
Not an issuer of securities, tokens, or cryptocurrency
Not a provider of investment, legal, tax, or financial advice

kT credits are service units redeemable for inference on the network. They are not a financial product. They have no investment character and no claim on InferLane revenue or assets. Credits do not convert to cash. The Service operates in a credits-only mode. If a cash pathway is introduced in the future, operators will need to separately opt in under new terms; existing credit balances will not be converted.

The anti-conflict guarantee

We explicitly cap the influence of rebate arrangements on routing decisions at 5% of the composite score. A provider with a bigger rebate can never beat a provider with better quality, lower cost, or lower latency. The rebate is only a tiebreaker when two candidates are within 0.5% of each other on the composite score.

This is enforced in code in our routing layer (src/lib/proxy/router-commercial.ts). The source is not public today; we'll link it here if and when the repository is opened.

Where your wallet balance lives

Prepaid balances are held by our licensed payment processor (Stripe) under their regulated payment-services arrangements. InferLane does not hold customer funds directly. We do not operate as a bank, money transmitter, money services business, securities broker, exchange, or qualified custodian. We record a service-credit balance that corresponds to the processor-held prepayment; we do not take custody of your money.

The double-entry ledger that tracks your balance is auditable and reconciled periodically (the automated nightly reconciliation job is built but not yet scheduled while we are pre-launch). Any discrepancy freezes the money layer until it's resolved.

Provider rebate arrangements

We publish the providers we have disclosed rebate arrangements with. Specific percentages are negotiated privately and fall into ranges disclosed here.

No disclosed rebate arrangements are active yet — we're pre-launch. This list will populate as we sign partnership rate agreements and as customers reach volumes that qualify for their own disclosed discounts.

Privacy tiers — what we can and can't guarantee

Different workloads have different privacy requirements. We route accordingly and are upfront about what each tier actually provides.

Cloud TEE — verifiable confidentiality

Workloads route to providers with hardware-backed Trusted Execution Environments (Azure Confidential Computing, AWS Nitro Enclaves). Attestation is cryptographically verified. Use this for PII, financial data, and compliance-sensitive workloads (HIPAA, SOC 2).

Cloud Standard — contractual privacy

Workloads route to major cloud providers (Anthropic, OpenAI, Google). Privacy is backed by their terms of service and data processing agreements, not hardware attestation. Suitable for business data that isn't regulated.

Best Effort — OS-level hardening only

Workloads may route to community or decentralized nodes. Privacy relies on OS-level protections (SIP, hardened runtime) — not hardware enclaves. There is no way to cryptographically verify that a consumer Mac is running untampered code today. This tier is appropriate for public data, non-sensitive classification, and image generation — not for PII or confidential business data.

Our routing engine selects the appropriate privacy tier automatically based on your configured policy. You can override per-request via the privacyTier parameter in the dispatch API, or set a default policy in your dashboard settings.

Subprocessors

The full list of third-party services InferLane uses to process customer data is at inferlane.dev/legal/subprocessors. We give 14-day notice of changes to customers on enterprise contracts.

Security

Responsible disclosure: security.txt
Contact: security@inferlane.dev
We maintain an internal ASVS L2 self-audit (commercial/security/asvs-l2.md). It is not yet published; request a copy via the security contact above.

Proxy latency overhead

Every request through InferLane adds routing overhead (auth, model selection, provider lookup, cost logging). Here are real measurements from April 2026 — a minimal Haiku request, from Sydney to us-east-1 (Vercel + Anthropic):

~750ms

Direct to Anthropic

~1.5s

Through InferLane proxy

Overhead is ~500–800ms, mostly Vercel serverless cold starts and the routing DB lookup. For a typical 5–30 second inference call, this is 2–10% added latency. The MCP tools (pick_model, session_cost) run locally with zero network overhead. We plan to move the routing decision to Vercel Edge Functions to bring overhead under 50ms.