Question 1

How long is a Production engagement?

Accepted Answer

Six to sixteen weeks for the first production release, depending on data complexity and integration scope. Variables that drive duration: dataset readiness from Prototyping, evaluation methodology depth, integration into existing systems (CRMs, ERPs, content pipelines), and deployment topology (cloud GPU is faster to ship than edge inference). Weekly checkpoints with named deliverables.

Question 2

What inference hardware do you target?

Accepted Answer

For cloud: AWS GPU (g5, g6 families), GCP A100 / H100, Cloudflare Workers AI for smaller models. For edge: NVIDIA Jetson (Orin, Nano), Google Coral TPU, Apple Silicon (M-series Mac mini for low-cost edge inference), and browser WASM (ONNX Runtime Web) for client-side perception. Hardware choice is workload-driven, not vendor-driven.

Question 3

What does the evaluation harness include?

Accepted Answer

Golden dataset with held-out splits, regression suite that runs on every model update, drift detectors (input, output, performance), calibration checks, and a benchmark comparison against either prior internal models or domain-relevant public baselines. The harness is delivered as code the client team can re-run.

Question 4

What APIs and SDKs do you ship?

Accepted Answer

A production inference service exposed via HTTP / gRPC (FastAPI, Tonic, or whatever fits the client stack), language SDKs in Python, TypeScript and Go for the stacks teams actually run, and operational telemetry (Prometheus metrics, OpenTelemetry traces, request-level audit logs). Generated client libraries via OpenAPI / Protocol Buffers where applicable.

Question 5

Where do the architectural decisions come from?

Accepted Answer

From the Prototyping engagement that preceded Production, plus Dynamis Advisory — Architecture for the upstream data and deployment-topology decisions. Production is where decisions become a built and operated system; the decisions themselves live with Advisory. The single solution architect coordinates, so Production never starts work the brief does not support.

Production-ready models, APIs, SDKs and the deployment artefacts behind them.

What ships when prototyping clears its gate.

FAQs

One architect, one inbox.