AI model inventory
Last reviewed 2026-05-18
Current inventory of approved AI providers, model families, and AI service classes used by HiringCoachAI in production. Any material new provider, model family, or AI feature class requires review under secure development lifecycle and an update to this inventory.
Large Language Models (text)
| Model family / identifiers | Vendor | Endpoint | Data in | Data out | Retention | Training opt-out |
|---|---|---|---|---|---|---|
| OpenAI GPT-family models, including configured defaults and route-specific model aliases | OpenAI (called both directly via OpenAI API and via Vercel AI Gateway) | api.openai.com/v1/chat/completions, api.openai.com/v1/responses, and Vercel AI Gateway equivalents | Prompts (resume, job descriptions, questions, coaching and analysis inputs) | Text completions | Per-request store: false where supported. No Zero Data Retention (ZDR) amendment: OpenAI's then-current standard API retention windows apply. | OpenAI's then-current standard API terms govern the no-training default; we have not signed an enterprise no-training amendment. Per-request store: false prevents generated responses from being stored as OpenAI application state for later retrieval. |
| Perplexity Sonar-family models | Perplexity | api.perplexity.ai/chat/completions | Research queries | Search-grounded text | Standard API terms; provider default retention applies. | Per provider default no-training in standard API terms. |
Preferred model aliases are defined in the central model registry and AI Gateway configuration. Model strings flow through this registry and are reviewed during quarterly AI inventory refreshes.
Speech synthesis (TTS)
| Voice / model | Vendor | Data in | Retention | Training opt-out |
|---|---|---|---|---|
| ElevenLabs voices and speech models, including user-selectable voices | ElevenLabs | Text to speak | Standard API terms; provider default retention applies. | Per provider default no-training in standard API terms. |
en-US-Neural2-* | Google Cloud Text-to-Speech | Text to speak | Standard API terms; provider default retention applies. | Per provider default no-training in standard API terms. |
Speech recognition (STT)
| Model | Vendor | Data in | Retention | Training opt-out |
|---|---|---|---|---|
nova-2 | Deepgram | Audio clip | Per-request redact=true to redact sensitive number-like entities from transcripts, such as payment cards and Social Security numbers. Provider default audio retention otherwise applies. | Per provider default no-training in standard API terms. |
Embedding / other
HiringCoachAI does not currently run a first-party embedding, reranker, or classification model in production. Any new embedding, reranker, or classification model is added here before deploy.
Endpoint configuration
| Endpoint | Base URL | Timeout | Retry policy |
|---|---|---|---|
| Vercel AI Gateway | https://ai-gateway.vercel.sh/v1 | 60 s | Up to 3 on 5xx |
| Perplexity | https://api.perplexity.ai | 30 s | 1 |
| ElevenLabs | https://api.elevenlabs.io/v1 | 30 s | 1 |
| Deepgram | https://api.deepgram.com/v1 | 30 s | 1 |
| Google TTS | texttospeech.googleapis.com | 30 s | 1 |
Version pinning
- Preferred model aliases live in a central registry consumed by the application and the AI Gateway configuration.
- OpenAI aliases route through either direct OpenAI API calls or the Vercel AI Gateway. The provider or gateway may resolve aliases to snapshots at request time. To convert an alias into a hard pin, the relevant configuration is updated with the reviewed snapshot identifier.
- A drift detector calls each provider's model-list endpoint monthly, resolves what each alias currently points to, and exits non-zero if the underlying snapshot has rolled since the last review. On drift, the responsible engineer either accepts the new snapshot (updating the central audit record with the new value and a fresh
verifiedAtdate) or pins to the prior weights (replacing the alias with the previous snapshot string). - Deepgram speech-to-text, ElevenLabs speech, and Google text-to-speech identifiers are pinned by provider model or voice name in the central registry or feature-specific configuration.
Change log
| Date | Change | Author |
|---|---|---|
| 2026-04-24 | Initial inventory | Engineering Lead |
| 2026-04-24 | Added the central model-audit record and monthly drift-detection job; documented monthly review process | Engineering Lead |
| 2026-05-12 | Reframed inventory around approved providers/model families and documented legacy route-specific model-string migration gap | Engineering Lead |
Quarterly review checklist
During each quarterly review, we verify:
- Every AI provider and material model family in production is listed.
- Every listed provider/model family has a documented retention and training-opt-out posture.
- No new models were introduced without review.
- Automated AI inventory parity and retention-flag checks pass.
- Vendor certifications and trust-center evidence remain current; see the sub-processors.