AI model inventory

Last reviewed 2026-05-18

Current inventory of approved AI providers, model families, and AI service classes used by HiringCoachAI in production. Any material new provider, model family, or AI feature class requires review under secure development lifecycle and an update to this inventory.

Large Language Models (text)

Model family / identifiers	Vendor	Endpoint	Data in	Data out	Retention	Training opt-out
OpenAI GPT-family models, including configured defaults and route-specific model aliases	OpenAI (called both directly via OpenAI API and via Vercel AI Gateway)	`api.openai.com/v1/chat/completions`, `api.openai.com/v1/responses`, and Vercel AI Gateway equivalents	Prompts (resume, job descriptions, questions, coaching and analysis inputs)	Text completions	Per-request `store: false` where supported. No Zero Data Retention (ZDR) amendment: OpenAI's then-current standard API retention windows apply.	OpenAI's then-current standard API terms govern the no-training default; we have not signed an enterprise no-training amendment. Per-request `store: false` prevents generated responses from being stored as OpenAI application state for later retrieval.
Perplexity Sonar-family models	Perplexity	`api.perplexity.ai/chat/completions`	Research queries	Search-grounded text	Standard API terms; provider default retention applies.	Per provider default no-training in standard API terms.

Preferred model aliases are defined in the central model registry and AI Gateway configuration. Model strings flow through this registry and are reviewed during quarterly AI inventory refreshes.

Speech synthesis (TTS)

Voice / model	Vendor	Data in	Retention	Training opt-out
ElevenLabs voices and speech models, including user-selectable voices	ElevenLabs	Text to speak	Standard API terms; provider default retention applies.	Per provider default no-training in standard API terms.
`en-US-Neural2-*`	Google Cloud Text-to-Speech	Text to speak	Standard API terms; provider default retention applies.	Per provider default no-training in standard API terms.

Speech recognition (STT)

Model	Vendor	Data in	Retention	Training opt-out
`nova-2`	Deepgram	Audio clip	Per-request `redact=true` to redact sensitive number-like entities from transcripts, such as payment cards and Social Security numbers. Provider default audio retention otherwise applies.	Per provider default no-training in standard API terms.

Embedding / other

HiringCoachAI does not currently run a first-party embedding, reranker, or classification model in production. Any new embedding, reranker, or classification model is added here before deploy.

Endpoint configuration

Endpoint	Base URL	Timeout	Retry policy
Vercel AI Gateway	`https://ai-gateway.vercel.sh/v1`	60 s	Up to 3 on 5xx
Perplexity	`https://api.perplexity.ai`	30 s	1
ElevenLabs	`https://api.elevenlabs.io/v1`	30 s	1
Deepgram	`https://api.deepgram.com/v1`	30 s	1
Google TTS	`texttospeech.googleapis.com`	30 s	1

Version pinning

Preferred model aliases live in a central registry consumed by the application and the AI Gateway configuration.
OpenAI aliases route through either direct OpenAI API calls or the Vercel AI Gateway. The provider or gateway may resolve aliases to snapshots at request time. To convert an alias into a hard pin, the relevant configuration is updated with the reviewed snapshot identifier.
A drift detector calls each provider's model-list endpoint monthly, resolves what each alias currently points to, and exits non-zero if the underlying snapshot has rolled since the last review. On drift, the responsible engineer either accepts the new snapshot (updating the central audit record with the new value and a fresh verifiedAt date) or pins to the prior weights (replacing the alias with the previous snapshot string).
Deepgram speech-to-text, ElevenLabs speech, and Google text-to-speech identifiers are pinned by provider model or voice name in the central registry or feature-specific configuration.

Change log

Date	Change	Author
2026-04-24	Initial inventory	Engineering Lead
2026-04-24	Added the central model-audit record and monthly drift-detection job; documented monthly review process	Engineering Lead
2026-05-12	Reframed inventory around approved providers/model families and documented legacy route-specific model-string migration gap	Engineering Lead

Quarterly review checklist

During each quarterly review, we verify:

Every AI provider and material model family in production is listed.
Every listed provider/model family has a documented retention and training-opt-out posture.
No new models were introduced without review.
Automated AI inventory parity and retention-flag checks pass.
Vendor certifications and trust-center evidence remain current; see the sub-processors.

← Back to the trust center