Spanish-English Contact Center ASR
- 2-channel
- Speaker separation
- ES + EN
- Languages
- Code-switch
- Tagged
- PII-reviewed
- Privacy
We typically reply within 24 hours
Bilingual Spanish-English contact-center conversations with frequent code-switching, captured for robust bilingual ASR.
Contact-center conversations that move between Spanish and English the way bilingual callers and agents actually speak, including mid-sentence code-switching. Speaker-separated audio and language-tagged transcripts make this usable for robust bilingual recognition.
Every session is collected from paid contributors with signed consent, transcribed by bilingual native reviewers, and screened for personal information before delivery.
Highlights
- Natural Spanish-English code-switching as it occurs in real calls
- Speaker-separated audio with language-tagged, timestamped transcripts
- Agent and customer role labels across common support scenarios
- Optional intent, entity, and disposition annotations
- Consent-cleared and PII-reviewed before delivery
Scenario coverage
Common contact-center scenarios with bilingual speakers. Coverage extends to specific industries, intents, or language balances on request.
Capture and format
Stereo WAV at 16 kHz or higher, one speaker per channel, recorded in low-noise environments. Sessions are segmented into utterances and time-aligned to language-tagged transcripts.
Annotations
Language-tagged, role-labeled, timestamped transcripts as standard, plus optional intent, entity, and disposition labels on request.
Provenance
- Paid contributors with signed consent
- Native-speaker transcription and quality review
- PII reviewed and redacted before delivery
- Per-recording audit trail and licensable usage rights
Use cases
- Bilingual and code-switching ASR training
- Language identification within conversations
- Intent and entity models for bilingual voice agents
- Contact-center analytics and QA