Collaborative Sessions: Real-Time Translation Between Languages
Imagine sitting across from someone who speaks a different language. No interpreter, no phone switching, no copy-pasting. You both just talk. dijin transcribes and translates in real-time — each person reads in their own language.
Two People, One Device, Two Languages
When two speakers use different languages on the same device, dijin activates cross-translation mode automatically. The system detects two distinct languages and enables bidirectional translation on its own.
How Cross-Translation Activates
First Speaker Talks (Japanese)
The speech engine transcribes, diarization assigns "speaker_0." System records: speaker_0 speaks Japanese. One language detected — cross-translation stays off.
Second Speaker Talks (Turkish)
The speech engine transcribes, diarization assigns "speaker_1." System records: speaker_1 speaks Turkish. Two languages detected — cross-translation activates automatically.
Bidirectional Translation Begins
Every segment from speaker_0 gets translated to Turkish, every segment from speaker_1 gets translated to Japanese. Automatic, continuous, real-time.
Two Translation Layers
dijin maintains two independent translation streams that never interfere with each other:
What You See on Screen
Speaker Memory Across Days
Works Offline
Apple Translation powers the on-device translation engine. No internet required for the core translation flow. Your audio and translations stay entirely on your device.
Online Multi-Participant Sessions
Beyond local two-person mode, dijin supports online collaborative sessions:
| Feature | How It Works |
|---|---|
| Session Creation | Host creates with join code + PIN |
| Participant Joining | From their own devices |
| Real-time Transcript | Broadcasted via WebSocket |
| Language Preference | Each participant sees their language |
| Rate Limiting | 200ms interval, 500 char cap |
Local + Online Combined
The two-person local mode and online multi-participant mode work together. Two people sharing a device can host an online session where remote participants also join — everyone sees the transcript in their own language.
Use Cases
The collaborative session infrastructure is live. The cross-translation local mode is in active development.