Help CenterExporting & FormatsJSON Format: Structured Data for Developers
💾advanced

JSON Format: Structured Data for Developers

Export complete transcription data as JSON with segments, timestamps, metadata, and speaker information for API integration.

4 min read
TranscribeNext Team
Updated Jan 15, 2025

JSON format exports the complete transcription data as structured JSON, perfect for developers building integrations, custom applications, or data analysis pipelines.

What You Get

  • Complete transcription metadata
  • Full text content
  • Segmented data with timestamps
  • Language detection information
  • Duration in seconds
  • Creation date
  • Unique transcription ID
  • Original filename

JSON Structure

The JSON export follows this structure:

{
  "id": "transcription_abc123",
  "fileName": "meeting-2025-01-15.mp3",
  "text": "Full transcription text...",
  "language": "en",
  "duration": 1825.45,
  "createdAt": "2025-01-15T10:30:00.000Z",
  "segments": [
    {
      "id": 1,
      "start": 0.0,
      "end": 5.23,
      "text": "Hello everyone and welcome..."
    },
    {
      "id": 2,
      "start": 5.23,
      "end": 9.45,
      "text": "Thank you for having me..."
    }
  ]
}

Field Descriptions

  • id - Unique transcription identifier
  • fileName - Original uploaded file name
  • text - Complete transcription as single string
  • language - Detected/selected language code (en, es, fr, etc.)
  • duration - Total audio duration in seconds (float)
  • createdAt - ISO 8601 timestamp of creation
  • segments - Array of timed text segments

Segment Object

Each segment in the segments array contains:

  • id - Sequential segment number
  • start - Start time in seconds (float)
  • end - End time in seconds (float)
  • text - Segment text content

Best Use Cases

  • API integrations and webhooks
  • Custom data processing pipelines
  • Analytics and text mining
  • Building custom subtitle generators
  • Database imports
  • Machine learning training data
  • Search indexing
  • Custom report generation

JSON data in code editor

/images/articles/json-code-example.png

Pro Tip

JSON exports are formatted with 2-space indentation for readability. Use JSON.parse() in JavaScript or json.loads() in Python to process the data.

Processing JSON Data (JavaScript)

// Read and process JSON export
const data = JSON.parse(jsonString);

// Get total duration in minutes
const minutes = Math.floor(data.duration / 60);

// Extract all text
const fullText = data.text;

// Process segments
data.segments.forEach(segment => {
  console.log(`[${segment.start}s] ${segment.text}`);
});

// Filter segments by time range
const firstMinute = data.segments.filter(
  seg => seg.start < 60
);

Processing JSON Data (Python)

import json

# Load JSON file
with open('transcription.json', 'r') as f:
    data = json.load(f)

# Access fields
print(f"File: {data['fileName']}")
print(f"Duration: {data['duration']}s")
print(f"Language: {data['language']}")

# Process segments
for segment in data['segments']:
    print(f"[{segment['start']:.2f}s] {segment['text']}")

Important

JSON files can be large for long transcriptions. A 1-hour transcription may result in a 100-200KB JSON file depending on segment density.

Tags

jsonapideveloperstructured-dataintegration