This API covers an audio track by transforming it into a new style while retaining its core melody. It incorporates Suno’s upload capability, enabling users to upload an audio file for processing. The expected result is a refreshed audio track with a new style, keeping the original melody intact.
V4_5ALL, V4, V4_5, V4_5PLUS, V5, V5_5. V5_5: Unleash Your Voice: Custom Models Tailored to Your Unique Taste — same limits as V5 where applicable.m (male), f (female)All endpoints require authentication using Bearer Token.
Add to request headers:
Authorization: Bearer YOUR_API_KEY⚠️ Note:
- Keep your API Key secure and do not share it with others
- If you suspect your API Key has been compromised, reset it immediately from the management page
The URL for uploading audio files, required regardless of whether customMode and instrumental are true or false.
Upload Audio Duration Limits: Maximum upload duration is 8 minutes
"https://storage.example.com/upload"
Enables Custom Mode for advanced audio generation settings.
true to use Custom Mode (requires style and title; prompt required if instrumental is false). The prompt will be strictly used as lyrics if instrumental is false.false for Non-custom Mode (only prompt is required). Lyrics will be auto-generated based on the prompt.true
Determines if the audio should be instrumental (no lyrics).
customMode: true):
true: Only style and title are required.false: style, title, and prompt are required (with prompt used as the exact lyrics).customMode: false): No impact on required fields (prompt only). Lyrics are auto-generated if instrumental is false.true
The AI model version to use for generation.
V5: Superior musical expression, faster generation.V5_5: Unleash Your Voice: Custom Models Tailored to Your Unique Taste. Same custom-mode prompt and style character limits as V5 (5000 / 1000).V4_5PLUS: V4.5+ is richer sound, new waysto create, max 8 min.V4_5: V4.5 is smarter prompts, fastergenerations, max 8 min.V4: V4 is improved vocal quality,max 4 min.V4_5ALL: V4.5-all is better song structure,max 8 min.V4_5ALL, V4, V4_5, V4_5PLUS, V5, V5_5 "V4_5ALL"
The URL to receive task completion notifications when upload and cover audio is complete. The callback process has three stages: text (text generation), first (first track complete), complete (all tracks complete). Note: In some cases, text and first stages may be skipped, directly returning complete.
"https://api.example.com/callback"
A description of the desired audio content.
customMode: true): Required if instrumental is false. The prompt will be strictly used as the lyrics and sung in the generated track. Character limits by model:
customMode: false): Always required. The prompt serves as the core idea, and lyrics will be automatically generated based on it (not strictly matching the input). Maximum 500 characters."A calm and relaxing piano track with soft melodies"
The music style or genre for the audio.
customMode: true). Examples: "Jazz", "Classical", "Electronic".
customMode: false): Leave empty."Classical"
The title of the generated music track.
customMode: true). Character limits by model:
customMode: false): Leave empty."Peaceful Piano Meditation"
Persona ID to apply to the generated music. Optional. Only available when Custom Mode is enabled. Use this to apply a specific persona style to your music generation.
To generate a persona ID, use the Generate Persona endpoint to create a personalized music Persona based on generated music.
"persona_123"
Persona model type to apply when using personaId. Optional.
style_persona (default): Applies style-focused persona characteristics.voice_persona: Applies voice-focused persona characteristics (only available with V5 and V5_5 models).style_persona, voice_persona "style_persona"
Music styles or traits to exclude from the generated audio.
"Heavy Metal, Upbeat Drums"
Preferred vocal gender. Optional. Allowed values: 'm' (male), 'f' (female).
m, f "m"
Style adherence weight. Optional. Range: 0-1. Two decimal places recommended.
0 <= x <= 1Must be a multiple of 0.010.65
Creativity/novelty constraint. Optional. Range: 0-1. Two decimal places recommended.
0 <= x <= 1Must be a multiple of 0.010.65
Relative weight of audio consistency versus other controls. Optional. Range: 0-1. Two decimal places recommended.
0 <= x <= 1Must be a multiple of 0.010.65
Request successful
200, 400, 401, 404, 405, 413, 429, 430, 455, 500 200
Error message when code != 200
"success"