For stereo audio files only. When enabled, transcribes left and right channels independently, then merges them. Note: This will double the transcription time.
If checked, separates the audio into its component stems (vocals, drums, etc.) before processing.
Select a profile to auto-fill settings for different instrument types.For reference only; it is recommended to test and adjust for optimal results.
When enabled, the model actively looks for and emphasizes the start of each note (the 'attack'). Recommended for percussive or clear, rhythmic music. Disable for very smooth, legato music like vocal pads.
When enabled, uses a secondary melody-detection algorithm to refine the main pitch contour. Highly recommended for most melodic content. Disable if you are transcribing non-melodic noise or complex polyphony.
When enabled, allows a single note to have multiple, continuous pitch bends within it. Essential for transcribing vocals, slides, or vibrato-heavy instruments. Disable for clean, discrete notes like a standard piano.