Kuroki Kasori AI

Kuroki Kasori AI for DiffSinger is a synthesized male Japanese voice, trained using artificial intelligence based on Shallow Diffusion Mechanism techniques. This voice bank is capable of naturally replicating the singing of the original voice provider, offering exceptional versatility through its four available vocal modes: Natural, Soft, Shy, Ghost, Open, Solid and Falsetto.
With these modes, Kuroki Kasori AI can produce soft and tender tones as well as solid and aggressive ones, covering a wide vocal range that adapts to various musical and artistic needs.
Additionally, this voice model includes an Autopitch function that enhances the natural quality of its singing.
Vocal Modes


Natural:
This is Kuroki Kasori's default vocal mode, replicating his natural and distinctive singing. His voice is sweet and soft, with delicate tones that gain strength and power in the higher notes.
It's the most developed vocal mode of the voice model, with a total dataset of 2:52:52.
Recommended tension: 0 to 30.
Data of the voice model
Dataset JP: 4:34:34
-
Natural: 1:43:11
-
Soft: 51:34
-
Shy: 20:15
-
Ghost: 12:56
-
Open: 31:27
-
Solid: 44:36
-
Falsetto: 32:18
The voice model was trained with tension (vr) allowing to increase its strength or decrease it more easily and comfortably to any available vocal mode.
Special Phonemes:
AP/br/bre/breath: To activate the breaths
hh: To use stronger exhalations
cl: For short pauses between notes
Updates:
-
29/01/2024: Kuroki Kasori AI for Diffsinger 1.0
-
03/03/2024: Kuroki Kasori AI for Diffsinger 1.1
-
17/03/2024: Kuroki Kasori AI for Diffsinger 1.2
-
22/05/2024: Kuroki Kasori AI for Diffsinger 1.3
-
08/07/2024: Kuroki Kasori AI for Diffsinger 2.0
-
06/02/2025: Kuroki Kasori AI for Diffsinger 2.5
Kuroki Kasori AI 1.3 (old version containing only the main "Natural" vocal mode).
*NEW*
Best Option
Kuroki Kasori AI 2.5 (More complete and updated voice model).