{
  "config": {},
  "definitions": {
    "subgraphs": [
      {
        "config": {},
        "extra": {
          "workflowRendererVersion": "LG"
        },
        "groups": [
          {
            "bounding": [
              170,
              -230,
              390,
              180
            ],
            "color": "#3f789e",
            "flags": {},
            "font_size": 24,
            "id": 1,
            "title": "Load model here"
          },
          {
            "bounding": [
              170,
              -30,
              390,
              280
            ],
            "color": "#3f789e",
            "flags": {},
            "font_size": 24,
            "id": 4,
            "title": "Latent"
          },
          {
            "bounding": [
              580,
              -230,
              350,
              140
            ],
            "color": "#3f789e",
            "flags": {},
            "font_size": 24,
            "id": 5,
            "title": "Adjust the vocal volume"
          },
          {
            "bounding": [
              170,
              270,
              390,
              223.60000610351562
            ],
            "color": "#3f789e",
            "flags": {},
            "font_size": 24,
            "id": 6,
            "title": "For repainting"
          }
        ],
        "id": "12b82ce4-ea55-42f8-9d19-3db74d8ca70e",
        "inputNode": {
          "bounding": [
            -220,
            150,
            161.376953125,
            120
          ],
          "id": -10
        },
        "inputs": [
          {
            "id": "afaa227b-8330-4e86-924d-a43f69949572",
            "label": "vocal volume multiplier",
            "linkIds": [
              138
            ],
            "name": "multiplier",
            "pos": [
              -78.623046875,
              170
            ],
            "type": "FLOAT"
          },
          {
            "id": "b822bfa5-91ca-46c3-a9a8-38dd21023c11",
            "label": "music tags",
            "linkIds": [
              139
            ],
            "name": "tags",
            "pos": [
              -78.623046875,
              190
            ],
            "type": "STRING"
          },
          {
            "id": "c5fe61ef-5ed0-4c23-b8dc-e4895681cfd6",
            "linkIds": [
              141
            ],
            "name": "lyrics",
            "pos": [
              -78.623046875,
              210
            ],
            "type": "STRING"
          },
          {
            "id": "7681279c-73de-4fd6-a573-1fc45b2add7f",
            "linkIds": [
              142
            ],
            "name": "seconds",
            "pos": [
              -78.623046875,
              230
            ],
            "type": "FLOAT"
          }
        ],
        "links": [
          {
            "id": 113,
            "origin_id": 51,
            "origin_slot": 0,
            "target_id": 49,
            "target_slot": 0,
            "type": "MODEL"
          },
          {
            "id": 114,
            "origin_id": 50,
            "origin_slot": 0,
            "target_id": 49,
            "target_slot": 1,
            "type": "LATENT_OPERATION"
          },
          {
            "id": 136,
            "origin_id": 64,
            "origin_slot": 0,
            "target_id": 68,
            "target_slot": 0,
            "type": "AUDIO"
          },
          {
            "id": 137,
            "origin_id": 40,
            "origin_slot": 2,
            "target_id": 68,
            "target_slot": 1,
            "type": "VAE"
          },
          {
            "id": 122,
            "origin_id": 52,
            "origin_slot": 0,
            "target_id": 18,
            "target_slot": 0,
            "type": "LATENT"
          },
          {
            "id": 83,
            "origin_id": 40,
            "origin_slot": 2,
            "target_id": 18,
            "target_slot": 1,
            "type": "VAE"
          },
          {
            "id": 108,
            "origin_id": 14,
            "origin_slot": 0,
            "target_id": 44,
            "target_slot": 0,
            "type": "CONDITIONING"
          },
          {
            "id": 115,
            "origin_id": 40,
            "origin_slot": 0,
            "target_id": 51,
            "target_slot": 0,
            "type": "MODEL"
          },
          {
            "id": 121,
            "origin_id": 49,
            "origin_slot": 0,
            "target_id": 52,
            "target_slot": 0,
            "type": "MODEL"
          },
          {
            "id": 117,
            "origin_id": 14,
            "origin_slot": 0,
            "target_id": 52,
            "target_slot": 1,
            "type": "CONDITIONING"
          },
          {
            "id": 120,
            "origin_id": 44,
            "origin_slot": 0,
            "target_id": 52,
            "target_slot": 2,
            "type": "CONDITIONING"
          },
          {
            "id": 119,
            "origin_id": 17,
            "origin_slot": 0,
            "target_id": 52,
            "target_slot": 3,
            "type": "LATENT"
          },
          {
            "id": 80,
            "origin_id": 40,
            "origin_slot": 1,
            "target_id": 14,
            "target_slot": 0,
            "type": "CLIP"
          },
          {
            "id": 126,
            "origin_id": 18,
            "origin_slot": 0,
            "target_id": -20,
            "target_slot": 0,
            "type": "AUDIO"
          },
          {
            "id": 138,
            "origin_id": -10,
            "origin_slot": 0,
            "target_id": 50,
            "target_slot": 0,
            "type": "FLOAT"
          },
          {
            "id": 139,
            "origin_id": -10,
            "origin_slot": 1,
            "target_id": 14,
            "target_slot": 1,
            "type": "STRING"
          },
          {
            "id": 141,
            "origin_id": -10,
            "origin_slot": 2,
            "target_id": 14,
            "target_slot": 2,
            "type": "STRING"
          },
          {
            "id": 142,
            "origin_id": -10,
            "origin_slot": 3,
            "target_id": 17,
            "target_slot": 0,
            "type": "FLOAT"
          }
        ],
        "name": "Text-to-Song AceStep (Simplified)",
        "nodes": [
          {
            "flags": {
              "collapsed": false
            },
            "id": 49,
            "inputs": [
              {
                "link": 113,
                "localized_name": "model",
                "name": "model",
                "type": "MODEL"
              },
              {
                "link": 114,
                "localized_name": "operation",
                "name": "operation",
                "type": "LATENT_OPERATION"
              }
            ],
            "mode": 0,
            "order": 8,
            "outputs": [
              {
                "links": [
                  121
                ],
                "localized_name": "MODEL",
                "name": "MODEL",
                "type": "MODEL"
              }
            ],
            "pos": [
              940,
              -160
            ],
            "properties": {
              "Node name for S&R": "LatentApplyOperationCFG",
              "cnr_id": "comfy-core",
              "ver": "0.3.34"
            },
            "size": [
              290,
              50
            ],
            "type": "LatentApplyOperationCFG",
            "widgets_values": []
          },
          {
            "bgcolor": "#533",
            "color": "#322",
            "flags": {},
            "id": 64,
            "inputs": [],
            "mode": 4,
            "order": 0,
            "outputs": [
              {
                "links": [
                  136
                ],
                "localized_name": "AUDIO",
                "name": "AUDIO",
                "type": "AUDIO"
              }
            ],
            "pos": [
              180,
              340
            ],
            "properties": {
              "Node name for S&R": "LoadAudio",
              "cnr_id": "comfy-core",
              "ver": "0.3.34"
            },
            "size": [
              370,
              140
            ],
            "type": "LoadAudio",
            "widgets_values": [
              "ace_step_example.flac",
              null,
              null
            ]
          },
          {
            "flags": {},
            "id": 68,
            "inputs": [
              {
                "link": 136,
                "localized_name": "audio",
                "name": "audio",
                "type": "AUDIO"
              },
              {
                "link": 137,
                "localized_name": "vae",
                "name": "vae",
                "type": "VAE"
              }
            ],
            "mode": 4,
            "order": 3,
            "outputs": [
              {
                "links": null,
                "localized_name": "LATENT",
                "name": "LATENT",
                "type": "LATENT"
              }
            ],
            "pos": [
              180,
              180
            ],
            "properties": {
              "Node name for S&R": "VAEEncodeAudio",
              "cnr_id": "comfy-core",
              "ver": "0.3.34"
            },
            "size": [
              370,
              46
            ],
            "type": "VAEEncodeAudio",
            "widgets_values": []
          },
          {
            "flags": {
              "collapsed": false
            },
            "id": 18,
            "inputs": [
              {
                "link": 122,
                "localized_name": "samples",
                "name": "samples",
                "type": "LATENT"
              },
              {
                "link": 83,
                "localized_name": "vae",
                "name": "vae",
                "type": "VAE"
              }
            ],
            "mode": 0,
            "order": 6,
            "outputs": [
              {
                "links": [
                  126
                ],
                "localized_name": "AUDIO",
                "name": "AUDIO",
                "type": "AUDIO"
              }
            ],
            "pos": [
              1080,
              270
            ],
            "properties": {
              "Node name for S&R": "VAEDecodeAudio",
              "cnr_id": "comfy-core",
              "ver": "0.3.32"
            },
            "size": [
              164.2388671875,
              46
            ],
            "type": "VAEDecodeAudio",
            "widgets_values": []
          },
          {
            "flags": {
              "collapsed": true
            },
            "id": 44,
            "inputs": [
              {
                "link": 108,
                "localized_name": "conditioning",
                "name": "conditioning",
                "type": "CONDITIONING"
              }
            ],
            "mode": 0,
            "order": 7,
            "outputs": [
              {
                "links": [
                  120
                ],
                "localized_name": "CONDITIONING",
                "name": "CONDITIONING",
                "type": "CONDITIONING"
              }
            ],
            "pos": [
              600,
              70
            ],
            "properties": {
              "Node name for S&R": "ConditioningZeroOut",
              "cnr_id": "comfy-core",
              "ver": "0.3.32"
            },
            "size": [
              204.134765625,
              26
            ],
            "type": "ConditioningZeroOut",
            "widgets_values": []
          },
          {
            "flags": {
              "collapsed": false
            },
            "id": 51,
            "inputs": [
              {
                "link": 115,
                "localized_name": "model",
                "name": "model",
                "type": "MODEL"
              }
            ],
            "mode": 0,
            "order": 2,
            "outputs": [
              {
                "links": [
                  113
                ],
                "localized_name": "MODEL",
                "name": "MODEL",
                "type": "MODEL"
              }
            ],
            "pos": [
              590,
              -40
            ],
            "properties": {
              "Node name for S&R": "ModelSamplingSD3",
              "cnr_id": "comfy-core",
              "ver": "0.3.34"
            },
            "size": [
              330,
              60
            ],
            "type": "ModelSamplingSD3",
            "widgets_values": [
              5.000000000000001
            ]
          },
          {
            "flags": {},
            "id": 50,
            "inputs": [
              {
                "link": 138,
                "localized_name": "multiplier",
                "name": "multiplier",
                "type": "FLOAT",
                "widget": {
                  "name": "multiplier"
                }
              }
            ],
            "mode": 0,
            "order": 9,
            "outputs": [
              {
                "links": [
                  114
                ],
                "localized_name": "LATENT_OPERATION",
                "name": "LATENT_OPERATION",
                "type": "LATENT_OPERATION"
              }
            ],
            "pos": [
              590,
              -160
            ],
            "properties": {
              "Node name for S&R": "LatentOperationTonemapReinhard",
              "cnr_id": "comfy-core",
              "ver": "0.3.34"
            },
            "size": [
              330,
              58
            ],
            "type": "LatentOperationTonemapReinhard",
            "widgets_values": [
              1
            ]
          },
          {
            "flags": {},
            "id": 52,
            "inputs": [
              {
                "link": 121,
                "localized_name": "model",
                "name": "model",
                "type": "MODEL"
              },
              {
                "link": 117,
                "localized_name": "positive",
                "name": "positive",
                "type": "CONDITIONING"
              },
              {
                "link": 120,
                "localized_name": "negative",
                "name": "negative",
                "type": "CONDITIONING"
              },
              {
                "link": 119,
                "localized_name": "latent_image",
                "name": "latent_image",
                "type": "LATENT"
              }
            ],
            "mode": 0,
            "order": 10,
            "outputs": [
              {
                "links": [
                  122
                ],
                "localized_name": "LATENT",
                "name": "LATENT",
                "slot_index": 0,
                "type": "LATENT"
              }
            ],
            "pos": [
              940,
              -40
            ],
            "properties": {
              "Node name for S&R": "KSampler",
              "cnr_id": "comfy-core",
              "ver": "0.3.34"
            },
            "size": [
              290,
              262
            ],
            "type": "KSampler",
            "widgets_values": [
              1089723283857463,
              "randomize",
              50,
              5,
              "euler",
              "simple",
              1
            ]
          },
          {
            "flags": {},
            "id": 17,
            "inputs": [
              {
                "link": 142,
                "localized_name": "seconds",
                "name": "seconds",
                "type": "FLOAT",
                "widget": {
                  "name": "seconds"
                }
              }
            ],
            "mode": 0,
            "order": 5,
            "outputs": [
              {
                "links": [
                  119
                ],
                "localized_name": "LATENT",
                "name": "LATENT",
                "type": "LATENT"
              }
            ],
            "pos": [
              180,
              50
            ],
            "properties": {
              "Node name for S&R": "EmptyAceStepLatentAudio",
              "cnr_id": "comfy-core",
              "ver": "0.3.32"
            },
            "size": [
              370,
              82
            ],
            "type": "EmptyAceStepLatentAudio",
            "widgets_values": [
              60,
              1
            ]
          },
          {
            "bgcolor": "#533",
            "color": "#322",
            "flags": {},
            "id": 40,
            "inputs": [],
            "mode": 0,
            "order": 1,
            "outputs": [
              {
                "links": [
                  115
                ],
                "localized_name": "MODEL",
                "name": "MODEL",
                "type": "MODEL"
              },
              {
                "links": [
                  80
                ],
                "localized_name": "CLIP",
                "name": "CLIP",
                "type": "CLIP"
              },
              {
                "links": [
                  83,
                  137
                ],
                "localized_name": "VAE",
                "name": "VAE",
                "type": "VAE"
              }
            ],
            "pos": [
              180,
              -160
            ],
            "properties": {
              "Node name for S&R": "CheckpointLoaderSimple",
              "cnr_id": "comfy-core",
              "models": [
                {
                  "directory": "checkpoints",
                  "name": "ace_step_v1_3.5b.safetensors",
                  "url": "https://huggingface.co/Comfy-Org/ACE-Step_ComfyUI_repackaged/resolve/main/all_in_one/ace_step_v1_3.5b.safetensors?download=true"
                }
              ],
              "ver": "0.3.32"
            },
            "size": [
              370,
              98
            ],
            "type": "CheckpointLoaderSimple",
            "widgets_values": [
              "ace_step_v1_3.5b.safetensors"
            ]
          },
          {
            "flags": {},
            "id": 14,
            "inputs": [
              {
                "link": 80,
                "localized_name": "clip",
                "name": "clip",
                "type": "CLIP"
              },
              {
                "link": 139,
                "localized_name": "tags",
                "name": "tags",
                "type": "STRING",
                "widget": {
                  "name": "tags"
                }
              },
              {
                "link": 141,
                "localized_name": "lyrics",
                "name": "lyrics",
                "type": "STRING",
                "widget": {
                  "name": "lyrics"
                }
              }
            ],
            "mode": 0,
            "order": 4,
            "outputs": [
              {
                "links": [
                  108,
                  117
                ],
                "localized_name": "CONDITIONING",
                "name": "CONDITIONING",
                "type": "CONDITIONING"
              }
            ],
            "pos": [
              590,
              120
            ],
            "properties": {
              "Node name for S&R": "TextEncodeAceStepAudio",
              "cnr_id": "comfy-core",
              "ver": "0.3.32"
            },
            "size": [
              340,
              500
            ],
            "type": "TextEncodeAceStepAudio",
            "widgets_values": [
              "synthwave, techno, synthpop, futuristic, electro, with liquid drum & bass drive.\nRestless, confident, dreamy mood at 128 BPM.\nAnalog bass, pulsating arps, percussive synth stabs, gated drums.\nQuick build,  then explosive drum burst, then clean fade.\nBreathy, rhythmic female vocals, minimal emotion, metallic echo.",
              "Verse\nNeon rain on my screen,\nDreams compile in silver sheen.\nNo weight, just motion,\nI’m plugged into emotion.\n\nChorus\nComfy Cloud — breathing light,\nCode and color, spark and wire.\nDrift through data, feel alive,\nIn your circuits, I arrive.",
              0.9900000000000002
            ]
          }
        ],
        "outputNode": {
          "bounding": [
            1350,
            260,
            120,
            60
          ],
          "id": -20
        },
        "outputs": [
          {
            "id": "64b3e663-3fea-4f74-a75e-77030e6a01cf",
            "linkIds": [
              126
            ],
            "localized_name": "AUDIO",
            "name": "AUDIO",
            "pos": [
              1370,
              280
            ],
            "type": "AUDIO"
          }
        ],
        "revision": 0,
        "state": {
          "lastGroupId": 7,
          "lastLinkId": 142,
          "lastNodeId": 73,
          "lastRerouteId": 0
        },
        "version": 1,
        "widgets": []
      }
    ]
  },
  "extra": {
    "VHS_KeepIntermediate": true,
    "VHS_MetadataImage": true,
    "VHS_latentpreview": false,
    "VHS_latentpreviewrate": 0,
    "ds": {
      "offset": [
        1448.8888351815401,
        826.7901973410861
      ],
      "scale": 0.44597694796838705
    },
    "frontendVersion": "1.35.7",
    "node_versions": {
      "ace-step": "06f751d65491c9077fa2bc9b06d2c6f2a90e4c56",
      "comfy-core": "0.3.34"
    },
    "workflowRendererVersion": "Vue"
  },
  "groups": [],
  "id": "88ac5dad-efd7-40bb-84fe-fbaefdee1fa9",
  "last_link_id": 138,
  "last_node_id": 77,
  "links": [
    [
      138,
      74,
      0,
      59,
      0,
      "AUDIO"
    ]
  ],
  "nodes": [
    {
      "flags": {},
      "id": 59,
      "inputs": [
        {
          "link": 138,
          "name": "audio",
          "type": "AUDIO"
        }
      ],
      "mode": 0,
      "order": 3,
      "outputs": [],
      "pos": [
        1508.0409984041914,
        -341.42319894431466
      ],
      "properties": {
        "Node name for S&R": "SaveAudioMP3",
        "cnr_id": "comfy-core",
        "ver": "0.3.34"
      },
      "size": [
        575.875,
        298.671875
      ],
      "type": "SaveAudioMP3",
      "widgets_values": [
        "audio/ComfyUI",
        "V0"
      ]
    },
    {
      "flags": {},
      "id": 74,
      "inputs": [
        {
          "label": "vocal volume multiplier",
          "link": null,
          "name": "multiplier",
          "type": "FLOAT",
          "widget": {
            "name": "multiplier"
          }
        },
        {
          "label": "music tags",
          "link": null,
          "name": "tags",
          "type": "STRING",
          "widget": {
            "name": "tags"
          }
        }
      ],
      "mode": 0,
      "order": 0,
      "outputs": [
        {
          "links": [
            138
          ],
          "name": "AUDIO",
          "type": "AUDIO"
        }
      ],
      "pos": [
        834.8349585680651,
        -342.74893463824935
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "proxyWidgets": [
          [
            "-1",
            "multiplier"
          ],
          [
            "-1",
            "tags"
          ],
          [
            "-1",
            "lyrics"
          ],
          [
            "-1",
            "seconds"
          ]
        ],
        "ver": "0.3.65"
      },
      "size": [
        639.859375,
        555.734375
      ],
      "type": "12b82ce4-ea55-42f8-9d19-3db74d8ca70e",
      "widgets_values": [
        1,
        "synthwave, techno, synthpop, futuristic, electro, with liquid drum & bass drive.\nRestless, confident, dreamy mood at 128 BPM.\nAnalog bass, pulsating arps, percussive synth stabs, gated drums.\nQuick build,  then explosive drum burst, then clean fade.\nBreathy, rhythmic female vocals, minimal emotion, metallic echo.",
        "Verse\nNeon rain on my screen,\nDreams compile in silver sheen.\nNo weight, just motion,\nI’m plugged into emotion.\n\nChorus\nComfy Cloud — breathing light,\nCode and color, spark and wire.\nDrift through data, feel alive,\nIn your circuits, I arrive.",
        60
      ]
    },
    {
      "bgcolor": "#000",
      "color": "#432",
      "flags": {},
      "id": 77,
      "inputs": [],
      "mode": 0,
      "order": 1,
      "outputs": [],
      "pos": [
        80,
        300
      ],
      "properties": {},
      "size": [
        730,
        1046
      ],
      "title": "About ACE Step and Multi-language Input",
      "type": "MarkdownNote",
      "widgets_values": [
        "Currently, multi-language support in ACE-Step V1 is implemented by uniformly converting all languages to English characters. At present, in ComfyUI, we haven't implemented the step of converting multiple languages into English. This is because if we need to implement the corresponding conversion, we have to add additional ComfyUI core dependencies, which may lead to unresolved dependency conflicts.\n\nSo, currently, if you need to input multi-language text, you have to convert it into English characters to complete this process manually. Then, at the beginning of the corresponding `lyrics`, input the abbreviation of the corresponding language code.\n\nFor example, use `[zh]` for Chinese, `[ja]` for Japanese, `[ko]` for Korean, and so on. For specific language input, please check the examples in the instructions. \n\nFor example, Chinese `[zh]`, Japanese `[ja]`, Korean `[ko]`, etc.\n\nExample:\n\n```\n[verse]\n\n[zh]wo3zou3guo4shen1ye4de5jie1dao4\n[zh]leng3feng1chui1luan4si1nian4de5piao4liang4wai4tao4\n[zh]ni3de5wei1xiao4xiang4xing1guang1hen3xuan4yao4\n[zh]zhao4liang4le5wo3gu1du2de5mei3fen1mei3miao3\n\n[chorus]\n\n[verse]​\n[ko]hamkke si-kkeuleo-un sesang-ui sodong-eul pihae​\n[ko]honja ogsang-eseo dalbich-ui eolyeompus-ileul balaboda​\n[ko]niga salang-eun lideum-i ganghan eum-ag gatdago malhaess-eo​\n[ko]han ta han tamada ma-eum-ui ondoga eolmana heojeonhanji ijge hae\n\n[bridge]\n[es]cantar mi anhelo por ti sin ocultar\n[es]como poesía y pintura, lleno de anhelo indescifrable\n[es]tu sombra es tan terca como el viento, inborrable\n[es]persiguiéndote en vuelo, brilla como cruzar una mar de nubes\n\n[chorus]\n[fr]que tu sois le vent qui souffle sur ma main\n[fr]un contact chaud comme la douce pluie printanière\n[fr]que tu sois le vent qui s'entoure de mon corps\n[fr]un amour profond qui ne s'éloignera jamais\n\n```\n\n"
      ]
    },
    {
      "bgcolor": "#000",
      "color": "#432",
      "flags": {},
      "id": 75,
      "inputs": [],
      "mode": 0,
      "order": 2,
      "outputs": [],
      "pos": [
        82.6664201947749,
        -338.65026679167113
      ],
      "properties": {},
      "size": [
        732,
        571.296875
      ],
      "title": "Note: how to use this workflow",
      "type": "MarkdownNote",
      "widgets_values": [
        "[Tutorial](https://docs.comfy.org/tutorials/audio/ace-step/ace-step-v1) \n\nThe AceStep requires `tags` and `lyrics` to generate music. `Tags` are for the music rhythm and type, and `lyrics` literally mean the words of the song.\n \n## For Comfy Cloud users\n\nIf you are using [cloud.comfy.org](https://cloud.comfy.org/):\n1. For the first run, you can just click the run button to see what happens. \n\n2. Try to updating the `tags(top one)`, and `lyrics(bottom one)` to have fun with it\n\n## For Local Uers\n\nDownload the following model and save it to the **ComfyUI/models/checkpoints** folder.\n[ace_step_v1_3.5b.safetensors](https://huggingface.co/Comfy-Org/ACE-Step_ComfyUI_repackaged/blob/main/all_in_one/ace_step_v1_3.5b.safetensors)\n\n## Report Issues\n\nIf you have any problems running this workflow, please report template-related issues via this link: [report the template issue here](https://github.com/Comfy-Org/workflow_templates/issues)"
      ]
    }
  ],
  "revision": 0,
  "version": 0.4
}