I tried Machine Learning Spleeter. (Explanation of how to use)

2019 11 09 00x28 50 News
Sponsoring

I tried Machine Learning Spleeter. (Explanation of how to use)

I'm yosi, the administrator of Chillout with Beats.

2020/04/14 postscript
When I tried it again, I couldn't get the following error message in my home environment.I investigated and tried some, but the problem is not solved at present.
Therefore, please note that you may not be able to do it even if you try this article.
Ordinal 242 was not found in the dynamic library C: \ Users \ isee \ Miniconda3 \ Library \ bin \ mkl_intel_thread.dll. 

I tried Spleeter, which has become a hot topic in the streets.

Due to the CLI (command line), the difficulty level is high.

Command line what?If you don't understand, I don't recommend it.

For the time being, please listen to the results of the demo song.

This is amazing.

original data

Extracted vocal data

Data other than vocals

Overview of Spleeter

Although not a well-known topic, the issue of source separation has been of interest to a large community of music signal researchers for decades.It starts with a simple observation.Music recordings are usually a mix of multiple individual instrument tracks (lead vocals, drums, bass, piano, etc.).The tasks for separating music sources are:Given the mix, can these individual tracks (also known as stems) be restored? This has many potential uses.Consider pre-processing for other tasks such as remixing, upmixing, active listening, educational purposes, as well as transcription.

Interestingly, our brain is very good at separating musical instruments.Just focus on one of the instruments on this track (for example, lead vocals) and you'll hear something completely different from the others.But it's not really a separation, you still hear all the other parts.In many cases it may not be possible to accurately recover the individual tracks mixed together.Therefore, the challenge is to approximate them as closely as possible.In other words, try to get as close to the original as possible without creating excessive distortion.

Over the years, many strategies have been studied by dozens of great research teams around the world.If you are interested in this fascinating journey, please read the overview of this article or read it.The pace of recent progress has taken a big leap, mainly due to advances in machine learning methods.To keep track, people are comparing algorithms in international rating campaigns.This shows that Spleeter's performance matches the performance of the proposed optimal algorithm.
In addition, Spleeter is very fast. If you are running the GPU version, you can expect to isolate 100 times faster than in real time, which makes it suitable for processing large datasets.

 

How to install Spleeter

First is the place to get information.

The main story on the official page is "Spleeter".

 
The executable file is here.

Spleeter preferences

To run Spleeter, you first need to prepare the environment.

Download and install Miniconda

Conda is probably the easiest, somini-waveIntroduce.

2019 11 08 16x38 20

I think the default installation is fine.

Install Git

You need a Git client, so download and install it.

 
Click the corresponding OS to start downloading without permission.

2019 11 08 17x18 25

I think the default installation is fine.

 

Install Spleeter

For Windows, start "Anaconda Prompt" from the start menu.

2019 11 08 17x21 37

2019 11 08 17x23 14

Once started, copy and paste the following commands line by line.

git clone https://github.com/deezer/spleeter
conda install -c conda-forge spleeter
conda env create -f spleeter /conda/spleeter-cpu.yaml
conda activate spleeter-cpu
 

Checking the operation of Spleeter using the demo sound source

 
Copy and paste the following command when the installation is complete.
cd spleeter python -m separate -h

If you can install it without any problem, the following help (only a part is described) will be displayed.

usage: spleeter separate [-h] [-a AUDIO_ADAPTER] [-p PARAMS_FILENAME]
[–Verbose] -i AUDIO_FILENAMES [AUDIO_FILENAMES…]
[-o OUTPUT_PATH] [-n {directory, filename}]
[-d MAX_DURATION] [-c {wav, mp3, ogg, m4a, wma, flac}]
[-m]
 

Export to 2stem

2stem can be divided into "vocal" and "non-vocal".
 
Check the operation using the demo file.
 
Copy and paste the following command.
 
python -m spleeter separate -i audio_example.mp3 -o audio_output

The file is stored in the "audio_output \ audio_example" folder.

I tried it twice, but the first time I only get "vocals.wav" with the vocals extracted.

If you execute the command again (use the upper key on the keyboard to go back in the history of the command), it will be output.

"Accompaniment.wav" other than vocals is also output.

Export to 4stem

4stem can be divided into "vocals", "bass", "drums", and "others".
 
Check the operation using the demo file.
spleeter separate -i audio_example.mp3 -o audio_output -p spleeter: 4stems

If successful, 4 files will be exported as below.

INFO: spleeter: File audio_output \ audio_example \ bass.wav written INFO: spleeter: File audio_output \ audio_example \ vocals.wav written INFO: spleeter: File audio_output \ audio_example \ drums.wav written INFO: spleeter: File audio_output \ audio_example \ other. wav written

You can specify how many to divide with "-p". Currently, the following 3 types are supported.

  • spleeter:2stems(ボーカル、その他)
  • spleeter:4stems(ボーカル、ベース、ドラム、その他)
  • spleeter:5stems(ボーカル、ベース、ドラム、ピアノ、その他)

Use Spleeter for your own sound source

If you want to enjoy it for yourself, I think it was okay to extract only the vocals from the sound source you ripped by yourself. (Be careful as it will be out if you upload it to the net)

I will describe only how to do it.

First, copy the file you prepared to the same level as "audio_example.mp3".

All you have to do now is change the command "audio_example.mp3" to the file name of the file you prepared.

spleeter separate -i audio_example.mp3 -o audio_output

I tried it in Japanese, but it worked.

I am surprised that it can be extracted quite beautifully.

If you get an error, it may work if you change it to alphanumericals.

I tried Machine Learning Spleeter. (Explanation of how to use) Summary

It's a great technology.

Moreover, if you have a stem, you can learn more.

This is a guess, but at present it seems that there is little learning of Japanese voice.

If you learn from a karaoke CD etc., it may be possible to extract with considerable accuracy.

Also, since this software can be used for free, there is a high possibility that GUI-included applications using this technology will come out in the future.

I suddenly thought that it would be amazing if this technology and the technology of iZotope's Dialogue Match released the other day were fused.

Sponsoring
Sponsoring
News
Chillout with Beat

Comment

  1. Fans of this site Than:

    I've researched the vocal extraction quite a bit, so I'm providing information.
    Please use it as a material when you can't think of an article.It's okay if you don't use it, but w

    I flew from the annual access ranking, but over time, the environment around Spleeter has improved considerably, and it has reached a level that can be managed if you are a little familiar with personal computers.
    Even so, if you do it properly, you can not use Docker at least, but if it is limited to Windows, a GUI application is provided.
    The name is the same as the Spleeter GUI.
    https://makenweb.com/#spleetergui
    It seems that all the necessary data is included, so it seems that it can be used immediately.
    I'm a Mac user, so I use Spleeter Web, but the GUI seems to be overwhelmingly easy to use on Windows.
    I'm sorry if it doesn't work because I haven't tried it, but it seems to work because it's the second most popular on GitHub related to Spleeter.

    Well, if you include paid, it is not a match with LALAL.AI, but if it is free, I feel that you can know it as an option.
    There are few articles in Japanese.

    • yosi Than:

      Thank you for providing information!
      I didn't know it, so I'll look it up a little.
      This is really easy.
      And uploading is troublesome online, but it's easy and good locally.
      Let me write an article!

      As you may know, AUDIOSTRIP has revived the vocal removal from Youtube with an online service.
      https://www.audiostrip.co.uk/

      It's quite convenient, but I feel like it will disappear if it spreads due to copyright issues ...

      • Fans of this site Than:

        As you may know, AUDIOSTRIP has revived the vocal removal from Youtube with an online service.

        I didn't know at all w
        It looks like a simplified version of Spleeter Web, which is published on GitHub and I also use.
        https://github.com/JeffreyCA/spleeter-web
        This one can use not only Spleeter but also Demucs, Tasnet, CrossNet-Open-Unmix (X-UMX), and it also supports import from YouTube, but you need to use Docker.
        It takes a lot of time to start up and it eats up memory.

        Since this is just a combination of youtubedl, ffmpeg and spleeter, I feel that there is no problem for personal use, but it may be suspicious in terms of copyright when it becomes open to the public like AUDIOSTRIP.

        • yosi Than:

          This one can use not only Spleeter but also Demucs, Tasnet, CrossNet-Open-Unmix (X-UMX), and it also supports import from YouTube, but you need to use Docker.

          Thank you for the information!
          Spleeter Web That's amazing.
          I think I'll try this as well.
          I've never used Docker, but it's really troublesome to eat a lot of capacity in virtual systems.

          I feel that AUDIOSTRIP disappears (or removes downloads from Youtube).
          I think it's a problem to be able to download it.

I copied the title and URL