Algorithm to draw waveform from audio

Question

I'm trying to draw a waveform from a raw audio file. I demuxed/decoded an audio file using FFmpeg and I have those informations: samples buffer, the size of the samples buffer, the duration of the audio file (in seconds), sample rate (44100, 48000, etc), sample size, sample format (uint8, int16, int32, float, double), and the raw audio data itself.

Digging on the Internet I found this algorithm (more here):

White Noise:

White Noise

The Algorithm

All you need to do is randomize every sample from –amplitude to amplitude. We don’t care about the number of channels in most cases so we just fill every sample with a new random number.

Random rnd = new Random();
short randomValue = 0;

for (int i = 0; i < numSamples; i++)
{
    randomValue = Convert.ToInt16(rnd.Next(-amplitude, amplitude));
    data.shortArray[i] = randomValue;
}

It's really good but I don't want to draw that way, but this way:

audacity

Is there any algorithm or idea of how I can be drawing using the informations that I have?

You appear to be trying to use wave form generating algorithms when you already have your wave form. So it sounds to me like you need to do nothing at all. — Galik
@Galik - What you mean with "you already have your wave form"? The only thing I have is the informations about the raw file listed above, now I'm looking for an algorithm to draw the wave form using those informations. — yayuj
Well the algorithms you linked have nothing to do with drawing the wave. They generate the wave. You generated your wave using ffmpeg to convert raw sound data. So you don't need a generator. I think maybe what you need is some kind of GUI framework that allows you to draw stuff on the screen. — Galik
@Galik - I see. I can use Qt with Canvas or OpenGL, but that is exactly the point, drawing those informations using Canvas or OpenGL. — yayuj
I think you need to pick a framework and then ask a question specifically for it because they all work a little differently. — Galik

Diljeet Diljeet · Accepted Answer · 2018-04-20T07:14:14

EXPLANATION FOR EVERYBODY I am a developer of a dj app and was searching for similar answers. So i will explain all about the music waveform you may see in any software including audacity.

There are 3 types of waveforms used to display in any music software. Namely Samples, Average and RMS.

1) Samples are the actual music points presented in a graph, could be an array of raw audio data (points you see when you zoom the waveform in audacity).

2) Average: most commonly used, suppose you are displaying 3 minute song on screen, so a single point on screen must display atleast 100ms(approx) of the song which has many raw audio points, so for displaying we calculate the average of all the points in that 100ms duration, and so on for the rest of the track (dark blue big waveform in audacity).

3) RMS: similar to average but here instead of average, root mean square of the particular duration is taken (the small light blue waveform inside the blue one is rms waveform in audacity).

Now how to calculate waveforms.

1) Samples is raw data when you decode a song using any technique you get raw samples/points. Now based on the format of points you convert them to range -1 to 1, example if format is 16-bit you divide all points by 32768(maximum range for 16 bit number) and then draw the points.

2) for average waveform - first add all points converting negative values to positive, then multiply by 2 and then take average.

//samples is the array and nb_samples is the length of array
float sum = 0;
for(int i = 0 ; i < nb_samples ; i++){
    if(samples[i] < 0)
        sum += -samples[i];
    else
        sum += samples[i];
}
float average_point = (sum * 2) / nb_samples; //average after multiplying by 2
//now draw this point

3) RMS: its simple take the root mean sqaure - so first square every sample, then take the sum and then calculate the mean and then sqaure root. I will show in programming

//samples is the array and nb_samples is the length of array
float squaredsum = 0;
for(int i = 0 ; i < nb_samples ; i++){
    squaredsum += samples[i] * samples[i]; // square and sum
}
float mean = squaredsum / nb_samples; // calculated mean
float rms_point = Math.sqrt(mean); //now calculate square root in last
//now draw this point

Note here the samples is the array of points for calculating the point/pixel for a particular duration of song. example if you want to draw 1 minute of songs data in 60 pixels so the samples array will be the array of all points in 1 second, i.e the amount of audio points to be displayed in 1 pixel.

Hope this will help someone to clarify the concepts about audio waveform.

Algorithm to draw waveform from audio

7 Answers

showwavespic

showwaves