After long time test. the test procedure is as the following steps:
- capture voice data as PCM data.
- encoded as amr-wb bit stream.
- decoded as PCM data from encoded bit stream.
- playback PCM data.
after 16 hours continue tests, there are some delay, about 800 mili seconds.
You did not specify between what two quantities you've measured the delay, but I guess you mean the relative positions of the playback and capture devices.
The speed of the devices is determined by the hardware. The playback and capture devices will run out of sync if they are not using the same clock. Are the capture and playback devices on different computers?
thank you very much. sorry, I don't write very clear. the codec loopback is test an arm platform. the position difference is between capture and playback. when I send an 'a' speech into mic, and after 800ms, the speaker will playback. after 16 houres the delay is existed. but after further test, the delay will be reset to almost less 200ms. I'm very wondering why. BTW, there are about two times snd_pcm_prepare in capture direction. what does snd_pcm_prepare actually do? I don't know the clock is sample rate or not. but the both direction are the same, 8k, 16k, etc.
is there any way to reduce the delay after long time test.
Measure the relative speed of the capture and playback devices, then resample accordingly (or just insert or drop a sample when needed).
this is a good question. in our case, the sample number is controlled by voice codec. codec will calculate the exactly sample numbers. if we know the exactly time delay from call snd_pcm_writei to hardware generate sound, we can decide how to steal sound samples. but the exactly time delay we couldn't know. BTW, both capture and playback are at the same sample rate, 8k, 16k, etc.
- If we want using salsa lib to replace the usage of alsa lib, what do we have to take care?
Plugins for sample rate/format conversion of software mixing are not available, so your application must use a sample format support by the hardware, and only one application can use the device at the same time.
so, it means if my application didn't need any particular format, like only 16bit LE, 1 channel, 16kHz and the hardware can provide such kind of format the plugins will pass the data from hardware to my application without any conversion? In other words, If my hardware can provide the format wich I want, the plugins will just use a little cpu to move data, and no conversion effort. is it or not?
Thank you very much.
Best Regards, Akio