Hello everyone,
I am currently developing an out-of-tree driver for a sound card that utilizes XDMA for sample transfers. I am currently using a kernel 6.5 with the latest xdma driver cherry-picked, but I don't think any changes since 6.5 is addressing my issue.
My initial issue pertains to the completion of DMA transfers between a start and a stop command in ALSA. If the interval is too brief, the transfer does not conclude properly, leading to distorted samples. A straightforward solution to this problem was to adequately wait for the transfer to finish upon the stop, ie. sleeping until we know the hardware is done with the transfert (the XDMA controller does not support stopping in the middle of a transfer).
To address this DMA issue, I have created a patch [1] that guarantees the completion of the DMA transfer upon the return of xdma_synchronize. This means xdma_synchronize now sleeps, but looking at other drivers around it appears expected to be able to do so.
Regarding the audio implementation, the following patch enforces the synchronization:
int playback_trigger(struct snd_pcm_substream *substream, int command) { struct my_dev *my_dev = snd_pcm_substream_chip(substream);
switch (command) { case SNDRV_PCM_TRIGGER_START: /* Synchronize on start, because the trigger stop is called from an IRQ context */ if (substream->stream == SNDRV_PCM_STREAM_PLAYBACK) dmaengine_synchronize(my_dev->playback_dma_chan); pipe_start(&my_dev->playback, substream); break; case SNDRV_PCM_TRIGGER_STOP: pipe_stop(&my_dev->playback, substream); break; default: return -EINVAL; } return 0; }
In order for a sleepable function like dmaengine_synchronize() to work in the trigger callbacks, the audio card nonatomic flag had to be set. It basically leads the sound core towards the use of a mutex instead of a spinlock.
static int probe([...]) { struct snd_pcm *pcm; [...] /* This flag is needed to be able to sleep in start/stop callbacks */ pcm->nonatomic = true; [...] snd_pcm_set_managed_buffer_all(pcm, [...]); }
This approach generally works well, but leads to "scheduling while atomic" errors. Indeed, the IRQ handler from the XDMA driver invokes a function within the sound subsystem, which subsequently acquires a mutex...
At the moment, the only solution I've found is to replace the wait_for_completion() in the XDMA driver [2] with a busy wait loop. However, this approach seems incorrect, as all other synchronization functions in other DMA drivers are sleeping, which should not cause an issue.
The problem might be related to the sound driver. Should I avoid manually using dmaengine_synchronize? How to achieve the same effect in this case? Perhaps there is a more traditional way to properly clean the stream in the sound subsystem which I overlooked?
Could someone please provide guidance on how to resolve this issue?
Thanks, Louis Chauvet
[1]: https://lore.kernel.org/all/20240327-digigram-xdma-fixes-v1-2-45f4a52c0283@b... [2]: https://elixir.bootlin.com/linux/latest/source/drivers/dma/xilinx/xdma.c#L55...