On 11/12/19 10:37 PM, Kuninori Morimoto wrote:
Hi Pierre-Louis
Thank you for your report
for_each_rtd_components(rtd, rtdcom, component) {
pr_err("plb: %s processing component\n", __func__);
if (!component)
pr_err("plb: %s component is NULL\n", __func__);
Could you perhaps add traces of which components are being accessed at each stage? We might want to go through and just add something like that in the code anyway to help figure things out.
I tried to add more traces but couldn't triangulate on a clear issue, and the traces have an Heisenbug effect.
So I switched to higher-level code analysis: it turns out that soc_dai_link_remove() routine is called from both topology and on card cleanup.
The patch 06/19 in this series essentially forces the pcm_runtimes to be freed in both cases, so possibly twice for topology-managed dailinks - or using information that's been freed already.
I 'fixed' this by adding an additional parameter to avoid doing the pcm runtime free from the topology (as was done before), and the kernel oops goes away. My tests have been running for 45mn now, when without change I get a kernel oops in less than 10-20 cycles (but still more than apparently our CI tracks, something to improve).
I pushed the code on GitHub to check if there are any negative points reported by the Intel CI, should be complete shortly: https://github.com/thesofproject/linux/pull/1469
I am not sure the suggested fix is correct, I don't fully get what the topology and card cleanups should do and how the work is split, if at all.
BTW, I guess your kernel is appling this patch, but, is it correct ?
df95a16d2a9626dcfc3f2b3671c9b91fa076c997 ("ASoC: soc-core: fix RIP warning on card removal")
Sorry morimoto-san, I am not getting the question.
Are you asking if the patch is correct?
Or are you asking if the kernel used for this test include this patch? The answer is yes, this patch ("ASoC: soc-core: fix RIP warning on card removal") was merged by Mark and the SOF tree does use it, since we follow Mark's tree.
If so, I think I could understand the issue. But, before explaining detail, I want to confirm that my solution is correct or not first.
Can you please check attached patch ?
Takashi's feedback seems to hint at problems with this patch, so will wait for further instructions here if you want me to test. Thanks!
Then, please remove above RIP warning patch. It is not clean patch, but OK to confirming, so far. I think
- no kernel Oops
- no RIP warning
Thank you for your help !! Best regards
Kuninori Morimoto
Alsa-devel mailing list Alsa-devel@alsa-project.org https://mailman.alsa-project.org/mailman/listinfo/alsa-devel