1) Программирование многоядерных DSP-процессоров TMS320C66x с использованием OpenMP https://habr.com/ru/articles/318762/
pdf:
1) Sparse Matrix-Vector Multiply on the Texas Instruments C6678 Digital Signal Processor https://pdfs.semanticscholar.org/6617/964cd7ead75d18a7b25dcc04c222abdce1f9.pdf
2) Optimising loops in c66 https://www.ti.com/lit/pdf/sprabg7
3) c66x instruction set https://www.ti.com/lit/ug/sprugh7/sprugh7.pdf