skip to main |
skip to sidebar
Weekly Report 10
Status and Accomplishment
- Tested the c6accel library. Found to have similar performance compared to mine. But was not better than opencv native library. The relative performance measured was 4209167/4577332 for 16-bit sobel. With continious memory allocation the performance was 4209167/4498566.
- Tried to remove some of the cache writeback but couldnot see any difference in performance.
- So, instead of only using DSP for the algorithm, I assigned the task between the 2 processors by dividing the data to work on . Created 2 thread one calling the ARM side and other calling the DSP side. There was a slight improve in the performance but still not better than native ARM side code. The performance achieved this time was 4209167/4467565. The half of the output image was visually dissimilar to the other half in terms of edge contrasts. I will upload this picture.
- I think only way I can gain performance is by working on both the processor. Creating 2 API for same function, one for DSP and the other for ARM.
- Worked on the application part too. Started coding for it.
Plans- Instead on only working on DSP, I am planning to use it for task offloading. Creating asynchronous API and fetching the result later.
- Look into performance and application.
Blockers
- Still not able to beat the ARM performance.
No comments:
Post a Comment