Paper: IEEE Access – DNN partitioning for inference throughput acceleration at the edge
Intro I am very excited to present this work, published in the IEEE Access journal, which presents an alternative to standard AI workload acceleration mechanisms at the edge (hardware acceleration, model compression, cloud off-loading). This work, in collaboration with Cisco,…
Read more