Optimizing BFloat16 Deployment of Tiny Transformers on Ultra-Low Power Extreme Edge SoCs
Transformers have emerged as the central backbone architecture for modern generative AI. However, most ML applications targeting low-power, low-cost SoCs (TinyML apps) do not employ Transformers as these models are thought to be challenging to quantize and deploy on small devices. This work proposes...
Saved in:
| Main Authors: | Alberto Dequino, Luca Bompani, Luca Benini, Francesco Conti |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-02-01
|
| Series: | Journal of Low Power Electronics and Applications |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2079-9268/15/1/8 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Tiny Machine Learning and On-Device Inference: A Survey of Applications, Challenges, and Future Directions
by: Soroush Heydari, et al.
Published: (2025-05-01) -
AESware: Developing AES-enabled low-power multicore processors leveraging open RISC-V cores with a shared lightweight AES accelerator
by: Eunjin Choi, et al.
Published: (2024-12-01) -
Data-intensive service deployment based on edge computing
by: Yongmei GAO, et al.
Published: (2019-07-01) -
Data-intensive service deployment based on edge computing
by: Yongmei GAO, et al.
Published: (2019-07-01) -
Slim-sugarcane: a lightweight and high-precision method for sugarcane node detection and edge deployment in natural environments
by: Lijiao Wei, et al.
Published: (2025-07-01)