Instruction-Efficient and Parallelized AES using CUDA and PTX for Data Encryption

Raditya Hakim Daniswara, Yani Parti Astuti

Abstract


This research presents an instruction-efficient and parallelized implementation of the AES-256 encryption algorithm using NVIDIA CUDA with inline PTX to optimize instruction usage and execution performance on GPUs. Conventional AES implementation on CUDA often suffers from redundant instructions and high computational overhead, limiting encryption throughput. To address this, three implementation variants were developed: a baseline version, a parallelized version, and an inline PTX-optimized version. The proposed PTX-enhanced AES reduces instruction redundancy through vectorized memory access, byte permutation, and logical operation consolidation using PTX instructions. Instruction analysis using NVIDIA Nsight Compute and cuobjdump revealed a 66% reduction in executed instructions compared to the baseline, primarily due to optimized AES operations. Performance evaluation of 32 MB plaintext demonstrates a 16-fold improvement in throughput, increasing from 2.73 Gbps in the baseline to 43.9 Gbps in the PTX-optimized version, with cycles per byte decreasing from 2.63 to 0.164. These results confirm that integrating fine-grained PTX control within CUDA enables substantial gains in encryption efficiency while maintaining correctness and security. The proposed approach provides a scalable foundation for high-performance cryptographic operations in GPU-based computing environments.

Keywords


AES; CUDA; GPU acceleration; instruction reduction; PTX

Full Text:

PDF

References


J. Kaur, S. Lamba, and P. Saini, “Advanced Encryption Standard: Attacks and Current Research Trends,” in 2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Mar. 2021, pp. 112–116. DOI: https://doi.org/10.1109/ICACITE51222.2021.9404716.

“Document Encryption Software Market Size, Share, Growth, and Industry Analysis, by Type (IDEA Algorithm,RSA Algorithm,AES Algorithm), by Application (Confidential Document, Meeting minutes, Technical Information, Financial Statements, Others), Regional Insights and Forecast to 2033,” 106220, Aug. 2025. [Online]. Available: https://www.marketgrowthreports.com/market-reports/document-encryption-software-market-106220

N. I. of S. and T. (NIST), M. J. Dworkin, M. S. Turan, and N. Mouha, “Advanced Encryption Standard (AES),” May 2023, DOI: https://doi.org/10.6028/NIST.FIPS.197-upd1.

D. Sarangi, “A Comparative Study of AES Encryption Modes and Hashing for Blockchain Applications,” 2024.

J. Song and S. C. Seo, “Efficient Parallel Implementation of CTR Mode of ARX-based Block Ciphers on ARMv8 Microcontrollers,” Applied Sciences, Vol. 11, No. 6, 2021, DOI: 10.3390/app11062548.

V. V. N. Akwukwuma Chete F. O. ,. Oshioluamhe, M. N. and Okpako A. E. ,., “Text Encryption using Advanced Encryption Standard (AES) Algorithm,” NJSTR, Vol. 6, No. 2, Jun. 2024, DOI: https://doi.org/10.5281/zenodo.12558923.

V. Sanz, A. Pousa, M. Naiouf, and A. De Giusti, “Performance Analysis of AES on CPU-GPU Heterogeneous Systems,” in Cloud Computing, Big Data & Emerging Topics, E. Rucci, M. Naiouf, F. Chichizola, L. De Giusti, and A. De Giusti, Eds., Cham: Springer International Publishing, 2022, pp. 31–42. DOI: https://doi.org/10.1007/978-3-031-14599-5_3.

L. A. Pérez-Sarmiento and C. Mancillas-López, “Parallel Implementation of OCB using VAES and GPUs,” The Journal of Supercomputing, Vol. 81, No. 5, p. 711, Apr. 2025, DOI: https://doi.org/10.1007/s11227-025-07179-w.

G. Nişancı, P. G. Flikkema, and T. Yalçın, “Symmetric Cryptography on RISC-V: Performance Evaluation of Standardized Algorithms,” Cryptography, Vol. 6, No. 3, p. 41, 2022, DOI: 10.3390/cryptography6030041.

T. A. Kwame Assa-Agyei Funminiyi Olajide, “Optimizing the Performance of the Advanced Encryption Standard Techniques for Secured Data Transmission,” International Journal of Computer Applications, Vol. 185, No. 21, pp. 31–36, Jul. 2023, DOI: 10.5120/ijca2023922941.

D. Guide, “Cuda C++ Programming Guide,” NVIDIA, July, 2025.

W.-K. Lee, H. J. Seo, S. C. Seo, and S. O. Hwang, “Efficient Implementation of AES-CTR and AES-ECB on GPUs with Applications for High-Speed FrodoKEM and Exhaustive Key Search,” IEEE Transactions on Circuits and Systems II: Express Briefs, Vol. 69, No. 6, pp. 2962–2966, 2022, DOI: 10.1109/TCSII.2022.3164089.

S. An and S. C. Seo, “Designing a New XTS-AES Parallel Optimization Implementation Technique for Fast File Encryption,” IEEE Access, Vol. 10, pp. 25349–25357, 2022, DOI: https://doi.org/10.1109/ACCESS.2022.3155810.

P. Hijma, S. Heldens, A. Sclocco, B. van Werkhoven, and H. E. Bal, “Optimization Techniques for GPU Programming,” ACM Comput. Surv., Vol. 55, No. 11, Mar. 2023, DOI: https://doi.org/10.1145/3570638.

O. Ozerk, C. Elgezen, A. C. Mert, E. Ozturk, and E. Savas, “Efficient Number Theoretic Transform Implementation on GPU for Homomorphic Encryption.” 2021. [Online]. Available: https://eprint.iacr.org/2021/124

H. Moussa and A. Fanfakh, “Survey of Symmetric Encryption Techniques Implemented Over GPU Platforms,” Journal of Intelligent Informatics, Networking, and Cybersecurity, Vol. 1, No. 1, p. 2, 2025.

H. T. Assafli, I. A. Hashim, and A. A. Naser, “Advanced Encryption Standard (AES) Acceleration and Analysis using Graphical Processing Unit (GPU),” Applied Nanoscience, Vol. 13, No. 2, pp. 1245–1250, Feb. 2023, DOI: https://doi.org/10.1007/s13204-021-01985-3.

V. T. Nguyen, P. H. Pham, V. T. D. Le, H. L. Pham, T. H. Vu, and T. D. Tran, “AES-RV: Hardware-Efficient RISC-V Accelerator with Low-Latency AES Instruction Extension for IoT Security,” IEICE Electronics Express, Vol. 22, No. 16, pp. 20250329–20250329, 2025, DOI: 10.1587/elex.22.20250329.

B. Marshall, G. R. Newell, D. Page, M.-J. O. Saarinen, and C. Wolf, “The Design of Scalar AES Instruction Set Extensions for RISC-V,” TCHES, Vol. 2021, No. 1, pp. 109–136, 03 2020, DOI: 10.46586/tches.v2021.i1.109-136.

M. K. Yang and J.-S. Jeong, “Optimized Hybrid Central Processing Unit–Graphics Processing Unit Workflow for Accelerating Advanced Encryption Standard Encryption: Performance Evaluation and Computational Modeling,” Applied Sciences, Vol. 15, No. 7, 2025, DOI: https://doi.org/10.3390/app15073863.




DOI: https://doi.org/10.32520/stmsi.v15i4.5931

Article Metrics

Abstract view : 8 times
PDF - 3 times

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.