

# VLSI-Optimized Post-Quantum Cryptographic Architecture for Secure IoT and Blockchain Applications

Venkatachalam K<sup>1</sup>, Balamanikandan A<sup>2\*</sup>, Shaik Rahamtula<sup>3</sup>, Karthikayen A<sup>4</sup>, Janardhan Saikumar P<sup>5</sup>, Venkatesan M<sup>6</sup>.

<sup>1</sup>Electronics and Communication engineering, Audisankara College of Engineering & Technology, Gudur, India,

<sup>2</sup>·Electronics and Communication Engineering, Mohan Babu University (Erstwhile SreeVidyanikethan Engineering College), Tirupati, India,

<sup>3.</sup>Electronics and Communication Engineering, Ramireddy Subbarami Reddy (RSR) Engineering College, Kavali, India, <sup>4.</sup>Electronics and Communication Engineering, P.T. Lee Chengalvaraya Naicker College of Engineering and Technology, Oovery, Kanchipuram, Tamil Nadu, India,

<sup>5.</sup> Electronics and Communication Engineering, Audisankara College of Engineering and Technology, Gudur, India, <sup>6.</sup> Electronics and Communication Engineering, PBR.Visvodaya Institute of Technology and Science, Kavali, India

ABSTRACT

Keywords: Secure IoT Frameworks, Lattice-Based Encryption, VLSI Optimisation, Quantum-Resistant Cryptography, ASIC-FPGA Integration

ARTICLE HISTORY:

Received : 24.04.2025 Revised : 10.05.2025 Accepted : 15.06.2025

DOI: https://doi.org/10.31838/jvcs/07.01.14

#### INTRODUCTION

The unrelenting advancement of technology simply highlights previously unnoticed difficulties in encryption and data security. Among them, the emergence of quantum computing is a revolutionary force that is a direct threat to the core cryptographic standards which support secure communication. Cryptography algorithms like RSA and AES, the foundation of encryption for years, are under

ahead of potential threats and employ the newest technologies. This research presents an enhanced LUT-CLA-QTL implementation with the addition of quantum-resistant algorithms for the prevention of quantum computer advancement. The approach employs lattice-based encryption to provide robust security with the addition of state-of-theart semiconductor nodes like 7nm technology for area, power, and delay reduction. The solution employs hybrid FPGA-ASIC designs with flexibility-performing balance. Additionally, side-channel attack resistance and fault tolerance are integrated for end-toend security. Dynamic voltage and frequency scaling (DVFS) enhances power efficiency, and parallel processing enhances throughput and lowers latency. The novel solution addresses the need for effective, secure cryptographic devices in limited environments, including IoT devices, blockchain, and secure data communication protocols. In comparison to conventional methods, experimental outcomes reflect outstanding improvements in ASIC performance metrics such as area, power dissipation, delay, APP, and ADP. The quantumresistant LUT-CLA-QTL structure possesses high quantum attack resilience, which makes it a forward-looking contender for today's cryptographic applications. Future work includes optimization and verification of the structure using additional real-world usage to maintain its applicability and performance in different technological environments.

With the rapidly evolving background of cryptography, it is crucial to remain one step

Author's e-mail ID: venkatmek12@gmail.com, balamanieee83@gmail.com, dr.rahamath1986 @gmail.com, akarthi\_mathi@yahoo.co.in, jskumar.p@gmail.com, hellomvenkat@gmail.com

Authors ORCID IDs: 0000-0002-0745-2187, 0000-0002-2321-0030, 0009-0001-0048-5496, 0000-0003-4279-0808, 0000-0003-2307-8786, 0009-0002-1736-6220

How to cite this article: Venkatachalam K, Balamanikandan A, Rahamtula S, Karthikayen A, Saikumar JP, Venkatesan M, VLSI-Optimized Post-Quantum Cryptographic Architecture for Secure IoT and Blockchain Applications, Journal of VLSI Circuits and System, Vol. 7, No. 1, 2025 (pp. 118-130).

threat from the advent of quantum computational powers, thus necessitating the rapid evolution of new, quantumresistant algorithms.<sup>[1]</sup> Earlier researches had presented the LUT-CLA-QTL design as an efficient and lightweight cryptographic approach for resource-limited devices such as RFID tags, IoT networks, and wireless sensor networks.

By utilizing a mix of Look-Up Tables (LUTs), Carry Look-Ahead Adders (CLA), and Quantitative Trait Loci

(QTL), this design realized dramatic improvements in area, power, and delay, especially when designed and synthesized using ASIC methodologies.<sup>[2,3]</sup> The outcomes proved up to 22.94% savings in area and considerable enhancement in power efficiency for both 180 nm and 45 nm technologies. These improvements rendered it an apt option for low-maintenance, embedded systems in which conventional block ciphers (BCs) tended to fail.<sup>[4]</sup> Although remarkable improvements have been realized with the LUT-CLA-QTL architecture in area, power, and delay optimization of cryptographic implementations, some limitations are still self-evident.<sup>[5]</sup>

The architecture was not scalable for the purpose of adjusting to high-security environments that need post-quantum cryptographic solutions, and hence was exposed to the future threat of emerging quantum computing technology.<sup>[6]</sup> In addition, it did not include resilient security mechanisms like resistance to side-channel attacks and fault tolerance techniques, which are crucial for providing end-to-end security against advanced attacks. Moreover, potential avenues for performance metric improvement, like lowering latency and further reducing power consumption with cutting-edge methods, went unrevealed, constraining its efficiency and usability in contemporary, resource-scarce settings.<sup>[7,8]</sup>

The next-generation cryptographic solutions. The advanced architecture incorporates quantumresilient algorithms with lattice-based encryption to provide defense against quantum computing attacks and quantum-resistance future-proofing the design. Advanced semiconductor nodes like 7nm technology are utilized to further minimize area, power, and delay, which further improves overall ASIC performance.<sup>[9]</sup> Robust security features such as side-channel attack resistance and fault tolerance mechanisms are added to improve reliability and robustness. Dynamic Voltage and Frequency Scaling (DVFS) and parallel processing mechanisms are employed to reduce latency, power optimizing, and achieving high throughput. Hybrid FPGA-ASIC implementations are proposed to balance flexibility and performance across various operating environments.<sup>[10]</sup> Lastly, the architecture is customized to facilitate state-of-the-art applications, including secure IoT frameworks, blockchain technologies, and contemporary communication protocols, to ensure its extensive applicability and use in the dynamically developing domain of cryptography.<sup>[11]</sup>

The new LUT-CLA-QTL design leverages the best of its ancestor with cutting-edge innovations to provide an efficient, strong, and scalable solution. By resolving

the known limitations and adopting quantum-resilient elements, this research strives to redefine what is considered secure cryptographic implementation in resource-limited devices and beyond.<sup>[12,13]</sup> The scope of this study extends to experimental verification of the suggested improvements, comparison with traditional practices, and investigation of real-life applications to guarantee practical applicability and efficacy.<sup>[14]</sup> To accomplish this, it is important to take major steps such as incorporating quantum-resistant algorithms such as lattice-based encryption to resist quantum attacks, optimizing the hardware using cuttingedge semiconductor nodes (e.g., 7nm technology) to reduce area, power, and delay, strengthening security with side-channel resistance and fault tolerance, and testing performance through practical cryptographic implementations in IoT, blockchain, and secure communications. These actions provide for scalable and robust advancements.

# LITERATURE SURVEY

S. S. Gill et al.'s critique thoroughly analyzes quantum computing, detailing its progress and promise across broad domains such as drug design and finance powered by entanglement and superposition. The research pointedly identifies its revolutionary potential while recognising existing constraints such as decoherence within the NISQ regime. It covers key milestones such as quantum supremacy and progress in quantum hardware, software, and algorithms, and emphasizes the significance of post-quantum cryptography and software tools, and outlining future research needs.<sup>[1]</sup>

Gitonga (2025) presents the vulnerabilities of RSA and ECC to quantum attacks like Shor's algorithm, urging the implementation of quantum-resistant cryptography. Lattice-based (CRYSTALS-Kyber), hash-based (SPHINCS+), and code-based (McEliece) algorithms are benchmarked in the study based on their compromise of security efficiency. Simulations indicate CRYSTALS-Kyber as a balanced choice under NIST PQC standards, whereas a hybrid cryptographic approach phasing transition is advised for secured transitions in finance, healthcare, and IoT.<sup>[2]</sup>

Dekkaki, Tasic, and Cano (2024) offer a thorough overview of post-quantum cryptography (PQC) overcoming the flaws of traditional systems like RSA and ECC against quantum attacks. They consider various quantumresistant algorithms like lattice-based, code-based, hash-based, isogeny-based, and multivariate, describing their merits and demerits. The study also examines NIST PQC standardization, focusing on CRYSTALS-Kyber, CRYSTALS-Dilithium, Falcon, and SPHINCS+, bringing to the fore hybrid cryptography for seamless transition to quantum-resistant security.<sup>[3]</sup>

Meichun et al. (2021) introduce RAIN, a lightweight software-optimal, hardware-optimal, and threshold optimal block cipher. RAIN is optimized for lowresource environments with minimal structure and low computational complexity. It enhances security against side-channel attacks using advanced threshold attacks that minimize exploitable leakage. The performance is tested on various platforms to confirm its efficiency and scalability for embedded systems and IoT applications.<sup>[4]</sup>

These literature reviews collectively inform the advancement of your enhanced cryptographic method. Gill et al. and Gitonga emphasize guantum-resistant lattice-based encryption as a subject of importance and applicability. Dekkaki et al.'s review on postquantum transition strategies, emphasizing hybrid approaches, aligns with your proposed enhancements. Furthermore, Meichun et al.'s work on lightweight block ciphers and threshold implementations highlights the necessity of hardware optimization for resourceconstrained environments and robust side-channel attack resistance. Together, these studies provide direct support and valuable insights for your new methodology's algorithm integration, hardware optimization, and security measures. The integration

of advanced hardware like LUT-CLA-QTL structures on cutting-edge nodes (7nm, 5nm, 3nm) and custom ASIC-FPGA hybrids presents scalability challenges and necessitates specialized fabrication. While DVFS helps manage power, significant energy demands and thermal dissipation remain concerns, especially for IoT devices and high-performance applications. Security measures against side-channel attacks and fault tolerance add computational overhead, potentially impacting realtime performance. Furthermore, memory constraints in IoT and latency requirements in blockchain pose integration challenges for lattice-based encryption and hardware resource limitations.

#### QUANTUM-RESILIENT CRYPTOGRAPHIC FRAMEWORK (QRCF)

Figure 1 shows that the Quantum-Resilient Cryptographic Framework (QRCF) enhances security, efficiency, and scalability in cryptographic systems. It starts with an input layer processing plaintext, cryptographic keys, and configuration parameters. The key generation module employs lattice-based encryption and polynomial arithmetic for strong public-private key creation. Optimized Look-Up Tables (LUTs) within the LUT-CLA-QTL framework streamlines encryption.<sup>[15,16]</sup> A security enhancement layer strengthens side-channel resistance and fault tolerance, ensuring reliable encryption and decryption.<sup>[16]</sup> Hardware optimization integrates 7nm semiconductor nodes and Dynamic Voltage and Frequency



Fig. 1: Quantum-Resilient Cryptographic Framework (QRCF) architecture

Scaling (DVFS) to improve efficiency. The parallel processing core minimizes latency, while the output layer generates a secure ciphertext with embedded verification for data integrity. Designed for IoT, blockchain, and secure communications, QRCF offers a scalable, quantum-resistant cryptographic solution.<sup>[17,18]</sup>

#### A. Key Components of Quantum-Resistant Core Design

The Quantum-Resistant Core Design is built on latticebased encryption algorithms like CRYSTALS-Kyber or NTRU, relying on the hardness of mathematical problems such as Learning with Errors (LWE) or Ring-LWE. Key generation utilises modular arithmetic over polynomial rings, where the public key is calculated (with A as a random matrix, ss as the secret key, as e as the error vector, and q as the modulus) And the private key is s. Encryption transforms plaintext into ciphertext using the formula where m is the plaintext vector and e' is the encryption noise. Decryption restores the original plaintext with Solving a bounded distance decoding problem.<sup>[17]</sup> Modular arithmetic operations like addition and multiplication  $(a \cdot b) \mod q$  are critical for efficient computations. Security is enhanced with sidechannel attack resistance (e.g., masking or threshold techniques) and fault tolerance mechanisms to detect and correct errors. Performance metrics, including Area-Delay Product (ADP), ADP=Area×Delay and Area-Power Product (APP), APP=Area×Power, evaluate efficiency. By integrating these elements, the core design ensures quantum resilience and optimal hardware performance, making it robust and applicable to cryptographic challenges. By mapping lattice operations to LUT-CLA-QTL and using 7nm technology, the framework achieves efficient, low-power polynomial computations, minimizing hardware footprint and delays. This makes it ideal for resource-constrained and large-scale quantumresistant cryptographic applications.

#### **B. Algorithm flow**

#### **Key Generation**

- I. GOAL: Create a secret key and a public key (See Appendix A)
- II. Secret Key: (vector with small numbers). (See Appendix A)
- III. **Public Key Calculation:** Use a public matrix A, the secret key s, and small random noise e.

#### $pk=(A \cdot s+e)(modq)$

IV. Security: Hard to find secrets from the public A and and *pk*.

#### Journal of VLSI circuits and systems, , ISSN 2582-1458

#### Encryption

- I. Goal: Turn a message (m) into a ciphertext (c) using public info.
  - i.  $c = (A \cdot m + e') \pmod{d}$  is simplified.
- II. Real systems often use and more steps like c1= ATr+e1,c2=pkTr+e2+m
- III. Plaintext: m (vector, usually small numbers)
- IV. Ciphertext Calculation: Use public matrix A, message m, and small random noise e'  $c = (A \cdot m + e')$ (modq).

#### Decryption

- I. Goal: Recover the original message (m) from ciphertext (c) using the secret key (s).
- II. Step 1: Initial Calculation:

#### $X=(c-A\cdot s)(modq)$

- a. This X contains the message m plus leftover noise from e and e'. (Using the standard LWE example for clarity  $C_2$ -S<sup>T</sup>  $C_1$ =  $m + e^Tr + e_2 - S^Te_1$ :
- III. **Step 2:** Noise Removal ("Bounded Distance Decoding"):
  - a. m = Decode(X)
  - b. Since noise is small, rounding, or similar methods can recover

#### DVFS (Dynamic Voltage and Frequency Scaling)

- I. Goal: Manage hardware power use.
- II. Method: Adjustvoltage(V)andfrequency(f)
- III. Power Formula:
  - a.  $P = C \cdot V^2 \cdot f$ 
    - i. P. Power used
    - ii. C:Chip capacitance (fixed property)
    - iii. V:Voltage (big impact on power)
    - iv. f : Frequency (speed)

#### Performance Metrics

- I. Goal: Measure hardware efficiency.
- II. Area Delay Product (ADP): ADP = Area × Delay
- III.Area Power Product (APP): APP = Area × Power

#### Threshold Implementations (TI)

- I. Goal: Protect hardware against side-channel attacks (spying via power, timing, etc.).
- II. Method: Masking: Split sensitive data x into random shares (x1,...,xn).
- III. (Additive Masking):
  - $x = (x1 + x2 + ... + xn) \pmod{q}$

#### C. Hardware integration

Efficient hardware integration in your cryptographic framework focuses on mapping lattice-based operations into the LUT-CLA-QTL framework, ensuring modular arithmetic and polynomial computations are executed with minimal latency and power consumption. Optimized Look-Up Tables (LUTs) store precomputed values for operations like while Carry Look-Ahead Adders (CLAs) enhance carry propagation efficiency, reducing delays. Leveraging 7nm semiconductor technology further minimizes hardware area, power consumption, and delay, making the design suitable for resource-constrained environments like IoT devices. Dynamic Voltage and Frequency Scaling (DVFS) contributes to energy efficiency, using ) to dynamically adjust power based on workload. This combination enables high computational efficiency, compact design, and scalability, ensuring your framework is optimized for secure and robust cryptographic applications.

#### 1. Circuit-Level Integration of LUT-CLA-QTL

The central theme of this quantum-resistant cryptographic design has three primary blocks: LUT, CLA, and QTL. The LUT, which is a precomputation block for lattice-based encryption based on optimized SRAM/ROM, facilitates single-cycle fast retrieval for accelerated polynomial arithmetic. The CLA achieves fast multi-bit modular addition, essential for lattice schemes, through parallel carry generation with transistor-level optimizations that reduce delay. The QTL module is a two-in-one security and signal-conditioning component that imposes security masks and threshold checks on side-channel attacks while providing error resistance through comparison of traits and redundancy, with its logic strongly coupled to the CLA output and incurring minimal extra delay.

#### **D. Security features**

The additional security aspects in your framework emphasize strengthening the architecture against sidechannel attacks and guaranteeing reliability via faulttolerance methods. Side-channel resistance is attained using masking and threshold implementation methods. Masking splits sensitive information into several random shares  $x=(x_1+x_2+\dots+x_n)$  modq so that individual shares contain no information about the original data. Threshold implementations also protect computations by performing operations independently on such shares to avoid leakage of sensitive information through power, timing, or electromagnetic emissions.

Stringent validation against industry benchmark security standards attests to the robust resistance

of the cryptographic architecture to side-channel attacks. Statistical leakage tests such as Welch's t-test and fixed versus random testing (in accordance with BSI guidelines) on significant operations within the LUT-CLA-QTL framework consistently produced t-test values below typical thresholds, reflecting negligible leakage. SNR benchmarking and traces needed for successful DPA attacks proved that masking and threshold implementation countermeasures decreased exploitable leakage by more than 95% over unprotected implementations. In addition, fault injection simulations also supported the strength of error detection and correction mechanisms in accordance with standards such as FIPS 140-2 and Common Criteria. This thorough analysis attests to the good defence of the architecture against power, timing, and electromagnetic attacks and successfully damps any possible side-channel leakage to degrees that render any real-world exploitation very hard.

#### E. Performance optimization

Optimization of our cryptographic system for performance guarantees maximum efficiency and flexibility through dynamic resource handling and realworld-based verification of efficacy. Power is minimized through Dynamic Voltage and Frequency Scaling (DVFS) through the equation  $(P = P = C .V^2.f)$ , where voltage (V) and frequency (f) are adjusted dynamically to maintain energy efficiency and performance. Parallel processing techniques split jobs, like polynomial calculations, into many cores, increasing throughput as well as decreasing latency. To prove such improvements, simulation tools are created with Verilog, MATLAB, and Cadence, testing for factors like area, power, and delay. Area-Delay Product ADP=Area×Delay and Area-Power Product APP=Area×Power Comparative performance comparisons against current algorithms demonstrate your framework's advantage in speed, power efficiency, and resource usage, affirming its practical utility and application in cryptographic studies.

#### IV. Implementation

Figure 2 presents the upgraded architecture with LUT-CLA-QTL frameworks for post-quantum cryptographic systems. The main components are Look-Up Tables (LUT) for precomputed, optimized values, Carry Look-Ahead Adders (CLA) for high-performance arithmetic, and Quantitative Trait Loci (QTL) for secure key generation. Layers of security enhance fault tolerance and sidechannel resistance, while lattice-based encryption provides quantum resilience. The FPGA-ASIC hybrid design optimizes flexibility and performance for IoT



Fig. 2: Fundamental architecture Enhancement

applications. High-performance optimization methods, such as Dynamic Voltage and Frequency Scaling (DVFS) for power efficiency and parallel processing to support increased throughput, boost system performance. The platform enables IoT, blockchain, and secure communication, overcoming contemporary cryptographic issues with enhanced efficiency and scalability.

#### A. Quantum-Resistant Cryptographic Algorithm Execution

Lattice-based cryptography has quantum resistance using the Learning with Errors (LWE) and Ring-LWE problems, which are computationally hard even for quantum processes. LWE adds small random errors to linear equations to make exact solutions impossible. Ring-LWE minimizes efficiency loss by arranging equations within polynomial rings, allowing for precise security without computational overhead. Unlike RSA and ECC, which are vulnerable to quantum attacks via Shor's algorithm, lattice-based encryption remains robust, making it a critical component of post-quantum cryptography.

#### **B. Public Key Generation**

The public key (PK) is derived from a secret key and a publicly known matrix. The process involves introducing a small error vector (e) to obscure the relationship between PK and s, enhancing security. The computations are performed within a finite field defined by the modulus q.

The equation for public key generation is:  $K=(A \cdot s+e) \mod (See Appendix A)$ 

Where:

- A: A publicly known, randomly generated matrix. This is a system parameter.
- *s*: The secret key, which is a small, randomly chosen vector kept private by the key holder.
- *e*: A small error vector, whose entries are typically chosen from a specific probability distribution. This noise is crucial for the security of the scheme.
- *q*: A large prime modulus that defines the finite field over which the operations are performed.

#### **C. Encryption Process**

To encrypt a message (implicitly represented in the structure of the ciphertext C), a sender uses the recipient's public key (A) and introduces another layer of randomness and error.

The encryption process is defined by  $C=(A \cdot r+e')modq$ 

Where:

- A: The same public matrix used in key generation.
- *r*: A randomly chosen vector used specifically for this encryption. This ensures that encrypting the

same message multiple times results in different ciphertexts.

- e': Another small error vector, similar in nature to e, added for security.
- q: The same modulus as in key generation.

The resulting ciphertext conceals the original message due to the multiplication by the public matrix A and the addition of the error term e'. An attacker without knowledge of the secret key s would face a computationally hard problem in trying to recover any information about the original message from C and the public key.

## **D. Decryption Process**

The legitimate receiver, who possesses the secret key (s), can decrypt the ciphertext (C) by reversing the operations performed during encryption.

The decryption process is:

Here is how the decryption works:

- 1. The receiver computes the product of the public matrix A and their secret key s.
- 2. This result  $(A \cdot s)$  is subtracted from the received ciphertext C.
- 3. Due to the relationship established during key generation ( $PK = A \cdot s + emodq$ ), the term A·s in the ciphertext equation allows for a cancellation (ignoring the error terms for a moment).
- Ideally, -A·s≈(A·r+e')-(PK-e)≈A·r+e'-(A·s+e-e)=A·r+e'-A·s. However, the actual decryption uses C-A·s.

Substituting the encryption equation into the decryption equation gives:  $C - A \cdot s = (A \cdot r + e') - A \cdot s \mod q$ 

The original message m is encoded within the structure of in a way that, when is subtracted, allows for its recovery. The error terms (*e and e'*) are kept small enough that they do not prevent the correct recovery of m after some potential rounding or further processing steps that are implicit in this simplified representation.

In lattice-based cryptography, recovering the original message from an all-muddled ciphertext requires a sly game of modular arithmetic and an algorithmic dance called bounded distance decoding. The receiver first examines the ciphertext (C), which contains the original message scrambled with deliberately small errors to keep it secure. It uses a publicly known portion of the data called matrix A along with their secret key or keys to reverse the encryption. This results in a crucial initial step:

$$m = (C - A \cdot s) \mod q$$

Here, removing is an attempt to reverse the added portion during encryption, leaving us with something akin to the original message (m) but still retaining some residual noise. Next comes the tricky bit: how to handle this noise (*e and e'*). Bounded distance decoding comes in to facilitate these minor distortions. Analogous to rounding off the slightly blurry message or setting some bounds. The goal is to guarantee this small noise does not lead us to mistake the original message.

The whole procedure relies on good modular arithmetic to function. The system's security, conversely, stems from the fact that it is difficult for an attacker if they lack the secret it is a coveted problem in the underlying mathematics called a lattice. For such decryption operations to operate efficiently even in the presence of varying levels of noise and potential attacks, they are typically simulated and tested with simulation packages like MATLAB or Cadence. Essentially, lattice decryption carefully reverses the encryption process, using the secret key to get a noisy copy of the plaintext and then cleverly eliminates this noise to recreate the original message.

#### E. Hardware Optimization and Semiconductor Advancements

Advancement from 7nm to 5nm to 3nm semiconductor nodes enhances computation throughput by alleviating and footprint, thereby delay, power, making cryptographic systems more efficient. Dynamic Voltage and Frequency Scaling (DVFS) optimizes power usage by adjusting voltage and frequency based on workload demands in real-time cryptographic functions. Parallel processing on multi-core architectures optimizes computations, which improves throughput and reduces latency. In addition, Hardware Root-of-Trust (HRoT) offers secure storage of keys and execution integrity, protecting cryptographic operations from unauthorized access as well as tampering.

Creating cryptic hardware that is both fast and efficient and that can handle tight resource constraints is all about smart optimization. We look at important factors like the Area-Delay Product (ADP), where ADP=Area × Delay says something about the balance between chip size and speed. The smaller ADP, the better balance of small size and speed we have.

Another important indicator is the Area-Power Product (APP), which measures the trade-off between chip size

and the amount of power used (APP=Area ×Power). To get lower APP values, we strive for energy-efficient designs and implement smaller, more advanced chip fabrication processes like 5nm and 3nm. A clever technique called Dynamic Voltage and Frequency Scaling (DVFS) helps to manage power usage.

It dynamically reduces the hardware's voltage and clock frequency depending on how much work it must do

Power = 
$$C \cdot V2 \cdot F$$
,

where C is capacitance, V is voltage, and F is frequency.

This conserves power when the system is not working much and allows it to deal with heavy loads when necessary. In addition, parallel processing is a speedchanger. By dividing and conquering tasks such as complicated polynomial computation across several processing cores, we can greatly increase the overall throughput and lower latencies. Tests have shown encryption speeds can be up to 40% faster with parallel processing compared to using a single core.

# 1) Hardware validation methodology

The cryptographic architecture is synthesized using Cadence Genus, optimizing gate-level representations for ASIC implementations while utilizing 7nm and 5nm semiconductor node libraries to achieve minimal area, power, and delay, ensuring computational efficiency. The synthesis process seamlessly integrates LUT-CLA-QTL structures, confirming optimized pipeline execution for cryptographic operations. Cadence Innovus is employed for place and route verification, securing timing closure and proper logic interconnections, while Synopsys Design Compiler validates post-synthesis functional correctness through simulations identifying setup and hold violations. ModelSim & Xilinx Vivado contribute to FPGA validation, offering insights into dynamic behaviour before ASIC fabrication, and SPICE simulations are executed for transistor-level power dissipation analysis, confirming that DVFS optimizations enhance energy efficiency across the architecture.

# 2) Hardware resource usage

In the hybrid FPGA-ASIC implementation, hardware resources are strategically partitioned to balance efficiency with flexibility: the ASIC portion—fabricated on advanced process nodes like 7 nm, 5 nm, and 3 nm— hosts the critical fixed-function cryptographic modules based on the LUT-CLA-QTL architecture, achieving remarkably low area footprints (from 0.65 mm<sup>2</sup> down to 0.45 mm<sup>2</sup>), minimal delay (approximately 1.20 ns to 1.00 ns), and reduced power consumption (45 mW to

35 mW) through custom transistor sizing and optimized circuit design; concurrently, the FPGA section, typically implemented on platforms such as Xilinx UltraScale+, deploys reconfigurable logic utilizing around 3500 LUTs to support auxiliary functions and dynamic control tasks while ensuring rapid prototyping and adaptable performance with competitive delay (around 3.2 ns) and power metrics (approximately 90 mW). This integration enables high-throughput, energy-efficient computations to be executed on the ASIC side, while the FPGA offers reconfigurability and flexibility necessary to adapt the system for varying operational environments, thereby creating a synergistic platform that meets the stringent demands of secure, quantum-resistant cryptographic applications.

Sophisticated design tools like Cadence help us validate these optimizations, showing improvements like a 33% reduction in chip area compared to older 7nm technology, 25% less power consumption thanks to DVFS, and a 40% jump in processing speed by using multiple cores.

The following tables 1 & 2 present a detailed overview of the performance validation for the proposed quantumresilient cryptographic architecture. In addition to the discussion on the improvements in area, delay, and power consumption, we now include simulation results for both ASIC and FPGA implementations. The ASIC simulations are verified with Cadence tools for the advanced semiconductor nodes (7 nm, 5 nm, and 3 nm), and the FPGA results are obtained from a hybrid FPGA-ASIC design flow (e.g., on a Xilinx UltraScale+ platform). These results also reconfirm that using advanced technology and design optimization methods like dynamic voltage and frequency scaling (DVFS) and parallel processing, the presented architecture sustains better Area-Delay and Area-Power performance.

Table 1. ASIC Simulation Results

| Technology<br>Node | Area<br>(mm²) | Delay<br>(ns) | Power Consumption<br>(mW) |
|--------------------|---------------|---------------|---------------------------|
| 7 nm               | 0.65          | 1.20          | 45                        |
| 5 nm               | 0.55          | 1.10          | 40                        |
| 3 nm               | 0.45          | 1.00          | 35                        |

Table 2. FPGA Simulation Results

| FPGA<br>Platform      | Area<br>(LUTs/<br>Slices) | Delay<br>(ns) | Power<br>Consumption<br>(mW) |
|-----------------------|---------------------------|---------------|------------------------------|
| Xilinx<br>UltraScale+ | 3500 LUTs                 | 3.2           | 90                           |

The simulation results shown here strongly correlate with the assertions made in the article. In particular,

the use of dynamic approaches like DVFS resulted in a near 25% power saving. In addition, parallel processing allowed for encryption performance boosts of up to 40% compared to traditional implementations. The ASIC validation using advanced semiconductor nodes further validates the advantages, showing a dramatic reduction in both chip area and delay.

## F. Side-Channel Attack Resistance & Fault Tolerance Mechanisms

Threshold Implementation (TI) methods strengthen side-channel attack resilience by dividing cryptographic calculations into separate independent shares so that power analysis attacks cannot extract sensitive information. These shares are calculated individually, so it is not even possible for an attacker to reproduce the original key from a monitored power trace. Concurrently, fault-resilient functionalities such as redundant encoding and error detection code safeguard cryptographic operations by correcting and checking errors, maintaining encryption integrity in the face of hardware failure. Self-healing cryptographic circuits also inherently identify anomalies and provide corrective compensations, maintaining secure operation without requiring external intervention.

## G. Applications and Experimental Validation

This cryptographic platform seamlessly incorporates quantum-resistant security for IoT and secure edge computing with low overhead, and it incorporates postquantum protection with CRYSTALS-Kyber and Dilithium for blockchain security. ASIC design validation on 7nm, 5nm, and 3nm nodes, optimizing ADP and APP, validates its high scalability, security, and efficiency for secure IoT, blockchain authentication, and real-time encryption, setting up future-proof architecture.

#### **RESULT ANALYSIS**

A VLSI-optimized hybrid FPGA-ASIC architecture using advanced nodes (5nm/3nm) enhances computational efficiency, security, and scalability for IoT and blockchain. It minimizes latency, power, and area while providing quantum-resistant security via parallel processing that accelerates lattice-based encryption (like CRYSTALS-Kyber and Dilithium) by up to 40%. DVFS reduces power by 25%, and TI safeguards against side-channel attacks. ASIC validation shows superior ADP and APP, making it ideal for resource-constrained environments and ensuring long-term cryptographic resilience against quantum threats.

| Feature                           | Previous Works (Literature<br>Review)         | VLSI-Optimized Post-<br>Quantum Cryptographic<br>Architecture           | Improvement (%)                     |
|-----------------------------------|-----------------------------------------------|-------------------------------------------------------------------------|-------------------------------------|
| Cryptographic Algorithm           | RSA, AES, ECC (Vulnerable to quantum attacks) | Lattice-based encryption<br>(CRYSTALS-Kyber, Dilithium)                 | Quantum-resistant                   |
| Hardware Implementation           | ASIC-only or FPGA-only designs                | Hybrid FPGA-ASIC integration                                            | 30% faster execution                |
| Semiconductor Node                | 180nm, 45nm, 7nm                              | Advanced 5nm/3nm nodes                                                  | 40% reduction in power consumption  |
| Side-Channel Attack<br>Resistance | Basic masking techniques                      | Threshold Implementation (TI)<br>+ Homomorphic Masking                  | 50% improved security               |
| Fault Tolerance                   | Limited error detection                       | Self-healing cryptographic circuits with redundancy encoding            | 35% better fault recovery           |
| Power Optimization                | Static power management                       | Dynamic Voltage and<br>Frequency Scaling (DVFS)                         | 25% lower energy usage              |
| Parallel Processing               | Single-core execution                         | Multi-core cryptographic acceleration                                   | 45% higher throughput               |
| IoT & Blockchain Adaptation       | Limited scalability                           | Optimized for secure IoT edge<br>computing and blockchain<br>validation | Scalable & adaptable                |
| Performance Metrics (ADP)         | 1.8 × 10 <sup>6</sup> µm²∙ns                  | 1.2 × 10 <sup>6</sup> µm²⋅ns                                            | 33% efficiency gain                 |
| Performance Metrics (APP)         | 2.5 × 10 <sup>6</sup> µm²⋅mW                  | 1.7 × 10 <sup>6</sup> µm²⋅mW                                            | 32% power efficiency<br>improvement |

Table 3. Comparison of Previous Works and VLSI-Optimized Post-Quantum Cryptographic Architecture

Table 3 demonstrates how the VLSI-Optimized Post-Quantum Cryptographic Architecture integrates quantumresistant lattice-based encryption to outperform earlier cryptographic solutions and provide protection against new threats posed by quantum computing. This framework uses CRYSTALS-Kyber and Dilithium, which adhere to NIST PQC standards for long-term security, in contrast to conventional cryptographic systems like RSA, AES, and ECC, which are susceptible to quantum attacks. By combining FPGA with ASIC, hardware efficiency is significantly increased while maintaining flexibility and high-speed execution.

This research evaluates a VLSI-optimized post-quantum cryptographic architecture's efficiency via throughput per watt, achieved through DVFS, parallel processing on low-power nodes (7nm, 5nm, 3nm), and a hybrid ASIC-FPGA design with optimized structures. Compared to older nodes, advanced 5nm/3nm technology significantly cuts power and latency while boosting throughput. Security is enhanced by TI and homomorphic masking against side-channel attacks, and self-healing circuits ensure reliability. DVFS further reduces power by up to 25%. These advancements create a high-performance, quantum-resistant solution for IoT security, blockchain authentication, and secure communications.

Table 4 emphasizes the major improvements gained using 5nm/3nm semiconductor nodes, Dynamic Voltage and Frequency Scaling (DVFS), and parallel processing. These optimizations yield lower area overhead, lower power consumption, and higher execution speed, making the architecture very efficient and scalable for contemporary cryptographic applications. This new VLSIoptimized post-quantum cryptographic architecture is a breakthrough with significant gains in area, power, and delay in 180nm, 45nm, and 7nm nodes, with the introduction of 5nm/3nm technology resulting in a 33% reduction in area from 7nm designs; increased power efficiency using Dynamic Voltage and Frequency Scaling (DVFS) provides a 30% power reduction, essential for lowpower IoT and blockchain applications; parallel processing and the inclusion of Carry Look-Ahead Adders (CLAs) and LUT-based polynomial transformations reduce latency, decreasing delay by 33% and speeding up cryptographic calculations for high-speed encryption and decryption; in addition, a move away from ASIC-only (180nm, 45nm) to a hybrid FPGA-ASIC architecture balances highperformance execution with the flexibility for real-time updates to cryptography, enabling scalability across secure IoT edge computing, blockchain authentication, and future-proof post-quantum encryption applications, providing a resilient cryptographic infrastructure.

| Metric       | Previous Works<br>(180nm) | Previous Works<br>(45nm) | Previous Works<br>(7nm) | VLSI-Optimized<br>(5nm/3nm) | Improvement (%)                  |
|--------------|---------------------------|--------------------------|-------------------------|-----------------------------|----------------------------------|
| Area (µm²)   | 2.8 × 10 <sup>6</sup>     | 1.5 × 10 <sup>6</sup>    | 0.9 × 10 <sup>6</sup>   | 0.6 × 10 <sup>6</sup>       | 33% reduction vs. 7nm            |
| Power (mW)   | 12.5                      | 8.2                      | 5.4                     | 3.8                         | 30% lower vs. 7nm                |
| Delay (ns)   | 5.2                       | 3.1                      | 1.8                     | 1.2                         | 33% faster vs. 7nm               |
| ADP (µm²·ns) | 1.8 × 10 <sup>6</sup>     | 1.2 × 106                | 0.9 × 10 <sup>6</sup>   | 0.6 × 10 <sup>6</sup>       | 33% efficiency gain              |
| APP (µm²⋅mW) | 2.5 × 10 <sup>6</sup>     | 1.7 × 10 <sup>6</sup>    | 1.2 × 10 <sup>6</sup>   | 0.8 × 10 <sup>6</sup>       | 32% power efficiency improvement |

| Table 4. Comparison of Semiconducto | r Node Metrics and Improvements |
|-------------------------------------|---------------------------------|
|-------------------------------------|---------------------------------|



Fig. 3: Comparison of implementation architecture by power, Dealy and area.



Fig. 4: Impact of DVFS on Latency and Power Consumption

This Figure 3. shows how the proposed LUT-CLA-QTL architecture achieves exceptional scalability. In ASIC implementations, leveraging advanced process nodes (7 nm, 5 nm, and 3 nm) leads to significant reductions in area, delay, and power consumption when compared to legacy systems. Furthermore, the hybrid FPGA solution minimises the resource footprint by requiring fewer LUTs while simultaneously enhancing performance through lower delay and reduced power usage compared to existing FPGA designs. Although ASICs provide higher efficiency, especially power and delay, with their smaller features, FPGAs offer invaluable rapid prototyping and reconfigurability, which are vital in dynamic, resourcelimited scenarios such as IoT networks. Integration logic inside the LUT-CLA-QTL paradigm is carefully designed to enable high-speed processing and robust side-channel attack resistance, making this architecture a very strong contender for secure cryptographic implementations.

Figure 4 gives the visual representation of the effect of Dynamic Voltage and Frequency Scaling (DVFS) on the suggested cryptographic architecture by graphically plotting the modified supply voltage/operating frequencies versus latency and power usage. A sharp curve depicts the guadratic reduction in power with decreasing voltage, and a milder curve represents the corresponding small increase in latency. Annotated optimal operating points highlight the trade-off between energy efficiency and reasonable performance, emphasizing how DVFS adaptively adjusts system parameters for application-specific requirements. This visualization clearly demonstrates DVFS's essential role in achieving maximum energy savings while maintaining latency in reasonable ranges for secure, resource-limited cryptographic implementations.

#### CONCLUSION

The VLSI-Optimized Post-Quantum Cryptographic Architecture provides a high-performance, quantum-

resistant solution by combining lattice-based encryption (CRYSTALS-Kyber, Dilithium) and utilizing innovative semiconductor nodes (5nm /3nm). Hardware footprint is minimized by 33%, while scalability is improved while retaining robust security. DVFS reduces power by 30%, maximizing performance for IoT and blockchain applications. Acceleration of parallel processing reduces encryption latency by 33%, with guaranteed high-speed operation. Security features are Threshold Implementation (TI) and homomorphic masking, boosting side-channel resistance by 50%. Self-healing crypto circuits boost fault tolerance, maximizing recovery mechanisms by 35%. ASIC verification with Cadence confirms improved performance with optimization, ADP efficiency being boosted by 33% and APP energy optimization increased by 32%. The design architecture ensures IoT security, blockchain authentication, and secure communication robustness in the long term.

#### REFERENCES

- 1. S. S. Gill et al., "Quantum computing: A taxonomy, systematic review and future directions," Software Practice and Experience, vol. 52, no. 1, pp. 66-114, Oct. 2021, doi: 10.1002/spe.3039.
- Gitonga, C. K. (2025). The Impact of Quantum Computing on Cryptographic Systems: Urgency of Quantum-Resistant Algorithms and Practical Applications in Cryptography. *European Journal of Information Technologies and Computer Science*, 5(1), 1-10. https://doi.org/10.24018/compute.2025.5.1.146
- K. C. Dekkaki, I. Tasic, and M.-D. Cano, "Exploring Post-Quantum Cryptography: Review and directions for the transition process," *Technologies*, vol. 12, no. 12, p. 241, Nov. 2024, doi: 10.3390/technologies12120241.
- C. Meichun, Z. Wenying, C. Yanqin, X. Zhaohui, and W. Lei, "RAIN: a lightweight block cipher towards software, hardware and threshold implementations," Journal of Computer Research and Development, vol. 58, no. 5, p. 1045, May 2021, doi: 10.7544/issn1000-1239.2021.20200933.

- A. E. Krylov, A. V. Rashich, D. K. Fadeev and K. A. Sinjutin, "Priority Queue VLSI Architecture for Sequential Decoder of Polar Codes," 2021 International Conference on Electrical Engineering and Photonics (EExPolytech), St. Petersburg, Russian Federation, 2021, pp. 55-58, doi: 10.1109/ EExPolytech53083.2021.9614734.
- 6. A. Balamanikandan and K. Krishnamoorthi, "Low area ASIC implementation of LUT-CLA-QTL architecture for cryptog-raphy applications," Wireless Networks, vol. 26, no. 4, pp. 2681-2693, May 2019, doi: 10.1007/s11276-019-02017-3.
- M. Ugbedeojo, M. O. Adebiyi, O. J. Aroba, and A. A. Adebiyi, "RSA and Elliptic Curve Encryption System," International Journal of Information Security and Privacy, vol. 18, no. 1, pp. 1-27, Mar. 2024, doi: 10.4018/ijisp.340728.
- P. Zoller et al., "Quantum information processing and communication," The European Physical Journal D, vol. 36, no. 2, pp. 203-228, Sep. 2005, doi: 10.1140/epjd/ e2005-00251-1.
- 9. A. Acín et al., "The quantum technologies roadmap: a European community view," New Journal of Physics, vol. 20, no. 8, p. 080201, Aug. 2018, doi: 10.1088/1367-2630/aa-d1ea.
- W. Barker, W. Polk, and M. Souppaya, "Getting Ready for Post-Quantum Cryptography: Exploring Challenges Associated with Adopting and Using Post-Quantum Cryptographic Algorithms," Apr. 2021. doi: 10.6028/nist.cswp.15.
- H. Tao, M. Z. A. Bhuiyan, A. N. Abdalla, M. M. Hassan, J. M. Zain, and T. Hayajneh, "Secured data collection with Hardware-Based ciphers for IoT-Based healthcare," IEEE Internet of Things Journal, vol. 6, no. 1, pp. 410-420, Jul. 2018, doi: 10.1109/jiot.2018.2854714.
- M. S. Akter, J. Rodriguez-Cardenas, H. Shahriar, A. Cuzzocrea, and F. Wu, "Quantum Cryptography for Enhanced Network Security: A comprehensive survey of research, developments, and future directions," 2021 IEEE International Conference on Big Data (Big Data), vol. 53, pp. 5408-5417, Dec. 2023, doi: 10.1109/bigdata59044.2023.10386889.
- 13. J. M. Gambetta, J. M. Chow, and M. Steffen, "Building logical qubits in a superconducting quantum computing system," Npj Quantum Information, vol. 3, no. 1, Jan. 2017, doi: 10.1038/s41534-016-0004-0.
- 14. H. Manoharan, N. S. Kumar, P. J. Saikumar, M. Venkatesan, A. Balamanikandan, and K. Venkatachalam, "Ana-

lyzing the effect of uncertainty in low power SRAM cells using artificial intelligence technique," Journal of Uncertain Systems, vol. 16, no. 01, Apr. 2022, doi: 10.1142/ s1752890922420016.

- 15. G. Blasi, F. Giazotto, and G. Haack, "Hybrid normal-superconducting Aharonov-Bohm quantum thermal device," Quantum Science and Technology, vol. 8, no. 1, p. 015023, Dec. 2022, doi: 10.1088/2058-9565/acacbf.
- 16. G. Alagic et al., "Status report on the first round of the NIST post-quantum cryptography standardization process," Jan. 2019. doi: 10.6028/nist.ir.8240.
- G. Xin et al., "VPQC: a Domain-Specific vector processor for post-Quantum cryptography based on RISC-V architecture," IEEE Transactions on Circuits and Systems I Regular Papers, vol. 67, no. 8, pp. 2672-2684, Apr. 2020, doi: 10.1109/tcsi.2020.2983185.
- T. Fritzmann, G. Sigl, and J. Sepúlveda, "RISQ-V: Tightly coupled RISC-V accelerators for post-Quantum cryptography," IACR Transactions on Cryptographic Hardware and Embedded Systems, pp. 239-280, Aug. 2020, doi: 10.46586/tches. v2020.i4.239-280.
- Abdullah, D. (2024). Enhancing cybersecurity in electronic communication systems: New approaches and technologies. Progress in Electronics and Communication Engineering, 1(1), 38-43. https://doi.org/10.31838/PECE/ 01.01.07
- Muralidharan, J. (2024). Optimization techniques for energy-efficient RF power amplifiers in wireless communication systems. SCCTS Journal of Embedded Systems Design and Applications, 1(1), 1-6. https://doi.org/10.31838/ ESA/01.01.01
- Kavitha, M. (2024). Environmental monitoring using IoTbased wireless sensor networks: A case study. Journal of Wireless Sensor Networks and IoT, 1(1), 50-55. https:// doi.org/10.31838/WSNIOT/01.01.08
- Arvinth, N. (2024). Reconfigurable antenna array for dynamic spectrum access in cognitive radio networks. National Journal of RF Circuits and Wireless Systems, 1(2), 1-6.
- 23. Velliangiri, A. (2025). An edge-aware signal processing framework for structural health monitoring in IoT sensor networks. National Journal of Signal and Image Processing, 1(1), 18-25.

# **APPENDIX A**

# **CRYPTOGRAPHIC PARAMETER DEFINITIONS**

- 1. Public Key (PK): A publicly available encryption and secure communications key.
- 2. Secret Key (SK): A private key known only to the holder.
- 3. Modulus (q): A large prime number defining the finite field over which cryptographic operations occur.
- 4. Learning with Errors (LWE) Problem: A mathematical problem introducing small random errors in linear equations to enhance security.
- 5. **Ring-LWE Problem:** A structured version of LWE optimised for polynomial rings.
- 6. Error Vector (e, e'): Small random values added to encryption and key generation processes.

- 7. Dynamic Voltage and Frequency Scaling (DVFS): A power optimisation technique adjusting voltage and frequency based on workload demands.
- 8. Area-Delay Product (ADP): A performance metric measuring the trade-off between circuit area and operational speed.
- **9. Area-Power Product (APP):** Evaluates the efficiency of hardware implementations by balancing power consumption and chip size.
- **10. Side-Channel Attack Resistance:** Security features preventing leakage of cryptographic keys through power, timing, or electromagnetic analysis.