Quantum Transformers for Natural Language Processing

Would the transformer architecture — the backbone of large language models (LLMs) like ChatGPT — still work if we replaced its neural networks with quantum unitary operations?

The answer is yes!

This repository contains two Quantum Transformer (QT) models in separate Jupyter notebooks that show how it works. For a detailed explanation, please refer to the code block descriptions in the attached Jupyter Notebooks or see my blog post on this work. While early results on the tiny Shakespeare dataset are modest, the structure is promising.

1. Interferometric Transformer (IT) Model

Replaces classical linear layers with interferometric networks—phase shifters + beamsplitters (Fourier transforms).

2. Rotational Transformer (RT) Model

Replaces classical linear layers with qubit‐rotation networks—only single‐qubit Ry rotations.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
IT.ipynb		IT.ipynb
README.md		README.md
RT.ipynb		RT.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Quantum Transformers for Natural Language Processing

1. Interferometric Transformer (IT) Model

2. Rotational Transformer (RT) Model

About

Uh oh!

Releases

Packages

Languages

ArunSehrawat/Quantum_Transformers_for_Natural_Language_Processing

Folders and files

Latest commit

History

Repository files navigation

Quantum Transformers for Natural Language Processing

1. Interferometric Transformer (IT) Model

2. Rotational Transformer (RT) Model

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages