TranslodeP2C

Overview

TranslodeP2C is an AI-powered pseudocode-to-C++ conversion system.
Leveraging a Transformer-based seq2seq model,
it translates pseudocode descriptions into structured C++ programs.
The project includes preprocessing, vocabulary building, training,
and inference, with an interactive Streamlit UI.

Features

Transformer-based sequence-to-sequence model for code generation.
Converts pseudocode to C++ using deep learning.
Preprocessing and vocabulary management for structured learning.
Training pipeline with customizable hyperparameters.
Inference system with greedy decoding.
Streamlit-based web UI for user-friendly interactions.

Installation

Prerequisites

Ensure you have the following installed:

Python 3.8+
PyTorch
Streamlit
tqdm

Setup

Clone the repository: git clone https://github.com/absarraashid3/translodep2c.git cd translodep2c
Install dependencies: pip install -r requirements.txt
Prepare your dataset and place it in data/train/split/.

Usage

Preprocessing

Convert TSV trInaining data into paired pseudocode-code format:

 python src/preprocess.py --input_tsv "C:\Projects\GenAi\data\train\split\spoc-train-train.tsv" --output_txt "C:\Projects\GenAi\data\train_pairs.txt"

Building Vocabulary

Generate vocabulary pickle files from training pairs:

 python src/vocab.py --pairs_file "C:\Projects\GenAi\data\train_pairs.txt" --src_vocab_file "src/src_vocab.pkl" --tgt_vocab_file "src/tgt_vocab.pkl"

Training the Model

Train the Transformer model for pseudocode-to-C++ conversion:

 python src/train.py --pairs_file "C:\Projects\GenAi\data\train_pairs.txt" --src_vocab_file "src/src_vocab.pkl" --tgt_vocab_file "src/tgt_vocab.pkl" --epochs 10 --batch_size 8

Inference

Generate C++ code from input pseudocode:

 python src/infer.py --model_checkpoint transformer_seq2seq.pt --src_vocab_file "src/src_vocab.pkl" --tgt_vocab_file "src/tgt_vocab.pkl" --pseudocode "read n print factorial of n"

Web Application

Launch the Streamlit UI:

 streamlit run src/app.py

Enter pseudocode and get auto-generated C++ code!

Future Enhancements

Implement beam search decoding for better predictions.
Fine-tune with more programming languages.
Optimize the model for faster inference.

🚀 Transform pseudocode into real C++ with TranslodeP2C!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TranslodeP2C

Overview

Features

Installation

Prerequisites

Setup

Usage

Preprocessing

Building Vocabulary

Training the Model

Inference

Web Application

Future Enhancements

🚀 Transform pseudocode into real C++ with TranslodeP2C!

About

Uh oh!

Releases

Packages

Uh oh!

Languages

AbsarRaashid3/TranslodeP2C

Folders and files

Latest commit

History

Repository files navigation

TranslodeP2C

Overview

Features

Installation

Prerequisites

Setup

Usage

Preprocessing

Building Vocabulary

Training the Model

Inference

Web Application

Future Enhancements

🚀 Transform pseudocode into real C++ with TranslodeP2C!

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages