Translation Transformer

A PyTorch implementation of the Transformer model for machine translation, specifically trained for English to Japanese translation.

Project Structure

Translation-Transformer/
├── src/
│   ├── model.py         # Transformer model implementation
│   ├── train.py         # Training script
│   ├── translate.py     # Translation script
│   ├── config.py        # Configuration settings
│   ├── dataset.py       # Dataset handling
│   └── __init__.py
├── tokenizer/
│   ├── tokenizer_en.json  # English tokenizer
│   └── tokenizer_ja.json  # Japanese tokenizer
└── research/
    ├── attention_visual.ipynb  # Attention visualization
    └── trials.ipynb           # Research experiments

Features

Implements the original Transformer architecture from "Attention is All You Need"
Supports English to Japanese translation
Uses pre-trained tokenizers for both languages
Includes attention visualization capabilities
GPU acceleration support (CUDA and MPS)

Requirements

Python 3.x
PyTorch
HuggingFace datasets
Tokenizers

Installation

Clone the repository:

git clone https://github.com/yourusername/Translation-Transformer.git
cd Translation-Transformer

Install dependencies:

pip install -r requirements.txt

Usage

Training

To train the model:

python src/train.py

The model will be saved in the weights directory specified in the config.

Translation

To translate a sentence:

python src/translate.py "Your English sentence here"

You can also use a test set index:

python src/translate.py 42  # Will use the 42nd example from the test set

Model Architecture

The implementation includes:

Multi-head attention mechanism
Positional encoding
Layer normalization
Residual connections
Feed-forward networks
Encoder-Decoder architecture

Configuration

Key configuration parameters (in config.py):

Batch size: 8
Number of epochs: 3
Learning rate: 0.0001
Sequence length: 350
Model dimension: 512
Number of attention heads: 8
Dropout: 0.1

Research

The research directory contains:

Attention visualization notebooks
Experimental trials and results

Acknowledgments

Based on the original Transformer paper: "Attention is All You Need" by Vaswani et al.
Uses the Helsinki-NLP/opus-100 dataset for training

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
research		research
src		src
tokenizer		tokenizer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Translation Transformer

Project Structure

Features

Requirements

Installation

Usage

Training

Translation

Model Architecture

Configuration

Research

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

baohuy11/Translation-Transformer

Folders and files

Latest commit

History

Repository files navigation

Translation Transformer

Project Structure

Features

Requirements

Installation

Usage

Training

Translation

Model Architecture

Configuration

Research

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages