Neuron IO

Implementation of the Transformer neural network architecture model, inspired by the original "Attention is All You Need" paper by Google

Key Features

Modularity: The project follows a modular approach, providing separate modules for key components such as activation functions, attention mechanisms, linear layers, and loss functions. This organization enhances code reusability and maintainability.
Tokenization and Vectorization: The tokenizer and vectorizer modules empower the model to preprocess text data, convert it into numerical representations, and make it suitable for input into the Transformer architecture. This enables efficient handling of natural language data.
Enhanced Regularization: The regularization module incorporates dropout techniques, a vital method to enhance the model's generalization capabilities, reducing overfitting, and improving its robustness during training.
Positional Encoding: The preprocessing module includes positional encoding, crucial in capturing sequential information and preserving the order of tokens within the input sequences, thereby aiding the Transformer's understanding of sequential data.
Attention Mechanisms: The attention module provides advanced attention mechanisms, allowing the model to focus on relevant parts of the input data. This attention mechanism is a key element in the Transformer's ability to handle long-range dependencies efficiently.
Custom Linear Layers: The linear module offers customizable linear layers, which are fundamental building blocks of the neural network. These layers play a significant role in transforming the input data between different representations, contributing to the model's overall performance.
C Extensions for Performance: The inclusion of C extensions enhances the computational performance of specific operations, optimizing critical computations and boosting the efficiency of the neural network.

The code is customizable to create custom neural network architectures.

Recursive Neural Network

import torch
import torch.nn as nn
from src.tokenizer import Tokenizer
from src.activation import ReLU, Softmax
from src.vectorizer import BuildVectors
from src.linear import Linear
from src.loss import Loss
from src.preprocessing import Embedding

class RNN(nn.Module):
    def __init__(self, vocab_size, embedding_dim, hidden_dim, output_dim):
        super(RNN, self).__init__()
        self.embedding = Embedding(vocab_size, embedding_dim)
        self.rnn = RNN(embedding_dim, hidden_dim)
        self.fc = Linear(hidden_dim, output_dim)
        self.activation = ReLU()

    def forward_prop(self, text):
        embedded = self.embedding(text)
        output, hidden = self.rnn(embedded)
        output = self.activation(output)
        output = self.fc(output)
        return output

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.vscode		.vscode
data		data
src		src
.gitattributes		.gitattributes
README.MD		README.MD
README.md		README.md
classfier.py		classfier.py
sketch.py		sketch.py
test.c		test.c
test.ipynb		test.ipynb
test.py		test.py
test.so		test.so
transformer.py		transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neuron IO

Key Features

About

Releases

Packages

Languages

teddyoweh/neuron-io

Folders and files

Latest commit

History

Repository files navigation

Neuron IO

Key Features

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages