NNOF: Neural Network Optimization Framework

This is a C++ project aimed at implementing and optimizing neural network operations for both CPU and GPU. The primary goal is to compare performance between basic CPU operations and optimized CPU operations (SIMD), as well as to compare performance between CPU and GPU implementations of basic neural network components.

Project Overview

This project implements:

A Tensor class for handling multi-dimensional data
CPU implementations of basic operations (addition, matrix multiplication)
SIMD-optimized versions of above operations using AVX instructions
A FullyConnectedLayer class with both CPU and GPU forward pass, using OpenCL for GPU
A benchmarking system to compare baseline and optimized CPU performance, and to compare CPU and GPU performance (Latency, Throughput, Memory Usage)

Key Components

tensor.h/cpp: Defines the Tensor class for data representation
ops_cpu.cpp: CPU implementations of neural network operations
ops_opencl.cpp: GPU (OpenCL) implementations of neural network operations
fully_connected_layer.h/cpp: Implementation of a fully connected neural network layer
gpu_operations.h/cpp: Wrapper for OpenCL operations
benchmark.h/cpp: Benchmarking utilities

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.vscode		.vscode
build		build
examples		examples
include		include
python/annof		python/annof
src		src
tests		tests
CMakeLists.txt		CMakeLists.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NNOF: Neural Network Optimization Framework

Project Overview

Key Components

Example Benchmarking

About

Releases

Packages

Languages

fahadhamdan1/NNOF

Folders and files

Latest commit

History

Repository files navigation

NNOF: Neural Network Optimization Framework

Project Overview

Key Components

Example Benchmarking

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages