Building Neural Networks in R

Introduction

Welcome to the first lesson of "Building and Applying Your Neural Network Library", the fourth and final course in our "Neural Networks from Scratch using R" path!

Throughout our journey so far, we've built a solid foundation in neural network principles. In the first course, we explored the fundamentals of neural networks, including perceptrons and the theory behind them. In the second course, we implemented forward propagation and activation functions. Most recently, in our third course, we mastered backpropagation and stochastic gradient descent, culminating in training a neural network on the diabetes dataset.

Now that we understand the core algorithms and mathematics, we're ready for the final stage: transforming our code into a proper, reusable neural network library. In this course, we'll take all the code we've produced in previous courses and restructure it into a more organized, modular framework — similar in spirit to popular libraries like torch or tensorflow, but built from scratch by us!

Our first task is to modularize the core components we've already built: dense layers and activation functions. By the end of this lesson, you'll have created a well-structured modular library that separates concerns and makes your neural network code more maintainable and extensible.

The Importance of Software Engineering in ML

Before we dive into implementation details, let's talk about why we're actually restructuring our code. So far, we've focused primarily on understanding the algorithms that power neural networks — the math, the theory, and the implementation of key concepts. While this understanding is crucial, there's another dimension to building effective ML systems: software engineering.

Software engineering principles are vital when building machine learning systems for several key reasons:

Maintainability: As models grow in complexity, well-structured code becomes easier to debug and update.
Reusability: Modular components can be reused across different projects.
Testability: Isolated components with clear interfaces are easier to test.
Collaboration: Well-organized code enables multiple people to work on different parts simultaneously.
Extensibility: Adding new features becomes simpler when code is properly modularized.

In the industry, ML practitioners rarely write monolithic scripts. Instead, they organize code into modular libraries with clearly defined responsibilities and proper structure. This is the approach we'll take as we build our neural network library.

Our library will be called neuralnets, and we'll structure it using a modular approach with separate subdirectories for different components. This structure separates concerns: activation functions live in their own module, layer implementations in another, and so on. As we continue through this course, we'll expand this structure to include losses, optimizers, and model classes.

Project Directory Structure Overview

Let's take a look at the complete directory structure we'll be building throughout this course, using a modular approach:

This structure follows a modular approach where each component is organized in its own subdirectory with clear separation of concerns:

init.R: The main package initialization file that sources all submodules
main.R: A demonstration script showing how to use the library
activations/: Contains activation function implementations and their initialization
layers/: Contains layer implementations (dense layers, etc.)
losses/: Contains loss function implementations
models/: Contains high-level model classes
optimizers/: Contains optimizer implementations

Each subdirectory contains an init.R file that handles the initialization of that specific component, making the overall structure clean and maintainable.

In this lesson, we'll focus on implementing the activation functions and dense layer components, along with the main package initialization. The other components will be added as we progress through the course, building up our complete neural network library step by step.

Creating Our Module Structure

Let's begin by understanding how our modular structure works. Unlike traditional R packages, we're using a module-based approach where each component lives in its own subdirectory with an initialization file.

The key to this structure is the main init.R file in the root directory, which acts as the central hub that brings all components together:

This approach gives us several advantages:

Clear organization: Each component has its own directory
Easy maintenance: Changes to one component don't affect others
Scalability: Adding new components is straightforward
Familiar pattern: Similar to how Python modules work

Understanding Our Module System

Now that we've set up our module structure, let's understand some important concepts that will help you work more effectively with this approach.

Module initialization: Each subdirectory has an init.R file that sources the relevant implementation files and sets up the module's public interface. This is similar to Python's __init__.py files.
Source path management: When we use source(), R executes the specified file and makes its contents available in the current environment. The path is relative to the working directory, so source("neuralnets/activations/init.R") loads the activation module.
Component availability: After sourcing a module's initialization file, all functions and classes defined in that module become available in the current environment. This is how we can use DenseLayer$new() or sigmoid() directly after sourcing the main init.R file.

Implementing Activation Functions

Now let's examine our activation functions implementation. The activation functions live in neuralnets/activations/functions.R:

These functions implement three common activation functions and their derivatives:

Sigmoid: A smooth, S-shaped function that maps any input to a value between 0 and 1.
ReLU (rectified linear unit): Returns the input if positive; otherwise, returns 0.
Linear: Simply returns the input unchanged (used in regression tasks).

The activation module's initialization file (neuralnets/activations/init.R) sources these functions and defines which ones are available:

Implementing the Dense Layer

Now let's examine our DenseLayer class implementation in neuralnets/layers/dense.R:

Testing Our Modular Structure: Network Setup

Now let's examine how to use our modular structure. The main demonstration script (neuralnets/main.R) shows how to load and use our library:

By sourcing the main init.R file, all components become available. We can create layers with DenseLayer$new() and access all activation functions directly. This produces output like:

Testing Our Modular Structure: Forward Pass

The demonstration script continues by performing a forward pass through the network:

This produces output similar to:

Using the Module System

To use our neural network library, you simply need to source the main initialization file:

This single line makes all components available:

DenseLayer class for creating layers
All activation functions: sigmoid, relu, linear
All derivative functions: sigmoid_derivative, relu_derivative, linear_derivative

The modular structure provides several benefits:

Easy to extend: Add new components by creating new subdirectories
Clean interface: Users only need to source one file
Maintainable: Each component is isolated and self-contained
Flexible: Components can be loaded individually if needed

Conclusion and Next Steps

Congratulations! You've successfully created a well-structured modular neural network library. By organizing our code into distinct modules with clear responsibilities and a unified initialization system, we've built a foundation that's both professional and extensible. This modular design provides the structure we need for the rest of this course, where we'll continue expanding our library by adding components for loss functions, optimizers, and a high-level model class.

The journey from understanding neural network principles to building a complete, well-structured library mirrors the path many practitioners take in the field. As you progress through this course, you'll not only deepen your understanding of neural networks but also develop valuable software engineering skills that are essential for real-world machine learning applications. Understanding how to properly structure code, manage dependencies, and create clean APIs will serve you well in any data science or machine learning project.

Now, it's time to get ready for some practice. Happy coding!

Next Lesson: Modular Training Components

Join the 1M+ learners on CodeSignal

Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal