CCeptron

A multi-layer perceptron written in C.

The entire code is a single C file. The program admits CSV files as data and all the important hyperparameters as arguments from the command line. In order to compile and use the program:

git clone 'https://github.com/DarkAdin/CCeptron.git'
cd CCeptron
make

This will compile the program with the appropriate flags. Set

DEBUG = 1

in the Makefile if you wish to use it with a debugger.

Then, use it as follows:

./CCeptron file.csv input_size hidden_size hidden_size2 hidden_size3 output_size epochs learning_rate annealing_rate saved_model

That will train the network on the data and save all weights and biases in a file. If the file already exists, the program skips the training part and directly predicts on the data.

The network comes with three hidden layers and one output layer by default. The three hidden layers use GELU as the activation function and the output layer uses sigmoid.

In the image above, we can see an idea of such network:

m parameters
h hidden neurons in the first hidden layer
j hidden neurons in the second hidden layer
k hidden neurons in the third hidden layer
o output neurons in the output layer

Every one of those parameters can be modified, except the choice of activation functions, which can be modified in the code itself.

By all means, modify the code to suit your needs. I have added aliases for the activation and error functions, so now it's way easier to choose and call them. Look into the code.

After applying weights and bias in each neuron, the activation function produces its output, which will be fed into the next layer until the output layer is reached. The output of the output layer will be faced against the real value present in the training data, and with that value the loss function will be calculated. With the loss function (its derivative, to be more specific) the network computes all gradients, which are useful in the back propagation process when we will update all weights and biases before the next forward propagation cycle.

The addition and modification of any aspect of the network should be easy enough, whether you want to add or remove hidden layers, and/or different activation functions.

Once all training epochs have passed, the network tests itself on the training data. This, of course, should be done on a separate testing dataset.

The random number generator is seeded with the current time and the current process ID.

Remember to shuffle your training data appropriately.

The learning rate is updated every epoch by the annealing rate. It uses gradient descent as the training scheme.

Now the back propagation function outputs the current error, which is shown every 20 epochs.

Example of valid data

The following example row comes from the iris dataset (encoded accordingly):

0.645569,0.795454,0.202898,0.08,1.00

Which consists of

sepal_length,sepal_width,petal_length,petal_width,class

If we wanted to train on this data, we should specify 4 as the input size and 1 as the output size (4 parameters and 1 class).

The class is encoded in the form of a number as well. If we have 3 different species in the iris dataset, an example of encoding them could be 0.0, 0.5 and 1.0. The maximum number of decimals in a data point should be 6.

Another way of encoding classes could be using more than one digit:

0.645569,0.795454,0.202898,0.08,1,0,0

Where the last three digits signify the class. Thus, if we have three classes, they could be encoded as

...,1,0,0 # First class
...,0,1,0 # Second class
...,0,0,1 # Third class

In other words, we would have

Sepal length	Sepal width	Petal length	Petal width	Class 1	Class 2	Class 3
0.65	0.79	0.20	0.08	1	0	0

As an example,

./CCeptron iris.csv 4 20 20 20 3 2000 0.001 0.999 iris_output

would train a network of 4 input neurons (since we have 4 parameters), followed by three hidden layers with 20 neurons each, and finally an output layer with 3 neurons (since we have 3 classes) for $2000$ epochs, with a starting learning rate of $0.001$ and an annealing rate of $0.999$, saving the model in a plain text file called iris_output. This file contains the main architecture of the recently trained network, and using it again as a parameter, if it already exists, will cause it to be used to test on the data without training. This way, once trained, a saved model can immediately be used to test on other data. Caveat: excluding the input data, the rest of hyperparameters must remain the same.

Remember to normalize your data!

The error of every epoch is saved in a file called savederrors by default, so it is easier to plot it with your favorite plotting software after every training session.

Example output

In the image above, we are training the network on the normalized iris dataset (dividing each value by the maximum value of that parameter) and encoding the classes in a 1,0,0 manner. Thus, we train the network on 4 parameters, with 3 hidden layers of 20 neurons each, and an output layer of 3 neurons (since we have 3 classes), for 2000 epochs, with a learning rate of 0.001 and an annealing rate of 0.999, saving the weights and biases and the architecthure in a file called flowers. The error towards the final epochs is minimal, as can be seen.

After the training process, the network performs one iteration of forward propagation in order to predict in the testing phase. Above can be seen a fraction of the output that results from this process.

In the image above the error function through the epochs can be seen for the normalized iris dataset with the following invocation:

./CCeptron iris.data 4 30 30 30 3 2000 0.001 0.999 flowers

Inspirations

CCeptron is a minimalistic approach to a simple neural network concept, the perceptron. As such, it is not a fully capable neural network. But it should be easily modifiable to suit general needs and make predictions on small-to-medium complexity data. Check out these amazing machine learning projects in C which heavily inspired CCeptron:

tinn: tiny neural network written in C with one hidden layer.
darknet: one of the biggest machine learning projects in C, works with CUDA and heavily influenced the Computer Vision field, capable of almost anything.
genann
kann: very complete project for constructing small to medium neural networks.

CCeptron is CPU only and as of yet, does not support multi-threading. As such, it will not be a good option for very large, or very complex data.

TO-DO

These things will be added in the future:

~~Save all weights and biases of the network to make future predictions~~
Automatically split data into training and test partitions
Return testing performance

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
CCeptron.c		CCeptron.c
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
error.png		error.png
neuralnetwork.png		neuralnetwork.png
testing.png		testing.png
training.png		training.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CCeptron

Example of valid data

Example output

Inspirations

TO-DO

About

Languages

License

DarkAdin/CCeptron

Folders and files

Latest commit

History

Repository files navigation

CCeptron

Example of valid data

Example output

Inspirations

TO-DO

About

Topics

Resources

License

Stars

Watchers

Forks

Languages