DEV Community

Cover image for Learn Python by building investment AI for fintech - Lesson4: Recurrent Neural Network (RNN)
Duomly
Duomly

Posted on • Updated on • Originally published at blog.duomly.com

Learn Python by building investment AI for fintech - Lesson4: Recurrent Neural Network (RNN)

This article was originally published at:
https://www.blog.duomly.com/python-course-with-building-a-fintech-investment-ai-lesson-4-recurrent-neural-network-rnn


Intro

In today's episode of the Python course with AI, we will learn how to make predictions with AI, and we will predict the prices.

Last episodes you can find here:

Lesson 3 - How to create and train AI:

Python course with building a fintech investment AI – Lesson 3: How to build and train AI

Lesson 2 - How to find financial data and how to use Pandas:

Python course with building a fintech investment AI – Lesson 2: Pandas and getting financial data

Lesson 1 - Prepare the AI project:

Python course with building a fintech investment AI – Lesson 1: Start the project

In the fourth lesson of the AI course, we will use the AI that we created in the previous episode.

We will predict some first stock prices and will compare with the real ones, to have an idea if our AI is smart.

Let's start!

If you prefer video, here is the youtube version:

Prepare data for prediction

As the first step, we should create a function that will prepare data for the prediction.

Create a function named "preparePredictData" in the file ai.py, below function "trainModel".

Next, pass "stockPricesLastMonth", and "stockPricesTest" as params.

def preparePredictData(stockPricesLastMonth, stockPricesTest):

Take all data

As the second step of preparing data, we should concatenate all the data that we have into the one data frame.

We can use the pandas "concat" method for that.

Next, we need to create the variable named "predictInputs".

To this variable, we should assign all the items from the position that we will get by calculation, up to the end of the data frame.

The calculation is the length of the total data, minus length of the lastMonth data, minus 120(6 months).

  dataset_total = pd.concat((stockPricesTest['Open'], stockPricesLastMonth['Open']), axis = 0)
  predictInputs = dataset_total[len(dataset_total) - len(stockPricesLastMonth) - 120:].values

Reshape the data

In the third step of the data preparation, we should reshape our data by using "reshape" method.

That will give us the formatted list of the "predictInputs", and our data will be ready to be scaled later.

predictInputs = predictInputs.reshape(-1,1)

Scale data to fit this one from training and prepare testPredicts

The fourth step should be to take 20 items from the "predictInputs", and append them into the array named "testPredicts".

predictInputs = scaler.transform(predictInputs)
testPredicts = []

for i in range(120, 140):
    testPredicts.append(predictInputs[i-120:i, 0])

Make 3d array and reshape

As the last step of the data preparation, we need to do 3d array from those items and reshape them.

Next, we need to return "testPredicts" variable.

testPredicts = np.array(testPredicts)
testPredicts = np.reshape(testPredicts, (testPredicts.shape[0], testPredicts.shape[1], 1))

return testPredicts

Make prediction

Super, we are done with data preparation now!

We can create the logic that will predict the prices for us.

After the prediction, we cannot forget to reinverse the data to the values that will be human-readable.

def getPrediction(dataToPredict):
  nextPrices = model.predict(dataToPredict)
  nextPrices = scaler.inverse_transform(nextPrices)
  return nextPrices

Create a graph to compare with real prices

If we have created the logic that can predict the prices, we would like to know if the AI is close to reality.

To do that, we predict the prices, for the last month, and we will compare them with the real ones.

We need to create a matplotlib graph.

Let's start by creating a function named "createGraph".

def createGraph(nextPrices, stockPricesLastMonth):

Separate real prices

Next, we need to separate the "Open" prices from all of the last month's data.

The "Open" column is the second one in the table.

So we should use "iloc" method, and take all the values from that column.

Let's take a look at the example below.

real = stockPricesLastMonth.iloc[:, 1:2].values

Create a graph

In this step, we can create a matplotlib graph.

We will use green color for the real prices, and orange color for the predicted ones.

plt.plot(real, color = 'green', label = 'Real prices')
plt.plot(nextPrices, color = 'orange', label = 'Prices from ai')
plt.ylabel('Prices')
plt.xlabel('Timestamp')
plt.legend()
plt.show()

Predict

As we did in the previous lesson, we need to pack all of the logic into one function.

We will use the function called "predict" and put all of the functions that we built today.

def predict():
  stockPricesLastMonth, stockPricesTest = splitData(prices)
  dataToPredict = preparePredictData(stockPricesLastMonth, stockPricesTest)
  predicted = getPrediction(dataToPredict)
  createGraph(predicted, stockPricesLastMonth)

Start prediction

Finally, we can run the prediction!

I'm super excited we came into this moment, and finally, we will take a look at how smart our AI is.

Let's call the function named "predict".

predict()

Conclusion

Congratulations!

Now, your project can not only get data, train AI but also predict the prices and show the graphs.

It's very important because you can compare prices with the real ones before you predict the future.

That will give you a chance to tune up the AI to get the best results.

Code repository for the Python course Lesson 4 is here:

https://github.com/Duomly/python-ai-investment-fintech/tree/Python-AI-course-Lesson-4

Stay with us for the next episodes, where we will build the next awesome and interesting things together.

And don't forget to share the info about the AI course with your friends!

Programming courses online

Thanks for reading,

Radek from Duomly

Top comments (0)