Forex Trading with R : Part 2

In the previous post the first steps were given for building the basis for trading forex. Now it is time to build the actual classifiers that  can give us future buy / hold / sell signals.

Assuming that everything is in working order and the instructions given in  the previous post were followed we can start building these classifiers.

First let's train a Neural Network. The following command trains a Neural Network and then applies the trained model on our test data and outputs the predictions for buy/sell/hold signals :

set.seed(134)
nn <- nnet(class~.,traindata, size = 3, rang = 0.1,decay = 0.001, maxit = 3000,trace="F")
table(actual=testdata$class,predicted=predict(nn,newdata=testdata,type="class"))

Note that a seed number was used.  You should either try different seed numbers (so that network weights are re-initialized) or omit the set.seed() directive. You should also experiment with other Neural Net parameters such as the number of iterations (maxit), the learning decay (decay), etc.


The confusion matrix shows us the necessary information for calculating TP, FP, TN,FN rates for each class (ie for each signal type).


Similarly we can train and test a Random Forest :


 rf.model<-randomForest(class~.,data=traindata,nodesize=40,importance=FALSE,mtry=3,ntree=100)
table(actual=testdata$class,predicted=predict(rf.model,newdata=testdata,type="class"))



Now let's train an SVM for our data. We can issue the following command : 

###train SVM
sv<-svm(class~.,traindata,gamma=0.01,cost=5,kernel="radial")

To see how the classifier did on the test set, we enter :

table(actual=testdata$class,predicted=predict(sv,newdata=testdata,type="class"))


Next we can try to optimize parameters of the SVM classifier as follows :


#find optimal values of Gamma and Cost for an RBF- SVM classifier
tuned <- tune(svm, class~., data = traindata,ranges = list(gamma = c(0.0001,0.001,0.05,0.1,0.2,0.3), cost = c(1,5,10,20,50,100,120,130)),tunecontrol = tune.control(sampling = "cross"),cross=10)


tuned


The first command uses 10-fold cross validation to identify the best gamma and cost parameters among some predetermined values. We then issue the command tuned to see which combination of parameters  gives us the lowest classification error. Knowing these parameters we can then use these parameters to train an SVM classifier and see how this model performs (as was shown previously).

Be aware of the following key points :


  1. Three sets of data should be used : Training, Test and Validation. The Validation set should not be a part of the optimization (=finding the best algorithm parameters) process.
  2. Make sure that you create classifiers for several time periods. Test the performance of any classifier according to the percentage of available data you use for training / testing / validation and the number of periods you use for the sliding window.
  3. Make also sure that once you have chosen your model, you use a correct way to test your system by simulating buy / hold / sell signals and taking under consideration all associated trading costs.


4 Responses to "Forex Trading with R : Part 2"

Powell Says :
February 26, 2011 2:37 AM

Thank you so much, I have learned a lot from these two articles.

rayton64 Says :
January 2, 2012 7:41 PM

Hi Themos,
many thanks to your great work and the remarkable book recommendation. One of the few books thats also helps non-mathematicians like I.
But have you run the MC-simulation ?
I've tried a simplified version with a smaller time window but get an exception (using Rx64 2.14.0/1).

Best regards

Themos Kalafatis Says :
January 3, 2012 10:56 AM

@rayton64,

You are welcome, the book mentioned is one of the best books i've read so far about R. Monte Carlo Methods are on my "TODO List" so i will post an example as soon as i set things up. Regarding your exception, have a look to R forums. If that leads you nowhere send me an e-mail to see if i can help.

rayton64 Says :
January 3, 2012 9:10 PM

Themos,
again thanks for your fast reply. The problems with the MC-simulation only occur with the GSPC-TS from the book, the example from the manual runs well. Another problem is to cut the TS, otherwise the PC will run out of memory. So if you know a special forum with topics to the DMwR-package or the book let me know.
I'm very grateful for every hint.

Best regards

Post a Comment