Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.advisorWeir, Michael
dc.contributor.authorFernandes, Antonio Ramires
dc.coverage.spatial258 p.en_US
dc.date.accessioned2018-05-22T14:56:45Z
dc.date.available2018-05-22T14:56:45Z
dc.date.issued1997
dc.identifier.urihttps://hdl.handle.net/10023/13468
dc.description.abstractThe issue of robust training is tackled for fixed multilayer feedforward architectures. Several researchers have proved the theoretical capabilities of Multilayer Feedforward networks but in practice the robust convergence of standard methods like standard backpropagation, conjugate gradient descent and Quasi-Newton methods may be poor for various problems. It is suggested that the common assumptions about the overall surface shape break down when many individual component surfaces are combined and robustness suffers accordingly. A new method to train Multilayer Feedforward networks is presented in which no particular shape is assumed for the surface and where an attempt is made to optimally combine the individual components of a solution for the overall solution. The method is based on computing Tangent Hyperplanes to the non-linear solution manifolds. At the core of the method is a mechanism to minimise the sum of squared errors and as such its use is not limited to Neural Networks. The set of tests performed for Neural Networks show that the method is very robust regarding convergence of training and has a powerful ability to find good directions in weight space. Generalisation is also a very important issue in Neural Networks and elsewhere. Neural Networks are expected to provide sensible outputs for unseen inputs. A framework for hyperplane based classifiers is presented for improving average generalisation. The framework attempts to establish a trained boundary so that there is an optimal overall spacing from the boundary to training points closest to this boundary. The framework is shown to provide results consistent with the theoretical expectations.en_US
dc.language.isoenen_US
dc.publisherUniversity of St Andrews
dc.subject.lccQA76.87F4
dc.subject.lcshNeural networks (Computer science).en
dc.titleRobustness and generalisation : tangent hyperplanes and classification treesen_US
dc.typeThesisen_US
dc.contributor.sponsorJunta Nacional de Investigação Científica e Tecnológica (Portugal).en_US
dc.type.qualificationlevelDoctoralen_US
dc.type.qualificationnamePhD Doctor of Philosophyen_US
dc.publisher.institutionThe University of St Andrewsen_US


This item appears in the following Collection(s)

Show simple item record