2.2.Data measurement setupThe model designed using neural networks has to be trained using some real data
obtained Array Fig.1.Photograph of the experimental setup.
from the above described system.The training procedure is necessary for the neural network to learn the model it is trying to predict.The data is collected from the experimental setup shown in Fig.1.This data set is obtained by measuring the steady-state and transient responses of the axial piston pump shown in Fig.2.The experimental investigation is carried out on a testbed,shown in Fig.1and the hydraulic circuit diagram shown in Fig.4.The suction and discharge lines of the test pump are connected directly to the suction and high ¯ow meter ports (24)and (25),respectively.The test pump (16)is driven by means of a high power hydraulic motor of controllable speed (13).The hydraulic circuit operates as follows:the oil ¯ows from reservoir (1)to the inlet port of the booster pump (4).The pressure relief valve (7)is used to protect the booster pump circuit against over pressure.The discharge of the booster pump passes through a non-return valve (6)to the suction and supply lines of the main pump (3).The pressure relief valve (8)is used to protect the main pump circuit against over pressure.The discharge of the main pump passes through the direction control valves (9)and (10)used to control the direction of the main pump discharge to the main drive motor circuit (12).The ¯ow rate of the test pump is indicated on a digital ¯ow meter (20).The speed of the test pump driven shaft is measured via a tachometer (15)and can be controlled by changing the speed of the electrical motor (5).The temperature of the working oil is maintained at 50258C during the operation.During the steady-state measurements,the variation of the supply pressure P is adjusted by means of the control valve (29)while the shut-o valve (30)is fully closed.The pressure gage (23)measures the oil pressure in the suction line and a digital pressure gage (21)indicates the oil pressure in the discharge line.The pressure relief valve (28)is used to protect the test
pump
Fig.2.Schematic diagram of the bent-axis piston pump.
M.A.Karkoub et al./Mechanism and Machine Theory 34(1999)1211±1226
1214
Fig.3.Schematic diagram of the control unit of the piston pump.
Table 1Pump parameters
Parameter
Description Value A c Large side area of control piston 0.000531m 2A p Piston area 0.000531m 2A pp Area of control element (38)0.0000246m 2A s Small side area of control piston 0.0000785m 2V Volume of pump delivery line 2.6Â10À3m 3V 1Volume of the ®rst control cavity 8.2Â10À6m 3V 2Volume of the second control cavity 1.7Â10À7m 3V 3Volume of the third control cavity 1.6Â10À5m 3a min Minimum cylinder inclination angle 48a max Maximum cylinder inclination angle 238
M.A.Karkoub et al./Mechanism and Machine Theory 34(1999)1211±12261215
circuit against overloading.During the transient measurements,valve (30)is fully open and valve (29)is fully closed.
2.3.Measurement of the steady-state response of the pump
The experimental determination of the steady-state performance of the studied pump is carried out by measuring the pump discharge ¯ow Q p at di erent values of the supply pressure P .The test pump parameters are presented in Table 1.The supply pressure P is controlled by the throttle valve (29)and measured by the digital pressure gage (21).The corresponding pump discharge Q p is measured by the digital ¯ow meter (20).Measurements were carried out for di erent pump speeds,550,800and 1000rpm at the same preset pressure.The pump discharge ¯ow Q p was also measured at di erent values of the preset pressure.The measured values are shown in Figs.7and
8.Fig.4.Schematic diagram of the hydraulic system.
M.A.Karkoub et al./Mechanism and Machine Theory 34(1999)1211±1226
1216
2.4.Measurement of the transient response of the pump
The experimental determination of the transient response of the studied pump,shown in Fig. 2,is carried out by measuring the operating pressures in di erent control cavities.Three electrical pressure transducers are mounted in di erent positions of the pump house which are connected directly to the control cavities of volume V1,V2and V3,as shown in Fig.3.Another pressure transducer is mounted at the pump exit line of volume V to measure the supply pressure P.The transducers are of piezoresistive type and can measure pressures between0.1 and400bar.The input voltage to each transducer is in the range of10±30V;however,the output voltage is between0and5V.These transducers are used to measure the control pressures P1,P2,P3and P(see Fig.3).A time record of the pressure signals is sampled using a PC and a data acquisition board.The board has12bit successive approximation converter with a12m s conversion time giving a maximum throughput rate of70kHz.A®xed loading ori®ce(18)and direction control valve,DCV,(17)are mounted on the pump exit line(see Figs.1and4).These are used to introduce rapid changes in the pump exit line pressure P. When the solenoid of the DCV is energized,the valve closes rapidly and the pump discharge Q p is forced to¯ow through the loading ori®ce.The testbed is arranged such that the current to the solenoid triggers the data acquisition system,thus picking up the transient variation of the pressures P1,P2,P3and P.These measurements are carried out at di erent pump speeds: 550,800and1000rpm.The measured values are shown in Figs.9±11.
3.Neural networks
In this paper,a computational tool,known as neural networks,is used to predict the behavior of bent-axis piston pump.These networks are nothing but a number of interconnected elements known as neurons.These neurons or processing elements are well
Fig.5.Schematic representation of a single neuron.selected linear or nonlinear functions that process any applied input to a known output.The input to the neurons is a weighted sum of the external inputs or the outputs of the neurons immediately preceding it.A small weight applied to the output of the neuron means that the following neuron does not process the input.In this manner,a speci®c circuit will be established for each pattern or input.This type of reasoning or circuitry makes neural networks capable of capturing nonlinearities often undetected by common modeling techniques.
The output of a speci®c neuron is a function of three main factors:the weighted input,the bias of the neuron,and the transfer function(see Fig.5).The output of any neuron is given by:
a f x b
where
x
w i I i
The transfer function f can be selected from a set of readily available functions.The function chosen for our task is the sigmoidal function:
f x
1 1 eÀx
This function is known to give good results,especially if the outputs are known for the given inputs.
Any network is usually divided into subnetworks or commonly referred to as layers.Each
Fig.6.Schematic representation of a multi-layer,feedforward,neural network.network contains two essential layers which are the input and output layers and one or more hidden layers if the task requires that.Fig.6shows a typical feedforward architecture of a neural network.The output of the output layer is a result of a combined e ect of all the neurons in the network.
3.1.Training
Designing a neural network requires at least four major steps:(1)selecting the number of layers;(2)selecting the number of neurons;(3)selecting the types of transfer functions;and(4) selecting a training data set that captures the behavior of the system.The training process is time consuming and very critical for the success of the network.Several techniques are used to do the training;among these is backpropagation with momentum.The weight of every input to every neuron is updated consecutively starting from the output layer and working backwards.During this process an objective function,usually the sum squared error,is minimized.Several optimization techniques are used in the literature including the Powell's and the Levenberg±Marquardt algorithms.The latter technique is used in this paper.This technique switches between the famous gradient descent(see the Appendix)and the Gauss±Newton algorithms.The so-called update rule in the Levenberg±Marquardt method is given by:
D W À
c T c a I
ÁÀ1
c T E 1
where c is matrix of derivatives of errors to each weight,a is a scalar,and E is the error vector.
3.2.Data selection
One of the factors that make the training process very time consuming is the size and quality of the training data.If the data does not cover all the details about the behavior of the system, the optimization procedure might not converge to the expected answer.However,obtaining a data set that describes all aspects of the system is not an easy task.Moreover,the training data can be redundant,i.e.several patterns convey the same information.Therefore,the training time will be increased to process the same information over and over again.If that is the case, it is suggested that the Karhunen±Loeve decomposition be used.
Suppose that we have a data set f(x,t)with M input vectors.De®ne the average of these vectors X:
"X 1
M M
i 1
X i
let
X
i X iÀ
"X
The covariance matrix is de®ned as follows:C ij 1
M
X
iÁ
X
j
The eigenvalues and eigenvectors of the covariance matrix can be obtained by singular values decomposition.Let f k i be the i th component of the k th eigenvector.The projection of the vectors onto the eigenspace is given by:
c k
i 1
M f k i X i
De®ne the energy of the set E as follows:
E
M
i 1
l i
where l i is the eigenvalue corresponding to the i th eigenvector.The energy of each input vector can be de®ned as follows:
E k l k E
Now,calculate the coordinates a i as follows:
a i
XÁc i c iÁc i
If the energy of the input vector is very small then the point will not have additional information about the system.Suppose that there are N energetic input vectors arranged from the most to least energetic,the original data set can be approximated as follows:
f x,t 9
N
i 1
a i c i
3.3.Cross-validation
The cross-validation process is needed to ensure that the right number of neurons is used in the network.Over-®tting is very common when using neural networks because one wants to make the error as small as possible.However,this may result in higher order®tting than is actually needed.Therefore,a mechanism to ensure over®tting does not happen should be devised.The cross-validation process involves the prediction of certain patterns not used in the training process.If the prediction is reasonable,the network is retained;otherwise,it is rejected.The search for the best network continues by increasing or decreasing the number of neurons until a satisfactory one is obtained.
4.Results
An experimental setup was built for a bent-axis piston pump.The purpose of the setup is to study the pump and minimize its power losses at high pressures.Our initial work is to derive a model to predict the dynamic behavior for the current design.There are several modeling schemes that can be used to derive a theoretical model for the pump.However,most of these schemes are simplistic and may not capture all aspects of the dynamic behavior of the pump such as nonlinearities.For that reason,neural networks are used here to attempt to capture most of the dynamic phenomena describing the system.Several neural networks were designed for the pump system following the procedure described in the previous section using a Pentium PC and Matlab.1The network that best describes the pump system has two hidden layers with ®ve neurons in each of the hidden layers.Most of the other designs were rejected because they failed in the cross-validation process.The neural network modeling technique predicted the steady-state behavior of the pump accurately for several pump speeds and pressure settings.The error between the experimental and the theoretical values does not exceed 2%(see Figs.7and 8).The pressure P was predicted accurately especially after time t =0.2s.At the start,the data was very noisy which can cause problems in most common ®tting techniques.This type of problem can be avoided in neural networks by using the proper type of transfer functions and ensuring that over-®tting does not occur.Despite the noisy nature of the training data,the designed neural network predicted the pressure in an acceptable manner.Figs.10and 12show a good agreement between the neural network prediction and
the Fig.7.Steady-state ¯ow rate for di erent set points and a pump speed of 1000rpm:experimental (Â)and NN prediction (dash-dotted)for set pressure of 75bar,experimental (w )and NN prediction (solid)for set pressure of 125bar,experimental (+)and NN prediction (dashed)for set pressure of 160bar.
1Matlab is software packaged and marketed by the Mathworks Inc.
experimental data.At the onset of the experiment;i.e.time less than 0.25s,the data looks very noisy and very hard to ®t.The designed network approximated the data in that region without ®tting the noise.Similarly,Fig.9shows how well the neural network model predicts the pressure P 2.The errors between the predicted and the experimental values are less than 7%.This is a
good Fig.8.Steady-state ¯ow rate for di erent pump speeds and set pressure of 75bar:experimental (Â)and NN prediction (dash-dotted)for 1000rpm,experimental (+)and NN prediction (dashed)for 800rpm,and experimental (w )and NN prediction (solid)for 550
rpm.
Fig.9.Prediction of the pressure P (ori®ce diameter=2.5mm):experimental (w )and NN prediction (solid)for pump speed of 1000rpm,experimental (+)and NN prediction for pump speed of 800rpm.
pump speed of1000rpm,experimental(+)and NN prediction(dotted)for pump speed of550rpm.
pump speed of1000rpm,experimental(+)and NN prediction(dotted)for pump speed of800rpm.
indication that the neural network modeling technique is a viable tool in modeling complicated systems such as the bent-axis pump.
5.Conclusion
A modeling technique,neural networks,has been used to predict the steady-state and dynamic behavior of a bent-axis piston pump.Experimental data was collected from an experimental setup to train the network.The resultant model predicted the pressures accurately.Therefore,neural networks has a lot of potential in modeling complicated systems such as the bent-axis piston pump.
Appendix.The gradient descent method
The gradient descent method is commonly used to update the weights during the training phase.Let us assume that the network is composed of n layers and the input vector to the network has m components.The output from neuron x in layer k is calculated as follows:a xk F k
2 N k À1i 1
w ixk a i k À1 b xk 3x 1,2,F F F ,N k and k 1,2,F F F ,n 1 where for k =
Fig.12.Prediction of the pressure P 3(pump speed=800rpm and ori®ce diameter=2.5mm):experimental (w )and NN prediction (solid).
N 0 m and a i 0 I i 2 The output of the output layer can be obtained by setting k=n in Eq.(1).The sum squared error (sse )of the network is given by:
sse P p 1 N n j 1À
d j Àa jn Á2p 3
where d j and a jn are the desired and actual outputs from neuron j in the output layer n ,respectively.The subscript p represents a speci®c input vector.The goal of the training procedure is to obtain a suitable set of weights that leads to a minimum sse .The gradient descent of layer k is given by: d n Àk p D F n Àk p ÀW T n Àk Áp d n Àk p k 1,2,F F F ,n À1
4 where T represents the transpose of the matrix.The neuron gradient matrix for an input vector p is given by: D F k p diag d F k 1k ,d F k 2k ,F F F ,d F k N k k
5 and the corresponding weight matrix is given by: W k p P T T R w 11w 12ÁÁÁw 1N k w 21w 22ÁÁÁw 2N k **ÁÁÁ*w N k 1w N k 2ÁÁÁw N k N k Q U U S p
6
The values of the gradient descent for the output layer is given by:
d p n D F n p z n p
7 where
z n p  d 1Àa 1n , d 2Àa 2n ,F F F ,Àd N n Àa N n n ÁÃT p 8 Following the procedure described above,we can calculate the gradient descents of all the layers one by one starting from the output layer.When all the gradient descents for all the layers are calculated,the adjustments for the weights are calculated using the following update rule:ÀD w ijk Áp Z Àd jk a ik Áp 9 The constant Z is known as the learning rate.The actual weights can be updated as follows: w ijk p w ijk p À1
ÀD w ijk Áp 10
References
[1]A.Aoyama,F.J.Francis,V.Venkatasubramanian,Control-a ne neural network approach for nonminimum-
phase nonlinear process control,Journal of Process Control6(1996)17±26.
[2]K.A.Edge,J.Darling,The pumping dynamics of swash plate piston pumps,Journal of Dynamic Systems,
Measurement,and Control111(19)307±312.
[3]R.Gharbi,M.Karkoub,A.Elkamel,An arti®cial neural network for the prediction of immiscible¯ood per-
formance,Energy and Fuels9(1995)4±900.
[4]R.M.Harris,K.A.Edge,D.G.Tilley,The suction dynamics of positive displacement axial piston pumps,ASME
Journal of Dynamic System,Measurements and Control116(1994)281±287.
[5]P.Kaliafetis,T.H.Costopoulos,Modelling and simulation of an axial piston variable displacement pump with
pressure control,Mechanism and Machine Theory30(1995)599±612.
[6]M.Karkoub,A.Elkamel,Modeling pressure distribution in a rectangular gas bearing using neural networks,
Tribology International30(2)(1997)139±150.
[7]I.Kiyoshi,N.Masakasu,Study of the operating moment of a swash plate type axial piston pump:®rst report,
The Journal of Fluid Control22(1994)7±29.
[8]C.J.Li,L.Yan,N.W.Chbat,Powell's method applied to learning neural network control of three unknown
dynamic systems,Optimal Control Applications and Methods16(1995)251±262.
[9]M.N.Panda,D.E.Zaucha,G.Perez,A.K.Chopra,Application of neural networks to modeling¯uid contacts
in Prudhoe Bay,in:SPE70th Annual Technical Conference and Exhibition,Dallas,TX,22±25October,1995.