ContentslistsavailableatScienceDirect
Energy & Buildings
journalhomepage:www.elsevier.com/locate/enbuild
Parameter estimation for externally simulated thermal network models R
O.M. Brastein
∗, B. Lie , R. Sharma , N.-O. Skeie
Department of Electrical Engineering, Information Technology and Cybernetics, University of South-Eastern Norway, Porsgrunn N-3918, Norway
a rt i c l e i nf o
Article history:
Received 17 November 2018 Revised 14 February 2019 Accepted 11 March 2019 Available online 11 March 2019 Keywords:
Grey-box models
Stochastic differential equations Parameter estimation Profile likelihood Thermal network models Unscented Kalman filter Ensemble Kalman filter
a b s t ra c t
Obtainingaccuratedynamicmodelsofbuildingthermalbehaviourrequiresastatisticallysolidfoundation forestimatingunknownparameters.Thisisespeciallyimportantforthermalnetworkgrey-boxmodels, sincealltheirparametersnormallyneed tobeestimatedfromdata.Oneattractivesolution istomax- imisethelikelihoodfunction,undertheassumptionofGaussiandistributedresiduals.Thistechniquewas developedpreviouslyandimplementedintheContinuousTimeStochasticModellingframework,where an ExtendedKalman Filter isused tocomputeresiduals and theircovariances. Themain result ofthis paperisasimilarmethodappliedtoathermal networkgrey-boxmodelofabuilding,simulatedasan electriccircuitinanexternaltool.Themodelisdescribedasalistofinterconnectedcomponentswithout derivingexplicitequations.Sincethismodelimplementationisnotdifferentiable,analternativeKalman filterformulationisneeded.TheUnscentedandEnsembleKalmanFiltersaredesignedtohandlenon-linear modelswithoutusingJacobians,andcanthereforealsobeusedwithmodelsinanon-differentiableform.
BothKalmanfilterimplementationsaretestedand comparedwithrespecttoestimationaccuracy and computation time. The ProfileLikelihood methodis used toanalyse structural and practicalparameter identifiability.Thismethodisextendedtocomputetwo-dimensionalprofiles,whichcanalsobeusedto analyseparameterinterdependencebyprovidinginsightintotheparameterspacetopology.
© 2019TheAuthors.PublishedbyElsevierB.V.
ThisisanopenaccessarticleundertheCCBY-NC-NDlicense.
(http://creativecommons.org/licenses/by-nc-nd/4.0/)
1. Introduction 1.1.Background
The heating and cooling of buildings consumes a significant partof the world’s total energy production.While new building materials andtechniquesmay reduce theenergy consumption of buildings,therenewalrateofbuildingsislow[1].Hence,itisim- portanttostudymethodsthatcanalsoreduceenergyconsumption inexistingbuildings.
Building Energy Management Systems (BEMS) utilising ad- vancedmodel-based control methods [2]to forecast the temper- aturevariations of a building in order to predict an optimal se- quenceofcontrol inputsis apromisingmethodforthereduction ofenergyconsumption.Since themodel’spredictionaccuracy di- rectlyinfluences theefficiencyofsuch methods,itisimportantto
R This research did not receive any specific grant from funding agencies in the public, commercial or not-for-profit sectors.
∗ Corresponding author.
E-mail address: [email protected] (O.M. Brastein).
develop accurate models of building thermal behaviour. In addi- tiontodescribingthetimeevolutionofthesystemstatesandout- puts,agoodmodelmustaccommodatedescriptionsofbothmea- surementnoiseandprocessnoise[3,4].Thisrequiresastatistically solidframeworkforestimatingunknownparameters[5].
Thermalnetworkmodels areoftenused tomodelthethermal behaviourofbuildings[1,6–8].Implemented asResistor–Capacitor equivalent circuits, these models offer an intuitive model design based on a cognitive understanding of the thermal physics in- volved. Since, typically, all parameters of such models must be identifiedfromdata,itisimportanttoinvestigateparameteriden- tifiability prior to assuming physical interpretation of the esti- matedparametervalues[8].
1.2. Previouswork
1.2.1. Modellingofdynamicsystems
Modelsaresometimes classifiedbasedonthelevelofphysical insight used in their derivation. If the model is mechanistic, i.e., based purely on physical equations, it is classified as white-box. Such models excel at describing non-linear state transitions and https://doi.org/10.1016/j.enbuild.2019.03.018
0378-7788/© 2019 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license. ( http://creativecommons.org/licenses/by-nc-nd/4.0/ )
measurements. Theyalsotendto generalisewell betweensimilar systems[5,9].Analternativeapproach istheuseofsystemidenti- fication (SID) methods [3,4,10–12], wherea predeterminedmodel structure with unknown coefficientsis calibratedusing measure- ments ofthe system inputs andoutputs. This results in a black- boxmodelinwhichnopriorphysicalinsightisused,exceptinthe choice of input and output measurements, sample time, andthe approximate model complexity. These models tend to have bet- ter predictionaccuracy, butlesscapability togeneralise[5,9].SID methodstendtoprovidebetterstatisticsonthemodeluncertainty, whicharetypicallycomputedduringthecalibrationprocess[3–5]. A third,intermediate, possibility isthe grey-box model,which is based on a simplified modelstructure constructed using naive physicalknowledgeofthesystem.Modelparametersarecalibrated frommeasurements ofthe system, similarlyto black-boxmodels.
Grey-boxmodels are oftentreated ina stochastic framework [5]. It couldbe arguedthat mostwhite-boxmodelsincludesome ap- proximationsand/orneedcalibrationofcertainparameters.Hence, they can benefitfrom theapplication ofstochastic grey-boxcali- brationmethods.Thisapproachhasindeedbeenclaimedasanat- uralframeworkformodellingdynamicsystemsingeneral[13].
1.2.2. TheCTSMframework
Estimation of parameters is essentially an optimisation prob- lem, whichrequires a well-definedobjective function.Severalal- ternatives are used in the literature, such as the deterministic simulation error approach [1]. A statistically solid alternative for stochastic grey-boxmodels is found in [5,14],which is based on maximisingthelikelihoodfunctionevaluatedby computingresid- uals in a Kalman Filter. This method has been previously devel- oped in a number ofpublications [5,14–16] andimplemented in theContinuousTimeStochasticModelling(CTSM)framework[15]. In CTSM, the residuals needed to evaluate the likelihood func- tionarecomputedusinganExtendedKalmanFilter(EKF)withsub- samplingof thestate transition equationsto improveresponse to non-linearmodels[5,15].TheEKFisbasedonlinearisingthestate transitionsand/ormeasurementequations,whichrequiresthatthe modelequationsaredifferentiable[17–19].
1.2.3. Identifiability
Since thermalnetwork buildingmodels are partially basedon physical knowledge, it is often suggested that the parameters can be assigneda physical interpretation[1,5,6]. Thisassumption should,however,beverifiedinthecontextofparameteridentifia- bility[3,20].Itiswellknownthat modelscancontainparameters thatarestructurallynon-identifiable[3,20].Further,lackofproper excitationofthesystemduringdataacquisitionmayleadtopracti- cal non-identifiability[3,8,20–22].Whilethemodelstructuremay be designedsuch thattheparameters areintendedtohavea spe- cificphysicalmeaning,itisnotcertainthattheestimatedparame- terssupportthisassumption.Agoodtoolforidentifiabilityanalysis istheprofilelikelihoodmethod[8,21,22].
1.3. Overviewofpaper
In this paper, a resistor-capacitor equivalent thermal network modelofabuildingisexpressedasalistofinterconnectedelectri- cal components.Themodelissimulatedinan externaltool with- out derivingexplicitmodelequations,hencethemodelcannot be differentiated. This is motivated by the need to simplify experi- mentationwithdifferentmodelstructuresinawaythatcouldpo- tentiallybeautomated.Theparameterestimationmethodfromthe CTSM framework is adapted to non-differentiable models, which requires an alternative to the EKF for computing residuals. Both the Unscented Kalman Filter (UKF) [18] and Ensemble Kalman Fil- ter(EnKF)[23]arecomparedandconsideredfortheestimationof
residuals. The explicitmodel equations are alsoderived on stan- dardlinearform,andusedwithastandardKalmanFilterasabase- line for comparison.Observe that while the model used here is linear,themethodisnotrestrictedtolinearmodels;theexternally simulatedstatetransitionscouldwellbenon-linear.
Aprofilelikelihoodapproachisused[22]toanalyseparameter identifiability.The methodisextended tocreatetwo-dimensional profiles in the form oftopological heat maps. These 2D plots are computedforallcombinationsofparameters.Inadditiontodiag- nosingtheidentifiabilityofthe parameters,theseplots allowde- tectionofparameterinterdependence.
Thepaperis organisedasfollows.Thetheoretical basisis dis- cussedinSection2.Themodel,externalsimulatorandexperimen- talset-upispresentedinSection3,andtheresultsarepresented anddiscussedinSection4.
2. Theoreticalbasis
2.1. Stochasticmodelparameterestimation
Estimationofparametersforaknownmodelstructure[17]can bedefinedassolvingtheoptimisationproblem:
θ
ˆ=argminθ g
( θ
;M,K,A)
(1)s.t.
θ
∈Here, M is a predetermined model structure, which is parametrised by
θ
∈,where ⊆Rnθ isa set of feasible values forthemodelparametersthat forminequality constraintsforthe optimisation problem in Eq. (1). K represents the experimental conditions,includinga setofmeasurements ofsysteminputsand outputs. These measurements are used to evaluate the objective functiongwhenθ
isvariedoverthefeasiblesetbyanumerical optimisationalgorithmA.Inthesequel,thealgorithmConstrained Optimisation By Linear Approximation (COBYLA)[24] is used. This algorithmisgradientfree,henceidealforsolving Eq.(1).COBYLA alsosupportsinequality constraintswhichcanbe usedto impose thelimitsofthefeasibleregionontheparameterestimates.SincethemodelstructureMisarepresentationofasystemS, itis oftenassumed that S∈M() andthat consequently there existsa trueparametervector
θ
∗ suchthat M(θ
∗)=S.However, this is rarely the case, especially for simplified grey-box models based on a naive physical understanding of the system S. Typi- cally,theestimateθ
ˆdependsontheamountofdynamicinforma- tionin K,the choice ofobjectivefunction g,andto some extent ontheoptimisationalgorithmA.Hence,itisnecessarytoanalyse theidentifiabilityoftheestimatedparameters.Thistopicisfurther discussedinSection2.4.Next, define the continuous time input ut∈Rnu and output yt∈Rny,andthecorrespondingorderedsequencesofdiscretetime measurementsukandyktakenfromthesystemS:
y[N]=[y0,y1,...,yN] (2) u[N]=[u0,u1,...,uN] (3) Here, the integer subscripts k=0,1,...,N denote the discrete time samplinginstants, andthesubscript enclosed in [·] is used toindicateanorderedsequence.
A grey-box model can be expressed as a continuous time stochastic differential equation (SDE) with a discrete time mea- surementequation;adoptingthenotationof[5]:
dxt=f
(
xt,ut,t,θ )
dt+σ (
ut,t,θ )
dω
t (4) yk=h(
xk,uk,tk,θ )
+ek (5) where t∈R is the time variable and xt∈Rnx is the continuous time state vector. The first and second terms in the state transi- tionequation, giveninEq.(4),are commonlycalledthedriftanddiffusionterm,respectively[5,25].Thediffusiontermexpressesthe processnoiseasthefunction
σ
multipliedwiththedifferentialofa standardWienerprocessω
t.Thediscretetime measurementequa- tionisgiveninEq.(5).2.2.Maximumlikelihood
This section givesa summary ofthe theoretical basis adopted from the CTSM framework [5,14,15]. The objective function g in Eq.(1)can bederived from thelikelihood function,which isde- fined asthe probability of observing the measurement sequence y[N] when
θ
andMareknown,i.e.:L
θ
;y[N],M=p
y[N]
| θ
,M(6) In the sequel, the model structure M is implicitly assumed knownandomittedfromthecondition.Byapplicationoftherule P(A∩B)=P(A
|
B)P(B)[25],Eq.(6)canbeexpandedsuchthat:L
θ
;y[N] = N
k=1
p
yk
|
y[k−1],θ
p
(
y0| θ )
(7)The diffusion term in Eq. (4), which is assumed to be addi- tiveand independentof thestate x, is driven by aWiener pro- cesswhosedifferentialisGaussiandistributed[5].Hence,itisrea- sonabletoassumethattheconditionalprobabilitiesinEq.(7)can
beapproximatedbyGaussiandistributions[5,15].Thisassumption canbecheckedduringmodelvalidationbytestingtheresidualsfor normality [3,5].The likelihoodcan then beexpressed asamulti- variateGaussiandistribution[5],
L
θ
;y[N]=
⎛
⎝
Nk=1
exp −12
kTEk−1|k−1
k
det
Ek|k−1
√ 2π
ny⎞
⎠
p(
y0| θ )
(8)AKalmanFiltermaybeusedtoestimatethequantities ˆ
yk|k−1=E
yk
|
y[k−1],θ
(9)
k=yk−yˆk|k−1 (10)
Ek|k−1=E
k
kT (11) IntheCTSMframework,anEKFisused.InSection2.3thealterna- tiveuseofUKFandEnKFisdiscussed.
Eq.(8)canfurther be simplifiedby takingthenegativeofthe logarithm;definingtheloglikelihoodfunction(
θ
;y[N]):
θ
;y[N] =−lnL
θ
;y[N] (12)Thesolutiontotheoptimisationproblemisnotaffectedsince argmax
θ∈L
θ
;y[N] =argmin θ∈θ
;y(N)(13)
Table 1
Comparing equations for UKF (left) and EnKF (right).
Definitions and initialisation
ζm(0)=λ+λnx ζc(0)=λ+λnx+
1−α2+β
ζm(i)=ζc(i)=2(λ+1nx), i∈{1,...,2nx} λ=α2(nx+κ)−nx
w(ki) ∼N(w¯k,Wk), i∈{1,...,np} v(ki) ∼N(v¯k,Vk), i∈{1,...,np} x0(i|)0 ∼N(x¯0,X0), i∈{1,...,np}
ˆ
x0|0=E[x0]=x¯0
X0|0=V x0−xˆ0|0
=X0
ˆ
x0|0=n1pnp i=1x(0i|)0 X0|0=np1−1np
i=1 x(0i|)0−xˆ0|0 (...)T
State propagation x{k2−1nx|+1k−1}=ς
xˆk−1|k−1,Xk−1|k−1
x(ki|)k−1=f xk(i−1) |k−1,uk−1,w¯k
i∈{0,...,2nx} ˆ
xk|k−1=2nx i=0ζm(i)x(ki|)k−1
a)Xk|k−1=2nx
i=0ζc(i) x(ki|)k−1−xˆk|k−1
(...)T+Wk
xik|k−1=f x(ki−1)|k−1,uk−1,w(ki−1)
i∈{1,...,np} xˆk|k−1=n1pnp
i=1xk(i|)k−1
b)Xk|k−1=np1−1np
i=1 xk(i|)k−1−xˆk|k−1
(...)T
Measurement estimate x{k2|kn−1x+1}=ς
ˆ
xk|k−1,Xk|k−1 y(ki|)k−1=h xk(i|)k−1,uk−1,v¯k
i∈{0,...,2nx} ˆ
yk|k−1=2nx i=0ζm(i)yk(i|)k−1
y(ki|)k−1=h x(ki|)k−1,uk−1,v(ki−1)
i∈{1,...,np} ˆ
yk|k−1=n1pnp i=1yˆ(ki|)k−1
Innovation and cross covariance Zk|k−1=2nx
i=0ζc(i) x(ki|)k−1−xˆk|k−1 y(ki|)k−1−yˆk|k−1T a)Ek|k−1=2nx
i=0ζc(i) y(ki|)k−1−yˆk|k−1
(...)T+Vk
Zk|k−1=np1−1np
i=1 x(ki|)k−1−xˆk|k−1 yk(i|)k−1−yˆk|k−1T
Ek|k−1=np1−1np
i=1 y(ki|)k−1−yˆk|k−1 (...)T
K k= Z k|k−1E k−1|k−1 K k= Z k|k−1E k−1|k−1 Aposteriori update c)
k|k−1=yk−yˆk|k−1 ˆ
xk|k=xˆk|k−1+Kkk|k−1
Xk|k=Xk|k−1−KkEk|k−1KkT
xk(i|)k=x(ki|)k−1+Kk(yk−y(ki|)k−1) i∈{1,...,np}
b)xˆk|k=n1p
np i=npx(ki|)k
b)Xk|k=np1−1np
i=1(x(ki|)k−xˆk|k)(...)T
a) Assuming affine noise. (See Remark 3).
b) Can be omitted (See Remark 5).
c) Mathematically equivalent but not interchangable (See Remark 6).
Finally,byconditioningonknowingy0,andeliminatingthescaling constants12 from(
θ
;θ
;y[N]),theobjectivefunctionfromEq.(1)is givenas:g
( θ
;M,K)
= Nk=1
kTEk−|1k−1
k+ln
det
Ek|k−1
(14)wheretheconstanttermc=N·ny·ln(2
π
)isdropped.2.3. AlternativeKFformulations
The popularity of the Kalman Filter has led to a number of adaptions. The Extended Kalman Filter (EKF) is perhaps the most commonsuchadaptionandisusedin[5].Inthesequel,twoother wellknownKFvariationsareoutlined;theUnscentedKalmanFilter (UKF)[18]andtheEnsembleKalmanFilter(EnKF)[23].Inaddition tobetterapproximationsfornon-linearmodels,UKFandEnKFdis- pensewiththecomputationofJacobiansandthereforedonotre- quirethemodeltobedifferentiable[18].Bothfiltersarelistedand comparedinTable1.
Given the SDE forthe state transition asin Eq. (4), the time evolutionoftheprobabilitydensityfunction(pdf)ofthestate,p(x, t), is described by the Fokker–Planck equation [23], also known as the Kolmogorov forward equation [5]. The multi-dimensional Fokker–Planckequation[25]canbeexpressedas
∂
p(
x,t)
∂
t +
i
∂ ∂
xi(
fi(
xt,ut,t,θ )
p(
x,t) )
= 1 2
i,j
∂
2∂
xi∂
xjp
(
x,t) σ
Wσ
Ti j (15)
wherefiistheithcomponentofthestatetransitionmodel.
In the EKF, the linearised model is used to approximate the first moments ofthispdf [23] by aTaylor seriesexpansion trun- catedafterthefirstterm[17,19].InbothUKFandEnKF,theFokker–
Planckequationisinsteadsolvedbyapproximatingthesolutionto Eq. (15) using a set of state realisations. The key difference be- tween the UKF andEnKF is in how that set is constructed. The UKF draws its state realisation set,called sigmapoints, usingthe unscentedtransform(UT).TheUT ofan expectedstate x¯withco- varianceXdeterministicallycomputesasetofsigmapointsx{N}=
x(i): i=0,1,...,N,where theshorthand{·}superscriptin- dicates a set and a superscript (·) denotes a member. For con- venience of notation, a UT operator
ς
(x¯,X) that returns a set of N=2nx+1sigmapointsisdefinedasx(0)=xˆ (16)
x(i)=xˆ+
(
nx+λ )
Xi,i∈
{
1,...,nx}
(17)x(nx+i)=xˆ−
(
nx+λ )
Xi,i∈
{
1,...,nx}
(18)Thesquare rootisoftenimplementedusingaCholeskydecompo- sition, and the subscript i denotes the i-th column [17,18]. Note that there are different versions ofthe UT [3,19],where the one presented in Eqs. (16)–(18) is used in the sequel. For a Gaus- sian random variable (GRV), the UT is known to approximate the pdf p(x, t) to third order accuracy, and to the second or- der for non-Gaussian random variables [17]. The introduction of
λ
=α
2(nx+κ
)−nx inEqs.(16)–(18)givesasetoftuningparam- eters that can improveapproximations ofhigher order moments [17–19].In contrast to the deterministic UT, the EnKF represents the state pdf using a Monte Carlo (MC) samplingmethod [17,18,23]. Thepdf isapproximatedasp(x,t)=dNnp,wheredNisthenumber ofstaterealisationsinsomesmallunitvolumeandnp isthetotal
numberofrealisations[23].Thesetofrealisations,i.e.,theensem- ble,is initially drawn at random using the mean andcovariance oftheinitial state.Subsequently,eachrealisationispropagatedas a distinct trajectory, thus making the EnKF equivalentto usinga MarkovChain MonteCarlo (MCMC)method tosolve the Fokker–
Planckequation[23].
2.3.1. RemarkstoTable1
Remark 1. Initialisation for both filters is equivalent if np is
“large”, since the computedensemble values based on MC sam- plingconvergetotheexpectationvaluesx¯0andX0.
Remark 2. In the UKF, the sigma transform is applied twice to computethesigmapointsforbothaprioriandaposterioristateand covarianceestimates.IntheEnKF,the realisationsaredrawn only intheinitialisation,andsubsequentlypropagatedindependently.
Remark 3. The process noise wk∼N(w¯k,Wk) andmeasurement noise
v
k∼N(v
¯k,Vk)entertheUKFandEnKFindifferentways.The model in Eqs. (4) and(5) assumes affine noise, hence the noise covariancesare added to therespective propagationequations in theUKF.Fornon-affinenoise,thereareotheradaptionsoftheUKF, e.g.,estimatingnoise byaugmenting thestate vector,that canbe used[18].IntheEnKF,arandomnumbergenerator(RNG)isused todrawinstances ofthenoise whichissubsequentlyused inthe statetransitionandmeasurementequationsforpropagationofthe ensemble.Remark4. If
ζ
m(i)= n1p andζ
c(i)=np1−1 intheUKFformulation,the correspondingequationsforestimatingmeanandcovariancefrom therealisationsetwouldbeidenticaltoEnKF(exceptfortheiter- ationindex)whennpislargeandλ
=0↔α
=1,κ
=0.Remark5. InordertoshowthesimilarityofUKFandEnKF,both filtersare formulated withexpressionsforcomputing aprioriand aposterioricovarianceforthestate estimate.Observethat forthe UKF these are needed in order to compute new sets of sigma points,whileintheEnKFthiscomputationcanbeomitted.Indeed, afundamentaladvantageoftheEnKFisthatitdoesnotrequireex- plicitcomputationoftheaprioriandaposterioristateestimateco- variance matrices, butrather propagatesthem asapproximations intheensemble.ThisisanadvantageoftheEnKFformodelswith ahighnumberofstates.
Remark 6. The EnKF aposteriori update of state realisations and covariance can be shown to be equivalent to the corresponding aposterioriupdateinthe UKF.However, since EnKFtreatsthe set ofrealisationasindependentstatetrajectories,theensemblemust beupdated fromaprioritoaposterioristateestimates.Hence, the two formulations are not interchangeable, despite being mathe- maticallyequivalent.
Remark 7. UKF hasthree hyperparameters,
α
,κ
andβ
; defaulttuningsare suggested for standard noise models in the UKF lit- erature.The EnKF hasonly one hyper parameter: thenumber of realisationsnp.
2.4.Profilelikelihood
Parameterestimatesareoftenreportedasapointintheparam- eterspace,orasaconfidenceinterval[26]withsomestatedcon- fidence
α
.Analternativesolutionistopresentthedistributionof theparametersoverthe feasiblerange.Sincetheestimationof parametersisbasedonthelikelihoodfunctioninEq.(6),oneattrac- tivechoiceforcreatingparameterdistributionsistheprofilelikeli- hood (PL)method presented in [8,21,22]. Thisapproach was also suggestedbytheauthorsofCTSM[27,28].ThePLmethodexplores the parameter space by optimising the parameters in two steps,ratherthansimultaneouslyasinEq.(1).Forsimplicityofnotation, thedependenceon y[N] is omittedfromthe loglikelihoodfunc- tion(
θ
;y[N])inthesequel.TheprofilelikelihoodPL(θ
i)isdefined astheminimumloglikelihoodforθ
iwhentheremainingparam- etersarefreelyoptimised[22,29]:PL
( θ
i)
=minθj =i
g
θ
j =i;M,K,θ
i (19)Values of
θ
i must be chosen prior to optimising the remain- ingθ
j=i[22]. Astraightforwardsolution,iftheobjectivefunction gis well behaved within the constraintsof , isto use a brute forceapproach withan evensampling ofθ
i.Alternatively, atwo- sidedgradientdecentalgorithm,usingafreelyoptimisedparame- tervectorasastartingpoint,canbeapplied[22,30].Theresulting likelihooddistributioncan beplottedasafunctionofθ
i andsub- sequentlyanalysed according to the definitions of structural and practicalidentifiability forlikelihood-based confidenceintervals [8]. Unliketheasymptotic confidenceinterval, which isbased onthe curvatureofthelikelihoodfunctionbycomputationoftheHessian [8,22],thelikelihood-basedconfidenceintervaliscomputedbyap- plying a threshold to the likelihood function to compute a confi- denceregion[22,29].Letθ
:( θ )
−θ
ˆ<
α ,
α=
χ
2( α
,ndf)
(20)where
θ
ˆisafreelyestimated,presumedoptimal,parametervector, andthe threshold α is theα
percentile oftheχ
2-distribution with ndf degrees of freedom. It follows from Wilks’ theorem [31]thatthelogarithmofthelikelihoodratioteststatistic2ln
( )
=2ln⎛
⎝
L( θ )
Lθ
ˆ⎞
⎠
=( θ )
−( θ
ˆ)
(21)can be used to compare two models. The difference in log like- lihood(
θ
)−θ
ˆis asymptotically
χ
2-distributed [22,32], with ndf equal tothe difference inthe numberoffree parameters be- tweenθ
andθ
ˆ. Hence, thePL methoduses aχ
2 threshold withndf=1. This formof confidence interval allows interpretation of structuraland practical identifiability by inspection of the upper andlower confidence boundaries [22]. If(
θ
) is lower than thethreshold in both directions, i.e., the interval at the stated con- fidence level is unbounded (± ∞), the parameter is classified as structurallynon-identifiable [22].If(
θ
) isboundedin onedirec-tion,thisindicatespracticalnon-identifiability[22,29].Profilelike- lihoodplotsareinterpretedsimilarly.Iftheplotislowerthan the confidencethresholdinbothdirectionsoronlyone,thisindicates structuralorpracticalnon-identifiability,respectively.
2.4.1. Two-dimensionalprofilelikelihood
The PL method essentially projects the nθ dimensional space ontothe singleparameter
θ
i, byfreely estimatingtheremain- ingparameters. Hence,ifparameters are notindependent,the PL method tends to overestimate the width of the likelihood-based confidenceinterval.Asteptowardsremedyingthisissueistomod- ifythePLmethodtoholdouttwoparametersratherthanone,i.e., PL2θ
i,θ
j =min θk=i,jg
θ
k=i,j;M,K,θ
i,θ
j (22)Thisresultsina two-dimensionaldistributionwhich canbeanal- ysed in a similar wayto the one-dimensional PL[22], using the definition in Eq. (20). The PL2 results are plotted as topological surfaces[22].Thisprojectstheparameterspaceontotheplane of
θ
i andθ
j.Inadditionto diagnosingidentifiabilityissues,these plotscanbeusedtodiagnoseparameterinterdependence.Observe that sinceθ
ˆ has nθ free parameters while thePL2 estimate hasnθ−2,thisgivesndf=2forthecomputationofα fromthe
χ
2-distributioninEq.(20).
Applying a confidence threshold to the PL2 method produces confidence regions in the (
θ
i,θ
j) plane, rather than intervals in a single parameter. Based on confidence thresholds computed from theχ
2 distribution, a similar interpretation of these two- dimensional topologies can be applied to diagnose identifiability by requiringthat the regionis boundedin alldirections. Ifthere isan unbounded equipotential valleywitha log likelihoodbelow theα threshold,theparameterisstructurallynon-identifiable.If theintervalorregionisunboundedonlyinone direction,thisin- dicatesapractically non-identifiableparameter.Examples oftwo- dimensionalPLplotsare giveninSection4.Ifparameter interde- pendence is observed, re-parametrisationof the modelsuch that theinterdependencyisresolved,maybeadvisableinordertoob- tainamodelwithtighterconfidenceboundsontheestimatedpa- rameters.2.4.2. Interpretationofwideconfidenceregions
Itcan beargued thata wide confidenceregionisindicativeof anidentifiabilityissueeveniftheregionisbounded.Iftherangeof acceptableparametervaluesislarge,theinterpretationoftheesti- matedparameters asbeingdeterminedbythephysicalproperties ofthesystem,i.e.,S∈M()→M
θ
ˆS,isquestionable.
Onepossiblecauseofwideconfidenceboundsontheestimated parametersisthepresenceofnuisanceparameters,i.e.,parameters whosevalueisinsignificantforthemodelestimates.
2.4.3. Effectofconstrainedparameters
Observe that solving the two-step optimisation problem in Eq.(19)subjectedtotheconstraint
θ
∈imposesarestrictionon theidentifiedprofilePL(θ
i).Thisconstraintmayskewtheresults, since the remaining parametersθ
i=j are only considered within theregion.Ifparametersarenotindependent,theprofileofone parametermaybeinfluencedbytheconstraintsofanother.Inthe PL2method,the effectofconstrained optimisationof parameters is easierto diagnose, since dependent parameters can be identi- fiedfromthetopologyplots.2.5. Modelvalidation
The CTSM methodrequires evaluationof theresiduals to ver- ify that the assumption of Gaussian distributed residuals is jus- tified [5,15]. In the CTSM literature, the autocorrelation function (ACF)isusedtotestfornormalityofresidualsinthetime-domain, whileacumulativeperiodogram(CP)isusedinthefrequencydo- main[5,8,15].Therearealsoanumberofalternativetestsfornor- mality that can be applied, such as thezero-crossings test orthe Kolmogorov–Smirnovtest[3].
3. Casestudymodelandsimulation 3.1. Model
Athermalnetwork modelofabuildingcan be expressedasa resistor-capacitor(RC) circuit. Thesemodels are basedon a naive physical understanding of temperature variations in the building structure, which entailssimplifications that necessarily introduce modelling errors. The result is a simplified, lumped parameter model, which should be treated in the framework of grey-box modelling, and hence formulated as stochastic differential equa- tions(SDE)asinEq.(4)[5].
Fig. 1showsan exampleof a candidateRC model which was developed to approximate the thermal behaviour of the experi- mental building discussed in Section 3.2, partially based on the
Fig. 1. The R3C2 thermal network model of an experimental building can be ex- pressed as a resistor–capacitor equivalent circuit containing three resistors and two capacitors.
Fig. 2. Calibration data for the R3C2 model. The model outputs T b (red) and T w(blue) are plotted together with the outdoor temperature input T ∞(green). The input power Q is plotted separately. (For interpretation of the references to colour ˙ in this figure legend, the reader is referred to the web version of this article.)
Table 2
Nominal parameter values and min/max limits for resis- tances [K/W] and capacitances [J/K].
R b R w R g C b C w
θ0 0.100 0.100 0.250 1200 k 1200 k θmin 0.030 0.030 0.075 360 k 360 k θmax 0.170 0.170 0.425 2040 k 2040 k
R4C2 model presented in [1]. The model has two outputs: the room temperature Tb andthe wall surface temperatureTw, and twoinputs:theconsumedpowerbyanelectricheatingelementQ˙ andtheoutsidetemperatureT∞.Fivecomponentsformthemodel structure: the thermal resistance betweenroom airand wall Rb, thebuildingenvelopeRw,andthethermalresistanceofwindows anddoorsRg.Thetwo capacitancesCb andCwrepresentthether- malcapacitanceofthebuildinginteriorandenvelope,respectively.
Anominalparametervector
θ
0,listedinTable2,isusedastheini- tialvalueforparameterestimation.Additionally,thefeasiblevalues regionislimitedbyθ
minandθ
max,whicharechosenas0.3×θ
0and1.7×
θ
0, respectively.3.2. Calibrationdata
The calibration data used for parameter estimation was ob- tainedfromanexperimentalbuildinglocatedatCampusPorsgrunn of the University of South–Eastern Norway (USN). The data was collected by multiple data acquisition systems,each producing a separatedatasubset,andcombinedintoaconsistentdatasetinthe preprocessingstep.Thedatawasfirstfilteredtoremovenoiseand
Fig. 3. Illustration of Kalman Filter (KF) with externally simulated (SIM) state prop- agation.
subsequentlyresampledintoauniformtemporalscale.Inorderto maintainmeasurementuncertaintyafter preprocessing,arandom noisecomponentofcovariance0.1wasadded tothetemperature measurements.TheresultingdataispresentedinFig.2.
3.3.RCSimulator
Thechoiceofmodelstructureforathermalnetworkmodel,i.e., theRCcircuit,usuallyinvolvessignificantexperimentation[1,7,16]. Tosimplify,andpossiblyautomate,theprocessoffindingappropri- atemodelstructures,itisusefultosimulatesuch modelswithout requiringexplicitmodelequations.Sincethethermalnetworksare modelledasRCcircuits,itisnaturaltolooktotheelectronicsfield wherecircuitsareoftensimulatedusingtoolssuchasSPICE[33].A circuitsimulatorcanbeusedtopropagatethestate,hencereplac- ingthedrifttermofEq.(4),asillustratedinFig.3.Usingthisset- upwiththeparameter estimationmethodinSection 2.2requires a KF implementation that can handle non-differentiable models, suchasUKFandEnKF.
A simple circuit simulator is constructed, named RCSimulator forreferenceinthesequel. Circuit simulatorstypically define the circuit model asa list ofinterconnected components,which can be takendirectlyfromthe schematicinFig.1.By convention,all componentshave two terminalsnamed in and out. Each node is assignedaninteger indexwhich isusedtoconfigure theconnec- tionsofthecomponentsasacircuit.Forexample,lettingnodeTb haveindex 1and Tw index2,the component Rb would havein- put/outputassignment(1,2).Foreachnodeinthecircuit,Kirchoff’s nodecurrentlawisusedtobalancetheflowinandoutofthenode [34]. The system of node equations can be written in difference form:
Axk+Amxk−1+Buk=0 (23)
Thecontributionsfromallcomponentsaresummedtogether,such that rows i in A, Am, and B constitute the balance equation for node i. Eq. (23) is solved for xk at each time-step in or- der to propagate the state. The only dynamicelement is the ca- pacitor,which is implemented using an implicit Eulerdiscretisa- tion,
dxdt
tk≈xk−xtk−1, by contributingto both the A andAm ma- trices.Voltagesources areimplementedasconstraintsonthedif- ferencebetweenthestatesofthetwoconnectednodes.Themea- surementEq.(5)canbe implementedasmeasuring thepotential betweenselectednodesintheRCcircuit.
The simulation scheme,andin particular the discretisation of the capacitive elements, could be extended with more accurate approximationssuch asthe Runge–Kutta 4th order (RK4) scheme [35].It isalso possibleto introduce non-linearcomponents,such asvariableresistors. Observethatwhilethetest casemodelused hereislinear,themethodofestimatingresidualswithUKForEnKF forexternallysimulatedmodelshasnosuchrestriction.
3.4.Discretetimelinearmodel
Forcomparison,themodelisalsoexpressedinastandardlinear statespaceform
dx
dt =Axt+But+Gwt (24)