• No results found

Neural networks for parameter estimation in microstructural MRI: application to a diffusion-relaxation model of white matter

N/A
N/A
Protected

Academic year: 2022

Share "Neural networks for parameter estimation in microstructural MRI: application to a diffusion-relaxation model of white matter"

Copied!
12
0
0

Laster.... (Se fulltekst nå)

Fulltekst

(1)

ContentslistsavailableatScienceDirect

NeuroImage

journalhomepage:www.elsevier.com/locate/neuroimage

Neural networks for parameter estimation in microstructural MRI:

Application to a diffusion-relaxation model of white matter

João P. de Almeida Martins

a,b,1,

, Markus Nilsson

a,1

, Björn Lampinen

c

, Marco Palombo

d

, Peter T. While

b,e

, Carl-Fredrik Westin

f,g

, Filip Szczepankiewicz

a,f,g

aDepartment of Clinical Sciences, Radiology, Lund University, Lund, Sweden

bDepartment of Radiology and Nuclear Medicine, St. Olav’s University Hospital, Trondheim, Norway

cDepartment of Clinical Sciences, Medical Radiation Physics, Lund University, Lund, Sweden

dCentre for Medical Image Computing and Department of Computer Science, University College London, London, United Kingdom

eDepartment of Circulation and Medical Imaging, NTNU-Norwegian University of Science and Technology, Trondheim, Norway

fRadiology, Brigham and Women’s Hospital, Boston, MA, United States

gHarvard Medical School, Boston, MA, United States

a b s t r a c t

Specificfeaturesofwhitemattermicrostructurecanbeinvestigatedbyusingbiophysicalmodelstointerpretrelaxation-diffusionMRIbraindata.Althoughmore intricatemodelshavethepotentialtorevealmoredetailsofthetissue,theyalsoincurtime-consumingparameterestimationthatmayconvergetoinaccurate solutionsduetoaprevalenceoflocalminimainadegeneratefittinglandscape.Machine-learningfittingalgorithmshavebeenproposedtoacceleratetheparameter estimationandincreasetherobustnessoftheattainedestimates.Sofar,learning-basedfittingapproacheshavebeenrestrictedtomicrostructuralmodelswitha reducednumberofindependentmodelparameterswheredensesetsoftrainingdataareeasytogenerate.Moreover,thedegreetowhichmachinelearningcan alleviatethedegeneracyproblemispoorlyunderstood.Forconventionalleast-squaressolvers,ithasbeenshownthatdegeneracycanbeavoidedbyacquisitionwith optimizedrelaxation-diffusion-correlationprotocolsthatincludetensor-valueddiffusionencoding.Whethermachine-learningtechniquescanoffsettheseacquisition requirementsremainstobetested.Inthiswork,weemployartificialneuralnetworkstovastlyacceleratetheparameterestimationforarecentlyintroduced relaxation-diffusionmodelofwhitemattermicrostructure.Wealsodevelopstrategiesforassessingtheaccuracyandsensitivityoffunctionfittingnetworksanduse thosestrategiestoexploretheimpactoftheacquisitionprotocol.Thedevelopedlearning-basedfittingpipelinesweretestedonrelaxation-diffusiondataacquired withoptimalandsub-optimalacquisitionprotocols.Networkstrainedwithanoptimizedprotocolwereobservedtoprovideaccurateparameterestimateswithinshort computationaltimes.Comparingneuralnetworksandleast-squaressolvers,wefoundtheperformanceoftheformertobelessaffectedbysub-optimalprotocols;

however,modelfittingnetworkswerestillsusceptibletodegeneracyissuesandtheirusecouldnotfullyreplaceacarefuldesignoftheacquisitionprotocol.

1. Introduction

Microstructure imaginguses compartment modelling of diffusion MRI (dMRI) data with the aim to map specific tissue quantities (Alexanderetal.,2019;Nilssonetal.,2013;Novikovetal.,2019).A centralgoalin microstructureimaginghasbeentoestimate thevol- umefractionsof differentmicrostructuralcomponentssuchas axons (Lampinen et al., 2020, 2019; Veraart etal., 2018). Estimating vol- umefractionsratherthansignalfractionsischallenging,however,be- causeitrequiresthesimultaneousestimationofbothdiffusionandre- laxationpropertiesofthedifferentmodelcompartments.Thiskindof inverseproblemissensitivetodegeneracyissues(Jelescuetal.,2016; Lampinenetal.,2019),inwhichmultiplesetsofmodelparameterscan describetheacquireddataequallywell.Parameterestimationcanalso becomputationallyslow,preventingreal-timemapping.Apotentialso- lutionistoemploymachinelearningtoacceleratetheparameteresti-

Correspondingauthorat:DepartmentofClinicalSciences,Radiology,LundUniversity,Lund,Sweden.

E-mailaddress:[email protected](J.P.deAlmeidaMartins).

1 Theseauthorscontributedequallytothiswork.

mationprocess(Golkovetal.,2016).However,thecurrentliterature lackssystematicassessmentsoftheadvantagesanddrawbacksofthis approach, whichissurprisingconsideringtheexponentialincreasein interestforsuchmethods.

Artificialneuralnetworks (ANNs)andothermachinelearningap- proaches have been applied previously toaccelerate the estimation of microstructure parametersfrom dMRIdata(Barbieri etal., 2020; Bertleff etal.,2017;Golkovetal.,2016;Grussuetal.,2020;Gyorietal., 2019;Hilletal.,2021;Kaandorpetal.,2021;Nedjati-Gilanietal.,2017; Palomboetal.,2020;Reisertetal.,2017).Forexample,arandomfor- estregressorhasbeenusedtofitacompartmentmodelforwhitemat- ter(WM)microstructureinthepresenceofwaterexchange (Nedjati- Gilanietal.,2017)andtofittheSANDImodelforgreymatterproperties (Palomboetal.,2020).Reisertetal.(2017)appliedmachinelearning toaBayesianestimationapproachwhichdramaticallyacceleratedthe fitting of two-andthree-compartmentmodels.Barbieri etal.(2020)

https://doi.org/10.1016/j.neuroimage.2021.118601.

Received30March2021;Receivedinrevisedform26August2021;Accepted18September2021 Availableonline22September2021.

1053-8119/© 2021TheAuthors.PublishedbyElsevierInc.ThisisanopenaccessarticleundertheCCBY-NC-NDlicense (http://creativecommons.org/licenses/by-nc-nd/4.0/)

(2)

appliedANNstotheintra-voxelincoherentmotionmodel.Animpor- tantopenquestion,however,iswhatimpactthetrainingstrategyhas onthefittingperformance.Thisisparticularlyrelevantwhenappliedto non-linearmulti-compartmentmodelswithmanyindependentmodel parameters,whichwehererefertolooselyas‘high-dimensionalmod- els’.Thegenerationoftrainingdatascalespoorlywiththenumberof modelparameters,assamplingeachcombinationofpmodelparame- tersinmstepsrequiresmp samples. Aspincreases,itisunavoidable thatafinitesetofsamplesbecomessparseinthep-dimensionalmodel parameterspace,riskingselectionbias.Here,weinvestigatetheimpact ofdifferentsamplingpatternswithinthisspaceontheperformanceof theneuralnetwork.

Apartfromacceleratingmodelfitting,neuralnetworksmayinprin- ciplealsoreducetherequirementsontheimagingprotocolbylearning priorsfromtrainingexamples(Golkovetal.,2016).Forexample,neu- ralnetworkshavebeenusedtolearnamappingbetweenfully-sampled andsub-sampleddatasets,whichcaninturnbeusedtostabilisemodel fittingperformanceagainstsubstantialdegreesofdatadown-sampling (Alexanderetal.,2017;Tianetal.,2020).However,wedonotexpect machinelearningapproachestocompletelyalleviatedegeneracyissues.

Indeed,forcaseswheretheacquisitionprotocoldoesnotprovidesuf- ficientinformationtoresolvebetweendifferentparametervalues,the learning-basedestimateswillsimplyequalthemeanofthemodelpa- rameterdistributionusedfortraining(Reisertetal.,2017).

Theaimsofthisstudyweretocomparetrainingstrategies,topro- pose tools toevaluate the performanceof modelfitting neuralnet- works,andtotesttowhatdegreeneuralnetworkscansolveproblems withdegeneracy.Asatestbed,weuseahigh-dimensionalrelaxation- diffusionmicrostructuremodelofWM(Lampinenetal.,2020, 2019; Veraartetal.,2018).Forthismodel,parameterestimationisenabled bystate-of-the-artimagingprotocolsfeaturingso-calledb-tensorencod- ing(Topgaard,2017; Westin et al., 2016) combined with diffusion- relaxationcorrelations(deAlmeida Martinsetal., 2020;deAlmeida MartinsandTopgaard,2018;Lampinenetal.,2019).Weinvestigated theabilityofneuralnetworkstospeedupmodelfitting,andexplored theextenttowhichtheycanoffsettherequirementsontheacquisition protocol.

2. Theory

Whitemattermicrostructurecanbemodelledbymultiplecompart- mentswithdifferentmicrostructuralpropertiesbutacommonorien- tationdistribution (Alexanderetal., 2019; Novikovetal., 2019).In thisdescription,themeasuredsignalistheconvolutionbetweenanori- entationdistributionfunction(ODF)P(𝒏̂)andamicrostructuralkernel K(𝒖̂𝒏̂)

𝑆(𝒖̂)=

𝒏|=1𝑃(𝒏̂)𝐾(𝒖̂𝒏̂)d𝒏̂, (1) where𝒏̂and𝒖̂areunitvectorsdefiningthesymmetryaxesoftheODF andofthediffusionencodingprocess,respectively.Notethatthemi- crostructuralkerneldepends on therelativeanglebetween 𝒏̂ and𝒖̂, cos𝛽=𝒖̂𝒏̂.Inthiswork,weassignaneffectivetransverserelaxation timeT2andanapparentmicroscopicdiffusiontensorDtoeachcom- ponent,anduseexponentiallydecayingfunctionstomodeltheeffect ofthesemicrostructuralpropertiesontherelaxation-diffusion-weighted signal(Veraartetal.,2018).Undertheseassumptions,themicrostruc- turekerneliswrittenasaweightedsumofexponentials

𝐾(𝒖̂𝒏̂)=𝑆0

𝐽 𝑗=1

𝑓𝑗exp(

𝐁(𝒖̂)∶𝐃𝑗(𝒏̂)) exp

(

𝜏E

𝑇2;𝑗

)

, (2)

correspondingtoamixtureofJcomponentswithsignalfractionfj,trans- verserelaxationtimeT2;j,anddiffusiontensorDj.Thecolon“:” denotes theFrobeniusinnerproduct,B:D=∑

𝑖

𝑗𝐵𝑖𝑗𝐷𝑖𝑗.InformationaboutT2;j andDjisencodedintothesignalbytheechotime𝜏Eanddiffusionen-

codingtensorB(𝒖̂),respectively,bothofwhichareexperimentalvari- ables.Tosimplifythemodel,weonlyconsideraxisymmetricB(𝒖̂)and additionallyassumethatthecomponent-wiseDjareaxisymmetric.

TheconvolutionexpressedinEq.(1)canbesimplifiedbyfactorizing bothP(𝒏̂)andK(𝒖̂𝒏̂)intheirsphericalharmoniccoefficientsplmand klm,respectively:

𝑃(𝒏̂)=∑

𝑙

𝑚 𝑝𝑙𝑚𝑌𝑙𝑚(𝒏̂), (3)

and 𝐾(𝒖̂𝒏̂)=∑

𝑙

𝑘𝑙0𝑌𝑙0(𝒖̂𝒏̂), (4)

whereYlmarethesphericalharmonicsbasisfunctions 𝑌𝑙𝑚,Φ)=

√ 2𝑙+1

4𝜋 (𝑙𝑚)!

(𝑙+𝑚)!𝐿𝑚𝑙(cosΘ)exp(𝑖𝑚Φ), (5) withthe𝐿𝑚𝑙(x)termdenotingtheassociatedLegendrepolynomials.The summationsinEqs.(3)arecarriedoutfororderl=0,1,2,…,andde- greem=−l,−l+1,…,l.InEq.(4),wehavetakentheaxialsymmetryof themicrostructuralkernelK(𝒖̂𝒏̂)intoaccount(Lampinenetal.,2020; Novikovetal.,2018).Symmetryaroundthepolaraxisimplieskl’m=0 foreitherm’≠0oroddl’.Takentogether,thismeansthatthekl’mco- efficientsarereducedtotheir0thdegreetermskl0(typicallywrittenas kl)andonlyeven-orderedsphericalharmonicterms(l’=0,2,…)pro- videnon-trivialcontributions.Usingthesphericalharmonicsaddition theorem,Eq.(4)canberewrittenas

𝐾(𝒖̂𝒏̂)=∑

𝑙

𝑘𝑙0 𝑙

𝑚=−𝑙

𝑌𝑙𝑚(𝒖̂)̄𝑌𝑙𝑚(𝒏̂)

√ 4𝜋

2𝑙+1. (6)

InsertingEqs.(3)and(6)intoEq.(1)andmakinguse oftheor- thonormalityofthesphericalharmonicsbasisfinallyyields(Driscolland Healy,1994;Healyetal.,1998)

𝑆(𝒖̂)=∑

𝑙

𝑚 𝑘𝑙0𝑝𝑙𝑚𝑌𝑙𝑚(𝒖̂)

√ 4𝜋

2𝑙+1, (7)

where𝒖̂canbeparameterizedbythepolarandazimuthalangles,𝜃and 𝜙,describingtheorientationofB,𝒖̂ ≡(sin𝜃cos𝜙,sin𝜃sin𝜙,cos𝜃).

Thesphericalharmoniccoefficientsofthemicrostructurekernel(kl0) andtheODF(plm)areestimatedastheinnerproductsbetweenagiven sphericalharmonicsbasisfunctionYlmandeitherK(𝒖̂𝒏̂)orP(𝒏̂).Due totheorthonormalityofthesphericalharmonicsbasis,theinnerprod- uctsaregivenbymultiplicationwiththecomplexconjugatesoftheYlm, followedbyintegrationsovertheunitsphere.Forthemicrostructural kernel,thisprocedureresultsin(Lampinenetal.,2020)

𝑘𝑙0𝑘𝑙=𝑆0

𝐽 𝑗=1

𝑓𝑗

4𝜋(2𝑙+1)I𝑙𝑗exp(

𝑏𝐷I;𝑗(

1−𝑏Δ𝐷Δ;𝑗)) exp

(

𝜏E

𝑇2;𝑗

) ,

(8) where bis the conventional b-value andbΔ denotesthe normalized anisotropyofthediffusionencodingtensorB(Erikssonetal.,2015).The isotropicdiffusivityandthenormalizeddiffusionanisotropy(DIandDΔ) arerelatedtotheaxialandradialdiffusivities(D||andD)ofthediffu- siontensoraccordingtoDI=(D||+2D)/3andDΔ=(D||D)/3DI (Conturoetal.,1996);initsprincipalaxis,agivenDcanthusberep- resentedby adiagonalmatrixparametrized asdiag(DI (1 −DΔ), DI (1−DΔ),DI (1+2DΔ)).TheIljfactorsareafunctionoftheregular Legendrepolynomials,Ll,anddefinedas

𝐼𝑙𝑗=

1 0

exp(

𝛼𝑗𝑥2)

𝐿𝑙(𝑥)d𝑥, (9)

with𝛼j=3bDI;jbΔDΔ;j.

DifferentdiffusionMRImodelsfeaturedifferent numbersofcom- ponentsandimposedifferentconstraintsonthecomponentproperties.

Hereweconsideratwo-compartmentmodel(J=2)comprisinga“stick”

(3)

component(S)withDΔ;S=1anda“zeppelin” (Z)componentwithDΔ;Z

≠1.Truncatingthesphericalharmonicsummationatthesecondorder (lmax=2)thenyieldsthesignalaccordingto

𝑆(𝒆,𝒎)=𝑆0

[ 𝑓Sexp(

𝑏𝐷I;S

(1−𝑏Δ

))

× (

I0;S+4𝜋I2;S

𝑚 𝑝2𝑚𝑌2𝑚(𝜃,𝜙) )

exp (

𝜏E

𝑇2;S

)

+( 1−𝑓S

)exp(

𝑏𝐷I;Z

(1−𝑏Δ𝐷Δ;Z

))

× (

I0;Z+4𝜋I2;Z

𝑚 𝑝2𝑚𝑌2𝑚(𝜃,𝜙) )

exp (

𝜏E

𝑇2;Z

)]

, (10)

wherem∈{−2,−1,0,1,2}.ThederivationofEq.(10)usesthe𝑝00= 𝑌00=1∕√

4𝜋ODFnormalization(Lampinenetal.,2020;Novikovetal., 2018).Thevectors eandmcapturethe experiment-related parame- ters,e=(𝜏E,b,bΔ,𝜃,𝜙),andscalarmodelparameters,m=(fS,DI;S, DI;Z, DΔ;Z, T2;S, T2;Z, p20, Re(p21), Im(p21), Re(p22), Im(p22)), where Re(𝑝𝑙𝑚)=(𝑝𝑙𝑚+(−1)𝑚𝑝𝑙𝑚)∕2andIm(𝑝𝑙𝑚)=(𝑝𝑙𝑚−(−1)𝑚𝑝𝑙𝑚)∕2𝑖denote therealandimaginaryparts oftheplm coefficients,respectively. We refertothemodelexpressedbyEq.(10)astheStandardModelwithRe- laxation(SMR).Thisnameischosentomarkitsdescendancefromthe

“standardmodel” ofWMmicrostructure(Novikovetal.,2019)andto emphasizethefactthatitaccountsforcompartment-specificT2times.

TheSMRmodelparameterscanbedeterminedbyfittingEq.(10)di- rectlytotheacquiredsignals(Lampinenetal.,2020).Analternative strategyistofittosomerepresentationofthesignal,suchasthespher- icalharmonicscoefficients.Veraartetal.(2018)usedamodelfitting frameworkthateffectivelyreducesthedimensionalityoftheparameter spacebymeansofperformingarotationallyinvariantfactorizationof thevoxel-wiseODFs(Novikovetal.,2018;Reisertetal.,2017).Theini- tialstepofsuchframeworkconsistsinprojectingthemeasuredsignal ontoasphericalharmonicbasis

𝑆(𝒖̂)=∑

𝑙

𝑚 𝑆𝑙𝑚𝑌𝑙𝑚(𝒖̂). (11)

TheSlmcoefficientsaresubsequentlyconvertedtorotationalinvariants Sl,andfittedtothecorresponding rotationallyinvarianttermsofthe P(𝒖̂)⊗K(𝒈̂𝒖̂)convolution

𝑆𝑙=𝑝𝑙𝑘𝑙, (12)

whereklisthe0thdegreetermofthemicrostructuralkernelasdefined byEq.(8).Therotationallyinvariantcoefficients,Slandpl,arecom- putedfrom(Novikovetal.,2018)

𝑥𝑙=

√ 4𝜋

𝑚||𝑥𝑙𝑚||2

(2𝑙+1) , (13)

wherexlmarethesphericalharmonicscoefficients,andxlSlorxlpl.Atsufficientlylowb-values,signalprojectionswithl>2havesmall contributionstothemeasuredsignalJespersenetal.,2007)andthesum inEq.(11)istypicallytruncatedatthesecondorderterm(l=2).Thefit- tingframeworksummarizedbyEqs.(11)–((13)iscommonlyreferredto asthe“RotInv” approachduetoitsuseofrotationalinvariants.Thel=2 RotInvapproachcondensesthefivep2m,m∈{−2,−1,0,1,2}parameters oftheSMRmodelontoasinglep2invariantcapturingtheorientation coherenceofthesub-voxeldiffusiondomains,thusreducingthedimen- sionalityofthefittingproblembyfourparameters.

3. Methods

3.1. Neuralnetworkarchitectureandtraining

Inthiswork,weconstructedfeedforwardneuralnetworksinMAT- LABR2020b(The MathWorks,Inc.), andusedthemtofitvectorsof scalar parameters, m = (fS, DI;S, DI;Z, DΔ;Z, T2;S, T2;Z, p20, Re(p21),

Im(p21),Re(p22),Im(p22)), tosets ofmeasurementsS(𝜏E, B).Weex- ploredvariousnetworkdesignswithdifferentnumbersofhiddennodes and/orlayersbeforedecidingontwofinalnetworkarchitectures:anar- tificialneuralnetworkfeaturing3fullyconnectedhiddenlayerswitha decreasingnumberofnodes(180,80,and55)andadeeper/widerneu- ralnetworkfeaturing4fullyconnectedhiddenlayerswith250nodes each. Allhidden layerswere activated byhyperbolic tangent (tanh) functionsandthedeeper/widernetworkalsofeaturedbatchnormaliza- tionlayersbetweenthefullyconnectedinnerlayersandtheirrespective tanhactivations.Todistinguishthenetworks,werefertothemasthe shallowerneuralnetwork(SNN)anddeeperneuralnetwork(DNN),re- spectively.BothSNNandDNNcompriseanoutputlayerwith11nodes corresponding totheparameters in m.The inputcompriseda given number(E) ofsignalsamplesacquiredwithapre-definedrelaxation- diffusionencodingprotocol.Weconsideredthreedifferentacquisition protocols;withE=164,E=242,andE=270samples(𝜏E,B).Indepen- dentnetworksweretrainedforeachprotocol,meaningthat3SNNsand 3DNNswereevaluated.ToremovetheinfluenceofS0fromthefitting problem,wenormalizedtheinputvectortothemediansignalacquired atthelowestb-valueandshortestecho-time.

Supervisednetworktrainingwasperformedusingameansquared errorloss

MSE=‖𝒎targ𝒎net22, (14)

wheremtargistheground-truthtargetvector,mnetisthecorresponding networkoutputvector,and

|| ⋅ ||2 denotesthe Euclidean norm. The mtarg parameters were rescaledbetween0and1usingamin-maxnormalizationstrategybe- forebeingsuppliedtothenetworks.Thenetworksweretrainedwithsets ofvoxelswithrandomlygeneratedmodelparametersandnoisysignal samplesS(𝜏E,B),asdetailedinSection3.2.TheSNNsweretrainedwith abatchsizeof0.5⋅106andascaledconjugategradientoptimiser.The DNNsweretrainedinamini-batchfashionusingatotalof5⋅106training sets,amini-batchsizeof50⋅103,andanAdamoptimiser.Throughout, trainingdatawasdividedsuchthat75%oftheoriginaldatawasused toupdatetheweightsandbiasesand25%wasusedforcross-validation.

Overfittingwasaddressedbyanearlystoppingmethodthatterminated trainingfollowinganincreaseoftheMSEofthevalidationdatafor5 (SNN)or20(DNN)consecutiveepochs.

NetworkGPUtrainingtookapproximately83minfortheSNNsand 74minfortheDNNsontwoparallelNVIDIAGeForceRTX2080SUPER, eachwith8GBofmemory.Bothgraphiccardswereinstalledonahigh- endconsumer-gradedesktopcomputerwith32GBmemoryandan8- coreInteli9–9900k3.6GHzCPUwith2threadspercore.

3.2. Generatingtrainingdata

Westudiedtheimpactoftrainingdatagenerationstrategiesonthe networkperformance,includingtrainingbasedonuniformlysampled andrealbraindata.Trainingparametervectorswerecreatedbytwo strategies:

- munifwassyntheticallyconstructedbyrandomsamplingofuncorre- lateduniformdistributionswithintheboundsdescribedinTable1; - mbrainwasconstructedfrominvivobraindatabyrandomlysampling parametervectorsestimatedfromaNLLSfitofEq.(10).Thisdataset containsparametercorrelationsfoundinatypicalbraindatasetfrom ahealthyadult.

Thembrainvectorscomprisethesolutionsofanonlinearleast-squares (NLLS)fitofEq.(10)toinvivosignaldata,referredtoasmfit,together withanadditionalparametersetmmut,consistingofrandommutations ofthefittedsolutions,givenby

𝒎mut=𝑿◦𝒎f it, (15)

where‘◦’denotestheelement-wise(Hadamard)product,andXis an 11-dimensionalvectorofnormallydistributednumbers.Eachelement

(4)

Table1

SMRparameterbounds.ThediffusivityboundswereenforcedbylimitingD||;S, D||;ZandD⊥;Z tothe[0.2,4.0]𝜇m2/msinterval.ForT2;S andT2;Z,thelower boundremovestheinfluenceoftheassumedlyfully-attenuatedmyelinwater, andthelargeupperboundofT2;Zenablesittocaptureeffectsofincreasedvalues inwhitematterlesions(Lampinenetal.,2019)aswellaspossiblecontamination withcerebrospinalfluidwhichisexpectedtohavealargerinfluenceonthemore isotropiczeppelincompartment(Lampinenetal.,2020).

Bounds f S D I;S[μm 2/ms] D I;Z[μm 2/ms] D Δ;Z T 2;S[ms] T 2;Z[ms]

Minimum 0 0.07 0.2 0.46 30 30

Maximum 1 1.33 4.0 0.86 300 1000

ofXisanindependentandidenticallydistributedrandomvariablesam- pledfromanormaldistributionwithmean1andstandarddeviation 0.3.ThestandarddeviationofXwaschosenfollowingbriefinsilicoex- perimentswhichrevealedthatvirtuallyindistinguishabletraining/test results areobtained forstandarddeviations withinthe[0.2, 0.5]in- terval,providedall othertraining/network parametersarekeptcon- stant.Thenumberofmfitvectorswaskeptconstant(𝑛f it≈1.5⋅105),and thetotalnumberofmutatedvectorswasdefinedas𝑛mut=𝑛brain𝑛f it. Theintroductionofmutatedparametersisadataaugmentationtech- nique,designedtosimultaneouslycompensatefortherelativelownum- berofmfitvectorsandexpandthe(fS, DI;S,DI;Z,DΔ;Z,T2;S, T2;Z,p20, Re(p21), Im(p21), Re(p22), Im(p22)) domain of the mbrain parameter targets.

Thetrainingvectors,mtrain,werecombinationsofmbrainandmunif parametervectors.Usingagiventotalnumberofvectors(𝑛tot)andvary- ingnumberofmbrainparameters(𝑛brain),wemodulatedthefractionof invivobraindata,𝑓brain=𝑛brain𝑛tot,between0and1instepsof0.05.

TheSNNtrainingsetscontainedatotalof𝑛tot=5⋅105parametervec- tors,whiletheDNNtrainingsetscontained𝑛tot=5⋅106.Fig.S1inthe SupportingInformationshowsthedistributionofmfit,mmut,andmunif parametersthatcomposeatypical𝑛tot=5⋅105SNNtrainingdataset.

Signaldataweregeneratedfrommtrain usingEq.(10)andone of threedifferent(𝜏E,B)acquisitionprotocols:

- Theoptimizedprotocolcomprisestensor-valuedencodingwithfull relaxation-diffusion-correlationoptimizedforminimalSMRparam- etervariance(Lampinenetal.,2020)

- The unoptimized protocol comprises tensor-valued encoding with relaxation-diffusion-correlations restricted to low b-values (Lampinen et al., 2019). This protocol was an early attempt to design a diffusion-relaxation protocol with b-tensor encoding. It preceded the optimized protocol and was configured to fit into an available timeslot by following heuristics without a formal performanceoptimization,andwaslaterfoundtoyielddegenerate resultsinwhitematter(Lampinenetal.,2020).

- TheLTE-onlyprotocolcomprisesdiffusion-relaxationoptimizedfor minimalSMRparametervariancebutincludesonlylinearb-tensor encoding(bΔ =1)(Lampinenetal.,2020).Justastheunoptimized protocolithasbeenfoundtoyielddegeneratesolutions inwhite matter.

Additionaldetailsonthevariousprotocolscanbefoundintheirre- spectivereferencesandinTableS1oftheSupportingInformation.We emphasizethatalltrainingdatausedinthisstudywasgeneratedusing theSMRforwardmodel,Eq.(10),ratherthanusingrawinvivobrain data.

Noise was sampled from the Rice distribution andadded tothe ground-truthsyntheticsignals.Because relaxation-diffusionMRI data comprisesvoxelswithdifferentsignal-to-noiseratio(SNR),theampli- tudeoftheSNRatS0=S(𝜏E=0,B=0)wasuniformlyvariedinthe intervalSNR∈[80,160].Consideringtherelaxation-diffusionproper- tiesoftypicalhealthyWM(T2≈70ms,DI≈0.9𝜇m2/ms),thischoice resultsin SNR∈[30,60]atthepointof maximalsignalof theopti- mizedprotocol(𝜏E=63ms,b=0.1ms/𝜇m2),SNRamplitudesthatare

consistentwithtensor-valueddMRImeasurementsoftheinvivobrain (Szczepankiewiczetal.,2019a).Finally,networksweretrainedusing mtrainvectorsastargetsandtheircorrespondinginsiliconoisysignalsas inputs.

3.3. Networkevaluation

Tofindtheoptimalfractionofmunifandmbrainparameters(adjusted bythefbrainparameter),wetrainedSNNswithvaryingvaluesoffbrain, deployedthemoninsilicodatageneratedfromanunseensubject,and comparedthevariousnetworksintermsofaccuracyoftheresultingpa- rameterestimates.Networkaccuracywasassessedvianormalizedroot- mean-squarederrors(NRMSE)andlinearcorrelationswithground-truth valuesintermsofthePearsoncorrelationcoefficient(𝜌).TheNRMSE captures theabsoluteagreementbetweenthetargetground-truthpa- rametersandtheircorrespondingnetworkestimates,whereas𝜌captures thelineartarget-to-estimatecorrelationstrength.Thefbrainoptimization processisdiscussedinmoredetailinsectionS3oftheSupportingInfor- mation.Briefly,thefbrainhyper-parametercontrolsatrade-off between accuracytoWM-relevantparametersandnetworkgeneralizability,and wefoundfbrain=0.5toprovideanoptimalbalancebetweenaccuracy andgeneralizability.Fromthispointonward,weconcentrateonnet- workstrainedwithfbrain=0.5datasetsandevaluatethemin further detailusingcorrelationplots.

Theaccuracyperformanceofanfbrain=0.5SNN,anfbrain=0.5DNN, andastandardNLLSsolverwerecomparedonthebasisofNRMSEsand Pearsoncorrelationcoefficients.Thecomparisonwasperformedusing twodistinctinsilicodatasets:onebasedonmfitvectorsfromWMand deepGMdata(mfit;WM-like),andanotherbasedonmunifvectors.Each datasetcomprisedatotalof10⋅103parametervectorsandtheirrespec- tiveinsilicosignals.Theground-truthsyntheticsignalswerecorrupted withRiciandistributednoiseandtheSNRattheS0pointwassampled uniformlyfromthe[80,160]range.

The effects of different acquisition protocols on network perfor- mance wereevaluatedintermsofNRMSEandsensitivitytoparame- terchanges.Thelatterwasgaugedbymodulatingthenon-orientational parametersofanSMRsolution(fS,DI;S,DI;Z,DΔ;Z,T2;S,T2;Z)oneata timeby10%andmeasuringtheresponseinallparameters.Theorig- inalparametersetwasbasedoninvivodatafromthecoronaradiata wherefS=0.45,DI;S=0.58𝜇m2/ms,DI;Z=1.36𝜇m2/ms,DΔ;Z=0.44, T2;S =69ms,T2;Z=60ms(Lampinenetal.,2020).Subsequently,in silicodatasetsweregeneratedforeachofthe6modulateddatasets,Rice noisewasaddedwithSNR=160atS0,andparameterestimationwas performedwithprotocol-specificnetworks.

ToinvestigateifthereducedparameterspaceofRotInvfittingim- pactstheperformanceofANN-basedfitting,wetrainedanSNNusing rotationallyinvariantinsilicodatasetsandthesameoptimalfbrainvalue foundfortheSMRnetworks.RotInvtrainingvectors,mtrain;RI,weregen- eratedfromthemtrainvectors(Section3.2),usingEq.(13)toconvert thefullSMRparameterstoRotInvparameters(fS,DI;S,DI;Z,DΔ;Z,T2;S, T2;Z,p2).TheRotInvinsilicosignaldatawasgeneratedinfoursteps:

(1) signalswerecalculatedusing mtrainandEq.(10);(2) noisewas added totheinsilico signaldatawitha SNR∈ [80,160]at S0;(3) SlmcomponentswereestimatedbyprojectingthenoisyS(𝜏E,B)signals toa sphericalharmonicsbasis; and(4)Sl,l={0,2} signalswerecalcu- latedfromSlmusingEq.(13).AswiththefullSMRmodel,trainingwas performedusingmtrain;RI astargetsandtheircorrespondingsynthetic noisysignalsasANNinputs.

TrainedSMR(RotInv)networks weretestedon previouslyunseen munif (munif;RI) andmbrain (mbrain;RI) synthetic datasets atan SNR∈ [80,160]atS0.Performancewascomparedintermsoftheirrespec- tivetarget-estimatecorrelations.Allnetworksweretrained/testedina leave-one-outfashionwherethetrainingandtestingmbrain (mbrain;RI) datasets were generated using in vivo data from different subjects (Section3.5).

(5)

3.4. Invivodataacquisition

Weanalyseddatafromthreeadultvolunteerspreviouslyreported in (Lampinen et al., 2020). The study was approved by the re- gional ethical review board in Lund and written informed consent was obtained from all volunteers prior to scanning. Measurements wereperformedonaMAGNETOMPrisma3Tsystem(SiemensHealth- care,Erlangen, Germany) using a prototype spin-echo EPI sequence that facilitates user-defined gradient waveformsfor diffusionencod- ing(Szczepankiewiczet al., 2019a).Datawere collectedusing echo timesbetween 63and130ms, repetitiontimeof3.4s,voxelsizeof 2.5mm3,40slices,matrix-sizeof88×88,in-planeandthroughplane accelerationfactorof 2×2(GRAPPA), partial-Fourierof 3/4,band- width=1775Hz/pixel,and“strong” fatsaturation.Diffusionencoding wasperformedwithgradientwaveformsoptimizedtomaximizetheen- codingstrengthperunittimeandtosuppressconcomitantfieldeffects (Sjölund etal.,2015; Szczepankiewiczetal., 2019b). Atotalof 270 combinationsof𝜏EandBwereused,accordingtotheoptimizedproto- colinTable S1oftheSupportingInformation.Totalacquisitiontime was15min.

3.5. Invivodataprocessingandparameterestimation

Priortoanalysis,allinvivodatawerecorrectedforeddy-currents andsubjectmotionusingElastiX(Kleinetal.,2009)withextrapolated targetvolumes(Nilssonetal.,2015).Susceptibility-inducedgeometric distortionswerecorrectedusingtheTOPUPtoolinFMRIBsoftwareli- brary(FSL)(Smithetal.,2004).Gibbsringingartefactcorrectionwas performedaccordingtothemethoddescribedin(Kellneretal.,2016).

Tosuppresstheinfluenceofnoise,wefiltereddatawitha3DGaussian kernelwithastandarddeviationof 0.45times thevoxeldimensions (Lampinenetal.,2020).

TheSMRmodelparameterswereestimatedfromavoxel-by-voxel NLLSfitofEq.(10)tothepost-processeddata.Thefittingprocesswas performed with the multidimensionaldMRI toolbox (https://github.

com/markus-nilsson/md-dmri)(Nilssonetal.,2018),withMATLAB’s built-inlsqcurvefitfunction.Toremoveoutliers,modelfittingwasper- formedtwiceineachvoxelandtheresultwithlowestresidualwasre- tained(Lampinenetal.,2020).Theinitialguessesweresampleduni- formlyfromtheparameterboundsinTable1.Theresultingestimates werestoredandusedtocomputeinsilicosignaldatafollowingtheproce- duredetailedinSection3.2.NLLSfittingofasingleinvivobraindataset tookapproximately8h(approximately5.5spervoxel)ontheCPUde- scribedinSection3.1.Thecomputationswerecarriedoutusingparallel computingandmulti-threading.

Finally,previouslytrainednetworkswereusedtoestimatethepa- rametersfrom Eq.(10) frominvivo data,which tookapproximately 2and20sforthewholebrainusingtheSNNandDNN,respectively.

Trainingwasperformedoninsilicomtrain datawithanoptimalfbrain fraction.Thetrainingprocessfollowedaleave-one-outscheme,where thenetworksweretrainedonsyntheticdatageneratedfromtwosub- jectsbeforebeingdeployed/testedonathird,previouslyunseen,sub- ject.Neuralnetworkfittingprovidedvoxel-wiseparametermapsthat werecomparedtotheonesobtainedfromaconventionalNLLSfitting approach.

4. Results

4.1. Neuralnetworkparameterestimates

SNN-basedparameterestimationwasapproximately104timesfaster thanNLLSfitting on thesame computer,andyieldedparametersin goodagreementwiththeground-truthtargetsandpreservedcontrast betweenregions characterizedbydistinct(T2, D) properties(Fig.1).

Forexample,theestimatedfSandp2arehighinWMregionsgenerally andhighestinorientationallycoherentWMregionssuchasthecorpus

callosum,similartothein-silicoground-truth.However,areducedcon- trastwasobservedintheT2;Zmaps,wherethedistinctionbetweenWM (darker) andcorticalGM(brighter)regionsis moreprominentin the ground-truthmap.TheT2;Zestimatesarealsocharacterizedbyconsid- erabledifferencesbetween ground-truthandestimatedparametersin thelongT2 regionssuchasthelateralventricles. Thelargestoverall discrepancybetweenestimatedandground-truthparameterswasfound forDΔ;Z,likelybecausethesignalisinsensitivetoitwhen|DΔ;Z|<0.5 (Erikssonetal.,2015).UsinganANNtrainedonsyntheticdatatodi- rectlyfitinvivoexperimentaldataresultedinnoisiermaps.Neverthe- less,itpreservedananatomicallyplausiblecontrast.Giventhestrong correlationsbetweeninsilicoground-truthmapsandnetworkestimates, thenoisierappearanceoftheinvivoparametermapsislikelybecause theSMRmodelcannotaccuratelyrepresenttheunderlyinginvivodata.

InvivoSNNparameterestimatesfromWMregionsofinterestsaredis- playedinTableS2oftheSupportingInformation,wheretheyareaddi- tionallycomparedtoNLLSestimates.

Fig. 2 shows that SNN-based estimates correlated well with the ground-truthparametertargets,withmostparametersyieldinglinear correlationcoefficientsclosetoorabove0.9.Thereferencedfigurefo- cusesontheperformanceofanetworktrainedwithinsilicoS(𝜏E,B) datageneratedwiththeoptimizedprotocolandanevenmixofrandom andWM-likesamples(fbrain=0.5),anddistinguishesbetweenperfor- manceonparametersobtainedbyuniformrandomsampling(lightblue points)andparametersderivedfrominvivonon-corticalbraindata(dark bluepoints).Redpointscorrespondtoparametervectorsderivedfrom lowcomponent-specificsignalfractions,asdescribedinthefigurecap- tion.PoorperformanceisobservedforlowDΔ;Zvalues,wherethenet- workyieldsDΔ;Z≈0.3regardlessoftheunderlyingground-truth.This canbeattributedtoanintrinsicdifficultyindistinguishingbetweenthe diffusion-weightedsignalsofcomponentswith|DΔ;Z|<0.5components (Erikssonetal.,2015).Moreover,apoortarget-to-estimatecorrespon- dencewasseenforT2;Z-timeswherethesewerelongerthanthemaximal echotime.

Theparametermapsestimatedfromthedeepernetworkareingood agreementwiththeirrespectiveground-truthtargets(Figs.S3andS4 oftheSupportingInformationcorrespondtoFigs.1and2).InFig.S4, weobservedthatDNN-basedfittingresultedinslightlystrongercorre- lationsbetweennetworkestimatesandground-truthparametertargets.

AlthoughinvivomapsfromDNNandSNNaresimilar,differencescan be foundinDΔ;ZandT2;Z; theDNNproducesanoisierDΔ;Zandthe T2;ZmaphasahighercontrastbetweenWMandcorticalGM.Bothof thesefeaturesarelikelyartefactual,andsuggestthattheDNNismore susceptibletodifferencesbetweentheSMRsignalpredictionsandthe measuredinvivodata.

Theerrorsandprediction-targetcorrelationsoftheANN-basedesti- matesarecompiledinTable2,wheretheyarealsocomparedtoacon- ventionalNLLSsolver.TheNLLS,SNN,andDNNapproachesallhavea comparableaccuracyforinsilicodatasetsdesignedtocaptureWM(T2, D)properties.Bycontrast,thefunction-fittingnetworksareobservedto bemoreaccuratethantheNLLSapproachforsyntheticmunifparameter vectors.

4.2. Effectofacquisitionprotocolonnetworkaccuracyandsensitivity

Inthissection,wefocusontherelationshipbetweenacquisitionpro- tocoldesignandnetworkperformance.Fig.3showsthatANN-based fittingcouldpartlybutnotcompletelyeliminatetheknownfitdegener- acyintheunoptimizedandLTE-onlyprotocols:ANNsbasedontheopti- mizedprotocolprovidelowerestimationerrors(NRMSE)thantheANNs basedontheothertwoprotocols.Comparingthesetwoprotocols,we notethattheunoptimizedprotocolyieldsrelativelymoreaccurateesti- matesofDI;S,andT2;Z,whiletheLTE-onlyprotocolyieldsmoreaccu- rateestimatesoffS,DI;Z,DΔ;Z,T2;S,andp2.Fig.3alsoshowsthatthe performanceofbothSNNandDNNislessaffectedbysub-optimalac- quisitionprotocolsthanthetraditionalNLLSapproach(Fig.3).Forthe

(6)

Fig.1.Deployingtrainednetworksonpreviouslyunseeninsilicoandinvivodataprovidesanatomicallyplausibleparametermapsinunder10s(includingdata managementtimes).Thefirstandsecondcolumnscomparetheground-truthtargetsandnetworkpredictions,respectively,oftheinsilicodataset.Differencemaps areshowninthethirdcolumn.Parametermapsobtainedfromapplyingatrainednetworkoninvivobraindataaredisplayedinthefourthcolumn.

NLLSapproach,theuseoftheunoptimizedorLTE-onlyprotocolsleadsto aconsiderableincreaseoftheestimationerrors,whileonlyaslightin- creaseofNRMSEisobservedfortheDNNorSNNapproaches.Thissug- geststhatANNsmaypartlyalleviateparameterestimationdifficulties causedbyaprotocolthatisinadequateinrelationtothemicrostructure model.

Fig.4shows thesensitivityof thevariousprotocolstoparameter changes.Networkstrainedondatageneratedwiththeoptimizedproto- colaresensitivetoallparameters,butslightlyunderestimatethemag- nitudeofthechange,particularlyinDΔ;Z.Theparameter-specificmod-

ulationsdidnothaveamajoreffectontheestimationoftheremaining unmodulatedparameters.Anexceptionwasfoundwhentheunderly- ingT2;Zisincreasedby10%,whichresultsina3%overestimationof theunchangedDI;S.Comparedtotheoptimizedprotocol,theunoptimized andLTE-onlyprotocolsexhibitalowersensitivitytothesmallparame- termodulationsandappeartobeunresponsivetochangesinDΔ;Z(both protocols)andDI;S(LTE-only).Inadditiontolowersensitivity,theunop- timizedprotocolalsoresultedinlessaccurateestimationsoftheunmod- ulatedparameters,witha10%modulationoffSleadingtoanerroneous 7%increaseinT2;S.

(7)

Fig.2. Scatterplotsofground-truthparametersvs.neuralnetworkpredictions.Lightbluepointsshowresultswhenthenetworkisdeployedonuniformlydistributed randommodelparameters.Thedarkbluepointscorrespondtoaninsilicodatasetderivedfromanonlinearleast-squaredfittomeasuredbraindatawherevoxels withinCSFandcorticalGMwereexcludedbymaskingoutregionswheremicroscopicanisotropy(Lasič etal.,2014),𝜇FA,islowerthan0.6.Theredpointscorrespond toregionswherepooraccuracyisexpected,i.e.,wherethesignalfractionoftherelevantcomponent(“stick” or“zeppelin” dependingontheparameter)accounts forlessthan15%ofthetotalsignalor,forthep2map,parametervectorswherethe“zeppelin” componentaccountsformorethan85%ofthetotalsignalfraction and|DΔ;Z|<0.4.TheinnerlegendsshowthePearsoncorrelationcoefficients(𝜌)ofthebluepoints.Forinterpretationofthereferencestocolorinthisfigurelegend, thereaderisreferredtothewebversionofthisarticle.

Table2

AccuracyperformanceofDNN-,SNN-andNLLS-basedfittingapproaches.Performanceisevaluated onsyntheticdatasimulatedfromtwodifferentsets:uniformlysampledrandomparameters(munif), andparametersderivedfromleast-squaredmodelfittingtoinvivoWManddeepGMdata(mfit;WM-like).

Metric Dataset Fitting method Fitting time [s] f S D I;S D I;Z D Δ;Z T 2;S T 2;Z

NRMSE m fit;WM-like NLLS 967 0.07 0.07 0.04 0.1 0.08 0.01

SNN 0.2 0.07 0.08 0.03 0.1 0.07 0.02

DNN 0.3 0.07 0.08 0.03 0.09 0.07 0.02

m unif NLLS 2201 0.08 0.21 0.18 0.21 0.16 0.21

SNN 0.1 0.05 0.11 0.10 0.15 0.12 0.14

DNN 0.5 0.05 0.11 0.10 0.14 0.11 0.14

𝜌 m fit;WM-like NLLS 967 0.90 0.87 0.91 0.78 0.78 0.98

SNN 0.2 0.90 0.84 0.93 0.70 0.76 0.96

DNN 0.3 0.91 0.85 0.92 0.73 0.77 0.95

m unif NLLS 2201 0.97 0.79 0.78 0.71 0.85 0.77

SNN 0.1 0.98 0.93 0.94 0.86 0.92 0.87

DNN 0.5 0.99 0.93 0.94 0.88 0.92 0.88

4.3. Neuralnetworkfittingofrotationallyinvariantmicrostructural features

Fig.5AshowsthattrainingaSNNwithrotationalinvariantsresults in slightlystrongercorrelationbetween targetandestimatedparam- eters(comparewiththescatterplotsof Fig.2).Wenotea consider- ableimprovementinaccuracyatlowDΔ;Zvalues,wheretheconstant DΔ;Z≈0.3behaviourobservedforthefullSMRmodel(seeFig.2) is nolongerpresent.ApplyingtheRotInv networktoanunseeninvivo Sl={0,2} datasetresultsinparametermapswithanatomicallyplausible contrast(seeFig.5B).ConsistentwiththebetterDΔ;Zaccuracyperfor- manceoftheRotInvapproach,wenotethattheRotInvDΔ;Zinvivomap hasasmootherappearanceandbetterdemarcatescortical/non-cortical parenchymathanitsnon-rotationallyinvariantSMRcounterpart(com- parethefourthcolumnofFigs.1with5B).

Interestingly, in vivo maps smoother than the ones displayed in Fig.5BcanbeattainedfromanANNthatwastrainedonunreasonably noisyinsilicodata.Fig.6displaystheinvivoparametermapsobtained

fromaRotInvnetworktrainedwithSNR∈[20,40]atS0,whichis4 timeslowerthanthatusedinFig.5.Theresultingmapshaveasmooth appearanceandexhibitanatomicallyplausiblecontrast.Forexample, regions withhighfS correspondtoWMregions,thelateralventricles arecharacterizedbylowfSandhighDI;Zvalues,anddarker/brighter DΔ;Zregions demarcatecortical/non-corticalparenchyma. Whileitis temptingtofavourtheseductively‘robust’mapsofFig.6overthenois- iermapsofFig.5B,wenotethatthelow-SNRRotInvnetworkresults inweakcorrelationsbetweentargetandestimatedparameters(compare thescatterplotsofFig.6withthoseofFig.5B).Forexample,SNN-based estimatesofDΔ;Zmayyieldasmoothmapthatappearsrobust,buta closerinspectionrevealsthattheDΔ;ZestimatesinWManddeepGMre- gionsareequaltothemeanofthetargetDΔ;Zdistributionandconstitute anexceedinglyinaccurateestimateoftheunderlyingground-truth.The tendencyfornetworkstoreturnthemeanofthetrainingparameterdis- tributionhasbeenreportedinstudiesoftheRotInvmodel(Reisertetal., 2017) and the behaviour was explained in detail by Coelho et al.

(2021).

(8)

Fig.3. OptimizedacquisitionprotocolsresultinANN-andNLLS-basedparameterestimateswithsmallererrors.Thebarplotsindicatethenormalizedroot-mean- squarederrors(NRMSE)betweenground-truthandpredictedparameters,forlearning-based(DNNandSNN)andNLLSfittingapproaches,andforinsilicodatasets generatedwithdifferentacquisitionprotocols.Theleftmostplotscorrespondtoatensor-valued(𝜏E,B)protocoloptimizedforminimalparametervariance,the optimizedprotocol(Lampinenetal.,2020);themiddleplotscorrespondtoasub-optimaltensor-valued(𝜏E,B)protocolwhererelaxation-diffusioncorrelationsare exclusivelyestablishedatlowb-values,theunoptimizedprotocol(Lampinenetal.,2019);therightmostplotsshowtheresultsfora(𝜏E,B)protocoloptimizedfor minimalparametervariancewhenlimitedtolineardiffusionencoding(bΔ=1),theLTE-onlyprotocol(Lampinenetal.,2020).PanelAshowsnetworkperformance onparameterssampledfromauniformdistribution,andpanelBshowstheperformanceoninsilicodatabasedonleast-squaresfittingresultstoinvivonon-cortical braintissuedata.

Fig.4.Sensitivityofacquisitionprotocolsto10%parametermodulations.Thematricesdisplaytherelationbetweenaninducedparameterchangeandtheobserved response.Whenasingleparameteronthey-axisismodulatedby10%,theresponsecanbereadinallotherparametersalongthex-axis.Anidealnetworkwould reportadiagonalmatrixwiththevalue10%onthediagonal,andzerootherwise.Theoptimizedprotocolappearssensitiveinallparameters,whereastheunoptimized protocollackssensitivityDΔ;ZandtheLTE-onlyprotocolslackssensitivitytobothDΔ;ZandDI;S.

5. Discussionandconclusions

ReplacingtraditionalNLLSsolverswithfunction-fittingneuralnet- works enables vastly faster parameter estimationwhen using high- dimensionalmicrostructuralmodels.Onaconsumer-gradedesktopcom- puter, the fitting time was reduced from hours (NNLS) to seconds (ANN).Naturally,theNNLSfittingtimes,basedontherelativelyslow trust-region-reflectivealgorithm, can be improvedby linearizing the fittingproblem(Daduccietal.,2015) orbyusingGPU-basedsolvers (Harmset al.,2017).However, whilesuchprocedures haveenabled wholebrainfittingofnon-linearmodelswithinminutes(Daduccietal., 2015;Harmsetal.,2017),westillexpecttheseconds-longforwardpass

of anANN toprovidea competitivechoicein termsof computation time.

TheANN-basedestimateswereobservedtobeingoodagreement with syntheticdatathat mimicked healthyWMaswell asdatathat spanned the entire space of allowed model parameters. When de- ployed on unseeninvivo brain data, neuralnetworks provide maps thatareconsistentwithknownbrainanatomyandpreservecontrastbe- tweenregionswithdifferentrelaxation-diffusionproperties.Ourfind- ings are encouraging and in line with recent advanced dMRI mod- ellingstudiesthatusemachinelearningtechniquesforparameteres- timation (Barbieri et al., 2020; Bertleff et al., 2017; Golkov et al., 2016; Grussu et al., 2020; Gyori et al., 2019; Hill et al., 2021;

(9)

Fig.5. Neuralnetworkfittingofarotationallyinvariant(RotInv)modelresultsinstrongtarget-estimatecorrelationsandplausiblemaps.(A)Correlationsbetween network-basedparameterestimatesandground-truthparametertargets.TheestimateswereobtainedfromaRotInvnetworktrainedusingafractionoffbrain=0.5 betweenrotationallyinvariantmbrainandmuniftrainingparametervectors.Thecolour-codingandlegendsfollowthesameconventionasFig.2.(B)Mapsofmi- crostructuraldiffusionparameters–fS,DI;S,DI;ZandDΔ;Z– obtainedfromfittingaRotInvnetworktorotationallyinvariantinvivobraindata.

Kaandorpetal.,2021;Nedjati-Gilanietal.,2017;Palomboetal.,2020; Reisertetal.,2017).Acombinationoferrormetrics,correlationanal- ysis,andsensitivitymatriceswasfoundtoprovideausefulsetoftools forquantitativelyassessingparameter-specificaccuracy/sensitivityand foridentifyingthelimitationsoflearning-basedapproaches.Thesetools facilitate a survey of the performance across all dimensions of the SMRmodel,forexample,revealingthatDΔ;Zwasconsistentlylessac- curatethanotherparameters, asexpectedfrompreviousstudiesthat haveemphasizedthatitisdifficulttoestimate(Erikssonetal.,2015; Lampinenetal.,2020,2019).Bycontrast,visualinspection ofANN- basedparametermapswasfoundtoprovidelimitedinsightonthegen- eral performanceof thenetworks. Indeed, smoothandanatomically plausiblemapscanbeachievedevenwithpoornetworkperformance anddatawithlow SNR.Thisisa commonanddeceptivepitfall that hasstrongimplicationsfortheevaluationofperformanceinmachine learningapproaches(Reisertetal.,2017).

Wefoundnoevidencethatvoxel-wiseANN-basedparameterestima- tioncanfullyalleviatethedegeneratefittinglandscapetypicallypresent whenworkingwithbiophysicalmodelsindMRI(Jelescuetal.,2016)or replaceanexhaustivesamplingofallrelevantexperimentaldimensions (Coelhoetal.,2019;Lampinenetal.,2020).Fig.3showsworseperfor- manceintermsofparameterestimationerrors(NRMSE)forthetwopro- tocolswithknowndegeneracyproblems(Lampinenetal.,2020).Simi- larly,Fig.4showsthatonlytheoptimizedprotocolcanfaithfullyrecover parameter-specificchangeswhiletheothertwocannot.Theseareboth

signsofunresolveddegeneracies.Indeed,wecannotexpectgoodperfor- manceforLTE-onlyandunoptimizedprotocolsbecausetheseprotocols canyieldvirtuallyidenticalsignalvectorsfordifferentmodelparam- eters;theinverseproblemhasmanysolutions(Lampinenetal.,2020, 2019).Nevertheless,ANN-basedfittingshowedanadvantagecompared withthetraditionalNLLSapproach,asityieldedlowerestimationerrors inthedegeneratecases(unoptimizedandLTE-onlyprotocols;Fig.3).Our interpretationisthattheNLLSapproachreturnsoneoutofthemanyso- lutions,whereastheANN-basedestimatetendstoanaverageacrossthe manysolutions.

The11-dimensionalparameterspaceoftheSMRmodelisdifficult tosampledenselyandthuspresentsachallengewhendesigningtrain- ingdatasetsthatarerepresentativeofthevastfittinglandscape.Inthis work,weaddressedthischallengebyconstructingtrainingdatabased oninvivohealthybraindata(mbrain)andmorenaïveparametervec- torsrandomlysampledfromtheentiremodelparameterspace(munif).

Networkstrainedexclusivelywithmbrainvectorsdisplayedthebestac- curacy intermsof expectedWMproperties,buttheirdomain ofva- lidityisrestrictedtotherelativelysmallspacespannedbymbrainsolu- tions.Thisraisesquestionsabouttheirgeneralizability,i.e.,theirperfor- manceincaseswhereatypicalmicroscopictissuestructuresarepresent (Alexanderetal.,2019).Tofindagoodtrade-off betweenaccuracyand generalizabilityweoptimizedthefractionofinvivo-basedtrainingdata (fbrain).However,weexpectthatmoreworkisneededtodefineatruly optimalstrategyfornetworktraining.

(10)

Fig.6. Traininganeuralnetworkwithaninsufficientdatasetmayresultinplausiblemapsbutpoortarget-estimatecorrelations.(A)Experimentalparametermaps obtainedfromfittingaRotInvSNNthatwastrainedonunreasonablynoisydata(SNRatS0inthe[20,40]range).Themapswereobtainedbydeployingthenetwork torotationallyinvariantinvivobraindata.(B)Correlationsbetweennetwork-basedparameterestimatesandground-truthparametertargets.Thecolour-codingand legendsfollowthesameconventionasFig.2.

Inthisstudy,wefocusedonfullyconnectednetworksthatfollowthe designofmultilayerperceptrons(MLPs),atraditionalANNclassthat iswell-suitedfor regressionproblems(Cybenko,1989; Horniketal., 1989).AlternativesorcomplementstothefullyconnectedANNarchi- tectureshouldalsobeexploredinfutureworks.Promisingavenuesin- cludetheuseofdropout(GalandGhahramani,2016;Tannoetal.,2021) ordeepensemblestrategies(Lakshminarayananetal.,2016;Qinetal., 2021)asameanstoderiveuncertaintymetrics,theuseofrolled-outnet- workstructuresinspiredbynon-learning-basediterativefittingframe- works(Ye,2017),theuseofauto-encoders(Zucchellietal.,2021),or theuseofdenoisingnetworks(Fadnavisetal.,2020;Wangetal.,2019) tominimize theamount ofnoisepresentinthedatathatissupplied tothefunction-fittingANN.Whilefundamentallydifferentnetworkar- chitecturesmayconsiderablyboosttheperformanceoftheANN-based fittingapproach, themodestdifferences foundbetweentheSNNand DNNdesignssuggeststhatsimplyincreasingthewidthand/ordepthof thefullyconnectedANNarchitectureisnotapromisingavenue.Despite thepotentialforimprovement,wenotethattheplotsinFig.2constitute animprovementoversimilartarget-estimatecorrelationplotsreported in(Reisertetal.,2017),wheresupervisedlearningbasedonpolynomial regressorswasusedtofitathree-compartmentdiffusionmodel,andare equivalenttothecorrelationplotsreportedin morerecentworkson learning-basedfittingofdiffusion(Gyorietal.,2019;Palomboetal., 2020)anddiffusion-relaxationMRImodels(Grussuetal.,2020).

ThefullyconnectedANNsweconsideredherearenotinvariantto samplerotations.TheinputtotheANNsisavectorofEsignalsamples measuredatapre-definedsetof bothdirectional(𝜃,𝜙) androtation- allyinvariant(𝜏E,b, bΔ)experimental points.Theorderingin which theEmeasurementsareprovidediskeptfixedandsampleswithsimi- larmicrostructuralkernelsK(𝒖̂𝒏̂)butdifferentorientationswillresult indistinctivenetworkinputvectors.Thisplacesaburdenonthetrain- ingdatageneration,whichhastospanasufficientsetofpossibletissue orientations.UsingtheRotInvformulationrendersthenetworkinvari- anttosamplerotations,whichconsiderablyreducesthedimensional- ityoftheparameterspacethathastoberepresentedintrainingdata.

Thehighertrainingefficiencylikelyexplainsits slightlyhigheraccu- racyperformancerelativetothefullSMRnetworks.Alternativestothe RotInvformulationpresentedinthisstudyincludeaframeworkbased onadifferentsetofrotationallyinvariantfeaturesofthedMRIsignal (Zucchellietal.,2021)orfittingthefullSMRmodelwithequivariant networkarchitectures(Cohenetal.,2018;Thomasetal.,2018).

Apotentiallimitationofthepresentstudyisthefocusonasingle multi-compartmentmodeloftissuemicrostructurewhoserangeofap- plicationismostlylimitedtoWManddeepGMtissues.Applications forcorticalGMshouldthereforeconsidermodelstailoredtotheappro- priatemicrostructure(Palomboetal.,2020).Ourdecisiontofocuson asinglemodelfollowsfrompreviousdMRIliteraturewhich haspre- sentedthe“StandardModel” oftissuemicrostructure– fromwhichour SMRmodeldescends– asanoverarchingsignalmodelthatencompasses severalotherWMmodelsasparticularcases(Novikovetal.,2019).Fur- thermore,the“StandardModel” hasbeenusedtorevealgeneraldegen- eracyproblemsinmicrostructureparameterestimation(Novikovetal., 2018). Giventhegenerality of ourmodelandtheprevalence of de- generaciesinadvanceddMRImodelling,weexpectthedegradationof performancewithlessoptimalprotocolstoalsobefoundinalternative multi-compartmentmodelsorwhenusingdifferentlearning-basedfit- tingalgorithms(e.g.:polynomialReisertetal.2017orrandomforest Nedjati-Gilanietal.2017,Palomboetal.2020regressors).However, futureworkisneededtofullycharacterizethegeneralrelationshipbe- tweenmachinelearningapproachesanddegeneratefittinglandscapes.

Inconclusion,functionfittingneuralnetworkscanbeusedtovastly accelerateparameterestimationwithhigh-dimensionalmicrostructural MRI models. Theaccuracyof ANN-based estimates was observed to degradelesswithsub-optimalprotocolsthantraditionalNLLSfitting.

However, theperformance of functionfitting networks was stillob- servedtoprimarilydependontheamountofinformationsampledbythe underlyingmeasurements,andwefoundnoevidencethatANN-based approachescan offsettheneedfor arichsetofdata. Therefore,ma- chinelearningmethodology inMRI microstructuremodellingshould bematchedwithcomprehensivedataacquisition.Thisworkpresentsa

Referanser

RELATERTE DOKUMENTER

There had been an innovative report prepared by Lord Dawson in 1920 for the Minister of Health’s Consultative Council on Medical and Allied Services, in which he used his

Although, particularly early in the 1920s, the cleanliness of the Cana- dian milk supply was uneven, public health professionals, the dairy indus- try, and the Federal Department

In April 2016, Ukraine’s President Petro Poroshenko, summing up the war experience thus far, said that the volunteer battalions had taken part in approximately 600 military

This report documents the experiences and lessons from the deployment of operational analysts to Afghanistan with the Norwegian Armed Forces, with regard to the concept, the main

Based on the above-mentioned tensions, a recommendation for further research is to examine whether young people who have participated in the TP influence their parents and peers in

The increasing complexity of peace operations and the growing willingness of international actors to assume extended responsibil- ity for the rule of law in often highly

Overall, the SAB considered 60 chemicals that included: (a) 14 declared as RCAs since entry into force of the Convention; (b) chemicals identied as potential RCAs from a list of

An abstract characterisation of reduction operators Intuitively a reduction operation, in the sense intended in the present paper, is an operation that can be applied to inter-