Branda, FrancescoFortino, GiancarloTalia, Domenico2025-11-282023-06-28http://hdl.handle.net/10955/5683Università della Calabria. Corso di laurea in Ingegneria Informatica, Modellistica, Elettronica e Sistemistica (DIMES). Dottorato di ricerca in Information and Communication Technologies (ICT). Ciclo XXXVInthelastyears,thecapacitytoproduceandcollectdatahasincreasedexpo- nentially.Thehugeamountofdatagenerated,commonlyreferredtoasBigData, thespeedatwhichitisproduced,anditsheterogeneityintermsofformatrepresent a challengetocurrentstorage,processing,andanalysiscapabilities.Thisscenario requiresthedesignandimplementationofnewarchitecturesandanalyticalplatform solutionsthatmustprocessBigDatatoextractcomplexpredictiveanddescriptive models.Today,high-performancecomputing(HPC)infrastructuressuchashighly parallelclusters,supercomputers,andcloudscanbeusedforprocessingandanalyz- ingmassivesourcesofreal-worlddatainvariousfields,includinggenomicsequencing andmedicalresearch,frauddetection,andweatherforecasting.Followingthesepre- liminaryobservations,thegoalofthisthesisistwofold.First,themainchallengesto besolvedforimplementinginnovativedataanalysisapplicationsonHPCsystemsare investigated.Inparticular,themainkeyresearchtopicsaddressedinclude:(i)stud- iesofsoftwaresystemsforBigDatastoring,processing,andanalysis;(ii)methods, techniques,andprototypesdesignedandusedtoimplementBigDatasolutionson massivedatasourcesrequiringtheuseofhigh-performancecomputingsystems;and (iii)designandprogrammingissuesforBigDataanalysisinExascalesystems,which willrepresentthenextcomputingstep.Second,severalinnovativeapplicationsand usecasesofBigDataanalyticsthatcanbeimplementedinlarge-scaleparallelsys- temsareproposed.Theseresearchcontributionsprovidenewinsightsandsolutions forextractingusefulknowledgefromlargevolumesofdata,describingmethodsand mechanismstosupportusers,practitioners,andscientistsworkingintheareaofBig Datainthedesignandexecutionofdataanalysistechniquesindifferentapplication domains.enbig data analysishigh-performance computing (HPC)social data miningmachine learninginfectious diseases modellingBig Data Analysis: Methodologies, Frameworks and Real-World ApplicationsThesis