LEADER 01230nam0 22003011i 450 001 SUN0016588 005 20151218092708.305 010 $a06-320-5477-8 100 $a20040525d2000 |0engc50 ba 101 $aeng 102 $aGB 105 $a|||| ||||| 200 1 $aEcological methods$fT. Richard E. Southwood, P. A. Henderson 205 $a3. ed 210 $aOxford$cBlackwell$dc2000 215 $aXV, 575 p.$cill.$d25 cm. 620 $aGB$dOxford$3SUNL000020 676 $a577$cEcologia$v22 700 1$aSouthwood$b, T. Richard E.$3SUNV012408$0729403 701 1$aHenderson$b, P. A.$3SUNV012409$0442436 712 $aBlackwell$3SUNV000122$4650 790 1$aSouthwood, T. R. E.$zSouthwood, T. Richard E.$3SUNV065632 801 $aIT$bSOL$c20181109$gRICA 912 $aSUN0016588 950 $aUFFICIO DI BIBLIOTECA DEL DIPARTIMENTO DI SCIENZE E TECNOLOGIE AMBIENTALI BIOLOGICHE E FARMACEUTICHE$d17 CONS Ed90 $e17 FSA1827 995 $aUFFICIO DI BIBLIOTECA DEL DIPARTIMENTO DI SCIENZE E TECNOLOGIE AMBIENTALI BIOLOGICHE E FARMACEUTICHE$bIT-CE0101$gFSA$h1827$kCONS Ed90$oc$qa 996 $aEcological methods$91429659 997 $aUNICAMPANIA LEADER 04056nam 22005655 450 001 9910768172903321 005 20240312140702.0 010 $a981-9976-57-X 024 7 $a10.1007/978-981-99-7657-7 035 $a(MiAaPQ)EBC30979404 035 $a(Au-PeEL)EBL30979404 035 $a(CKB)29126986800041 035 $a(DE-He213)978-981-99-7657-7 035 $a(EXLCZ)9929126986800041 100 $a20231129d2024 u| 0 101 0 $aeng 135 $aurcnu|||||||| 181 $ctxt$2rdacontent 182 $cc$2rdamedia 183 $acr$2rdacarrier 200 10$aDirty Data Processing for Machine Learning /$fby Zhixin Qi, Hongzhi Wang, Zejiao Dong 205 $a1st ed. 2024. 210 1$aSingapore :$cSpringer Nature Singapore :$cImprint: Springer,$d2024. 215 $a1 online resource (141 pages) 311 08$aPrint version: Qi, Zhixin Dirty Data Processing for Machine Learning Singapore : Springer Singapore Pte. Limited,c2024 9789819976560 327 $aChapter 1. Introduction -- Chapter 2. Impacts of Dirty Data on Classification and Clustering Models -- Chapter 3. Dirty-Data Impacts on Regression Models -- Chapter 4. Incomplete Data Classification with View-Based Decision Tree -- Chapter 5. Density-Based Clustering for Incomplete Data -- Chapter 6. Feature Selection on Inconsistent Data -- Chapter 7. Cost-Sensitive Decision Tree Induction on Dirty Data. 330 $aIn both the database and machine learning communities, data quality has become a serious issue which cannot be ignored. In this context, we refer to data with quality problems as ?dirty data.? Clearly, for a given data mining or machine learning task, dirty data in both training and test datasets can affect the accuracy of results. Accordingly, this book analyzes the impacts of dirty data and explores effective methods for dirty data processing. Although existing data cleaning methods improve data quality dramatically, the cleaning costs are still high. If we knew how dirty data affected the accuracy of machine learning models, we could clean data selectively according to the accuracy requirements instead of cleaning all dirty data, which entails substantial costs. However, no book to date has studied the impacts of dirty data on machine learning models in terms of data quality. Filling precisely this gap, the book is intended for a broad audience ranging from researchers inthe database and machine learning communities to industry practitioners. Readers will find valuable takeaway suggestions on: model selection and data cleaning; incomplete data classification with view-based decision trees; density-based clustering for incomplete data; the feature selection method, which reduces the time costs and guarantees the accuracy of machine learning models; and cost-sensitive decision tree induction approaches under different scenarios. Further, the book opens many promising avenues for the further study of dirty data processing, such as data cleaning on demand, constructing a model to predict dirty-data impacts, and integrating data quality issues into other machine learning models. Readers will be introduced to state-of-the-art dirty data processing techniques, and the latest research advances, while also finding new inspirations in this field. 606 $aArtificial intelligence$xData processing 606 $aData mining 606 $aBig data 606 $aData Science 606 $aData Mining and Knowledge Discovery 606 $aBig Data 615 0$aArtificial intelligence$xData processing. 615 0$aData mining. 615 0$aBig data. 615 14$aData Science. 615 24$aData Mining and Knowledge Discovery. 615 24$aBig Data. 676 $a005.7 700 $aQi$b Zhixin$01453413 701 $aWang$b Hongzhi$0654187 701 $aDong$b Zejiao$01453414 801 0$bMiAaPQ 801 1$bMiAaPQ 801 2$bMiAaPQ 906 $aBOOK 912 $a9910768172903321 996 $aDirty Data Processing for Machine Learning$93656032 997 $aUNINA