This textbook explains SQL within the contextof data science and introduces the different parts of SQL as they are needed for the tasks usually carried out during data analysis. Using the framework of the data life cycle, it focuses on the steps that are very often given the short shift in traditional textbooks, like data loading, cleaning and pre-processing.
The book is organized as follows. Chapter 1 describes the data life cycle, i.e. the sequence of stages from data acquisition to archiving, that data goes through as it is prepared and then actually analyzed, together with the different activities that take place at each stage. Chapter 2 gets into databases proper, explaining how relational databases organize data. Non-traditional data, like XML and text, are also covered. Chapter 3 introduces SQL queries, but unlike traditional textbooks, queries and their parts are described around typical data analysis tasks like data exploration, cleaning and transformation. Chapter 4 introduces some basic techniques for data analysis and shows how SQL can be used for some simple analyses without too much complication. Chapter 5 introduces additional SQL constructs that are important in a variety of situations and thus completes the coverage of SQL queries. Lastly, chapter 6 briefly explains how to use SQL from within R and from within Python programs. It focuses on how these languages can interact with a database, and how what has been learned about SQL can be leveraged to make life easier when using R or Python. All chapters contain a lot of examples and exercises on the way, and readers are encouraged to install the two open-source database systems (MySQL and Postgres) that are used throughout the book in order to practice and work on the exercises, because simply reading the book is much less useful than actually usingit.
This book is for anyone interested in data science and/or databases. It just demands a bit of computer fluency, but no specific background on databases or data analysis. All concepts are introduced intuitively and with a minimum of specialized jargon. After going through this book, readers should be able to profitably learn more about data mining, machine learning, and database management from more advanced textbooks and courses.
This textbook explains SQL within the contextof data science and introduces the different parts of SQL as they are needed for the tasks usually carried out during data analysis. Using the framework of the data life cycle, it focuses on the steps that are very often given the short shift in traditional textbooks, like data loading, cleaning and pre-processing.
The book is organized as follows. Chapter 1 describes the data life cycle, i.e. the sequence of stages from data acquisition to archiving, that data goes through as it is prepared and then actually analyzed, together with the different activities that take place at each stage. Chapter 2 gets into databases proper, explaining how relational databases organize data. Non-traditional data, like XML and text, are also covered. Chapter 3 introduces SQL queries, but unlike traditional textbooks, queries and their parts are described around typical data analysis tasks like data exploration, cleaning and transformation. Chapter 4 introduces some basic techniques for data analysis and shows how SQL can be used for some simple analyses without too much complication. Chapter 5 introduces additional SQL constructs that are important in a variety of situations and thus completes the coverage of SQL queries. Lastly, chapter 6 briefly explains how to use SQL from within R and from within Python programs. It focuses on how these languages can interact with a database, and how what has been learned about SQL can be leveraged to make life easier when using R or Python. All chapters contain a lot of examples and exercises on the way, and readers are encouraged to install the two open-source database systems (MySQL and Postgres) that are used throughout the book in order to practice and work on the exercises, because simply reading the book is much less useful than actually usingit.
This book is for anyone interested in data science and/or databases. It just demands a bit of computer fluency, but no specific background on databases or data analysis. All concepts are introduced intuitively and with a minimum of specialized jargon. After going through this book, readers should be able to profitably learn more about data mining, machine learning, and database management from more advanced textbooks and courses.
Antonio Badia is Associate Professor in the Department of Computer Science and Engineering at the University of Louisville, KY, USA. He has taught both introductory and advanced college database courses for more than 20 years, and created and taught a course on data management and analysis for non-computer science students. His research on database systems has been funded by NSF and others, and produced more than 50 publications in conferences and technical journals.
Đăng ký nhận thông tin qua email để nhận được hàng triệu ưu đãi từ Muathuoctot.com
Tạp chí sức khỏe làm đẹp, Kem chống nắng nào tốt nhất hiện nay Thuoc giam can an toan hiện nay, thuoc collagen, thuoc Dong trung ha thao , thuoc giam can LIC, thuoc shark cartilage thuoc collagen youtheory dau ca omega 3 tot nhat, dong trung ha thao aloha cua my, kem tri seo hieu qua, C ollagen shiseido enriched, và collagen shiseido dạng viên , Collagen de happy ngăn chặn quá trình lão hóa, mua hang tren thuoc virility pills vp-rx tri roi loan cuong duong, vitamin e 400, dieu tri bang thuoc fucoidan, kem chống nhăn vùng mắt, dịch vụ giao hang nhanh nội thành, crest 3d white, fine pure collagen, nên mua collagen shiseido ở đâu, làm sáng mắt, dịch vụ cho thue kho lẻ tại tphcm, thực phẩm tăng cường sinh lý nam, thuoc prenatal bổ sung dinh dưỡng, kem đánh răng crest 3d white, hỗ trợ điều trị tim mạch, thuốc trắng da hiệu quả giúp phục hồi da. thuốc mọc tóc biotin
Mua hàng cực dễ dàng và nhanh chóng
Nhận hàng triệu khuyến mãi
Đăng kí tích điểm hoàn tiền lên đến 10%
Chính sách vận chuyển , giao hàng nhanh chóng
Mua hàng cực dễ dàng và nhanh chóng
Nhận hàng triệu khuyến mãi
Đăng kí tích điểm hoàn tiền lên đến 10%
Chính sách vận chuyển , giao hàng nhanh chóng
Mua hàng cực dễ dàng và nhanh chóng
Nhận hàng triệu khuyến mãi
Đăng kí tích điểm hoàn tiền lên đến 10%
Chính sách vận chuyển , giao hàng nhanh chóng