| Course Objectives: |
To teach the basic subjects of statistics, which is the first and an important step of data science, to be applied with the help of a mathematical background and a data analysis language. |
| Course Content: |
The relationship between data, information and knowledge, structural, semi-structural and nonstructured data, data sources, things to be considered in data collection, universe, sample, distribution concepts and its effects on the data set, R language and its detailed use, basic statistical concepts, confidence intervals, correlations and types, Chi-square, t and F distributions, hypothesis tests, Chi-square based significance tests, simple and multiple regression, mathematical backgrounds and applications in R language |
| Week |
Subject |
Related Preparation |
| 1) |
Differences, similarities and methods in intersection sets between the concepts of data science, data mining, machine learning and artificial intelligence. Data, information, knowledge concepts. |
|
| 2) |
Scope of data analysis, steps, tools used in the process. Introduction to R language (setting up R, community and Cran structure, advantages and disadvantages of RStudio, package and mirror selection, basic menus) |
|
| 3) |
Basic operations with R language (calling default datasets in R, understanding the summary and data structure of a called data set, viewing certain rows and columns from the data) |
|
| 4) |
R language and basic concepts (variables, operators, vectors, matrices, arrays, tables, basic loop and condition structures) |
|
| 5) |
Basic operations with R language (creating own data set, printing / saving, recall, finding and recalling data set from internet, simple level graphics) |
|
| 6) |
Data manipulation (data manipulation with default functions in R) |
|
| 7) |
Data manipulation (data manipulation with commands in R packets - Tidyverse / Dplyr) |
|
| 8) |
Basic descriptive statistics, hypothesis creation and confidence interval concepts, their application with R language |
|
| 9) |
Correlation concept, its types and application with R language |
|
| 10) |
Chi-square, t and F distributions, hypothesis tests, Chi-square based significance tests, their application with R language |
|
| 11) |
Chi-square, t and F distributions, hypothesis tests, Chi-square based significance tests, their application with R language |
|
| 12) |
The Concept of Regression, simple regression and multiple regression, their application with the R language |
|
| 13) |
The Concept of Regression, simple regression and multiple regression, their application with the R language |
|
| 14) |
LDA and EDA, R language applications |
|
| 15) |
LDA and EDA, R language applications |
|
| 16) |
Final Exam Week |
|
| Course Notes / Textbooks: |
Illowsky, B.; Dean, S. (2018). Introductory Statistics, Rice University OpenStax, ISBN: 978-1-947172-05-0. https://d3bxy9euw4e147.cloudfront.net/oscms-prodcms/media/documents/IntroductoryStatistics-OP_i6tAI7e.pdf
Büyüköztürk, Ş.; Çokluk, Ö.; Köklü, N. (2018). Sosyal Bilimler İçin İstatistik, Pegem Akademi, ISBN: 9789756802335.
|
| References: |
Özdemir, Ş. (2018). R Dili İle Veri Ön İşlemeden Model Seçimine Kadar Makine Öğrenmesi Süreci. BTK Akademi Eğitim Videoları. https://www.btkakademi.gov.tr/portal/course/r-dili-ile-veri-on-islemeden-model-secimine-kadar-makine-ogrenmesi-sureci-2305#!/about
Introduction to R, Datacamp.
R Tutorial for Beginners: Learn R Programming Language, Guru99. https://www.guru99.com/r-tutorial.html
|
| |
Program Outcomes |
Level of Contribution |
| 1) |
It has a wide range of interdisciplinary approaches to management information systems, primarily business and computer engineering. |
3 |
| 2) |
Comprehends the management information systems in terms of technical, organizational and managerial aspects and uses the current programming language by knowing the logic of programming. |
3 |
| 3) |
Uses different information technologies and systems for understanding and solving various business problems. |
2 |
| 4) |
Interpret the data, concepts and ideas in the field of management information systems with scientific and technological methods. |
1 |
| 5) |
Analyze the needs for an information system and analyze the processes of analysis, design and implementation of the database. |
1 |
| 6) |
Gains technical and managerial contributions to IT projects and takes responsibility. |
2 |
| 7) |
Solve complex business and informatics problems by using various statistical techniques and numerical methods and make analyzes using statistical programs effectively. |
2 |
| 8) |
Uses a foreign language at the B1 General Level in terms of European Language Portfolio criteria according to the level of education. |
1 |
| 9) |
Develops teamwork, negotiation, leadership and entrepreneurship skills. |
1 |
| 10) |
Has universal ethical values, social responsibility awareness and sufficient legal knowledge. |
1 |
| 11) |
Develops positive attitudes related to lifelong learning and identifies individual learning needs and carries out studies to correct them. |
1 |
| 12) |
Students will be able to communicate their ideas and solutions both written and orally, and present and publish them on both national and international platforms. |
1 |
| 13) |
It uses information and communication technologies together with computer software at the advanced level of European Computer Driving License required by the field. |
1 |