Master of Science in Data Science
October 4, 2021 2023-01-31 9:46Master of Science in Data Science
The Master of Science (M.Sc.) in data science program is designed to provide students with a high level of employability in the field of data analysis, the broader field of computing and applied mathematics. Graduates will be able to demonstrate the skills and knowledge required for further advancement in the public and government sectors. Career opportunities also exist in the private sector where graduates will be able to demonstrate an understanding of statistical analysis and automated decision-making processes equipping them with extremely valuable knowledge to business, industry, and IT firms.
The M.Sc. degree is awarded upon the successful completion of a minimum of 30 credit hours. The Master of Science (M.Sc.) in data science is broken down into four (4) semesters having 30 CH course (8 courses of which 3 courses are offered each Semester + 1 Master Thesis of 6 CH).
- Sustainable Learning Vision
- Working on engineering projects with the industry
- Ministry of Higher Education Accreditation
- State of the art labs and equipment
- Engagement of Students in Professional Activities
- Support for graduates to pursue their postgraduate studies
MSDS Program Goals?
- Presents the students with techniques to manage complex data sets.
- Presents the students with data scientist skills to model and analyze data using algorithms and data driven computing programs.
- Presents the students with data scientist skills to formulate and present analytics solutions that is appropriate to the stakeholders.
MSDS Program Learning Outcomes
- PLO 1. Apply computing algorithms and techniques to manage large-scale, and complex data set.
- PLO 2. Identify and evaluate the opportunities, needs, ethics, social responsibility, bias and limitations of the data and algorithms to provide professional data science solutions.
- PLO 3. Formulate, integrate and design solutions and methodologies using data analytics driven programs and research methods to propose a data analytics solution or research problem.
- PLO 4. Interpret and communicate effectively as managers in multidisciplinary teams through demonstrating effective professional oral & writing skills for data analytics and making persuasive presentations at a managerial level.
The study plan for a Full-time student (2 years [4 semesters] study plan)
Semester | Courses Code | Courses Title | Semester | Courses Code | Courses Title |
1 | DS601 | Fundamentals of Data Science | 2 | DS621 | Research Methods in Data Science |
DS602 | Statistics & Probability in Data Science | DS620 | Data Visualization & Data Representation Techniques | ||
DS603 | Advanced Database Queries and Data Warehouse in Data Science | DS604 | Big Data Management using Hadoop | ||
Semester | Courses Code | Courses Title | Semester | Courses Code | Courses Title |
3 | DSXXX | Electives 1 | 4 | DS629 | Data Science Thesis (Continued from Semester 3) |
DSXXX | Electives 2 | ||||
DS629 | Data Science Thesis |
The study plan for a part-time student (4 years [8 semester] study plan)
Semester | Courses Code | Courses Title | Semester | Courses Code | Courses Title |
1 | DS601 | Fundamentals of Data Science | 2 | DS603 | Advanced Database Queries and Data Warehouse in Data Science |
DS602 | Statistics & Probability in Data Science | DS621 | Research Methods in Data Science | ||
Semester | Courses Code | Courses Title | Semester | Courses Code | Courses Title |
3 | DS620 | Data Visualization & Data Representation Techniques | 4 | DSXXX | Electives 1 |
DS604 | Big Data Management using Hadoop | DSXXX | Electives 2 | ||
Semester | Courses Code | Courses Title | |||
5-8 | DS629 | Data Science Thesis |
Course Description
DS 601 Fundamentals of Data Science
This course introduces the principles of data science field as well as review common functionalities of toolkits that are suitable for a data scientist portfolio, with emphasis on data sets cleaning, processing, merging, manipulating, as well as statistical concepts and test for significance in data. In particular, this course focuses on how to read in datasets into data structures, to query and index these structures, to merge multiple data structures, to summarize data into tables, to group data into logical categories, and to manipulate dates. The coverage also includes essentials of machine learning topics such as regression, classification and clustering as well as the importance of storytelling and data visualizations in data science process. This course is an intensive practice course using real-world datasets and variety of statistical methods for both data cleaning activities and compute statistical metrics for data analysis.
DS 602 Statistics and Probability in Data Science
The course introduces the foundations of probability and statistics necessary for modeling and undertaking statistical decisions. The course combines both the mathematical theory and practices of applying this theory to actual data. Coverage includes probability and probability distributions, descriptive statistics random variables, dependence, statistical test and confidence intervals, correlation, regression, entropy and experimental design.
DS 603 Big Data Management Using Hadoop
This course introduces the fundamentals of managing big data and the corresponding standard big data platform that is suitable for addressing substantive needs for handling the different significant amount of data. This course includes an introduction to Hadoop, Hadoop components for data organization, storage, retrieval, and analysis and processing data using Hadoop and querying big data. This course will include topics such as large-scale data analysis, data storage systems, data representations, semi-structured data models. This course will involve hands- on practices using Hadoop to handle real-world datasets.
DS 604 Advanced Database Queries and Data Warehouse in Data Science
This course provides review concepts and practices as applied to database and data warehouse. Coverage includes of developing advanced database queries, distributed databases and performance issues in database system. Moreover, the course introduces state-of-the-art on a set of topics warehouses and dimensional data modeling.
DS 620 Data Visualization & Data Representation Techniques
This course focuses on data visualization techniques for effective communication of data science results through visual analytics dashboards and storytelling. Students will gain a thorough understanding of information visualization design, principles, guidelines, and evaluation criteria. The course covers best practices for creating basic diagrams and charts, statistical charts and tables, and selecting appropriate methods for specific problems. They will learn how to make data-driven decisions, apply interactive techniques to visualizations, and perform data transformations. The course also includes hands-on experience in creating interactive visualizations and dashboards using Python and leading tools such as Power BI, Tableau, and QlikView.
DS 621 Research Methods in Data Science
The purpose of the course is to provide an opportunity for the students to prepare their thesis or research project proposal, excellent writing style and compelling research strategies research, analyze ethical issues that may arise when working datasets. Formulate problem statement, conduct the literature review, identify the type of research method strategy, and plan the overall structure of the study, prepare the project data sets. Also, you will learn about issues of reproducibility, and how to set up your data science product such that it is reproducible.
DS 623 Machine Learning in Data Science
This course introduces machine learning techniques and methods. The coverage includes applied machine learning for the data scientist, the issue of the dimensionality of data, the task of data clustering, clusters evaluation, supervised approaches for building predictive models, data generalization (e.g., cross-validation and over-fitting). This course coverage includes advanced techniques, such as developing ensembles methods, and predictive models’ practical limitations.
DS 624 Text mining in Data Science
This course introduces text mining and manipulation techniques. Course coverage includes working with text toolkits, the text structure for both human and machines, text manipulation needs, regular expressions, text cleaning, preparing text for machine algorithms processing, natural language processing techniques, classifications, and topic and similarity detection in documents. This course involves hands-on practices using real-world documents.
DS 625 Social Network Analysis in Data Science
This course involves hands-on practices using real-world documents. The course introduces the process of modeling the social structures as networks, analyze the connectivity of social networks, measure the importance of nodes with the social network such as centrality and closeness, network evolution over time, a model of network generation and link problem prediction.
DS 626 Management Data Science
The course applies the tools and techniques of management science to engineering problems. It covers linear programming, sensitivity analysis, waiting line, decision analysis, forecasting. The course makes use of available optimization software and spreadsheets to solve practical problems through case studies.
DS 628 Ethics in Data Science
This course introduces ethics, policies and legal issues that face computer professionals and data scientist while working with information systems and datasets. This course will discuss Intellectual Property, privacy debates, laws and professional ethics governing these issues while working with computer systems, as well as the course, will examine the related cyber security issues. Specific case studies and assignments will be used to illustrate the discussed issues.
DS 631 Data Science Thesis
The purpose of this thesis is to provide an opportunity to analyze large-scale real-world problems according to the scientific methods. The student demonstrates an ability to utilize and apply various data science opportunity and techniques to design, implement, test, and evaluate a complex problem, deriving insights from data and sharing ideas with other stakeholders. The focus of the thesis depends on the selected elective courses and therefore corresponds Data science product outcome.
- All students must have a valid English Proficiency Score IELTS 6, TOFEL 550 or EmSAT 1400.
- A minimum cumulative GPA of 2.5 on a 4.0 scale or its established equivalent.
- Completion of a recognized baccalaureate degree in a discipline appropriate for the IT degree. A Higher Diploma is not equivalent to a baccalaureate degree and does not qualify an applicant for admission to the MSDS program.
- A valid conditional English Proficiency Score EmSAT 1250/1375, IELTS 5.5 & TOFEL 530.
- The applicant may apply for student visa only after getting admitted to the program and enroll as a Full Time student.
- Student Visa is valid only for one year.
- A student visa permits the student to work only as a part time after getting the needful approvals from the concerned authorities in the UAE.
- The approval or rejection of the visa is under General Directorate of Residency and Foreigners Affairs in Dubai. The visa process has no timing frame as it may take from two days up to two months, so we strongly recommend to submit your documents at least 2 months before the start of the term.
The Graduate Programs Office accepts applications throughout the year for the MSDS program.
Thank you for your interest in the University of Dubai. Please fill out this preliminary application form and one of our admission officers will be in contact with you shortly. If you have any problems or questions completing the form, please contact us.