CAB330 Data and Web Analytics


To view more information for this unit, select Unit Outline from the list below. Please note the teaching period for which the Unit Outline is relevant.


Unit Outline: Semester 2 2024, Gardens Point, Internal

Unit code:CAB330
Credit points:12
Pre-requisite:CAB220 or DSB100 or CAB230 or IAB207
Equivalent:INB342
Assumed Knowledge:

Familiarity with the following IT concepts at the introductory level: Elementary Statistics; Basic Database Concepts; Finding library resources; and Issues involved in aligning business technology and information systems are assumed knowledge.

Coordinator:Richi Nayak | r.nayak@qut.edu.au
Disclaimer - Offer of some units is subject to viability, and information in these Unit Outlines is subject to change prior to commencement of the teaching period.

Overview

Data analytics has become a popular way to support decision-making by turning an organization's large collection of data into useful knowledge about their customers and business processes. Data analytics has direct applications in several fields such as social networks, business processes, search-engines, e-commerce, digital libraries, bioinformatics and web information systems. This unit provide fundamental knowledge and skills of data analytics to help with data-driven decision making. You will learn the different types of data mining techniques to apply classification, clustering and association mining. You will learn how the processing can be applied to text and web usage data. This is an introductory unit and the knowledge and skills developed in this unit are relevant to all IT professionals. It builds on CAB220 - Fundamentals of Data Science which introduces the basic concepts of data manipulation.

Learning Outcomes

On successful completion of this unit you will be able to:

  1. Analyse the effectiveness of data and Web analytics methods and tools when applied to real-world problems;
  2. Plan and manage your mining projects effectively from the start and avoid pitfalls in data preparation, modelling, and results interpretation;
  3. Identify appropriate problems for data/Web mining and Integrate data/Web analytics solutions into business and technical infrastructures of organizations; and
  4. Work collaboratively in small groups in order to maximise efficiency of managerial decisions related to data analytics projects.
  5. Communicate clearly in written, oral and visual formats to specialist and non-specialist audiences.

Content

The following topics will be covered.

  • Introduction to Data Mining and Knowledge Discovery
  • The knowledge discovery process and methodology;
  • Data preparation for knowledge discovery
  • Classification and prediction
  • Clustering
  • Link Analysis
  • Text Mining
  • Web Mining

Learning Approaches

This subject will be delivered through the following means:

  • Pre-recorded lectures which provide the theoretical basis of the subject;
  • Practicals (2 hours) which allow you to apply theory to practical (industry data-driven) problems using available software tools and implementation exercises.
  • Interactive Q&A session (1 hour) weekly


The learning process will be focused on real-world scenarios. Emphasis will be placed on theoretical work, laboratory exercises and case studies. The review exercises will be designed to reinforce key concepts and to assist in the completion of assessments. Problem handling assessments will be drawn from typical industry applications and real world data sources. You are also encouraged to use data from your field of interest.

Feedback on Learning and Assessment

You can obtain feedback on your progress throughout the unit through asking the teaching staff for advice and assistance during lectures and practical sessions. You are encouraged to contact the lecturer personally for seeking feedback. The assessments will be marked according to a criteria sheet and returned to you within two weeks of submission.

Assessment

Overview

The assessments in this unit are designed for you to demonstrate a critical understanding of the data and web analytics concepts acquired during the lectures, as well as the application of these concepts in real-world application settings acquired during practicals. The quizzes will allow you to demonstrate your understanding of the methods and challenges associated with data and web analytics. Assessment criteria will be made available to you at the introduction of each assessment.

Unit Grading Scheme

7- point scale

Assessment Tasks

Assessment: Case Study

Predictive Data Analytics
Case Study 1 includes mining meaningful information from the underlying data after applying predictive mining techniques.

This is an assignment for the purposes of an extension.

Weight: 25
Individual/Group: Group
Due (indicative): Mid Semester
Related Unit learning outcomes: 1, 2, 3, 4, 5

Assessment: Project (applied)

Descritive Data Mining
Application of clustreing and link analysis on entrprise, document and web data.

This is an assignment for the purposes of an extension.

Weight: 25
Individual/Group: Group
Due (indicative): Late Semester
Related Unit learning outcomes: 1, 2, 3, 4, 5

Assessment: Examination

This will assess your learning from the entire semester. This exam will consist of MCQ and short and long form  questions.

Weight: 50
Individual/Group: Individual
Due (indicative): Central Examination Period
Central exam duration: 2:40 - No perusal
Related Unit learning outcomes: 1, 2, 3

Academic Integrity

Students are expected to engage in learning and assessment at QUT with honesty, transparency and fairness. Maintaining academic integrity means upholding these principles and demonstrating valuable professional capabilities based on ethical foundations.

Failure to maintain academic integrity can take many forms. It includes cheating in examinations, plagiarism, self-plagiarism, collusion, and submitting an assessment item completed by another person (e.g. contract cheating). It can also include providing your assessment to another entity, such as to a person or website.

You are encouraged to make use of QUT’s learning support services, resources and tools to assure the academic integrity of your assessment. This includes the use of text matching software that may be available to assist with self-assessing your academic integrity as part of the assessment submission process.

Further details of QUT’s approach to academic integrity are outlined in the Academic integrity policy and the Student Code of Conduct. Breaching QUT’s Academic integrity policy is regarded as student misconduct and can lead to the imposition of penalties ranging from a grade reduction to exclusion from QUT.

Resources


Followings will also be used in addition to the text book.

  • Lecture notes on Canvas.
  • Various selected papers from the literature (provided via Canvas).

    You are strongly encouraged to read recommended references and articles pertaining to this unit.

    No extraordinary charges or costs are associated with the requirements of this unit.

Resource Materials

Prescribed text(s)

Author: J. Han and M. Kamber, Title: Data Mining Concepts and Techniques, Morgan Kaufmann, 2012

This book is available as an e-book in the library. This book mainly contains the material covered in lectures from week 1 to week 8. Sufficient materials will be provided to you via handouts or online links for the lectures from week 9 to week 13.

Risk Assessment Statement

There are no out of the ordinary risks associated with this unit. It is your responsibility to familiarise yourself with the Health and Safety policies and procedures applicable within campus areas and laboratories.

Course Learning Outcomes

This unit is designed to support your development of the following course/study area learning outcomes.

DS01 Bachelor of Data Science

  1. Communicate effectively in a variety of modes, to expert and non-expert audiences, including in a professional context.
    Relates to: ULO5