CAB430 Data and Information Integration


To view more information for this unit, select Unit Outline from the list below. Please note the teaching period for which the Unit Outline is relevant.


Unit Outline: Semester 1 2024, Gardens Point, Internal

Unit code:CAB430
Credit points:12
Pre-requisite:(CAB220 or DSB100 or CAB201 or ITD121) or Admission to EN60
Assumed Knowledge:

CAB201 Programming Principles or equivalent.

Coordinator:Yue Xu | yue.xu@qut.edu.au
Disclaimer - Offer of some units is subject to viability, and information in these Unit Outlines is subject to change prior to commencement of the teaching period.

Overview

With the rapid growth of data and digital repositories, there is an increasing awareness of benefits of data warehousing and mining techniques for Business Intelligence. Data warehousing represents an ideal vision of maintaining a central digital repository of all organizational data that can be smartly used through data mining tools to maximize business profits. Data warehousing is recognized by the IT industry as a dominant technique for applications of databases in the future.

This unit discusses the concepts, architectures and methods of data warehousing and mining techniques, e.g., data warehouse architecture and schema, data cubes and OLAP (on-line analytical processing), ETL (Data Extraction, Transformation and Loading) process, data quality, association analysis and classification. It also focuses on the topics and techniques that are most promising for building and analyzing multidimensional data for efficiently organizing data warehouses and mining tools.

Learning Outcomes

On successful completion of this unit you will be able to:

  1. Explain data warehouse concepts and processes of ETL (Data Extraction, Transformation and Loading) ) and OLAP (on-line analytical processing)
  2. Explain the concepts and algorithms of pattern and association mining, data clustering and classification
  3. Use Microsoft tools to build data warehouses from multiple datasets
  4. Apply data mining algorithms to analyse the data in data warehouses to answer users' queries for decision making
  5. Work in a self-reliant and independent way including the ability to manage time and prioritise activities to achieve deadlines.
  6. Write a professional report to justify the approach taken to build the data warehouse and data analysis

Content

The following topics will be covered:

  • Introduction to SQL server
  • Data warehouse architecture and schema
  • OLAP and Data cubes
  • ETL and data quality
  • Introduction to SQL data mining extension
  • Pattern and sequence mining
  • Association analysis
  • Data clustering
  • Data classification

Learning Approaches

This unit is available for you to study in either on-campus or online mode. You can expect to spend on average between 10 - 15 hours per week involved in preparing for and attending scheduled classes, preparing and completing assessment tasks as well as independent study and consolidation of your learning. 

Each week pre-recorded lectures will be available to introduce the core material and provide the theoretical basis of the subject. Weekly practicals held on-campus or online, contain some exercises relevant to lectures. You should study the lecture material, work out the exercises and check the answers against the solutions, to verify your understanding of the material. To clarify anything, check with the lecturer or tutors either by email or in class. This learning process requires your weekly commitment.

Practicals commence in Week 2. The practicals will reinforce the practical and theory presented in the pre-recorded lecture. The practicals are designed with a focus on development of practical skills. You may use the practical class to ask questions about your assignments.

Feedback on Learning and Assessment

  • You will receive feedback on completed and marked assignments via individual comments on assignment scripts.
  • The teaching staff will be available in person at specified times or via email to answer questions.
  • You can ask the teaching staff for advice and assistance during lectures and practical sessions.
  • For the Final Exam you are referred to the Faculty's formal Rules, Policy and Procedures.

Assessment

Overview

Appropriate assessment criteria will be made available to students at the introduction of each assignment.

Unit Grading Scheme

7- point scale

Assessment Tasks

Assessment: Project (applied)

Data warehousing.

This assignment is eligible for the 48-hour late submission period and assignment extensions.

Weight: 30
Individual/Group: Individual
Due (indicative): Week 7
Related Unit learning outcomes: 3, 5, 6

Assessment: Project (applied)

SQL Data mining

This assignment is eligible for the 48-hour late submission period and assignment extensions.

Weight: 30
Individual/Group: Group
Due (indicative): Week 13
Related Unit learning outcomes: 4, 5, 6

Assessment: Timed Online Assessment

Three (3) hour duration 

Testing Weeks 1-13 Lectures & Practicals.

Weight: 40
Individual/Group: Individual
Due (indicative): Central Examination Period
Exam Period
Related Unit learning outcomes: 1, 2

Academic Integrity

Students are expected to engage in learning and assessment at QUT with honesty, transparency and fairness. Maintaining academic integrity means upholding these principles and demonstrating valuable professional capabilities based on ethical foundations.

Failure to maintain academic integrity can take many forms. It includes cheating in examinations, plagiarism, self-plagiarism, collusion, and submitting an assessment item completed by another person (e.g. contract cheating). It can also include providing your assessment to another entity, such as to a person or website.

You are encouraged to make use of QUT’s learning support services, resources and tools to assure the academic integrity of your assessment. This includes the use of text matching software that may be available to assist with self-assessing your academic integrity as part of the assessment submission process.

Further details of QUT’s approach to academic integrity are outlined in the Academic integrity policy and the Student Code of Conduct. Breaching QUT’s Academic integrity policy is regarded as student misconduct and can lead to the imposition of penalties ranging from a grade reduction to exclusion from QUT.

Resources

No extraordinary charges or costs are associated with the requirements for this unit.

Resource Materials

Recommended text(s)

J. Han and M. Kamber, Data Mining Concepts and Techniques, Morgan Kaufmann, 2011, 3nd edition (QUT Library e-book).

MacLennan, Jamie; Tang, ZhaoHui; and Crivat, Bogdan (2009), Data Mining with Microsoft SQL Server 2008. Wiley Publishing, Inc., (ISBN: 987-470-27774-4) (QUT Library E-book).

Silvers, Fon, Building and maintaining a data warehouse, 2008, Auerbach , (ISBN: 9781420064629), (QUT Library E-book).

Reference book(s)

Chris Leiter et al., Beginning Microsoft SQL Server 2008 Administration, Wrox, Hoboken. Chapter 16, (QUT Library E book)

G. Shmueli, N. R. Patel and P. C. Bruce, Data mining for business intelligence, John Wiley & Sons, 2007.

MacLennan, Jamie; Tang, ZhaoHui; and Crivat, Bogdan (2009), Data Mining with Microsoft SQL Server 2008. Wiley Publishing, Inc., (ISBN: 987-470-27774-4) (QUT Library E-book).

Malinowski, Elzbieta and Zimanyi, Esteban, Advanced data warehouse design, Springer, 2008. Chapter 2 (QUT Library E-book)

R. Elmasri and S. B. Navathe, Fundamentals of Database Systems, 5th Edition, Pearson, 2007.

Other

Microsoft Data Mining Extensions (DMX) online reference

Risk Assessment Statement

There are no out of the ordinary risks associated with this unit.

Course Learning Outcomes

This unit is designed to support your development of the following course/study area learning outcomes.

EN60 Graduate Certificate in Communication for Engineering

  1. Demonstrate and apply specialised knowledge and technical skills in at least one Engineering discipline.
    Relates to: Project (applied), Project (applied), Timed Online Assessment
  2. Critically investigate real world engineering issues and solve complex problems drawing on specialised creative skills, analysis, evaluation and synthesis of discipline knowledge, theory and practice.
    Relates to: Project (applied), Timed Online Assessment
  3. Employ effective written and oral professional communication skills across social, cultural and discipline domains.
    Relates to: Project (applied), Project (applied)
  4. Exercise responsibility and accountability in applying knowledge and skills for own learning and effective practice including working independently, ethically and collaboratively.
    Relates to: Project (applied)