IFN509 Data Exploration and Mining


To view more information for this unit, select Unit Outline from the list below. Please note the teaching period for which the Unit Outline is relevant.


Unit Outline: Semester 1 2024, Gardens Point, Internal

Unit code:IFN509
Credit points:12
Pre-requisite:(Students in IN17 to apply for requisite waiver) or (IFN554 or IFQ554 (Concurrent pre-requisite) or enrolment in IN15 or IQ15 or IFN555 or IFQ555 (Concurrent pre-requisite)) or (192cps in EV08 or enrolment in IV58). Note: IFN554, IFQ554, IFN555 and IFQ555 can be enrolled in the same teaching period as IFN509.
Equivalent:IFQ509
Assumed Knowledge:

Basic statistics functions

Anti-requisite:INN342, INN343
Coordinator:Richi Nayak | r.nayak@qut.edu.au
Disclaimer - Offer of some units is subject to viability, and information in these Unit Outlines is subject to change prior to commencement of the teaching period.

Overview

This fundamental data science unit addresses the core concepts, techniques and practices of data exploration and mining. In the information age, with astronomical amounts of data produced and made available every minute, data exploration and mining becomes necessary for individuals and organisations who need to make decisions. With the advancements in data storage technology and the need for automation, data analytics skills are now essential. Data analytics methods enable users to manage, interpret, understand, process and analyse the data to find useful insight. This unit will introduce you to a wide range of data analytics methods and theories to manipulate, model and analyze data. 

This is an introductory unit and the knowledge and skills developed in this unit are relevant to both computer science and non-computer science majors.

Learning Outcomes

On successful completion of this unit you will be able to:

  1. Analyse data driven scenarios and scope data analytics questions.
  2. Choose and apply exploratory data analysis methods to understand the data and enhance the data quality.
  3. Apply appropriate basic data mining operations and methods to model the underlying data problem.
  4. Create reproducible data analysis pipelines.
  5. Work independently or in a team to implement a data analytics project.
  6. Communicate professionally in written and visual formats the findings data analysis project to specialist and non-specialist audiences.

Content

Data analytics is a multi-disciplinary field that includes methods from statistics, mathematics, machine learning, data mining, artificial intelligence and computer science, to find useful insights from datasets. In this unit you will learn the exploratory data analysis methods to cleanse, manipulate and represent data to find simple insights and interpretations from large data repositories. You will learn how to investigate and prevent issues around data quality. You will develop a strong understanding of abstract data representations, that is, how to extract and select features from a range of data types (including text, web and social media). You will learn the fundamental data mining operations and methods to model the underlying data problem using the pre-processed datasets.

Learning Approaches

This subject will be delivered through the following means:

  • Lectures (2 hours), which provide the theoretical basis of the subject.
  • Tutorials (1 hours, which provide an opportunity to work in groups on case studies to investigate and develop data science architectures.

  • Computer tutorial (1 hour), which allow you to apply theory to practical (industry data-driven) problems using available software tools.

    The learning process will be focused on real-world scenarios. Emphasis will be placed on theoretical work, laboratory exercises and case studies. The exercises will be designed to reinforce key concepts and to assist in the completion of assessments. Problem handling assessments will be drawn from typical industry applications and data sources.

Feedback on Learning and Assessment

You can obtain feedback on your progress throughout the unit through asking the teaching staff for advice and assistance during lectures and practical sessions.

The assessments will be marked according to a criteria sheet and returned to you within two weeks of submission.

You can request private consultation with teaching staff.

Assessment

Overview

This unit includes two summative assessment items during the semester and a final examination. Summative assessments are designed to give students a hands-on approach to deal with various issues that arise in a data driven analysis. Students will be provided with real-world case studies for analysing data quality and for conducting data pre-processing, data exploration and data mining. Students will learn how to choose a particular method that is appropriate to the problem at hand, and you will be given similar tasks in tutorial and practicals that will help them to successfully complete the assessment items. More details of each assessment task are given below

Unit Grading Scheme

7- point scale

Assessment Tasks

Assessment: Problem Solving Task

Develop solutions by applying data manipulation strategies and techniques to solve the given problems.

This assignment is eligible for the 48-hour late submission period and assignment extensions.

 

Weight: 20
Length: 5 weeks
Individual/Group: Individual and group
Due (indicative): Week 8
Related Unit learning outcomes: 2, 4, 5, 6

Assessment: Project (applied)

Design and Implement a data mining project based on analysis of a data analysis scenario and development of data analytics workflow, including interpretation and communication of results along with critical reflection regarding the applied methods.

This assignment is eligible for the 48-hour late submission period and assignment extensions.

 

Weight: 40
Length: 5 weeks
Individual/Group: Individual and group
Due (indicative): Week 13
Related Unit learning outcomes: 1, 3, 4, 5, 6

Assessment: On campus (in-person) Exam

Quiz / short answer questions and more advanced problems.

Weight: 40
Individual/Group: Individual
Due (indicative): Central Examination Period
Central exam duration: 2:40 - Including 10 minute perusal
Related Unit learning outcomes: 1, 2, 3

Academic Integrity

Students are expected to engage in learning and assessment at QUT with honesty, transparency and fairness. Maintaining academic integrity means upholding these principles and demonstrating valuable professional capabilities based on ethical foundations.

Failure to maintain academic integrity can take many forms. It includes cheating in examinations, plagiarism, self-plagiarism, collusion, and submitting an assessment item completed by another person (e.g. contract cheating). It can also include providing your assessment to another entity, such as to a person or website.

You are encouraged to make use of QUT’s learning support services, resources and tools to assure the academic integrity of your assessment. This includes the use of text matching software that may be available to assist with self-assessing your academic integrity as part of the assessment submission process.

Further details of QUT’s approach to academic integrity are outlined in the Academic integrity policy and the Student Code of Conduct. Breaching QUT’s Academic integrity policy is regarded as student misconduct and can lead to the imposition of penalties ranging from a grade reduction to exclusion from QUT.

Resources

No extraordinary charges or costs are associated with the requirements for this unit. Where applicable, online resources will be provided to the student as additional materials.


Resource Materials

Recommended text(s)

J. Han, M. Kamber and J. Pei, Data Mining: Concepts and Techniques, 3rd edition, 2012

Risk Assessment Statement

There are no unusual health or safety risks associated with this unit.

Course Learning Outcomes

This unit is designed to support your development of the following course/study area learning outcomes.

EN72 Master of Advanced Robotics and Artificial Intelligence

  1. Demonstrate and apply advanced and specialist discipline knowledge, concepts and practices in Advanced Robotics and AI and Data Analytics domains
    Relates to: On campus (in-person) Exam
  2. Critically analyse, evaluate and apply appropriate methods to problems to achieve research-informed solutions in Advanced Robotics and AI and Data Analytics domains
    Relates to: Problem Solving Task, On campus (in-person) Exam
  3. Apply systematic approaches to plan, design, execute and manage projects in Advanced Robotics and AI and Data Analytics domains
    Relates to: Project (applied)
  4. Communicate complex information effectively and succinctly in oral and written form for diverse purposes and audiences
    Relates to: Project (applied)
  5. Work independently and collaboratively demonstrating ethical and socially responsible practice
    Relates to: Project (applied)

EN75 Master of Sustainable Infrastructure with Data Analytics

  1. Demonstrate and apply advanced and specialist discipline knowledge, concepts, methods and practices as they relate to contemporary practice in Sustainable Infrastructure and Data Analytics domains
    Relates to: On campus (in-person) Exam
  2. Analyse and evaluate problems in Sustainable Infrastructure and Data Analytics domains using technical approaches informed by contemporary practice and leading-edge research to achieve evidence based, innovative, critically informed solutions and outcomes
    Relates to: Problem Solving Task, On campus (in-person) Exam
  3. Apply innovative, systematic approaches to plan, design, deliver and manage projects in Sustainable Infrastructure and Data Analytics domains in a way that assures sustainable outcomes and strategic objectives over their whole lifecycle
    Relates to: Project (applied)
  4. Effectively communicate problems in Sustainable Infrastructure and Data Analytics domains, related complex data and information, and solutions in contemporary professional formats for diverse purposes and audiences
    Relates to: Project (applied)
  5. Demonstrate ethically and socially responsible practice, recognising the importance of personal accountability, reflective practice, risk-informed judgements, and leadership
    Relates to: Project (applied)

EN76 Master of Renewable Energy with Data Analytics

  1. Demonstrate and apply advanced and specialist discipline knowledge, concepts, methods and practices as they relate to contemporary practice in Renewable Energy and Data Analytics domains
    Relates to: On campus (in-person) Exam
  2. Analyse and evaluate problems in Renewable Energy and Data Analytics domains using technical approaches informed by contemporary practice and leading-edge research to achieve evidence based, innovative, critically informed solutions and outcomes
    Relates to: Problem Solving Task, On campus (in-person) Exam
  3. Apply innovative, systematic approaches to plan, design, deliver and manage projects in Renewable Energy and Data Analytics domains in a way that assures sustainable outcomes and strategic objectives over their whole lifecycle
    Relates to: Project (applied)
  4. Effectively communicate problems in Renewable Energy and Data Analytics domains, related complex data and information, and solutions in contemporary professional formats for diverse purposes and audiences
    Relates to: Project (applied)
  5. Demonstrate ethically and socially responsible practice, recognising the importance of personal accountability, reflective practice, risk-informed judgements, and leadership
    Relates to: Project (applied)

EN77 Master of Advanced Manufacturing with Data Analytics

  1. Demonstrate and apply advanced and specialist theory-based discipline knowledge, concepts, methods and practices as they relate to contemporary practice in Advanced Manufacturing and Data Analytics domains
    Relates to: On campus (in-person) Exam
  2. Employ advanced specialist technical skills, analysis approaches, design, and data to the solution of problems in Advanced Manufacturing and Data Analytics domains, critically evaluating solutions and practice-informed performance to deliver whole of life requirements and strategic objectives
    Relates to: Problem Solving Task, Project (applied), On campus (in-person) Exam
  3. Implement professional communication and collaborative skills while engaging with stakeholders, exchanging ideas, and presenting complex information to specialist and non-specialist audiences in Advanced Manufacturing and Data Analytics domains
    Relates to: Project (applied)
  4. Demonstrate ethical and socially responsible practice, recognising the importance of personal accountability, and reflective practice, risk-informed judgements, and leadership
    Relates to: Project (applied)

LV41 Bachelor of Biomedical Science

  1. Critically review, analyse and synthesise foundational knowledge in a broad range of biomedical discipline areas and in depth theoretical, technical and practical knowledge in specialised discipline areas.
    Relates to: Problem Solving Task, Project (applied), On campus (in-person) Exam
  2. Demonstrate the cognitive skills required to find solutions to scientific problems.
    Relates to: Problem Solving Task, Project (applied), On campus (in-person) Exam
  3. Contribute effectively to biomedical projects, either as an individual or as a member of a team, by demonstrating professional behaviour and participating in continuous learning.
    Relates to: Problem Solving Task, On campus (in-person) Exam
  4. Apply knowledge and skills to rapidly source, critically analyse and communicate biomedical science information using appropriate technologies.
    Relates to: Project (applied)