AI Technologies

The Australian Artificial Intelligence Institute is at the forefront of developing the full range of new AI technologies.

Advanced AI Tech

The Australian Artificial Intelligence Institute has developed many pioneering AI technologies, including:

and much more.

Read about some of AAII's new and emerging AI tech innovations:

RFNN: Robust Fuzzy Neural Network With an Adaptive Inference Engine

SSD: Multidomain Adaptation With Sample and Source Distillation

DCA: Dynamic Classifier Alignment for Unsupervised Multi-Source Domain Adaptation

MSCLDA: Multi-source contribution learning for domain adaptation

SAGN: Multi-stream concept drift self-adaptation using graph neural network

MCIMO: Multiclass Classification With Fuzzy-Feature Observations: Theory and Algorithms

LIR-eGB: Evolving Gradient Boost - A Pruning Scheme Based on Loss Improvement Ratio for Learning under Concept Drift

SummAttacker: Improving the Robustness of Summarization Systems with Dual Augmentation

CoTASP: Continual Task Allocation in Meta-Policy Network via Sparse Prompting

FPF and k-FPF: Does Continual Learning Equally Forget All Parameters?

PFedRec: Dual Personalization on Federated Recommendation

Prompt Federated Learning for Weather Forecasting: Toward Foundation Models on Meteorological Data

UnifieR: A Unified Retriever for Large-Scale Retrieval

CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models

MorAL: Stay Moral and Explore - Learn to Behave Morally in Text-based Games

Robust H∞ Pinning Synchronization for Multiweighted Coupled Reaction–Diffusion Neural Networks

Suboptimal Leader-to-Coordination Control for Nonlinear Systems With Switching Topologies: A Learning-Based Method

Approximate Optimal Control for Nonlinear Systems with Periodic Event-Triggered Mechanism

AutoGMap: Learning to Map Large-Scale Sparse Graphs on Memristive Crossbars

Designing Efficient Bit-Level Sparsity-Tolerant Memristive Networks

Robust Gaussian Process Regression With Input Uncertainty: A PAC-Bayes Perspective

Disentangling Stochastic PDE Dynamics for Unsupervised Video Prediction

An Extremely Simple Algorithm for Source Domain Reconstruction

Is Out-of-Distribution Detection Learnable?

Attention-Bridging TS Fuzzy Rules for Universal Multi-Domain Adaptation without Source Data

Meta OOD Learning for Continuously Adaptive OOD Detection

Source-Free Multi-Domain Adaptation with Fuzzy Rule-based Deep Neural Networks

Bibliometric analysis of parasite vaccine research from 1990 to 2019

Machine learning for administrative health records: A systematic review of techniques and applications

A state-of-the-art methodology for high-throughput in silico vaccine discovery against protozoan parasites and exemplified with discovered candidates for Toxoplasma gondii

A review of the machine learning datasets in mammography, their adherence to the FAIR principles and the outlook for the future

Data driven science for clinically actionable knowledge in diseases

Inferring Actual Treatment Pathways from Patient Records

Objective: Treatment pathways are step-by-step plans outlining the recommended medical care for specific diseases; they get revised when different treatments are found to improve patient outcomes. Examining health records is an important part of this revision process, but inferring patients’ actual treatments from health data is challenging due to complex event-coding schemes and the absence of pathway-related annotations. The objective of this study is to develop a method for inferring actual treatment steps for a particular patient group from administrative health records — a common form of tabular healthcare data — and address several technique- and methodology-based gaps in treatment pathway-inference research.

Methods: We introduce Defrag, a method for examining health records to infer the real-world treatment steps for a particular patient group. Defrag learns the semantic and temporal meaning of healthcare event sequences, allowing it to reliably infer treatment steps from complex healthcare data. To our knowledge, Defrag is the first pathway-inference method to utilise a neural network (NN), an approach made possible by a novel, self-supervised learning objective. We also developed a testing and validation framework for pathway inference, which we use to characterise and evaluate Defrag’s pathway inference ability, establish benchmarks, and compare against baselines.

Results: We demonstrate Defrag’s effectiveness by identifying best-practice pathway fragments for breast cancer, lung cancer, and melanoma in public healthcare records. Additionally, we use synthetic data experiments to demonstrate the characteristics of the Defrag inference method, and to compare Defrag to several baselines, where it significantly outperforms non-NN-based methods.

Conclusions: Defrag offers an innovative and effective approach for inferring treatment pathways from complex health data. Defrag significantly outperforms several existing pathway-inference methods, but computationally-derived treatment pathways are still difficult to compare against clinical guidelines. Furthermore, the open-source code for Defrag and the testing framework are provided to encourage further research in this area.

AAII investigator: Paul Kennedy

AAII research lab: Biomedical Data Science Lab

Publication details: A. Caruana, M. Bandara, K. Musial, D. Catchpoole, Paul J. Kennedy, Inferring Actual Treatment Pathways from Patient Records, Journal of Biomedical Informatics. 148 (2023) 104554.

Bibliometric analysis of parasite vaccine research from 1990 to 2019

Machine learning for administrative health records: A systematic review of techniques and applications

Machine learning provides many powerful and effective techniques for analysing heterogeneous electronic health records (EHR). Administrative Health Records (AHR) are a subset of EHR collected for administrative purposes, and the use of machine learning on AHRs is a growing subfield of EHR analytics. Existing reviews of EHR analytics emphasise that the data-modality of the EHR limits the breadth of suitable machine learning techniques, and pursuable healthcare applications. Despite emphasising the importance of data modality, the literature fails to analyse which techniques and applications are relevant to AHRs. AHRs contain uniquely well-structured, categorically encoded records which are distinct from other data-modalities captured by EHRs, and they can provide valuable information pertaining to how patients interact with the healthcare system.

This paper systematically reviews AHR-based research, analysing 70 relevant studies and spanning multiple databases. We identify and analyse which machine learning techniques are applied to AHRs and which health informatics applications are pursued in AHR-based research. We also analyse how these techniques are applied in pursuit of each application, and identify the limitations of these approaches. We find that while AHR-based studies are disconnected from each other, the use of AHRs in health informatics research is substantial and accelerating. Our synthesis of these studies highlights the utility of AHRs for pursuing increasingly complex and diverse research objectives despite a number of pervading data- and technique-based limitations. Finally, through our findings, we propose a set of future research directions that can enhance the utility of AHR data and machine learning techniques for health informatics research.

AAII investigator: Paul Kennedy

AAII research lab: Biomedical Data Science Lab

Publication details: A. Caruana, M. Bandara, K. Musial, D. Catchpoole, Paul J. Kennedy, Machine learning for administrative health records: A systematic review of techniques and applications. Artificial Intelligence In Medicine (2023), Volume 144, October 2023, 102642. (DOI: https://doi.org/10.1016/j.artmed.2023.102642).

A state-of-the-art methodology for high-throughput in silico vaccine discovery against protozoan parasites and exemplified with discovered candidates for Toxoplasma gondii

A review of the machine learning datasets in mammography, their adherence to the FAIR principles and the outlook for the future

Ventral and Dorsal Stream EEG Channels: Key Features for EEG-Based Object Recognition and Identification (HAI Centre)

Object recognition and object identification are multifaceted cognitive operations that require various brain regions to synthesize and process information. Prior research has evidenced the activity of both visual and temporal cortices during these tasks. Notwithstanding their similarities, object recognition and identification are recognized as separate brain functions. Drawing from the two-stream hypothesis, our investigation aims to understand whether the channels within the ventral and dorsal streams contain pertinent information for effective model learning regarding object recognition and identification tasks. By utilizing the data we collected during the object recognition and identification experiment, we scrutinized EEGNet models, trained using channels that replicate the two-stream hypothesis pathways, against a model trained using all available channels. The outcomes reveal that the model trained solely using the temporal region delivered a high accuracy level in classifying four distinct object categories. Specifically, the object recognition and object identification models achieved an accuracy of 89% and 85%, respectively. By incorporating the channels that mimic the ventral stream, the model’s accuracy was further improved, with the object recognition model and object identification model achieving an accuracy of 95% and 94%, respectively. Further- more, the Grad-CAM result of the trained models revealed a significant contribution from the ventral and dorsal stream channels toward the training of the EEGNet model. The aim of our study is to pinpoint the optimal channel configuration that provides a swift and accurate brain--computer interface system for object recognition and identification.

Investigators: Daniel Leong, Thomas (Tien-Thong) Do, CT Lin.

All authors are with GrapheneX-UTS Human-centric Artificial Intelligence Centre (HAI) and Australian Artificial Intelligence Institute (AAII).

Publication details: Leong D, Do T, Lin CT. Ventral and Dorsal Stream EEG Channels: Key Features for EEG-Based Object Recognition and Identification. IEEE Trans Neural Syst Rehabil Eng. (DOI: 10.1109/TNSRE.2023.3339698).

research introduction

Fine-Grained Distillation for Long Document Retrieval

Causal Reinforcement Learning: A Survey

Structured Federated Learning through Clustered Additive Modeling

Is heterogeneity notorious? Taming heterogeneity to handle test-time shift in federated learning

False Correlation Reduction for Offline Reinforcement Learning

Human-Guided Moral Decision Making in Text-based Games

CITB: A Benchmark for Continual Instruction Tuning

How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances

Turn-Level Active Learning for Dialogue State Tracking

Toward Autonomous Distributed Clustering

Enhanced Adjacency-constrained Hierarchical Clustering using Fine-grained Pseudo Labels

Hierarchical clustering is able to provide partitions of different granularity levels. However, most existing hierarchical clustering techniques perform clustering in the original feature space of the data, which may suffer from overlap, sparseness, or other undesirable characteristics, resulting in noncompetitive performance. In the field of deep clustering, learning representations using pseudo labels has recently become a research hotspot. Yet most existing approaches employ coarse-grained pseudo labels, which may contain noise or incorrect labels. Hence, the learned feature space does not produce a competitive model. In this paper, we introduce the idea of fine-grained labels of supervised learning into unsupervised clustering, giving rise to the enhanced adjacency-constrained hierarchical clustering (ECHC) model. The full framework comprises four steps. One, adjacency-constrained hierarchical clustering (CHC) is used to produce relatively pure fine-grained pseudo labels. Two, those fine-grained pseudo labels are used to train a shallow multilayer perceptron to generate good representations. Three, the corresponding representation of each sample in the learned space is used to construct a similarity matrix. Four, CHC is used to generate the final partition based on the similarity matrix. The experimental results show that the proposed ECHC framework not only outperforms 14 shallow clustering methods on eight real-world datasets but also surpasses current state-of-the-art deep clustering models on six real-world datasets. In addition, on five real-world datasets, ECHC achieves comparable results to supervised algorithms.

AAII Investigators: Jie Yang, CT Lin

AAII research lab: Computational Intelligence and Brain-Computer Interface Lab (CIBCI Lab)

Funding source: Australian Research Council (ARC) under discovery grant DP210101093 and discovery grant DP220100803

Publication details: IEEE Transactions on Emerging Topics in Computational Intelligence

Authors are also with the Human-centric Artificial Intelligence Centre (HAI)

Online Boosting Adaptive Learning under Concept Drift for Multistream Classification

Deep Reinforcement Learning in Nonstationary Environments With Unknown Change Points

Domain Adaptation with Interval-valued Observations: Theory and Algorithms

Unsupervised Domain Adaptation (UDA) focuses on enhancing the model performance on an unlabeled target domain by leveraging knowledge from a source domain. The source and target domains usually share different distributions. Existing UDA research primarily concentrates on image data characterized by crisp-valued features. However, interval-valued data, where all the observations’ features are described by intervals, is also a common type of data in real-world scenarios. For instance, measurement instruments are unable to provide exact numerical outcomes, instead employing intervals to describe their results. Hence, this paper focuses on the highly challenging context known as domain adaptation with interval-valued observations. In this environment, the objective is to improve classification accuracy within an unlabeled target domain by capitalizing on knowledge gleaned from a labeled source domain, where both domains exclusively feature interval-valued observations. To address this, we first establish an upper bound on the risk in the interval-valued target domain, underpinning our analysis with rigorous theoretical insights. Subsequently, guided by our theoretical analysis, a new model based on Takagi-Sugeno Fuzzy rules and a Self-supervised Pseudo-labeling strategy (SP-TSF) is developed to address the proposed problem. Takagi-Sugeno fuzzy rules are harnessed to handle the inherent uncertainty intrinsic to interval-valued data, while a pseudo-labeling strategy is developed to augment distribution alignment between the source and target domains, each characterized by interval-valued observations. Extensive experiments on both synthetic and realworld datasets verify the rationality of our theoretical analysis and the efficacy of the proposed model.

AAII investigators: Guangzhi Ma; Jie Lu; Feng Liu; Zhen Fang; Guangquan Zhang

AAII research lab: Decision Systems and e-Service Intelligence Lab (DeSI Lab)

Funding source: Australian Research Council (ARC) under Laureate project FL190100149

Publication details: IEEE Transactions on Fuzzy Systems

Advanced AI Tech

RFNN: Robust Fuzzy Neural Network With an Adaptive Inference Engine

SSD: Multidomain Adaptation With Sample and Source Distillation

DCA: Dynamic Classifier Alignment for Unsupervised Multi-Source Domain Adaptation

MSCLDA: Multi-source contribution learning for domain adaptation

SAGN: Multi-stream concept drift self-adaptation using graph neural network

MCIMO: Multiclass Classification With Fuzzy-Feature Observations: Theory and Algorithms

LIR-eGB: Evolving Gradient Boost - A Pruning Scheme Based on Loss Improvement Ratio for Learning under Concept Drift

SummAttacker: Improving the Robustness of Summarization Systems with Dual Augmentation

CoTASP: Continual Task Allocation in Meta-Policy Network via Sparse Prompting

FPF and k-FPF: Does Continual Learning Equally Forget All Parameters?

PFedRec: Dual Personalization on Federated Recommendation

Prompt Federated Learning for Weather Forecasting: Toward Foundation Models on Meteorological Data

UnifieR: A Unified Retriever for Large-Scale Retrieval

CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models

MorAL: Stay Moral and Explore - Learn to Behave Morally in Text-based Games

Robust H∞ Pinning Synchronization for Multiweighted Coupled Reaction–Diffusion Neural Networks

Suboptimal Leader-to-Coordination Control for Nonlinear Systems With Switching Topologies: A Learning-Based Method

Approximate Optimal Control for Nonlinear Systems with Periodic Event-Triggered Mechanism

AutoGMap: Learning to Map Large-Scale Sparse Graphs on Memristive Crossbars

Designing Efficient Bit-Level Sparsity-Tolerant Memristive Networks

Robust Gaussian Process Regression With Input Uncertainty: A PAC-Bayes Perspective

Disentangling Stochastic PDE Dynamics for Unsupervised Video Prediction

An Extremely Simple Algorithm for Source Domain Reconstruction

Is Out-of-Distribution Detection Learnable?

Attention-Bridging TS Fuzzy Rules for Universal Multi-Domain Adaptation without Source Data

Meta OOD Learning for Continuously Adaptive OOD Detection

Source-Free Multi-Domain Adaptation with Fuzzy Rule-based Deep Neural Networks

Bibliometric analysis of parasite vaccine research from 1990 to 2019

Machine learning for administrative health records: A systematic review of techniques and applications

A state-of-the-art methodology for high-throughput in silico vaccine discovery against protozoan parasites and exemplified with discovered candidates for Toxoplasma gondii

A review of the machine learning datasets in mammography, their adherence to the FAIR principles and the outlook for the future

Data driven science for clinically actionable knowledge in diseases

Inferring Actual Treatment Pathways from Patient Records

Bibliometric analysis of parasite vaccine research from 1990 to 2019

Machine learning for administrative health records: A systematic review of techniques and applications

A state-of-the-art methodology for high-throughput in silico vaccine discovery against protozoan parasites and exemplified with discovered candidates for Toxoplasma gondii

A review of the machine learning datasets in mammography, their adherence to the FAIR principles and the outlook for the future

Ventral and Dorsal Stream EEG Channels: Key Features for EEG-Based Object Recognition and Identification (HAI Centre)

Fine-Grained Distillation for Long Document Retrieval

Causal Reinforcement Learning: A Survey

Structured Federated Learning through Clustered Additive Modeling

Is heterogeneity notorious? Taming heterogeneity to handle test-time shift in federated learning

False Correlation Reduction for Offline Reinforcement Learning

Human-Guided Moral Decision Making in Text-based Games

CITB: A Benchmark for Continual Instruction Tuning

How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances

Turn-Level Active Learning for Dialogue State Tracking

Toward Autonomous Distributed Clustering

Enhanced Adjacency-constrained Hierarchical Clustering using Fine-grained Pseudo Labels

Online Boosting Adaptive Learning under Concept Drift for Multistream Classification

Deep Reinforcement Learning in Nonstationary Environments With Unknown Change Points

Domain Adaptation with Interval-valued Observations: Theory and Algorithms

Multi-source Domain Adaptation with Interval-Valued Target Data via Fuzzy Neural Networks