Data Mining to improve biosecurity risk profiling

Project ID: 1301A
CEBRA Project Leader: Andrew Robinson
DA Sponsor: Raelene Vivian
DA Project Leader: Greg Hood
DA Division: Border Compliance
MPI Project Manager: Christine Reed
Collaborators: ABARES

This project will conduct a series of case studies to test and demonstrate the value of data mining for risk profiling, and will determine how to incorporate these techniques in operational practices. It will:

  1. Draw together the different profiling requirements across the department and investigate which techniques are appropriate for which pathways
  2. Search for commonalities and clear differences in the approaches required.
  3. Develop repeatable analysis algorithms to enable statistical profiling for each case study.
  4. Apply these techniques to the different pathways and data sources in the department in a series of case studies.
  5. Determine how the insights from profiling can be communicated to stakeholders and thereby improve compliance. Develop and then simplify data extraction, preprocessing and transformation techniques so that they can be incorporated into DA IT systems and business practices.

The suite of case studies include:

  1. Spatial referencing of postal addresses (geocoding) to augment compliance data with census data from the Australian Bureau of Statistics (Chris Woodland)
  2. Generalised Pattern Analysis—analyse traveller-related data to identify better risk indicators for risk-profiling passengers (Kathleen Quan)
  3. Analysing patterns of import broker activity to identify and predict non-compliance (Stephen Richardson)
  4. Transforming AIMS data to define import units for risk-return analysis.
  5. Estimating compliance when evidence is scarce or incomplete (Nianjun Liu)
  6. Broad patterns of compliance in imported cargo.
  7. Predicting the frequency of hitchhiker pests based on season, cargo-type and other risk factors (Jamie Brown)

1301A Report

Research program

Data Mining

Not currently logged in: Login