Internship—Shine Solutions Pty Ltd

DAPP - Data Analysis Preparation Pipeline

19 May 2023

This position is offered through the ANU Computing Internship courses (COMP3820 / COMP4820 / COMP8830).

Semester 2, 2023 applications open on Tuesday 23 May and close on Monday 29 May.


Shine Solutions Pty Ltd

We are technology consultants and software developers specialising in Digital Platforms & Apps, AWS Cloud Solutions, Cloud Managed Services.

Project —DAPP - Data Analysis Preparation Pipeline

It is estimated that the world creates 1000 petabytes of data each day. The insights from analysis on data helps shape our everyday lives. In order to gain these insights, data needs to be prepared for this analysis to happen.

But this can be tricky, as the delivery, formats, schemas, and many other aspects of data, can vary between each data source, even when the data is on the same topic. This makes analysis difficult, as the different structures in the data make it hard to run analysis processes across the different structures. Additionally, there is a demand for the analysis insights to be available in real-time.

With all these requirements, the data from different input data sources needs to be transformed into a unified structure that will allow the analysis to be conducted; all the while doing so in a fast, repeatable and scalable way.

Develop a data pipeline to prepare employment profile data. It needs to:

  • Extract employment profile data from different sources and transform them into a standardised data schema

  • Design a standardised data store, and load the transformed data into it.

  • Run from start to finish, as data is delivered.

  • Hosted and operated on the cloud.

As a minimum for the demonstration, 2 different data sources must be demonstrated.

Performing analysis on the data is not a requirement, but can be added as a stretch task.

Required technical skills

  • Amazon Web Services (AWS), or a desire to learn.

  • Typescript, Java, Python or Ruby for the transformations.

  • Cloudformation/ Terraform for Infrastructure as Code definitions, or a desire to learn

Required/preferred professional and other skills

Eagerness to learn and not afraid to ask questions.

Delivery Mode

Hybrid (Project can be undertaken in-person or remote)

Student location

Student must be located in Canberra only

Project’s Special Requirements/ Conditions

Intern requires Australian citizenship

Type of internship

The intern/s will be engaged as a casual employee

How to apply

Applications are invited from eligible students to apply for the Computing Internship courses COMP3820 or COMP4820 or COMP8830. Eligibility details of COMP3820 / COMP4820 / COMP8830 and further information about the Computing Internship can be found on the Computing Internship page.

Eligible students can apply through the Computing Internship application form which will be available via the Computing Internship page between Tuesday 23 May 2023 to Monday 29 May 2023.

You can nominate multiple preferred Internship projects and host organisations through the one online application form.

Eligibility and Room Available in degree to undertake COMP3820/COMP4820/COMP8830 will be assessed at the time of application. If you do not meet the eligibility criteria or do not have room in your degree to fit COMP3820/COMP4820/COMP8830, your application will not be progressed.

Your application will require you to upload the following documents:

  • an updated copy of your Resume, and
  • an Expression of Interest (limit 350 words) for each organisation you wish to apply to (for organisations with multiple projects only submit one Expression of Interest but state clearly which project/s you wish to be considered for).

You are on Aboriginal land.

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

arrow-left bars search times arrow-up