- The data obtained for 26 May 2024 to 17 August 2024 inclusive covers a 12-week period containing an entry for every product captured by Coles and Woolworths Trends for the respective supermarket website each day. The data was scraped from each website's site-map, so products with an active URL were captured.
- The product's price and availability was collected. Any product that was available for less than 70 days was omitted from the analysis.
- Four extreme outliers — where the price moved more than 20 times in the given period — were omitted from the analysis.
- Online prices are understood to be largely the same as in-store prices, with differences depending on location and perishable items. Online-only specials have been included in the analysis, which represent a very small proportion of the total products analysed.
- Both the Coles and Woolworths data had between 2-5 days missing for various products. It was assumed the products affected were unavailable on that day.
- Cigarettes were removed from the category analysis as there were only three products represented.
- The recorded price change may not have happened on that exact day listed, but on a day or two earlier due to the timing of the data collection.
- Not every line of the dataset was validated, but it was cross-checked against open-source supermarket price platform HotPrices.org for its available data. The method was validated after the fact with a manual spot check of more than 200 products across four weeks after the analysed period. The platforms Coles Trends and Woolworths Trends are also public facing and have been open to scrutiny since July.
Posted , updated