The table below provides a preview of the full dataset, which contains over a million prices. We recommend that you download and work with the data in a coding environment. This data should be used in conjuction with the Item Lookup and Premise Lookup tables.
0 viewsΒ·0 downloads
Prices are collected and verified by groundstaff on a daily basis, with over 2 million prices collected every month.
This data is collected for the purpose of price surveillance, and is excellent for high-frequency analysis of specific items in specific locations. Inflation surveillance requires a different approach, in particular to ensure proper representativeness. Inflation analysis should be conducted using DOSM's CPI data.
β
The table below provides a preview of the full dataset, which contains over a million prices. We recommend that you download and work with the data in a coding environment. This data should be used in conjuction with the Item Lookup and Premise Lookup tables.
Name in Dataset | Variable | Definition |
---|---|---|
date (Date) | Date | The date in YYYY-MM-DD format |
premise_code (Categorical) | Premise Code | Integer representing the premise, to be mapped using the Premise Lookup Table. The lookup table will give you the premise name, address, district, and state. |
item_code (Categorical) | Item Code | Integer representing the item, to be mapped using the Item Lookup Table. The lookup table will give you the item name, unit of measurement, and categorisation. |
price (Float) | Price | Price in RM |
01 Nov 2022, 12:00
N/A
This data is made open under the Creative Commons Attribution 4.0 International License (CC BY 4.0). A copy of the license is available Here.
Full Dataset (CSV)
Recommended for individuals seeking an Excel-friendly format.
0
Full Dataset (Parquet)
Recommended for data scientists seeking to work with data via code.
0
Connect directly to the data with Python.
# If not already installed, do: pip install pandas fastparquet
import pandas as pd
URL_DATA = 'https://storage.data.gov.my/pricecatcher/pricecatcher_2022-10.parquet'
df = pd.read_parquet(URL_DATA)
if 'date' in df.columns: df['date'] = pd.to_datetime(df['date'])
print(df)
This data catalog is not available through OpenAPI as the nature of the data makes it unsuitable for API access. For the full dataset, please use the provided download link as shown in the above section.
Β© 2024 Public Sector Open Data