Upload Data to S3 Bucket

The next step is to unzip the downloaded zip file.

Please give attention to the following in the preprocessed data:

  • The file structure. A more in-depth explanation of file structure can be found in Custom Data Set.
  • Naming conventions of the files
  • The different schemas. The schemas have been included below for your convenience.

MovieLens Schemas:

Item Meta Data (item_meta_data.json)
{
    "name": "item_meta_data",
    "namespace": "com.xineoh.recommender.schema",
    "fields": [
        {
            "name": "item_id",
            "type": "string",
            "label": "movieId"
        },
        {
            "name": "meta_data_1",
            "type": "string",
            "categorical": "false",
            "label": "title",
            "recommender_field": 0
        },
        {
            "name": "meta_data_2",
            "type": "string",
            "categorical": "true",
            "label": "genres",
            "recommender_field": 0
        }
    ],
    "version": "1.0"
}
Interaction Data (user_item_interaction.json)
{
    "name": "user_item_interaction",
    "namespace": "com.xineoh.recommender.schema",
    "fields": [
        {
            "name": "user_id",
            "type": "string"
        },
        {
            "name": "item_id",
            "type": "string"
        },
        {
            "name": "event_type",
            "type": "string",
            "measurement": "ordinal"
        },
        {
            "name": "event_value",
            "type": "float"
        },
        {
            "name": "event_date",
            "type": "timestamp"
        }
    ],
    "version": "1.0"
}
User Meta Data (user_meta_data.json)
{
    "name": "user_meta_data",
    "namespace": "com.xineoh.recommender.schema",
    "fields": [
        {
            "name": "user_id",
            "type": "string",
            "label": "userId"
        }
    ],
    "version": "1.0"
}

Uploaded Data:

Once you have unzipped the data, you will need to upload it to the S3 bucket you created at this guide's start.

Once uploaded, your S3 bucket should have the following files:

  • item_meta_data
    • item_meta_data_ml_25m_xineoh.csv
  • user_item_interaction
    • user_item_interactions_ml_25m.csv
  • user_meta_data
    • user_meta_data_ml_25m.csv
  • schemas
    • item_meta_data.json
    • user_item_interaction.json
    • user_meta_data.json