For your convenience, we have preprocessed MovieLens data that you can download and use. We have taken the 25M-instance dataset and performed the necessary data cleanup, schema transformation and file structuring. This will allow you to simply upload the preprocessed data to your S3 bucket and proceed with the next steps in this guide.
The preprocessed data can be downloaded here.
If you want to go through the processing steps yourself, you can get the unprocessed Movielens data here: Movielens 25M Dataset.