Retrieve images using text as query

Implementation of OneEncoder using one layer on UP for light demo, Only coco train dataset is used in this example (3000 images).