The challenges in teaching machines to see like humans

AI machines being trained with a large number of different website templates can cluster and display distinct web page layouts for each user group. Sounds simple, but how do we get machines to view visuals and layout patterns like humans? There are a few challenges to overcome before achieving this.