My robot would be sitting at a table, and it would look down at the objects on that table. It would be able identify the objects on the table: salt, pepper, a can of beer, a knife, a plate a spoon, and carrots on the plate. It would also know the coordinates of each object in 3D space of each object. Knowing that it would reach out and pick up the beer. I think Tensorflow can do some of this already. Microsoft Cognitive vision gives you the objects it sees, but not the locations.
Discover more robots
Robot56's Just Another Omnibot
Omnibot Tomy restoration: camera and radar servo working, EZ-B Bluetooth connection troubleshooting, awaiting motor...
Cliffordkoperski's HAL THE ROBOT THAT WALKS AND TALKS
5.5ft humanoid walks, talks-56 servos on 3 EZB v4 boards via WiFi, two batteries, onboard PC; Synthiam ARC scripting...
Nomad's Adventurebot Cherry
Pink & cherry AdventureBot - complete original unit repainted; cherry camera, light-pink dome, painted wheels and...
