Toward a machine learning model that can reason about everyday actions

摘要： Researchers train a model to reach human-level performance at recognizing abstract concepts in video.

Kim Martineau | MIT Quest for Intelligence Publication Date:August 31, 2020

The ability to reason abstractly about events as they unfold is a defining feature of human intelligence. We know instinctively that crying and writing are means of communicating, and that a panda falling from a tree and a plane landing are variations on descending.

Organizing the world into abstract categories does not come easily to computers, but in recent years researchers have inched closer by training machine learning models on words and images infused with structural information about the world, and how objects, animals, and actions relate. In a new study at the European Conference on Computer Vision this month, researchers unveiled a hybrid language-vision model that can compare and contrast a set of dynamic events captured on video to tease out the high-level concepts connecting them.

Their model did as well as or better than humans at two types of visual reasoning tasks — picking the video that conceptually best completes the set, and picking the video that doesn’t fit. Shown videos of a dog barking and a man howling beside his dog, for example, the model completed the set by picking the crying baby from a set of five videos. Researchers replicated their results on two datasets for training AI systems in action recognition: MIT’s Multi-Moments in Time and DeepMind’s Kinetics.

......

轉貼自： MIT News

若喜歡本文，請關注我們的臉書 Please Like our Facebook Page： Big Data In Finance

Toward a machine learning model that can reason about everyday actions

摘要： Researchers train a model to reach human-level performance at recognizing abstract concepts in video.

留下你的回應

以訪客張貼回應

回應

釘選列表

喜愛列表

Web Services

YOU MAY BE INTERESTED

Popular Tags

	今日	3833
	昨日	4136
	本週	19518
	本月	27626
	總訪客量	2153871