paper reading 12/30/2015~12/31/2015

CVPR13的oral：Cumulative Attribute Space for Age and Crowd Density Estimation

视频地址：http://techtalks.tv/talks/cumulative-attribute-space-for-age-and-crowd-density-estimation/58656/

slide地址：http://www.slideserve.com/quasar/cumulative-attribute-space-for-age-and-crowd-density-estimation

这两天就在看这个工作了，文章主要是introduces a novel cumulative attribute concept for learning a regression model when only sparse and imbalanced data are available。选择的两个命题human age and people-count are strongly correlated and neighbouring values have closer similarities than those further apart。主要看Crowd Density Estimation的部分。

这些日子读文章感觉crowd counting by regression的解决方案都偏数学，废话，regression！这篇文章不是单纯的把feature映射到output，而是先regression得到一个多输出的结果，再做一次regression得到最终的单output。框架图是这样的，说实话，单看是看不懂的！

简单来说，就是测试时先由X得到a（中间结果），再由a得到y。训练的时候，由标好的数据先由y得到a，再学习x到a的变换关系。而惯常的方法是直接去学习x到y的转换关系。

实际上，20个人和21个人区别不是很大，为了利用输出临近值这种相关性，文章中在训练时，有

谓之Cumulative Attribute。其实我觉得这块可以再发掘一下，20个人和21个人的关系，6个人和21个人的关系，其实还是蛮不相同的。

然后就是学习x到a，文章中采用Joint Attribute Learning，estimate the mappings of all m attributes simultaneously by learning a multi-output regression function的方法，数学很重要~就是解下面这个问题：

paper中多次提到他们BMVC2012的一个工作：Feature Mining for Localised Crowd Counting，多输出的中间结果也是利用这个工作完成的。一搜索发现自己早就下载好了一份，元旦这两天好好的读一下。

其他的部分就没什么特别的。文章的实验做得很扎实啊，好像各个对比都很扎实，这还是同时有两个命题的情况下啊！