Good list! I think it's important to note that this article is (intentionally) focused on modern CNN architectures, and not "deep learning" in general.
I'd also add in the following "technique" articles: Geoff Hinton et al.'s dropout paper and Loffe and Szegedy's Batch Normalization paper. I don't think there's been enough time for the dust to settle, but I'm excited about the possibilities Stochastic Depth could offer, too.