* A shorter version of this paper appeared at the workshop on
`Critiquing and correcting trends in machine learning` at NeurIPS 2018 Access Paper or Ask Questions
* Accepted for IEEE Transactions on Audio, Speech and Language
Processing, Special Issue on Sound Scene and Event Analysis Access Paper or Ask Questions