* There is a mistake common to all the main proofs. In summary, what we
find are saddle points or global maxima of the respective loss functions and
not the global minima. We apologize for this Access Paper or Ask Questions
* Will be presented at the workshop "Analyzing and interpreting neural
networks for NLP", collocated with the EMNLP 2018 conference in Brussels Access Paper or Ask Questions