Yoshua Bengio - Publications

Affiliations: 
Université de Montréal, Montréal, Canada 
Area:
Computer Science

194 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2016 Havaei M, Davy A, Warde-Farley D, Biard A, Courville A, Bengio Y, Pal C, Jodoin PM, Larochelle H. Brain tumor segmentation with Deep Neural Networks. Medical Image Analysis. 35: 18-31. PMID 27310171 DOI: 10.1016/j.media.2016.05.004  1
2016 Bengio Y. Machines Who Learn. Scientific American. 314: 46-51. PMID 27196842 DOI: 10.1038/scientificamerican0616-46  1
2016 Haykin S, Wright S, Bengio Y. Big data: Theoretical aspects [Scanning the Issue] Proceedings of the Ieee. 104: 8-10. DOI: 10.1109/JPROC.2015.2507658  1
2016 Bahdanau D, Chorowski J, Serdyuk D, Brakel P, Bengio Y. End-to-end attention-based large vocabulary speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2016: 4945-4949. DOI: 10.1109/ICASSP.2016.7472618  1
2016 Laurent C, Pereyra G, Brakel P, Zhang Y, Bengio Y. Batch normalized recurrent neural networks Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2016: 2657-2661. DOI: 10.1109/ICASSP.2016.7472159  1
2016 Gülçehre Ç, Bengio Y. Knowledge matters: Importance of prior information for optimization Journal of Machine Learning Research. 17.  1
2015 LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 521: 436-44. PMID 26017442 DOI: 10.1038/nature14539  1
2015 Goodfellow IJ, Erhan D, Luc Carrier P, Courville A, Mirza M, Hamner B, Cukierski W, Tang Y, Thaler D, Lee DH, Zhou Y, Ramaiah C, Feng F, Li R, Wang X, ... ... Bengio Y, et al. Challenges in representation learning: a report on three machine learning contests. Neural Networks : the Official Journal of the International Neural Network Society. 64: 59-63. PMID 25613956 DOI: 10.1016/j.neunet.2014.09.005  1
2015 Bengio Y, Lee H. Editorial introduction to the Neural Networks special issue on Deep Learning of Representations. Neural Networks : the Official Journal of the International Neural Network Society. 64: 1-3. PMID 25595998 DOI: 10.1016/j.neunet.2014.12.006  1
2015 Sordoni A, Bengio Y, Vahabi H, Lioma C, Simonsen JG, Nie JY. A hierarchical recurrent encoder-decoder for generative context-aware query suggestion International Conference On Information and Knowledge Management, Proceedings. 19: 553-562. DOI: 10.1145/2806416.2806493  1
2015 Cho K, Courville A, Bengio Y. Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks Ieee Transactions On Multimedia. 17: 1875-1886. DOI: 10.1109/TMM.2015.2477044  1
2015 Mesnil G, Dauphin Y, Yao K, Bengio Y, Deng L, Hakkani-Tur D, He X, Heck L, Tur G, Yu D, Zweig G. Using recurrent neural networks for slot filling in spoken language understanding Ieee Transactions On Audio, Speech and Language Processing. 23: 530-539. DOI: 10.1109/TASLP.2014.2383614  1
2015 Kahou SE, Bouthillier X, Lamblin P, Gulcehre C, Michalski V, Konda K, Jean S, Froumenty P, Dauphin Y, Boulanger-Lewandowski N, Chandias Ferrari R, Mirza M, Warde-Farley D, Courville A, Vincent P, ... ... Bengio Y, et al. EmoNets: Multimodal deep learning approaches for emotion recognition in video Journal On Multimodal User Interfaces. DOI: 10.1007/s12193-015-0195-2  1
2015 Mesnil G, Rifai S, Bordes A, Glorot X, Bengio Y, Vincent P. Unsupervised learning of semantics of object detections for scene categorization Advances in Intelligent Systems and Computing. 318: 209-224. DOI: 10.1007/978-3-319-12610-4_13  1
2015 Alain G, Bengio Y. What regularized auto-encoders learn from the data-generating distribution Journal of Machine Learning Research. 15: 3563-3593.  1
2015 Gouws S, Bengio Y, Corrado G. BilBOWA: Fast bilingual distributed representations without word alignments 32nd International Conference On Machine Learning, Icml 2015. 1: 748-756.  1
2015 Jean S, Cho K, Memisevic R, Bengio Y. On using very large target vocabulary for neural machine translation Acl-Ijcnlp 2015 - 53rd Annual Meeting of the Association For Computational Linguistics and the 7th International Joint Conference On Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference. 1: 1-10.  1
2015 Dauphin YN, De Vries H, Bengio Y. Equilibrated adaptive learning rates for non-convex optimization Advances in Neural Information Processing Systems. 2015: 1504-1512.  1
2015 Courbariaux M, Bengio Y, David JP. Binaryconnect: Training deep neural networks with binary weights during propagations Advances in Neural Information Processing Systems. 2015: 3123-3131.  1
2015 Chorowski J, Bahdanau D, Serdyuk D, Cho K, Bengio Y. Attention-based models for speech recognition Advances in Neural Information Processing Systems. 2015: 577-585.  1
2015 Chung J, Gulcehre C, Cho K, Bengio Y. Gated feedback recurrent neural networks 32nd International Conference On Machine Learning, Icml 2015. 3: 2067-2075.  1
2015 De Brébisson A, Simon É, Auvolat A, Vincent P, Bengio Y. Artificial neural networks applied to taxi destination prediction Ceur Workshop Proceedings. 1526.  1
2015 Chung J, Kastner K, Dinh L, Goel K, Courville A, Bengio Y. A recurrent latent variable model for sequential data Advances in Neural Information Processing Systems. 2015: 2980-2988.  1
2015 Xu K, Ba JL, Kiros R, Cho K, Courville A, Salakhutdinov R, Zemel RS, Bengio Y. Show, attend and tell: Neural image caption generation with visual attention 32nd International Conference On Machine Learning, Icml 2015. 3: 2048-2057.  1
2014 Courville A, Desjardins G, Bergstra J, Bengio Y. The Spike-and-Slab RBM and Extensions to Discrete and Sparse Data Distributions. Ieee Transactions On Pattern Analysis and Machine Intelligence. 36: 1874-87. PMID 26352238 DOI: 10.1109/TPAMI.2013.238  1
2014 Rivest F, Kalaska JF, Bengio Y. Conditioning and time representation in long short-term memory networks. Biological Cybernetics. 108: 23-48. PMID 24258005 DOI: 10.1007/s00422-013-0575-1  1
2014 Bordes A, Glorot X, Weston J, Bengio Y. A semantic matching energy function for learning with multi-relational data: Application to word-sense disambiguation Machine Learning. 94: 233-259. DOI: 10.1007/s10994-013-5363-6  1
2014 Mesnil G, Bordes A, Weston J, Chechik G, Bengio Y. Learning semantic representations of objects and their parts Machine Learning. 94: 281-301. DOI: 10.1007/s10994-013-5336-9  1
2014 Gulcehre C, Cho K, Pascanu R, Bengio Y. Learned-norm pooling for deep feedforward and recurrent neural networks Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8724: 530-546. DOI: 10.1007/978-3-662-44848-9_34  1
2014 Yao L, Ozair S, Cho K, Bengio Y. On the equivalence between deep NADE and generative stochastic networks Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8726: 322-336. DOI: 10.1007/978-3-662-44845-8_21  1
2014 Bengio Y. Evolving culture versus local minima Studies in Computational Intelligence. 557: 109-138. DOI: 10.1007/978-3-642-55337-0_3  1
2014 Raiko T, Yao L, Cho K, Bengio Y. Iterative neural autoregressive distribution estimator (NADE-k) Advances in Neural Information Processing Systems. 1: 325-333.  1
2014 Yosinski J, Clune J, Bengio Y, Lipson H. How transferable are features in deep neural networks? Advances in Neural Information Processing Systems. 4: 3320-3328.  1
2014 Sordoni A, Bengio Y, Nie JY. Learning concept embeddings for query expansion by quantum entropy minimization Proceedings of the National Conference On Artificial Intelligence. 2: 1586-1592.  1
2014 Chen M, Weinberger K, Sha F, Bengio Y. Marginalized denoising auto-encoders for nonlinear representations 31st International Conference On Machine Learning, Icml 2014. 4: 3342-3350.  1
2014 Bengio Y, Thibodeau-Laufer É, Alain G, Yosinski J. Deep generative stochastic networks trainable by backprop 31st International Conference On Machine Learning, Icml 2014. 2: 1470-1485.  1
2014 Montúfar G, Pascanu R, Cho K, Bengio Y. On the number of linear regions of deep neural networks Advances in Neural Information Processing Systems. 4: 2924-2932.  1
2014 Dumoulin V, Goodfellow IJ, Courville A, Bengio Y. On the challenges of physical implementations of RBMs Proceedings of the National Conference On Artificial Intelligence. 2: 1199-1205.  1
2014 Dauphin YN, Pascanu R, Gulcehre C, Cho K, Ganguli S, Bengio Y. Identifying and attacking the saddle point problem in high-dimensional non-convex optimization Advances in Neural Information Processing Systems. 4: 2933-2941.  1
2014 Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y. Generative adversarial nets Advances in Neural Information Processing Systems. 3: 2672-2680.  1
2013 Courville A, Desjardins G, Bergstra J, Bengio Y. The Spike-and-Slab RBM and Extensions to Discrete and Sparse Data Distributions. Ieee Transactions On Pattern Analysis and Machine Intelligence. PMID 24323880  1
2013 Goodfellow IJ, Courville A, Bengio Y. Scaling up spike-and-slab models for unsupervised feature learning. Ieee Transactions On Pattern Analysis and Machine Intelligence. 35: 1902-14. PMID 23787343 DOI: 10.1109/TPAMI.2012.273  1
2013 Bengio Y, Courville A, Vincent P. Representation learning: a review and new perspectives. Ieee Transactions On Pattern Analysis and Machine Intelligence. 35: 1798-828. PMID 23787338 DOI: 10.1109/TPAMI.2013.50  1
2013 Bengio Y, Courville A, Vincent P. Representation Learning: A Review and New Perspectives. Ieee Transactions On Pattern Analysis and Machine Intelligence. PMID 23459267  1
2013 Kahou SE, Pal C, Bouthillier X, Froumenty P, Gülçehre C, Memisevic R, Vincent P, Courville A, Bengio Y, Ferrari RC, Mirza M, Jean S, Carrier PL, Dauphin Y, Boulanger-Lewandowski N, et al. Combining modality specific deep neural networks for emotion recognition in video Icmi 2013 - Proceedings of the 2013 Acm International Conference On Multimodal Interaction. 543-550. DOI: 10.1145/2522848.2531745  1
2013 Sordoni A, Nie JY, Bengio Y. Modeling term dependencies with quantum language models for IR Sigir 2013 - Proceedings of the 36th International Acm Sigir Conference On Research and Development in Information Retrieval. 653-662. DOI: 10.1145/2484028.2484098  1
2013 Martinez HP, Bengio Y, Yannakakis G. Learning deep physiological models of affect Ieee Computational Intelligence Magazine. 8: 20-33. DOI: 10.1109/MCI.2013.2247823  1
2013 Bengio Y, Boulanger-Lewandowski N, Pascanu R. Advances in optimizing recurrent networks Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8624-8628. DOI: 10.1109/ICASSP.2013.6639349  1
2013 Boulanger-Lewandowski N, Bengio Y, Vincent P. High-dimensional sequence transduction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 3178-3182. DOI: 10.1109/ICASSP.2013.6638244  1
2013 Laufer E, Ferrari RC, Yao L, Delalleau O, Bengio Y. Stacked calibration of off-policy policy evaluation for video game matchmaking Ieee Conference On Computatonal Intelligence and Games, Cig. DOI: 10.1109/CIG.2013.6633642  1
2013 Goodfellow IJ, Erhan D, Carrier PL, Courville A, Mirza M, Hamner B, Cukierski W, Tang Y, Thaler D, Lee DH, Zhou Y, Ramaiah C, Feng F, Li R, Wang X, ... ... Bengio Y, et al. Challenges in representation learning: A report on three machine learning contests Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8228: 117-124. DOI: 10.1007/978-3-642-42051-1_16  1
2013 Bengio Y. Deep learning of representations: Looking forward Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 7978: 1-37. DOI: 10.1007/978-3-642-39593-2_1  1
2013 Bengio Y, Courville A. Deep Learning of Representations Intelligent Systems Reference Library. 49: 1-28. DOI: 10.1007/978-3-642-36657-4_1  1
2013 Dauphin YN, Bengio Y. Stochastic ratio matching of RBMs for sparse high-dimensional inputs Advances in Neural Information Processing Systems 1
2013 Bengio Y, Yao L, Alain G, Vincent P. Generalized denoising auto-encoders as generative models Advances in Neural Information Processing Systems 1
2013 Pascanu R, Mikolov T, Bengio Y. On the difficulty of training recurrent neural networks 30th International Conference On Machine Learning, Icml 2013. 2347-2355.  1
2013 Bengio Y, Mesnil G, Dauphin Y, Rifai S. Better mixing via deep representations 30th International Conference On Machine Learning, Icml 2013. 552-560.  1
2013 Mesnil G, He X, Deng L, Bengio Y. Investigation of recurrent-neural-network architectures and learning methods for spoken language understanding Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 3771-3775.  1
2013 Luo H, Carrier PL, Courville A, Bengio Y. Texture modeling with convolutional spike-and-slab RBMs and deep extensions Journal of Machine Learning Research. 31: 415-423.  1
2013 Goodfellow IJ, Mirza M, Courville A, Bengio Y. Multi-prediction deep Boltzmann machines Advances in Neural Information Processing Systems 1
2013 Goodfellow IJ, Warde-Farley D, Mirza M, Courville A, Bengio Y. Maxout networks 30th International Conference On Machine Learning, Icml 2013. 2356-2364.  1
2013 Mesnil G, Rifai S, Bordes A, Glorot X, Bengio Y, Vincent P. Unsupervised and transfer learning under uncertainty: From object detections to scene categorization Icpram 2013 - Proceedings of the 2nd International Conference On Pattern Recognition Applications and Methods. 345-354.  1
2012 Bengio Y, Chapados N, Delalleau O, Larochelle H, Saint-Mleux X, Hudon C, Louradour J. Detonation classification from acoustic signature with the Restricted Boltzmann Machine Computational Intelligence. 28: 261-288. DOI: 10.1111/j.1467-8640.2012.00419.x  1
2012 Delalleau O, Contal E, Thibodeau-Laufer E, Ferrari RC, Bengio Y, Zhang F. Beyond skill rating: Advanced matchmaking in ghost recon online Ieee Transactions On Computational Intelligence and Ai in Games. 4: 167-177. DOI: 10.1109/TCIAIG.2012.2188833  1
2012 Bengio Y. Practical recommendations for gradient-based training of deep architectures Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 7700: 437-478. DOI: 10.1007/978-3-642-35289-8-26  1
2012 Rifai S, Bengio Y, Courville A, Vincent P, Mirza M. Disentangling factors of variation for facial expression recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 7577: 808-822. DOI: 10.1007/978-3-642-33783-3_58  1
2012 Bergstra J, Bengio Y. Random search for hyper-parameter optimization Journal of Machine Learning Research. 13: 281-305.  1
2012 Boulanger-Lewandowski N, Bengio Y, Vincent P. Modeling temporal dependencies in high-dimensional sequences: Application to polyphonic music generation and transcription Proceedings of the 29th International Conference On Machine Learning, Icml 2012. 2: 1159-1166.  1
2012 Boulanger-Lewandowski N, Bengio Y, Vincent P. Discriminative non-negative matrix factorization for multiple pitch estimation Proceedings of the 13th International Society For Music Information Retrieval Conference, Ismir 2012. 205-210.  1
2012 Goodfellow IJ, Courville A, Bengio Y. Large-scale feature learning with spike-and-slab sparse coding Proceedings of the 29th International Conference On Machine Learning, Icml 2012. 2: 1439-1446.  1
2012 Hamel P, Bengio Y, Eck D. Building musically-relevant audio features through multiple timescale representations Proceedings of the 13th International Society For Music Information Retrieval Conference, Ismir 2012. 553-558.  1
2012 Rifai S, Bengio Y, Dauphin YN, Vincent P. A generative process for sampling contractive auto-encoders Proceedings of the 29th International Conference On Machine Learning, Icml 2012. 2: 1855-1862.  1
2012 Bordes A, Glorot X, Weston J, Bengio Y. Joint learning of words and meaning representations for open-text semantic parsing Journal of Machine Learning Research. 22: 127-135.  1
2012 Larochelle H, Mandel M, Pascanu R, Bengio Y. Learning algorithms for the classification restricted Boltzmann machine Journal of Machine Learning Research. 13: 643-669.  1
2011 Bergstra J, Bengio Y, Louradour J. Suitability of V1 energy models for object classification. Neural Computation. 23: 774-90. PMID 21162668 DOI: 10.1162/NECO_a_00084  1
2011 Breuleux O, Bengio Y, Vincent P. Quickly generating representative samples from an RBM-derived process Neural Computation. 23: 2058-2073. DOI: 10.1162/NECO_a_00158  1
2011 Mandel MI, Pascanu R, Eck D, Bengio Y, Aiello LM, Schifanella R, Menczer F. Contextual tag inference Acm Transactions On Multimedia Computing, Communications and Applications. 7. DOI: 10.1145/2037676.2037689  1
2011 Bengio Y, Delalleau O. On the expressive power of deep architectures Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6925: 18-36. DOI: 10.1007/978-3-642-24412-4_3  1
2011 Rifai S, Mesnil G, Vincent P, Muller X, Bengio Y, Dauphin Y, Glorot X. Higher order contractive auto-encoder Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6912: 645-660. DOI: 10.1007/978-3-642-23783-6_41  1
2011 Bengio Y. Discussion of "the neural autoregressive distribution estimator", Journal of Machine Learning Research. 15: 38-39.  1
2011 Delalleau O, Bengio Y. Shallow vs. deep sum-product networks Advances in Neural Information Processing Systems 24: 25th Annual Conference On Neural Information Processing Systems 2011, Nips 2011 1
2011 Bordes A, Weston J, Collobert R, Bengio Y. Learning structured embeddings of knowledge bases Proceedings of the National Conference On Artificial Intelligence. 1: 301-306.  1
2011 Dauphin YN, Glorot X, Bengio Y. Large-scale learning of embeddings with reconstruction sampling Proceedings of the 28th International Conference On Machine Learning, Icml 2011. 945-952.  1
2011 Desjardins G, Courville A, Bengio Y. On tracking the partition function Advances in Neural Information Processing Systems 24: 25th Annual Conference On Neural Information Processing Systems 2011, Nips 2011 1
2011 Glorot X, Bordes A, Bengio Y. Domain adaptation for large-scale sentiment classification: A deep learning approach Proceedings of the 28th International Conference On Machine Learning, Icml 2011. 513-520.  1
2011 Courville A, Bergstra J, Bengio Y. A spike and slab Restricted Boltzmann Machine Journal of Machine Learning Research. 15: 233-241.  1
2011 Courville A, Bergstra J, Bengio Y. Unsupervised models of images by spike-and-slab RBMs Proceedings of the 28th International Conference On Machine Learning, Icml 2011. 1145-1152.  1
2011 Glorot X, Bordes A, Bengio Y. Deep sparse rectifier neural networks Journal of Machine Learning Research. 15: 315-323.  1
2011 Bergstra J, Bardenet R, Bengio Y, Kégl B. Algorithms for hyper-parameter optimization Advances in Neural Information Processing Systems 24: 25th Annual Conference On Neural Information Processing Systems 2011, Nips 2011 1
2011 Hamel P, Lemieux S, Bengio Y, Eck D. Temporal pooling and multiscale learning for automatic annotation and ranking of music audio Proceedings of the 12th International Society For Music Information Retrieval Conference, Ismir 2011. 729-734.  1
2011 Rifai S, Dauphin YN, Vincent P, Bengio Y, Muller X. The manifold tangent classifier Advances in Neural Information Processing Systems 24: 25th Annual Conference On Neural Information Processing Systems 2011, Nips 2011 1
2011 Rifai S, Vincent P, Muller X, Glorot X, Bengio Y. Contractive auto-encoders: Explicit invariance during feature extraction Proceedings of the 28th International Conference On Machine Learning, Icml 2011. 833-840.  1
2011 Bengio Y, Bastien F, Bergeron A, Boulanger-Lewandowski N, Breuel T, Chherawala Y, Cisse M, Côté M, Erhan D, Eustache J, Glorot X, Muller X, Lebeuf SP, Pascanu R, Rifai S, et al. Deep learners benefit more from out-of-distribution examples Journal of Machine Learning Research. 15: 164-172.  1
2010 Larochelle H, Bengio Y, Turian J. Tractable multivariate binary density estimation and the restricted Boltzmann forest. Neural Computation. 22: 2285-307. PMID 20569177 DOI: 10.1162/NECO_a_00014  1
2010 Rivest F, Kalaska JF, Bengio Y. Alternative time representation in dopamine models. Journal of Computational Neuroscience. 28: 107-30. PMID 19847635 DOI: 10.1007/s10827-009-0191-1  1
2010 Le Roux N, Bengio Y. Deep belief networks are compact universal approximators Neural Computation. 22: 2192-2207. DOI: 10.1162/neco.2010.08-09-1081  1
2010 Bengio Y, Delalleau O, Simard C. Decision trees do not generalize to new variations Computational Intelligence. 26: 449-467. DOI: 10.1111/j.1467-8640.2010.00366.x  1
2010 Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward neural networks Journal of Machine Learning Research. 9: 249-256.  1
2010 Turian J, Ratinov L, Bengio Y. Word representations: A simple and general method for semi-supervised learning Acl 2010 - 48th Annual Meeting of the Association For Computational Linguistics, Proceedings of the Conference. 384-394.  1
2010 Mandel MI, Eck D, Bengio Y. Learning tags that vary within a song Proceedings of the 11th International Society For Music Information Retrieval Conference, Ismir 2010. 399-404.  1
2010 Desjardins G, Courville A, Bengio Y, Vincent P, Delalleau O. Parallel tempering for training of restricted Boltzmann Machines Journal of Machine Learning Research. 9: 145-152.  1
2010 Erhan D, Courville A, Bengio Y, Vincent P. Why does unsupervised pre-training help deep learning? Journal of Machine Learning Research. 9: 201-208.  1
2010 Erhan D, Courville A, Bengio Y, Vincent P. Why does unsupervised pre-training help deep learning? Journal of Machine Learning Research. 9: 201-208.  1
2010 Vincent P, Larochelle H, Lajoie I, Bengio Y, Manzagol PA. Stacked denoising autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion Journal of Machine Learning Research. 11: 3371-3408.  1
2009 Carreau J, Bengio Y. A hybrid pareto mixture for conditional asymmetric fat-tailed distributions. Ieee Transactions On Neural Networks / a Publication of the Ieee Neural Networks Council. 20: 1087-101. PMID 19473936 DOI: 10.1109/TNN.2009.2016339  1
2009 Bengio Y, Delalleau O. Justifying and generalizing contrastive divergence. Neural Computation. 21: 1601-21. PMID 19018704 DOI: 10.1162/neco.2008.11-07-647  1
2009 Bengio Y. Learning deep architectures for AI Foundations and Trends in Machine Learning. 2: 1-27. DOI: 10.1561/2200000006  1
2009 Bengio Y, Louradour J, Collobert R, Weston J. Curriculum learning Acm International Conference Proceeding Series. 382. DOI: 10.1145/1553374.1553380  1
2009 Carreau J, Bengio Y. A hybrid Pareto model for asymmetric fat-tailed data: The univariate case Extremes. 12: 53-76. DOI: 10.1007/s10687-008-0068-0  1
2009 Bergstra J, Bengio Y. Slow, decorrelated features for pretraining complex cell-like networks Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference. 99-107.  1
2009 Chapados N, Bengio Y. Augmented functional time series representation and forecasting with Gaussian processes Advances in Neural Information Processing Systems 20 - Proceedings of the 2007 Conference 1
2009 Koller D, Schuurmans D, Bengio Y, Bottou L. Preface Advances in Neural Information Processing Systems 21 - Proceedings of the 2008 Conference 1
2009 Erhan D, Manzagol PA, Bengio Y, Bengio S, Vincent P. The difficulty of training deep architectures and the effect of unsupervised pre-training Journal of Machine Learning Research. 5: 153-160.  1
2009 Le Roux N, Manzagol PA, Bengio Y. Topmoumoute online natural gradient algorithm Advances in Neural Information Processing Systems 20 - Proceedings of the 2007 Conference 1
2009 Courville AC, Eck D, Bengio Y. An infinite factor model hierarchy via a noisy-or mechanism Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference. 405-413.  1
2009 Bengio Y, Shuurmans D, Lafferty J, Williams C, Culotta A. Preface Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference. xxi-xxiii.  1
2009 Larochelle H, Bengio Y, Louradour J, Lamblin P. Exploring strategies for training deep neural networks Journal of Machine Learning Research. 10: 1-40.  1
2009 Dugas C, Bengio Y, Bélisle F, Nadeau C, Garcia R. Incorporating functional knowledge in neural networks Journal of Machine Learning Research. 10: 1239-1262.  1
2009 Le Roux N, Bengio Y, Lamblin P, Joliveau M, Kégl B. Learning the 2-D topology of images Advances in Neural Information Processing Systems 20 - Proceedings of the 2007 Conference 1
2008 Bengio Y, Senécal JS. Adaptive importance sampling to accelerate training of a neural probabilistic language model Ieee Transactions On Neural Networks. 19: 713-722. PMID 18390314 DOI: 10.1109/TNN.2007.912312  1
2008 Le Roux N, Bengio Y. Representational power of restricted boltzmann machines and deep belief networks. Neural Computation. 20: 1631-49. PMID 18254699 DOI: 10.1162/neco.2008.04-07-510  1
2008 Larochelle H, Bengio Y. Classification using discriminative restricted boltzmann machines Proceedings of the 25th International Conference On Machine Learning. 536-543.  1
2008 Larochelle H, Erhan D, Bengio Y. Zero-data learning of new tasks Proceedings of the National Conference On Artificial Intelligence. 2: 646-651.  1
2008 Vincent P, Larochelle H, Bengio Y, Manzagol PA. Extracting and composing robust features with denoising autoencoders Proceedings of the 25th International Conference On Machine Learning. 1096-1103.  1
2007 Bengio Y. On the challenge of learning complex functions. Progress in Brain Research. 165: 521-34. PMID 17925268 DOI: 10.1016/S0079-6123(06)65033-4  1
2007 Larochelle H, Erhan D, Courville A, Bergstra J, Bengio Y. An empirical evaluation of deep architectures on problems with many factors of variation Acm International Conference Proceeding Series. 227: 473-480. DOI: 10.1145/1273496.1273556  1
2007 Carreau J, Bengio Y. A hybrid Pareto model for conditional density estimation of asymmetric fat-tail data Journal of Machine Learning Research. 2: 51-58.  1
2007 Chapados N, Bengio Y. Noisy K best-paths for approximate dynamic programming with application to portfolio optimization Journal of Computers. 2: 12-19.  1
2007 Le Roux N, Bengio Y. Continuous neural networks Journal of Machine Learning Research. 2: 404-411.  1
2007 Bengio Y, Lamblin P, Popovici D, Larochelle H. Greedy layer-wise training of deep networks Advances in Neural Information Processing Systems. 153-160.  1
2006 Bengio Y, Monperrus M, Larochelle H. Nonlocal estimation of manifold structure. Neural Computation. 18: 2509-28. PMID 16907635 DOI: 10.1162/neco.2006.18.10.2509  1
2006 Erhan D, L'heureux PJ, Yue SY, Bengio Y. Collaborative filtering on a family of biological targets. Journal of Chemical Information and Modeling. 46: 626-35. PMID 16562992 DOI: 10.1021/ci050367t  1
2006 Chapados N, Bengio Y. The K best-paths approach to approximate dynamic programming with application to portfolio optimization Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4013: 491-502. DOI: 10.1007/11766247_42  1
2006 Bengio Y, Schwenk H, Senécal JS, Morin F, Gauvain JL. Neural probabilistic language models Studies in Fuzziness and Soft Computing. 194: 137-186. DOI: 10.1007/10985687_6  1
2006 Bengio Y, Delalleau O, Le Roux N, Paiement JF, Vincent P, Ouimet M. Spectral dimensionality reduction Studies in Fuzziness and Soft Computing. 207: 519-550.  1
2005 Zaccaro MC, Lee HB, Pattarawarapan M, Xia Z, Caron A, L'Heureux PJ, Bengio Y, Burgess K, Saragovi HU. Selective small molecule peptidomimetic ligands of TrkC and TrkA receptors afford discrete or complete neurotrophic activities. Chemistry & Biology. 12: 1015-28. PMID 16183026 DOI: 10.1016/j.chembiol.2005.06.015  1
2005 Bengio Y, Grandvalet Y. Bias in estimating the variance of K-fold cross-validation Statistical Modeling and Analysis For Complex Data Problems. 75-95. DOI: 10.1007/0-387-24555-3_5  1
2005 Grandvalet Y, Bengio Y. Semi-supervised learning by entropy minimization Advances in Neural Information Processing Systems 1
2005 Morin F, Bengio Y. Hierarchical probabilistic neural network language model Aistats 2005 - Proceedings of the 10th International Workshop On Artificial Intelligence and Statistics. 246-252.  1
2005 Ouimet M, Bengio Y. Greedy spectral embedding Aistats 2005 - Proceedings of the 10th International Workshop On Artificial Intelligence and Statistics. 253-260.  1
2005 Bengio Y, Monperrus M. Non-local manifold tangent learning Advances in Neural Information Processing Systems 1
2005 Delalleau O, Bengio Y, Le Roux N. Efficient non-parametric function induction in semi-supervised learning Aistats 2005 - Proceedings of the 10th International Workshop On Artificial Intelligence and Statistics. 96-103.  1
2005 Bengio Y, Larochelle H, Vincent P. Non-local manifold parzen windows Advances in Neural Information Processing Systems. 115-122.  1
2005 Bengio Y, Delalleau O, Le Roux N. The curse of highly variable functions for local kernel machines Advances in Neural Information Processing Systems. 107-114.  1
2005 Rivest F, Bengio Y, Kalaska J. Brain inspired reinforcement learning Advances in Neural Information Processing Systems 1
2005 Bengio Y, Le Roux N, Vincent P, Delalleau O, Marcotte P. Convex neural networks Advances in Neural Information Processing Systems. 123-130.  1
2004 L'Heureux PJ, Carreau J, Bengio Y, Delalleau O, Yue SY. Locally linear embedding for dimensionality reduction in QSAR. Journal of Computer-Aided Molecular Design. 18: 475-82. PMID 15729847 DOI: 10.1007/s10822-004-5319-9  1
2004 Bengio Y, Delalleau O, Le Roux N, Paiement JF, Vincent P, Ouimet M. Learning eigenfunctions links spectral embedding and kernel PCA. Neural Computation. 16: 2197-219. PMID 15333211 DOI: 10.1162/0899766041732396  1
2004 Bengio Y, Grandvalet Y. No unbiased estimator of the variance of K-fold cross-validation Journal of Machine Learning Research. 5: 1089-1105.  1
2004 Bengio Y, Paiement JF, Vincent P, Delalleau O, Le Roux N, Ouimet M. Out-of-sample extensions for LLE, Isomap, MDS, Eigenmaps, and spectral clustering Advances in Neural Information Processing Systems 1
2003 Ghosn J, Bengio Y. Bias learning, knowledge sharing. Ieee Transactions On Neural Networks / a Publication of the Ieee Neural Networks Council. 14: 748-65. PMID 18238057 DOI: 10.1109/TNN.2003.810608  1
2003 Nadeau C, Bengio Y. Inference for the generalization error Machine Learning. 52: 239-281. DOI: 10.1023/A:1024068626366  1
2003 Vincent P, Bengio Y. Manifold parzen windows Advances in Neural Information Processing Systems 1
2003 Bengio Y, Chapados N. Extensions to metric-based model selection Journal of Machine Learning Research. 3: 1209-1227.  1
2002 Takeuchi I, Bengio Y, Kanamori T. Robust regression with asymmetric heavy-tail noise distributions. Neural Computation. 14: 2469-96. PMID 12396571 DOI: 10.1162/08997660260293300  1
2002 Collobert R, Bengio S, Bengio Y. A parallel mixture of SVMs for very large scale problems. Neural Computation. 14: 1105-14. PMID 11972909 DOI: 10.1162/089976602753633402  1
2002 Bengio Y, Chapados N. Metric-based model selection for time-series forecasting Neural Networks For Signal Processing - Proceedings of the Ieee Workshop. 2002: 13-22. DOI: 10.1109/NNSP.2002.1030013  1
2002 Vincent P, Bengio Y. Kernel matching pursuit Machine Learning. 48: 165-187. DOI: 10.1023/A:1013955821559  1
2002 Chapelle O, Vapnik V, Bengio Y. Model selection for small sample regression Machine Learning. 48: 9-23. DOI: 10.1023/A:1013943418833  1
2002 Bengio Y, Schuurmans D. Guest introduction: Special issue on new methods for model selection and model combination Machine Learning. 48: 5-7. DOI: 10.1023/A:1013921901994  1
2002 Collobert R, Bengio Y, Bengio S. Scaling large learning problems with hard parallel mixtures Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2388: 8-23. DOI: 10.1007/3-540-45665-1_2  1
2002 Vincent P, Bengio Y. K-local hyperplane and convex distance nearest neighbor algorithms Advances in Neural Information Processing Systems 1
2002 Chapados N, Bengio Y, Vincent P, Ghosn J, Dugas C, Takeuchi I, Meng L. Estimating car insurance premia: A case study in high-dimensional data inference Advances in Neural Information Processing Systems 1
2001 Bengio Y, Ducharme R, Vincent P. A neural probabilistic language model Advances in Neural Information Processing Systems. DOI: 10.1162/153244303322533223  1
2001 Chapados N, Bengio Y. Cost functions and model combination for VaR-based asset allocation using neural networks Ieee Transactions On Neural Networks. 12: 890-906. DOI: 10.1109/72.935098  1
2001 Bengio Y, Lauzon VP, Ducharme R. Experiments on the application of IOHMMs to model financial returns series Ieee Transactions On Neural Networks. 12: 113-123. DOI: 10.1109/72.896800  1
2001 Chapados N, Bengio Y. Input decay: Simple and effective soft variable selection Proceedings of the International Joint Conference On Neural Networks. 2: 1233-1237.  1
2001 Dugas C, Bengio Y, Bélisle F, Nadeau C, Garcia R. Incorporating second-order functional knowledge for better option pricing Advances in Neural Information Processing Systems 1
2000 Bengio Y. Gradient-based optimization of hyperparameters Neural Computation. 12: 1889-1900. PMID 10953243  1
2000 Schwenk H, Bengio Y. Boosting neural networks Neural Computation. 12: 1869-1887. PMID 10953242  1
2000 Bengio S, Bengio Y. Taking on the curse of dimensionality in joint distributions using neural networks Ieee Transactions On Neural Networks. 11: 550-557. DOI: 10.1109/72.846725  1
2000 Bengio Y, Bengio S. Modeling high-dimensional discrete data with multi-layer neural networks Advances in Neural Information Processing Systems. 400-406.  1
1999 Bengio S, Bengio Y, Robert J, Bélanger G. Stochastic learning of strategic equilibria for auctions Neural Computation. 11: 1199-1209.  1
1999 LeCun Y, Haffner P, Bottou L, Bengio Y. Object recognition with gradient-based learning Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1681: 319-345.  1
1998 Bonneville M, Meunier J, Bengio Y, Soucy JP. Support vector machines for improving the classification of brain PET images Proceedings of Spie - the International Society For Optical Engineering. 3338: 264-273. DOI: 10.1117/12.310900  1
1998 LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition Proceedings of the Ieee. 86: 2278-2323. DOI: 10.1109/5.726791  1
1998 Schwenk H, Bengio Y. Training methods for adaptive boosting of neural networks Advances in Neural Information Processing Systems. 647-650.  1
1998 Bengio Y, Berigio S, Isabelle JF, Singer Y. Shared context probabilistic transducers Advances in Neural Information Processing Systems. 409-415.  1
1998 Bengio Y, Gingras F, Goulard B, Lina JM, Scott K. Gaussian mixture densities for classification of nuclear power plant data Computers and Artificial Intelligence. 17: 189-209.  1
1998 Bottou L, Haffner P, Howard PG, Simard P, Bengio Y, LeCun Y. High quality document image compression with "DjVu" Journal of Electronic Imaging. 7: 410-425.  1
1997 Bengio Y. Using a financial training criterion rather than a prediction criterion International Journal of Neural Systems. 8: 433-443. PMID 9730019  1
1997 Schwenk H, Bengio Y. Adaboosting neural networks: Application to on-line character recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1327: 967-972.  1
1997 Ghosn J, Bengio Y. Multi-task learning for stock selection Advances in Neural Information Processing Systems. 946-952.  1
1996 Bengio Y, Frasconi P. Input-output HMM's for sequence processing Ieee Transactions On Neural Networks. 7: 1231-1249. DOI: 10.1109/72.536317  1
1995 Bengio Y, LeCun Y, Nohl C, Burges C. LeRec: a NN/HMM hybrid for on-line handwriting recognition. Neural Computation. 7: 1289-303. PMID 7584903  1
1995 Bengio S, Bengio Y, Cloutier J. On the search for new learning rules for ANNs Neural Processing Letters. 2: 26-30. DOI: 10.1007/BF02279935  1
1994 Bengio Y, Simard P, Frasconi P. Learning Long-Term Dependencies with Gradient Descent is Difficult Ieee Transactions On Neural Networks. 5: 157-166. DOI: 10.1109/72.279181  1
1993 Bengio Y, Frasconi P, Simard P. The problem of learning long-term dependencies in recurrent networks Ieee International Conference On Neural Networks - Conference Proceedings. 1993: 1183-1188. DOI: 10.1109/ICNN.1993.298725  1
1992 Bengio Y, De Mori R, Flammia G, Kompe R. Global Optimization of a Neural Network-Hidden Markov Model Hybrid Ieee Transactions On Neural Networks. 3: 252-259. DOI: 10.1109/72.125866  1
1992 Bengio Y, De Mori R, Gori M. Learning the dynamic nature of speech with back-propagation for sequences Pattern Recognition Letters. 13: 375-385. DOI: 10.1016/0167-8655(92)90035-X  1
1992 Bengio Y, De Mori R, Flammia G, Kompe R. Phonetically motivated acoustic parameters for continuous speech recognition using artificial neural networks Speech Communication. 11: 261-271. DOI: 10.1016/0167-6393(92)90020-8  1
1990 Bengio Y, Pouliot Y. Efficient recognition of immunoglobulin domains from amino acid sequences using a neural network Bioinformatics. 6: 319-324. PMID 2257492 DOI: 10.1093/bioinformatics/6.4.319  1
1990 Cosi P, Bengio Y, De Mori R. Phonetically-based multi-layered neural networks for vowel classification Speech Communication. 9: 15-29. DOI: 10.1016/0167-6393(90)90041-7  1
1989 Bengio Y, Cardin R, De Mori R, Merlo E. Programmable execution of multi-layered networks for automatic speech recognition Communications of the Acm. 32: 195-199. DOI: 10.1145/63342.63345  1
Show low-probability matches.