ICML-2011 Workshop on Unsupervised and Transfer learning
Abstract
We organized a data mining challenge in \unsupervised and transfer learning" (the UTL challenge) followed by a workshop of the same name at the ICML 2011 conference in Bellevue, Washington. This introduction presents the highlights of the outstanding con- tributions that were made, which are regrouped in this issue of JMLR W&CP. Novel methodologies emerged to capitalize on large volumes of unlabeled data from tasks related (but di�erent) from a target task, including a method to learn data kernels (similarity measures) and new deep architectures for feature learning.
References (65)
- A. Acharya et al. Transfer learning with cluster ensembles. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- F. Aiolli. Transfer learning by kernel meta-learning. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- G. M. Allenby and P. E. Rossi. Marketing models of consumer heterogeneity. Journal of Econometrics, 89:57-78, 1999.
- G. M. Allenby and P. E. Rossi. Learning multiple tasks with kernel methods. Journal of Machine Learning Research, 6:615-637, 2005.
- A. Arnold, R. Nallapati, and W. W. Cohen. A comparative study of methods for transduc- tive transfer learning. In In ICDM Workshop on Mining and Management of Biological Data, 2007.
- N. Arora, G. M. Allenby, and J. Ginter. A hierarchical bayes model of primary and secondary demand. Marketing Science, 1998.
- B. Bakker and T. Heskes. Task clustering and gating for bayesian multi-task learning. Journal of Machine Learning Research, 4:83-99, 2003.
- P. Baldi. Autoencoders, unsupervised learning, and deep architectures. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- J. Baxter. Theoretical models of learning to learn. Learning to Learn, pages 71-94, 1997.
- S. Ben-David and R. Schuller. Exploiting task relatedness for multiple task learning. In Proceedings of Computational Learning Theory (COLT), pages 185-192, 2003.
- Y. Bengio. Learning deep architectures for AI. Foundations and Trends in Machine Learn- ing, 2(1):1-127, 2009. doi: 10.1561/2200000006. Also published as a book. Now Publish- ers, 2009.
- Y. Bengio. Deep learning of representations for unsupervised and transfer learning. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- Y. Bengio, J.-F. Paiement, P. Vincent, O. Delalleau, Nicolas Le Roux, and Marie Ouimet. Out-of-sample extensions for lle, isomap, mds, eigenmaps, and spectral clustering. In NIPS03, 2003.
- A. Blum and T. Mitchell. Combining labeled and unlabeled data with co-training. In COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory, pages 92-100. Morgan Kaufmann Publishers, 1998.
- L. Breiman and J. H. Friedman. Predicting multivariate responses in multiple linear regres- sion. Royal Statistical Society Series B, 1998.
- J. Bromley, I. Guyon, Y. LeCun, E. Säckinger, and R. Shah. Signature verification using a "siamese" time delay neural network. In In NIPS Proc, 1994.
- J. Buhmann et al. Information theoretic model selection for pattern analysis. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- R. Caruana. Multitask learning. Machine Learning, 28(1):41-75, 1997.
- S.-C. Chin and W. Nick Street. Divide and transfer: an exploration of segmented trans- fer to detect wikipedia vandalism. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- D. Erhan, Y. Bengio, A. Courville, P.-A. Manzagol, P. Vincent, and S. Bengio. Why does unsupervised pre-training help deep learning? JMLR, 11:625-660, 2010.
- Z. Ghahramani. Unsupervised Learning, volume 3176, pages 72-112. Springer-Verlag, Berlin, 2004.
- A. Globerson and N. Tishby. Sufficient dimensionality reduction. J. Mach. Learn. Res., 3:1307-1331, March 2003. ISSN 1532-4435. URL http://portal.acm.org/citation. cfm?id=944919.944975.
- W. Greene. Econometric Analysis, 5th Edition. Prentice Hall, 2002.
- S. M. Gutstein. Transfer Learning Techniques for Deep Neural Nets. PhD thesis, The University of Texas at El Paso, 2010.
- I. Guyon, G. Dror, V. Lemaire, D. Silver, G. Taylor, and D. W. Aha. Analysis of the ijcnn 2011 utl challenge. Neural Networks, In press 2011a.
- I. Guyon, G. Dror, V. Lemaire, G. Taylor, and D. W. Aha. Unsupervised and transfer learning challenge. In The 2011 International Join Conference on Neural Networks, pages 793-800, July 2011b.
- K. Hayashi et al. Self-measuring similarity for multi-task gaussian process. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- T. Heskes. Empirical bayes for learning to learn. In P. Langley, editor, Proceedings of the International Conference on Machine Learning (ICML'00), pages 367-374, 2000.
- G. E. Hinton and T. J. Sejnowski. Unsupervised learning: foundations of neural com- putation. Computational neuroscience. MIT Press, 1999. ISBN 9780262581684. URL http://books.google.ca/books?id=yj04Y0lje4cC.
- A. K. Jain, M. N. Murty, and P. J. Flynn. Data clustering: A review, 1999.
- T. Jebara. Multi-task feature and kernel selection for svms. In Proceedings of the Interna- tional Conference on Machine Learning (ICML 04), pages 185-192, 2004.
- B. Kulis. Icml tutorial on metric learning, 2010. URL http://www.cs.berkeley.edu/ ~kulis/icml2010_tutorial.htm.
- N. Lawrence. Probabilistic non-linear principal component analysis with gaussian process latent variable models. JMLR, 6:1783-1816, 2005.
- Y.-A. Le Borgne. Supervised dimensionality reduction in the unsupervised and transfer learning 2011 competition, 2011.
- G. Lee, L. Stoolman, and C. Scott. Transfer learning for auto-gating of flow cytometry data. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- H. Lee, A. Battle, R.t Raina, and A. Y. Ng. Efficient sparse coding algorithms. In B. Schölkopf, J. Platt, and T. Hoffman, editors, Advances in Neural Information Pro- cessing Systems 19, pages 801-808. MIT Press, Cambridge, MA, 2007.
- C. Liu, J. Xie, Hui Xiong, and Y. Ge. Stochastic unsupervised learning on unlabeled data. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- L. Pratt (Editor). Reuse of neural networks through transfer. Connection Science, 8(2), 1996.
- U. Luxburg. A tutorial on spectral clustering. Statistics and Computing, 17:395-416, December 2007. ISSN 0960-3174. doi: 10.1007/s11222-007-9033-z. URL http://portal. acm.org/citation.cfm?id=1288822.1288832.
- G. Mesnil et al. Unsupervised and transfer learning challenge: a deep learning approach. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- T. M. Mitchell. The need for biases in learning generalizations. Readings in Machine Learning, pages 184-191, 1980. ed. Jude W. Shavlik and Thomas G. Dietterich.
- A. Niculescu-Mizil and R. Caruana. Inductive transfer for bayesian network structure learn- ing. Journal of Machine Learning Research -Proceedings Track, 2:339-346, 2007.
- A. Niculescu-Mizil and R. Caruana. Inductive transfer for bayesian network structure learn- ing. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- V. Nikulin and T.-H. Huang. Unsupervised dimensionality reduction via gradient-based matrix factorization with two adaptive learning rates. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012. NIPS1995 Workshop. Learning to learn. http://plato.acadiau.ca/courses/comp/dsilver/ NIPS95 LTL/transfer.workshop.1995.html, 1995.
- NIPS2005 Workshop. Inductive transfer -10 years later. http://iitrl.acadiau.ca/itws05/, 2005.
- S. J. Pan and Q. Yang. A survey on transfer learning. IEEE Transactions on Knoweledge and Data Engineering, 22(10):1345-1359, October 2010.
- L. Pratt and S. Thrun (Editors). Transfer in inductive systems. Machine Learning, 28(1), 1997.
- R. Raina, A. Battle, H. Lee, B. Packer, and A. Y. Ng. Self-taught learning: Transfer learning from unlabeled data. In Proceedings of the Twenty-fourth International Conference on Machine Learning, 2007a.
- R. Raina, A. Battle, H. Lee, B. Packer, and A. Y. Ng. Self-taught learning: transfer learning from unlabeled data. In Proceedings of the 24th international conference on Machine learning, ICML '07, pages 759-766, New York, NY, USA, 2007b. ACM. ISBN 978-1-59593-793-3. doi: 10.1145/1273496.1273592. URL http://doi.acm.org/10.1145/ 1273496.1273592.
- J. Ramon, K. Driessens, and T. Croonenborghs. Transfer learning in reinforcement learning problems through partial policy recycling. Proc. 18th European Conf. Machine Learning (ECML 2007), pages 699-707, 2007.
- M. Saeed. Use of representations in high dimensional spaces for unsupervised and transfer learning challenge, 2011.
- R. Salakhutdinov, J. Tenenbaum, and A. Torralba. One-shot learning with a hierarchi- cal nonparametric bayesian model. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- D. L. Silver and K. P. Bennett. Special issue on inductive transfer. Machine Learning, 73, 2008.
- D. L. Silver and R. E. Mercer. The parallel transfer of task knowledge using dynamic learning rates based on a measure of relatedness. Connection Science Special Issue: Transfer in Inductive Systems, 8(2):277-294, 1996.
- D. L. Silver, R. Poirier, and D. Currie. Inductive transfer with context-sensitive neural networks. Machine Learning, 73:323-336, 2008.
- A. J. Smola, S. Mika, B. Schölkopf, and R. C. Williamson. Regularized principal manifolds. JMLR, 1:179-209, 2001.
- M.E. Taylor and P. Stone. Cross-domain transfer for reinforcement learning. Proc. 24th International Conf. Machine Learning (ICML 2007), pages 879-886, 2007.
- S. Thrun and L. Y. Pratt (Editors). Learning To Learn. Kluwer Academc Publisher, Boston, MA, 1997.
- P. E. Utgoff. Machine Learning of Inductive Bias. Kluwer Academc Publisher, Boston, MA, 1986.
- U. von Luxburg, R. C. Williamson, and I. Guyon. Clustering: Science or art? In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- C. Widmer and G. Rätsch. Multitask learning in computational biology. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- A. Wilson, A. Fern, and P. Tadepalli. Transfer learning in sequential decision problems: A hierarchical bayesian approach. In ICML 2011 Unsupervised and Transfer Learning Workshop. JMLR W&CP, this volume, 2012.
- X.-M. WU, A. Man-Cho So, Z. Li, and S.-Y. R. Li. Fast graph laplacian regularized kernel learning via semidefinite quadratic linear programming. In Y. Bengio, D. Schuurmans, J. Lafferty, C. K. I. Williams, and A. Culotta, editors, Advances in Neural Information Processing Systems 22, pages 1964-1972. Curran Associates, Inc., 2009.
- Z. Xu et al. Rapid feature learning with stacked linear denoisers, 2011.
- A. Zellner. An efficient method for estimating seemingly unrelated regression equations and tests for aggregation bias. Journal of the American Statistical Association, 57:348-368, 1962.