A connectionist implementation of identical elements
Abstract
In training two networks on tasks that are different on the sur-face, but similar, or even isomorphic, at a higher level of de-scription, similarities between the network solutions are plau-sible if not expected. Such similarities tend to become evi-dent when two networks with shared weights are trained on similar tasks. After training, the shared weights were used as part of a third network that was trained on a third task similar to the first two. This "head start" results in significantly shorter training times than a network that starts with random weights. Shared hidden unit response profiles were analyzed across networks trained on structurally analogous tasks to re-veal parallel, but nonidentical features.
References (14)
- Caruana, R. (1997) Multitask learning. Machine Learning, 28:41-75.
- Cleeremans, A., D. Servan-Schreiber, and J.L. McClelland. (1989) Finite state automata and simple recurrent net- works. Neural Computation, 1:372--381.
- Damasio, A. (1989) Time-locked multiregional retroactiva- tion: A systems level proposal for the neural substrates of recall and recognition. Cognition 33:25-62.
- Dienes, Z., Altman G. T. M., and Gao, S.-J. (1999) Mapping across domains without feedback: A neural network model of transfer of implicit knowledge, Cognitive Sci- ence 12:53-82.
- Elman, J. L. (1990). Finding structure in time. Cognitive Science, 14:179--211.
- Gentner, D., (1983) Structure-mapping: A theoretical framework for analogy, Cognitive Science 7:155-170.
- Halford, G., Wilson, W., Guo, J., Gayler, R., Wiles, J., Stewart, J. (1994). Connectionist implications for process- ing capacity limitations in analogies. In: K. Holyoak & J. Barnden (eds.) Advances in connectionist and neural computation theory, vol. 2, Analogical Connections, pp. 363--415. Norwood, NJ: Ablex.
- Holyoak, K. & Thagard, P. (1989) Analogical mapping by constrant satisfaction. Cognitive Science 13, 295-355.
- Hubel, D. H., & Wiesel, T. (1962). Receptive fields, binocu- lar interaction, and functional architecture in the cat's vis- ual cortex. J. Physiol. 160, 106-154.
- Hummel, J. & Holyoak, K. (1997) Distributed representa- tions of structure: A theory of analogical access and map- ping. Psychological Review, 104, 427-466.
- Mitchell, M. (1993) Analogy-making as Perception: A com- puter model. Cambridge, MA: MIT Press.
- Munro, P. (1996) Shared network resources and shared task properties. In: Proceedings of the Eighteenth Annual Conference of the Cognitive Science Society. Mahwah NJ: Erlbaum
- Pratt, L. Y., Mostow, J., and Kamm, C. A. (1991) Direct transfer of learned information among neural networks. In: Proceedings of the Ninth National Conference on Arti- ficial Intelligence (AAAI-91) Anaheim CA
- Thorndike, E. L.., & Woodworth, R. S. (1901). The influ- ence of improvement in one mental function upon the ef- ficiency of other functions. Psychological Review, 8, 247- 261. a