Academia.eduAcademia.edu

Outline

Deep Spatial Learning with Molecular Vibration

2020

Abstract

Machine learning over-fitting caused by data scarcity greatly limits the application of machine learning for molecules. Due to manufacturing processes difference, big data is not always rendered available through computational chemistry methods for some tasks, causing data scarcity problem for machine learning algorithms. Here we propose to extract the natural features of molecular structures and rationally distort them to augment the data availability. This method allows a machine learning project to leverage the powerful fit of physics-informed augmentation for providing significant boost to predictive accuracy. Successfully verified by the prediction of rejection rate and flux of thin film polyamide nanofiltration membranes, with the relative error dropping from 16.34% to 6.71% and the coefficient of determination rising from 0.16 to 0.75, the proposed deep spatial learning with molecular vibration is widely instructive for molecular science. Experimental comparison unequivocally...

References (13)

  1. Brenden M Lake, Ruslan Salakhutdinov, and Joshua B Tenenbaum. Human-level concept learning through probabilistic program induction. Science, 350(6266):1332-1338, 2015.
  2. Keith T Butler, Daniel W Davies, Hugh Cartwright, Olexandr Isayev, and Aron Walsh. Machine learning for molecular and materials science. Nature, 559(7715):547-555, 2018.
  3. Chunpeng Wang, Ullrich Steiner, and Alessandro Sepe. Synchrotron big data science. Small, 14(46):1802291, 2018.
  4. Sheng Ye, Wei Hu, Xin Li, Jinxiao Zhang, Kai Zhong, Guozhen Zhang, Yi Luo, Shaul Mukamel, and Jun Jiang. A neural network protocol for electronic excitations of n-methylacetamide. Proceedings of the National Academy of Sciences, 116(24):11612-11617, 2019.
  5. Andrew W Young, Deborah Hellawell, and Dennis C Hay. Configurational information in face perception. Perception, 42(11):1166-1178, 2013.
  6. Lunyang Liu, Wenduo Chen, and Yunqi Li. A statistical study of proton conduction in nafion®-based composite membranes: Prediction, filler selection and fabrication methods. Journal of Membrane Science, 549:393-402, 2018.
  7. Matthew K Nielsen, Derek T Ahneman, Orestes Riera, and Abigail G Doyle. Deoxyfluorination with sulfonyl fluorides: navigating reaction space with machine learning. Journal of the American Chemical Society, 140 (15):5004-5008, 2018.
  8. Sean Ekins, Ana C Puhl, Kimberley M Zorn, Thomas R Lane, Daniel P Russo, Jennifer J Klein, Anthony J Hickey, and Alex M Clark. Exploiting machine learning for end-to-end drug discovery and development. Nature materials, 18(5):435, 2019.
  9. Yao-Shen Guo, Yan-Li Ji, Bin Wu, Nai-Xin Wang, Ming-Jie Yin, Quan-Fu An, and Cong-Jie Gao. High-flux zwit- terionic nanofiltration membrane constructed by in-situ introduction method for monovalent salt/antibiotics separation. Journal of Membrane Science, 593:117441, 2020.
  10. Hui-Fang Xiao, Chang-Hui Chu, Wang-Ting Xu, Bo-Zhi Chen, Xiao-Hui Ju, Weihong Xing, and Shi-Peng Sun. Amphibian-inspired amino acid ionic liquid functionalized nanofiltration membranes with high water permeability and ion selectivity for pigment wastewater treatment. Journal of Membrane Science, 586:44-52, 2019.
  11. Fangmeng Sheng, Linxiao Hou, Xiuxia Wang, Muhammad Irfan, Muhammad A Shehzad, Bin Wu, Xuemei Ren, Liang Ge, and Tongwen Xu. Electro-nanofiltration membranes with positively charged polyamide layer for cations separation. Journal of Membrane Science, 594:117453, 2020.
  12. Jing Wang, Si Zhang, Pengfei Wu, Wenxiong Shi, Zhi Wang, and Yunxia Hu. In situ surface modification of thin-film composite polyamide membrane with zwitterions for enhanced chlorine resistance and transport properties. ACS applied materials & interfaces, 11(12):12043-12052, 2019.
  13. MJ Frisch, GW Trucks, HB Schlegel, GE Scuseria, MA Robb, JR Cheeseman, G Scalmani, V Barone, GA Pe- tersson, H Nakatsuji, et al. Gaussian 16 revision a. 03. 2016; gaussian inc. Wallingford CT, 2(4), 2016.