Copyright (c) 2014 AJC
This work is licensed under a Creative Commons Attribution 4.0 International License.
Novel Framework for Improving the Desired Structure Prediction on Imbalance Data Set
Corresponding Author(s) : Yinghua Lu
Asian Journal of Chemistry,
Vol. 26 No. 17 (2014): Vol 26 Issue 17
Abstract
Desired target prediction has important significance in rational synthesis of materials. In this paper, taken (6,12)-ring-containing structure as the desired target, a novel framework is proposed for improving the prediction performance on the microporous aluminophosphates (AlPOs) database. In contrast to existing data processing techniques for class imbalance problem, the proposed framework first explores the intra-class distribution of majority class by clustering such that multiple specific models can be established according to the distribution. The main advantage is that one model can characterize the clustered data well. Then, Q times n-fold cross-validation procedure is applied to assess the prediction performance. Finally, we compare the proposed framework with two existing data processing procedures. The comparison results demonstrate that the desired target predictions can achieve improved performance remarkably.
Keywords
Download Citation
Endnote/Zotero/Mendeley (RIS)BibTeX
- H. Lee, S.I. Zones and M.E. Davis, Nature, 425, 385 (2003); doi:10.1038/nature01980.
- X.D. Liu, Y.H. Xu, J.H. Yu, Y. Li, W. Zeng, C. Chen, J.Y. Li, W.Q. Pang, R.R. Xu and Y. Xu, Chem. J. Chinese Univ., 24, 949 (2003).
- L.A. Baumes, M. Moliner and A. Corma, QSAR Comb. Sci., 26, 255 (2007); doi:10.1002/qsar.200620064.
- J.Y. Li, M. Qi, J. Kong, J.Z. Wang, Y. Yan, W.F. Huo, J.H. Yu, R.R. Xu and Y. Xu, Micropor. Mesopor. Mater., 129, 251 (2010); doi:10.1016/j.micromeso.2009.10.001.
- M. Qi, Y.H. Lu, J.Z. Wang and J. Kong, Mol. Inform., 29, 203 (2010); doi:10.1002/minf.200900052.
- W.F. Huo, N. Gao, Y. Yan, J.Y. Li, J.H. Yu and R.R. Xu, Acta Phys. Chim. Sin., 27, 2111 (2011).
- J.S. Li, Y.H. Lu, J. Kong, N. Gao, J.H. Yu, R.R. Xu, J.Z. Wang, M. Qi and J.Y. Li, Micropor. Mesopor. Mater., 173, 197 (2013); doi:10.1016/j.micromeso.2012.03.007.
- M. Qi, J.S. Li, J.Z. Wang, Y.H. Lu and J. Kong, Ind. Eng. Chem. Res., 51, 16734 (2012); doi:10.1021/ie3019774.
- http://zeobank.jlu.edu.cn.
- B.J. Frey and D. Dueck, Science, 315, 972 (2007); doi:10.1126/science.1136800.
- D.V. Nguyen and D.M. Rocke, Bioinformatics, 18, 39 (2002); doi:10.1093/bioinformatics/18.1.39.
- J. Manuel Serra, L. Allen Baumes, M. Moliner, P. Serna and A. Corma, Comb. Chem. High Throughput Screen., 10, 13 (2007); doi:10.2174/138620707779802779.
References
H. Lee, S.I. Zones and M.E. Davis, Nature, 425, 385 (2003); doi:10.1038/nature01980.
X.D. Liu, Y.H. Xu, J.H. Yu, Y. Li, W. Zeng, C. Chen, J.Y. Li, W.Q. Pang, R.R. Xu and Y. Xu, Chem. J. Chinese Univ., 24, 949 (2003).
L.A. Baumes, M. Moliner and A. Corma, QSAR Comb. Sci., 26, 255 (2007); doi:10.1002/qsar.200620064.
J.Y. Li, M. Qi, J. Kong, J.Z. Wang, Y. Yan, W.F. Huo, J.H. Yu, R.R. Xu and Y. Xu, Micropor. Mesopor. Mater., 129, 251 (2010); doi:10.1016/j.micromeso.2009.10.001.
M. Qi, Y.H. Lu, J.Z. Wang and J. Kong, Mol. Inform., 29, 203 (2010); doi:10.1002/minf.200900052.
W.F. Huo, N. Gao, Y. Yan, J.Y. Li, J.H. Yu and R.R. Xu, Acta Phys. Chim. Sin., 27, 2111 (2011).
J.S. Li, Y.H. Lu, J. Kong, N. Gao, J.H. Yu, R.R. Xu, J.Z. Wang, M. Qi and J.Y. Li, Micropor. Mesopor. Mater., 173, 197 (2013); doi:10.1016/j.micromeso.2012.03.007.
M. Qi, J.S. Li, J.Z. Wang, Y.H. Lu and J. Kong, Ind. Eng. Chem. Res., 51, 16734 (2012); doi:10.1021/ie3019774.
B.J. Frey and D. Dueck, Science, 315, 972 (2007); doi:10.1126/science.1136800.
D.V. Nguyen and D.M. Rocke, Bioinformatics, 18, 39 (2002); doi:10.1093/bioinformatics/18.1.39.
J. Manuel Serra, L. Allen Baumes, M. Moliner, P. Serna and A. Corma, Comb. Chem. High Throughput Screen., 10, 13 (2007); doi:10.2174/138620707779802779.