Welcome to Chinese Agricultural Science Bulletin,

Chinese Agricultural Science Bulletin ›› 2015, Vol. 31 ›› Issue (6): 241-246.doi: 10.11924/j.issn.1000-6850.casb14110089

Special Issue: 烟草种植与生产

Previous Articles     Next Articles

Study on the Classification of Flue-cured Tobacco Based on the Random Forest Algorithm

Guo Dongfeng1, Hu Haizhou2, Wang Jitao1, Yao Zhongda1, Yang Hui3, Xu Wei3, Liu Xinmin2   

  1. (1Technology Center of Anhui Cigarette Industrial Co. Ltd., Hefei 230088;2Tobacco Research Institute of Chinese Academy of Agricultural Sciences, Qingdao Shandong 266101;3China Tobacco Guizhou Industrial Co., Ltd., Guiyang 550001)
  • Received:2014-11-16 Revised:2015-02-10 Accepted:2015-01-09 Online:2015-03-20 Published:2015-03-20

Abstract: In order to find out the key factors affecting the tobacco flavor classification, 3 types of flue-cured tobacco leaves in 6 tobacco planting areas in China were used as the research objects, and the objects were classified based on the random forest algorithm. The results showed that the correct rate made up 82.35% of the whole samples, in particular the correct rate for QING type reached 100%, but the correct rate for the other types did not match the reality perfectly. Meanwhile the importance of each variable could be reflected under random forest algorithm. In this case the key factors was as following: benzyl alcohol, β-ionone, 2-cyclopentene-1,4-dione, 2-acetyl-5-methyl-furan, methyl palmitate, 5 -(hydroxymethyl) furfural, 3-hydroxy-β-damascone, β-dihydro damascene, furfural, phenylethanol, dihydroactinidiolide etc. Therefore, the random forest algorithm could be applied to the study of tobacco flavor classification, achieve good result in the classification of overall flue-cured tobacco samples, and find out the key factors of classification. Therefore, the random forest algorithm could be explored in the tobacco research.