Parkinson's disease (PD) is a neurodegenerative disease with various clinical manifestations caused by multiple risk factors. However, the effect of different factors and relationships between different features related to PD and the extent of those factors leading to the incidence of PD remains unclear. we employed Bayesian network to construct a prediction model. The prediction system was trained on the data of 35 patients and 26 controls. The structure learning and parameter learning of Bayesian Network was completed through the tree-augmented network (TAN) and Netica software, respectively. We employed four Bayesian Networks in terms of the syllable, including monosyllables, disyllables, multisyllables and unsegmented syllables. The area under the curve (AUC) of monosyllabic, disyllabic, multisyllabic, and unsegmented-syllable models were 0.95, 0.83, 0.80 and 0.84, respectively. In the monosyllabic tests, the best predictor of PD was duration, the posterior probability of which was 92.70%. Meanwhile, minimum f0 (61.60%) predicted best in the disyllabic tests and the variables that predicted best in multisyllables and unsegmented syllables were end f0 (59.40%) and maximum f0 (58.40%). In the cross-sectional comparison, the prediction effect of each variable in the monosyllabic tests was generally higher than that of other test groups. The monosyllabic models had the highest predicted performance of PD. Among acoustic parameters, duration was the strongest feature in predicting the prevalence of PD in monosyllabic tests. We believe that this network methodology will be a useful tool for the clinical prediction of Parkinson's disease.Supplementary InformationThe online version contains supplementary material available at 10.1007/s11571-024-10194-x.