I want to use split() to divide the English article into sentences, but the "." in the abbreviation (such as U.S) in the article will affect the segmentation of the sentence.
Is there any way to avoid the interference of symbols in abbreviations?I tried to use the nltk package, but it didn't work.Is there any way to make changes to the code on top of split()?