methods for categorizing textual data 9790107