Text this: Identifying academic phrases in self-compiled corpora: A case study from mathematical sciences