Spacy: Handling of Hashtags and DollarTags

David Doherty
4 min readMar 1, 2021

There are many NLP models floating around, and my favourite framework for dealing with NLP models is Spacy. The out of the box models are great for general well-written English which is great; but poses a problem when we start dealing with Reddit / Twitter :D

Firstly we have hashtags, and company symbols (e.g. $GME). Many frameworks do not have native handling of these. Generally the lowest…

--

--

David Doherty

I write about Fintech, it's past & future, leveraging 20+ years of experience in leadership roles at large Fintechs