In the world of nonprofit data management, we work really hard to smart code all of our interactions with our audience so that we can successfully report on them. However, we work in a relationship industry and that often requires detailed explanation, which, translated to data speak, is free text. And there is our conundrum.
Processing free text is the domain of artificial intelligence, a discipline that we nonprofit data scientists are learning now. The new processing program, Python, even has a package called Beautiful Soup which parses websites, the text within them, and the HTML tags marking any variety of content. The R program also has a package called SentimentAnalysis which assigns a sentiment to different words by using the package’s dictionary, called a lexicon.
But we can do some of this analysis without an artificial intelligence program. Here are some steps to work your way into trying out text parsing, starting with the easy stuff and working toward the sophisticated stuff.