Dear All,
My apologies for what is likely a very basic question, but I am doing my first text analysis and am looking for some guidance.
I am working with a colleague in our romance language department. She is analyzing speeches of a politician over time to see how they have changed. She has identified a two key words and we are trying to assess the concordance of the words in proximity to each other, and how that has shifted over time. We are comparing the likelihood of the key word #1 occurring within a certain # of words when key word #2 is observed vs. the likelihood of key word #1 occurring within a certain # of words when key word #2 is not observed. Is this a common method of analyzing concordance of words. If so, can someone share a citation with me. Also, is there a recommended method for choosing the "within a certain # of words" limit? And is there some obvious method of analyzing this that we are omitting?
I appreciate any feedback you have on this.
Sincerely,
Michael
------------------------------
Michael Posner
Associate Professor of Statistics
Director, Center for Statistics Education
Villanova University
------------------------------