tag:blogger.com,1999:blog-6535481649727720492.post969398043980307998..comments2024-03-27T21:43:02.451-04:00Comments on NEW SAVANNA: Topic Analysis: Goldstone & Underwood and the "economy" of word distribution across topicsBill Benzonhttp://www.blogger.com/profile/08360044945265178991noreply@blogger.comBlogger4125tag:blogger.com,1999:blog-6535481649727720492.post-39945995768402906632015-09-26T05:53:41.514-04:002015-09-26T05:53:41.514-04:00If you take a look at this little chronology I put...If you take a look at <a href="http://new-savanna.blogspot.com/2011/03/for-historical-record-cog-sci-and-lit.html" rel="nofollow">this little chronology</a> I put together several years ago you’ll see that I like to use 1957 as a reference point. That’s when Frye published his <i>Anatomy</i> and when Chomsky published <i>Syntactic Structures</i>. The year before the term “artificial intelligence”Bill Benzonhttps://www.blogger.com/profile/08360044945265178991noreply@blogger.comtag:blogger.com,1999:blog-6535481649727720492.post-80821055914238961492015-09-25T16:20:07.373-04:002015-09-25T16:20:07.373-04:00Interesting about the 1950s as watershed moment.
...Interesting about the 1950s as watershed moment.<br /><br />It's often true that large topics have words at the top of the list in common -- driven partly by the labeling strategy that chooses most-common-words to label the topic. Doesn't necessarily mean they're similar when you dig down more deeply. And I think you're probably right that there's some kind of Ted Underwoodhttps://www.blogger.com/profile/04012428899328561750noreply@blogger.comtag:blogger.com,1999:blog-6535481649727720492.post-60233376047984489282015-09-25T07:00:44.372-04:002015-09-25T07:00:44.372-04:00On stopword strategy, you might take every word th...On stopword strategy, you might take every word that’s in two or more megatopic, put it on the stoplist; and then rerun the analysis. The expectation being that those topics would break apart into smaller more interpretable topics or even that their tokens would be ‘absorbed’ into other topics. Is that the idea?<br /><br />I’ve been wondering if we might be looking at different overall Bill Benzonhttps://www.blogger.com/profile/08360044945265178991noreply@blogger.comtag:blogger.com,1999:blog-6535481649727720492.post-31170905316915791502015-09-24T15:58:52.356-04:002015-09-24T15:58:52.356-04:00I looked up what the big four topics got labeled u...I looked up what the big four topics got labeled using my "correlation" strategy -- take the top 50 words in a topic and sort them by their correlation with the aggregate topic frequency.<br /><br />14: Instead of "other like even great" I got "great far found made" (older)<br />52 Instead of "other like even much" I got "way kind something like" Ted Underwoodhttps://www.blogger.com/profile/04012428899328561750noreply@blogger.com