-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Develop 8. Next Steps 1: comparing corpora #12
Comments
Based on #34 (comment) move to 'Dev Tasks'. |
Based on #34 (comment) scope here to:
|
rough outlineintroThis episode introduces potential next steps for comparing corpora. In the context of catalogue data, this is important because: provides an alternative point of analysis for recognising the features of the catalogue data under analysis; can be used to compare sub-sets of catalogue data, e.g. use an exemplar subset to understand what linguistic features of the comparative sub-set need adjusting/repairing; allows comparison of catalogue data to everyday speech, in order to tease out - in an evidential way - the special language that should be used in guides to cataloguing at your institution (because, you'll - probably - have a style you want based on some exemplar cataloguing) main bodyThree parts:
|
Potentially, use comparing with Photo db subjects as a way of thinking about comparing between parts of the catalogue entry (so, 'description' is not in isolation) |
@rossi-uk Made a big update today! Are you able to work on the four remaining points at the top of the ticket? #12 (comment) |
@drjwbaker Thanks, yes, will do before our meeting. |
for numbers in line 31 what settings should be used for the wordlist - I had unticked treat all as lowercase in Tool Preferences and got 73295 vs 63100; once ticked it give the numbers in the lesson |
Keyness section - should we mention that the selection3 file needs to be open and a word list created. After the previous section I was working with the wordlist that I had generated to compare the corpora and that confused me. Also clarify the settings for the word list - Tool Preference - untick treat all data as lower case. |
Re task 2 are we suggesting that people run this with both corpora and then export Iams keyness txt file and BMC keyness txt file and open side by side in notepad and compare there? |
Comparing concordances section - again I get a different number of results for both corpora - what settings are we using in Tool preferences for the wordlists? I got 3103 results for behind across both corpora |
From meeting 4/12:
|
implement first 5 changes from #12 (comment)
https://github.com/CatalogueLegacies/antconc.github.io/blob/gh-pages/_episodes/09-comparing.md
Draft at #12 (comment) To do:
Keyword List
tool preference, adjusts from default: untick 'Treat all data as lowercase', tick 'show negative keywords', change 'effect size measure' to 'ratio of relative frequencies', change 'Reference Corpus' toglge to 'use word list(s)', and ensure that corpus is loaded (with 'load' button)The text was updated successfully, but these errors were encountered: