All Tags

1-grams 18th-century 18th-century-literature 19th-century 19th-century-literature abstracts academia academic-articles academic-publishing academic-research acoustic african-american african-american-history african-history african-languages african-names-database afrikaans ages ai airbnb albania albanian album-covers albums algorithmic-normalization alignment amazon america american-civil-war american-english american-literature american-south ancient-documents ancient-greek ancient-history ancient-roman ancient-rome ancient-world anglo-saxons animation anime annotated-texts annotations anthropology aozora api arabic aragonese archeology architecture argument armenian art articles artificial-intelligence asia asr attitudes audio audio-transcription audio-transcriptions australia australian-poetry author authority-files authorship-attribution autocorrection azerbaijani baby-names baltimore bash basque belarusian bengali bestellers bias bibliographic-data bibliographic-records bibliography billboard bloggers blogs book-history book-history-data books bosnian boundaries bourdieu brazil brazilian-portuguese breast-cancer breton brightkite british-english british-library british-literature british-newspapers buddhism bulgaria bulgarian bulgarian-history byu byzantine canada canadian-history canon captions catalan categories cdc censorship census-data ceramic-studies ceramics character chat-format chicago china chinese church circulation citations clan classification clinical clustering code-switching college collocates computational-linguistics computational-social-science computer-vision conference-program contemporary-literature cornell corpora corpus correspondence cover-art covid-19 credit creoles crf-parsing crime croatian csv cultural-analytics cultural-heritage culture cuneiform czech danish data-extraction data-journalism data.gov dates dative-alternation deaths debt demographic demographics dependency-grammar dependency-trees description detective dh2013 dh2014 dh2015 dialectology dialogue digital-humanities diplomacy directed diversity download dpla drama ds dutch dutch-literature earnings east-asian-history eastern-mediterranean ebooks ecco ecco-tcp economic economics edges education eebo eebo-tcp egypt eigen-decomposition eighteenth-century eighteenth-century-poetry elections email emolex emotion employment england english epidemiology esperanto estonian ethos europe european-history event-detection facebook fan-fiction fantasy faroese federalist-papers fiction film film-history finnish flickr folklore forced-alignment framing french full-text function-words galician game-of-thrones gameboiy-advance gameboy gender genome-data genre genres gentrification geo-data geographical-data george-r-r-martin georgian geospatial-data german gestures github global-data global-english global-history goodreads google-books gothic government gowalla greece greek groceries gujarati gutenberg haitian-creole hand-corrected handwriting handwritten-text-recognition hate-crimes hathi hathitrust health-data healthcare hebrew higgs-boson hindi hiring historical historical-data historical-linguistics historical-manuscripts historical-maps historical-records historiography history-of-science hollywood homicide horror hotels housing html htrc humor hungarian hypothes.is i-vectors icelandic identity ido ids image image-recognition images incarceration india indian-history indonesian instagram interlingua internet-archive internet-trolls irish isbn islam islamic-history italian italy james-joyce japan japanese javascript jeopardy jewish-history job-listings jokes json kannada keywords khmer kirghiz knowledge-graph koineization korean kurdish land-use language-change language-contact language-typology language-variation large-scale-repository latin latvian law legal-history lemmas lesbian-and-gay-history letter-writing letters lgbt-history liberia libraries library library-data life-expectancy linguistic-variation linguistics linked-data linked-open-data literary-careers literary-characters literary-fiction literary-plot literary-prizes literature lithuanian livejournal loc locations london los-angeles luxembourgish lynching macedonian machine-learning machine-translation magazines making-of-america malay maltese mandarin manuscripts maps marathi marc mary-russell-mitford media medicine medieval medieval-archaeology medieval-history medieval-music menus metadata meter microsoft migration-studies moa modern-south-arabian morphology movie-reviews movie-scripts movies mp3 multiclass-classification multimedia multiple-languages museums mushrooms music muslim-history names natural-language-processing nepali ner nes netflix network-analysis new-york-times newgate news news-articles newspaper newspapers ngrams nineteenth-century nlp noaa nodes non-consumptive nonfiction north-america north-american-review northern-ireland norwegian.norwegian novels nutrition obesity objective-c ocr old-bailey old-english ontology oral-history oral-literature ottoman page segmentation page-level pakistan pandas paris parliament part-of-speech pdf persian personal-narratives personography phd-theses phonetics photos phrases pidgins pill place-names placeography plaintext plays podcast poetry pokec police-data policy-positions polish politics pop portrait portugal portuguese pos post-45 post-medieval-archaeology pragmatics predicates predictions prescriptions prison production products programmable-web pronunciation proper-name-disambiguation prosopography public-data publishers-weekly publishing-industry pubs pymarc python python2 questions r race race-films racism ratings raw readership reasoning reception recommendation records reddit reference regression religion religious-documents requires-key research research-institutes resolutions reviews roman-empire romanian romansh ruby russian sacred-texts saudi-arabic scholarly-communications scholarly-publishing science-fiction scottish sega-genesis semantics semeval sentiment sentiment-analysis serbian serial-fiction sermons shakespeare sign-languages signed-languages silent-films silicon-valley similarity sin singapore singapore-english slave-trade slavery slavic slovak slovakia slovene sms snes social social-computing social-links social-media social-network social-networks social-policy sociolinguistics song-of-ice-and-fire sonnets soqotra soqotri southern-united-states spanish speech speech-recognition spelling spelling-correction spelling-errors spelling-variation spotify squeeze statistical-analysis stop-words stopwords subsets subtitles sumerian supervised survey swahili sweden swedish sydney syllables symmetry-calculations syntax tagalog tags tamil tar.gz tcp technology tei television telugu term-frequencies term-frequency-tables test-data text text-analysis text-messaging text-mining textual-reuse thai theater thomas-gray tibet tibetan-buddhim title tokens topic-modeling topics trademarks training-data transcripts translation trivia turkey turkish turkmen tutorial tweepy tweets twitch twitter txtlab type typos ukrainian united-kingdom united-nations united-states unsupervised urdu usa uzbek vai-script venice video video-games vietnamese violence volapük voting voyages vsm walloon weather web-data web-scraping websites welsh western-frisian wget who wikipedia womens-history word-counts workplace world-english world-literature world-religion worldcat writing-systems xls xml yahoo yemen yiddish youtube zip zone classification zotero