NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Using Word Frequency Lists to Measure Corpus Homogeneity and Similarity between Corpora
Adam Kilgarriff
|
Paper Details:
Year: 1997
Venue:
VLC |
WS |
SIG: SIGDAT
Citations
URL
An Unsupervised Probabilistic Approach for the Detection of Outliers in Corpora
David Guthrie
|
Louise Guthrie
|
Yorick Wilks
|
Comparing Corpora using Frequency Profiling
Paul Rayson
|
Roger Garside
|
A Bayesian Mixture Model for Term Re-occurrence and Burstiness
Avik Sarkar
|
Paul H Garthwaite
|
Anne De Roeck
|
How Comparable are Parallel Corpora? Measuring the Distribution of General Vocabulary and Connectives
Bruno Cartoni
|
Sandrine Zufferey
|
Thomas Meyer
|
Andrei Popescu-Belis
|
The Effects of Corpus Size and Homogeneity on Language Model Quality
Tony G. Rose
|
No URLs Found
Field Of Study
Task
Tagging
Information Retrieval
Language
Chinese
English
Dataset
Biographies
Similar Papers
Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
Mona Diab
|
Mahmoud Ghoneim
|
Abdelati Hawwari
|
Fahad AlGhamdi
|
Nada AlMarwani
|
Mohamed Al-Badrashiny
|
Sentiment after Translation: A Case-Study on Arabic Social Media Posts
Mohammad Salameh
|
Saif Mohammad
|
Svetlana Kiritchenko
|
Natural Language Processing for Dialectical Arabic: A Survey
Abdulhadi Shoufan
|
Sumaya Alameri
|
Semi-supervised Structured Prediction with Neural CRF Autoencoder
Xiao Zhang
|
Yong Jiang
|
Hao Peng
|
Kewei Tu
|
Dan Goldwasser
|
A Survey of Arabic Named Entity Recognition and Classification
Khaled Shaalan
|