|
|||||
|
(NOTE:
Some of the older papers are hard to find in electronic form. Links marked
with an asterisk (*) are as similar to the published paper as I could find.) Journals 1. Church, K., and Patil, R. (1982) “Coping with Syntactic Ambiguity
or How to Put the Block in the Box on the Table,” American Journal
of Computational Linguistics, Volume 8, Number 3-4, July-December.
pdf
(from LDC) 2. Church, K. (1987)
“Phonological Parsing and Lexical Retrieval” Cognition,
Volume 25, pp. 53-69. ps 3. Church, K., and Hanks, P.,
“Word Association Norms, Mutual Information and Lexicography,” Computational
Linguistics, Vol 16:1, pp. 22-29, (1991). ps
4. Church, K., and Gale, W.,
“A Comparison of the Enhanced Good-Turing and Deleted Estimation
Methods for Estimating Probabilities of English Bigrams,”
Computer Speech and Language, 5:1, (1991). 5. Church, K., and Gale, W. (1991)
“Probability Scoring for Spelling Correction,” Statistics and
Computing. ps (missing figures) 6. Church, K., review of Aarts, J., and Meijs, W. ( 7. Gale, W., and Church, K. (1993)
“A Program for Aligning Sentences in Bilingual Corpora,” Computational
Linguistics, 19:1, pp. 75-102. pdf
(via LDC) ps (figures missing) text (containing useful
appendix) pdf (via ACM) 8. Gale, W., Church, K. and
Yarowsky, D. (1993) “A Method for Disambiguating Word Senses in a Large
Corpus,” Computers and Humanities, 26: 415-439. ps
9. Church, K. and Helfman, J. (1993) “Dotplot:
a Program for Exploring Self-Similarity in Millions of Lines of Text and
Code,” The Journal of Computational and Graphical Statistics,
2:2, pp. 153-174. ps related
pages 10.Church, K. and Mercer, R. (1993)
“Introduction to the Special Issue on Computational Linguistics Using
Large Corpora,” Computational Linguistics, 19:1, pp. 1-24. pdf
(via LDC) ps pdf (via ACM) 11.Church, K. and Hovy, E. (1993)
“Good Applications for Crummy Machine Translation,” Machine
Translation, vol 8, pp. 239-258. ps
12.Church, K. and Rau, L. (1995)
“Commercial Applications of Natural Language Processing,” Communications
of the ACM, pp. 71-79. ps
pdf (via ACM) 13.Church, K. and Gale, W. (1995)
“Poisson Mixtures,” Journal of Natural Language Engineering.
ps 14.Isabelle, P. and Church, K. ( 15.Dagan , 16.Yamamoto, M. and Church, K (2001)
“Using Suffix Arrays to compute Term Frequency and Document Frequency
for All Substrings in a Corpus,” Computational Linguistics, vol 27:1, pp. 1-30, MIT Press. pdf
related
links 17.Amir, A., Church, K. and Dar, E.
(2004), “The submatrices character
count problem: an efficient solution using separable values,”
Information and Computation, 190:1, pp. 100-116. 18.Church,
K. (2005) “Reviewing the Reviewers,” Computational Linguistics,
31:4, pp. 575-578. (via ACM) 19.Li,
P. and Church, K. (2007) A sketch algorithm for
estimating two-way and multi-way associations, Computational Linguistics
33(3), 305-354. 20.Li, P. Hastie, T., and Church, K. (2007), Patents 1. 4,829,580 “Text analysis
system with letter sequence recognition and speech stress assignment
arrangement.” 2. 5,146,405 “Methods for
part-of-speech determination and usage.” 3. 5,283,833 “Method and
apparatus for speech processing using morphology and rhyming.” 4. 5,541,836 “Word
disambiguation apparatus and methods.” 5. 5,572,423 “Method for
correcting spelling using error frequencies.” 6. 5,608,622 “System for
analyzing translations.” 7. 5,647,023 “Method of
nonlinear filtering of degraded document images.” Conference Proceedings 1. Church, K., (1979) “Co-ordinate
Squares: A Solution to Many Chess Pawn Endgames,” (abbreviated version
of B.S. Thesis), International Joint Conference on Artificial Intelligence,
2. Church, K. (1980) “On
Parsing Strategies and Closure,” Association for Computational
Linguistics. pdf (direct from ACL) 3. Church, K. (1983) “A
Finite-State Parser for Use in Speech Recognition,” Association for
Computational Linguistics. pdf
(direct from ACL) pdf (via ACM) 4. Church, K. (1983)
“Allophonic and Phonotactic Constraints are
Useful,” International Joint Conference on Artificial Intelligence,
5. Church, K. (1983) “A
Finite-State Parser for Use in Speech Recognition,” Journal of the
Acoustical Society of 6. Ejerhed
, E., and Church, K. (1983)
“Finite State Parsing,” in Karlsson, F.
(ed), Papers from the Seventh Scandinavian Conference of Linguistics, 7. Church, K. (1985) “Stress
Assignment in Letter to Sound Rules for Speech Synthesis,” Association
for Computational Linguistics. pdf
(direct from ACL) ps 8. Church, K. (1986)
“Morphological Decomposition and Stress Assignment for Speech
Synthesis,” Association for Computation Linguistics. pdf
(direct from ACL) 9. Church, K. (1987) “Three
Classes of + Boundaries,” The Eleventh International Congress of
Phonetic Sciences, 10.Church, K. (1988) “A
Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text,”
Second Conference on Applied Natural Language Processing, 11.Koskenniemi
, K., and Church K. (1988)
“Complexity, Two-Level Morphology and Finnish,” Coling, 12.Church, K. (1989)
“Syntactic Parsing May Not Help Speech Recognition Very Much,” in
Working Notes of the AAAI Symposium: Text-Based Intelligent Systems.
13.Church, K., and Hanks, P. (1989)
“Word Association Norms, Mutual Information and Lexicography,” Association
for Computational Linguistics, 14.Church, K., and Gale, W. (1989)
“Enhanced Good-Turing and Cat-Cal: Two New Methods for Estimating
Probabilities of English Bigrams,” Second
Darpa Workshop on Speech and Natural Language, 15.Church, K., and Hindle, D. (1990) “Collocational
Constraints and Corpus-Based Linguistics,” in Working Notes of the
AAAI Symposium: Text-Based Intelligent Systems. 16.Church, K., and Gale, W. (1990)
“Poor Estimates of Context are Worse than None,” Third Darpa
Workshop on Speech and Natural Language, 17.Kernighan, M., Church, K., Gale,
W (1990) “A Spelling Correction Program Based on a Noisy Channel
Model,” Coling, 18.Coker, C., Church, K., Liberman,
M. (1990) “Morphology and Rhyming: Two Powerful Alternatives to
Letter-to-Sound Rules for Speech Synthesis,” European Speech
Communication Association, Conference on Speech Synthesis. ps
19.Gale, W. A. and K. W. Church
(1990) “Estimation Procedures for Language Context: Poor Estimates are
Worse than None,” Proceedings in Computational Statistics, 1990,
p.69-74, Physica-Verlag, 20.Gale, W., and Church, K. (1991)
“Identifying Word Correspondences in Parallel Text,” Fourth
Darpa Workshop on Speech and Natural Language, Asilomar,
pp. 152-157. ps* 21.Gale, W., and Church, K. (1991)
“A Program for Aligning Sentences in Bilingual Corpora,” Association
for Computational Linguistics. pdf
(direct from ACL) 22.Church, K. (1991) “Some Statistical
Opportunities in Speech and Language,” 23rd Symposium on the Interface,
Computing Science and Statistics, 23.Church, K. (1991)
“Concordances for Parallel Text,” Seventh Annual Conference of
the UW Centre for the New OED and Text Research, 24.Church, K. and Hovy, E. (1991)
“Good Applications for Crummy Machine Translation,” in Neal, J.
and Walter, S. Natural Language Processing Systems Evaluation Workshop,
University of California, Berkeley, CA. 25.Church, K. and Gale, W. (1992)
“One Sense per Discourse,” Proceedings of Fifth DARPA Workshop
on Speech and Natural Language. ps
26.Gale, W., Church, K., Yarowsky,
D. (1992) “Estimating Upper and Lower Bounds on the Performance of
Word-Sense Disambiguation Programs,” Association for Computational
Linguistics. pdf (via LDC) ps
27.Gale, W., Church, K., Yarowsky,
D. (1992) “Using Bilingual Materials to Develop Word Sense
Disambiguation Methods, TMI, 28.Church, K. and Helfman, J. (1992) “Dotplot:
a Program for Exploring Self-Similarity in Millions of Lines of Text and
Code,” Interface. 29.Gale, W., Church, K. and
Yarowsky, D. (1992) “Work on Statistical Methods for Word Sense
Disambiguation,” Proceedings AAAI Fall Workshop on Statistics in
Natural Language. 30.Church, K. (1993) “Char_align: A Program for Aligning Parallel Texts at the
Character Level,” Association for Computational Linguistics, pp.
1-8. pdf (direct from ACL) ps
31.Dagan , 32.Church, K., Dagan, 33.Church, K., Gale, W., Helfman, J., Lewis, D. (1994) “Fax: An Alternative
to SGML,” Coling. pdf
(via LDC) ps pdf (via ACM) 34.Fung, P. and Church, K. (1994)
“K-vec: A New Approach for Aligning Parallel
Texts,” Coling. ps 35.Dagan , 36.Agazzi
, O., Church, K., and Gale,
W. (1994), “Using OCR and Equalization to Downsample
Documents,” Proceedings of the 12th International
Conference on Pattern Recognition, 37.Church, K. (1995) “One Term
or Two?” Proceedings of the 18th Annual International ACM SIGIR
Conference on Research and Development in Information Retrieval, pp.
310-318. pdf (via ACM) ps
38.Church, K. and Gale, W. (1995)
“Inverse Document Frequency (IDF): A Measure of Deviation from
Poisson,” Proceedings of the Third Workshop on Very Large Corpora,
pp. 121-130. ps 39.Caldwell, D., Church, K. and
Fowler, G. (1998), “Using Quantitative Methods to Compress Call Detail:
From Data Warehousing to Data Publishing,” Symposium on Quantitative
Analysis for Decision Making, pp. 71-79. 40.Yamamoto, M. and Church, K (1998)
“Using Suffix Arrays to Compute Term Frequency and Document Frequency
for All Substrings in a Corpus,” Proceedings of the Sixth Workshop
on Very Large Corpora, pp. 28-37. 41.Belanger, D., Church, K. and
Hume, A. (1999) Virtual Data Warehousing, Data Publishing, and Call Detail,
pp. 106-117, W. Jonker (eds.) “Databases in
Telecommunications 1999, International Workshop Co-located with
VLDB-99,” Edinburgh, Scotland, Proceedings. Lecture Notes in Computer
Science, Vol. 1819, Springer, 2000, ISBN 3-540-67667-9 http://www.informatik.uni-trier.de/~ley/db/conf/vldbw/vldbw1999.html.
doc*
42.Ozawa, T., Yamamoto, M., Umemura,
K., Church, K. (1999) Japanese word segmentation using similarity measure for
IR. In Proceedings of the First NTCIR Workshop on Research in Japanese Text
Retrieval and Term Recognition. 43.Buchsbaum
, A., Caldwell, D., Church,
K., Fowler, G. and Muthukrishnan, S. (2000),
“Engineering the Compression of Massive Tables: An Experimental
Approach,” in Proc. 11th ACM-SIAM Symp. on
Discrete Algorithms (SODA), pp. 175-184. pdf related links pdf (via ACM) 44.Church, K. (2000),
“Empirical Estimates of Adaptation: The chance of Two Noriega's is closer to p/2 than p 2,” Coling, pp. 173-179. pdf
(via LDC) pdf ps
pdf (via ACM) 45.Umemura, K. and Church, K.
(2000), “Empirical Term Weighting and Expansion Frequency,” Empirical
Methods in Natural Language Processing and Very Large Corpora, pp.
117-123. pdf 46.Yamamoto, E., Yamamoto, M.,
Umemura, K. and Church, K. (2000) “Dynamic Programming: A Method for
Taking Advantage of Technical Terminology in Japanese Documents,” Information
Retrieval with Asian Languages (IRAL), pp. 125-132. pdf (via ACM) 47.Feng , L., Umemura, K., Yamamoto, M.
and Church, K. (2000) “Using Variable Length Ngrams
for Retrieving Technical Abstracts in Japanese,” Information
Retrieval with Asian Languages (IRAL), pp. 213-214. pdf (via ACM) 48.Sable, C. and Church, K. (2001)
“Using Bins to Empirically Estimate Term Weights for Text
Categorization,” 2001 Conference on
Empirical Methods in Natural Language Processing (EMNLP 2001). ps alternate 49.Sable,
C. McKeown, K., and Church, K. (2002) "NLP
Found Helpful (at least for one Text Categorization Task)," in Proceedings
of the 2002 Conference on Empirical Methods in Natural Language Processing
(EMNLP), 50.Amir,
A., Church, K. and Dar, E. (2002), “Separable attributes: a technique for
solving the sub matrices character count problem,” ACM-SIAM symposium on Discrete algorithms
(SODA), pp. 400-401 pdf (from ACM) 51.Carey E. Priebe, David J. Marchette, Youngser Park, Ed-
ward J. Wegman, Jeffrey L. Solka,
Diego A. Socolinsky, Damianos
Karakos, Ken W. Church, Roland Guglielmi, Ronald R. Coifman, Dekang Lin, Dennis M. Healy, Marc Q. Jacobs,
Anna Tsao (2004), ``Iterative Denoising
for Cross-Corpus Discovery,” COMPSTAT, Prague, Cz.
ps (see also, http://www.ams.jhu.edu/~priebe/confri2004.html) 52. 53. 54.Li, P., Hastie, T., Church, K
(2006), “Improving Random
Projections Using Marginal Information,” Conference on Learning Theory
(COLT). 55.Li, P., Hastie, T., Church, K. (2006), “Very
Sparse Random Projections,” KDD. 56.Li, P., Hastie, T., Church, K. (2007), “Nonlinear
Estimators and Tail Bounds for Dimension Reduction in L1 Using Cauchy Random
Projections,” Conference on Learning Theory (COLT). 57.Li, P., Church, K., Hastie, T. (2007), Conditional
Random Sampling: A Sketch-based Sampling Technique for Sparse Data, NIPS. 58.Church, K., Hart, T and Jianfeng,
G. (2007), Compressing Trigram Language Models with Golomb
Coding, Proceedings of the 2007 Joint Conference on Empirical Methods
in Natural Language Processing and Computational Natural Language Learning
(EMNLP-CoNLL), pp. 199-207, http://www.aclweb.org/anthology/D/D07/D07-1021.
59.Qiaozhu,
M. and Church, K. (2008) “Entropy
of Search Logs: How Hard is Search? With Personalization? With Backoff?,” The First ACM International
Conference on Web Search and Data Mining (WSDM'08), pages 45-54. [pdf] [slides] [video]
Books 1. Church, K. (1988) Phonological
Parsing in Speech Recognition, Kluwer. 2. Armstrong, S., Church, K.,
Isabelle, P., Manzi, S., Tzoukermann,
E. and Yarowksky, D. (eds.) (1999), Natural
Language Processing Using Very Large Corpora, Kluwer.
Book Chapters 1. Church, R., and Church, K. (1977)
“Plans, Goals, and Search Strategies for the Seleciton
of a Move in Chess,” in Frey, P. (ed), Chess Skill in Man and Machine,
Springer-Verlag. 2. Martin, W., Church, K., and Patil, R. (1987) “Preliminary Analysis of a
Breadth-First Parsing Algorithm: Theoretical and Experimental Results,”
L.Bolc (ed.), Natural Language Parsing Systems,
Springer-Verlag. 3. Church, K. (1987)
“Phonological Parsing and Lexical Retrieval” in Frauenfelder, U. and Tyler, L., (ed) Spoken Word
Recognition, MIT Press. 4. Church, K, Gale, W., Hanks, P., Hindle, D. (1991) “Parsing, Word Associations and
Typical Predicate-Argument Relations,” in Tomita, M.(ed.) Current Issues in Parsing
Technology, Kluwer Academic Publishers, Boston,
MA. ps 5. Church, K., Hanks, P., Hindle, D., Gale, W. (1991) “Using Statistics in
Lexical Analysis,” in Zernik (ed), Lexical
Acquisition: Using On-line Resources to Build a Lexicon, Lawrence
Erlbaum, pp. 115-164. ps 6. Liberman , M., and Church, K. (1991)
“Text Analysis and Word Pronunciation in Text-to-Speech
Synthesis,” in Furui, S., and Sondhi, M. (eds.), Advances in Speech Signal
Processing. ps* 7. Church, K., Hanks, P., Hindle, D., Gale, W., Moon, R., “Substitutablity,” in Atkins and Zampolli
(ed.), (1994), Computational Approaches to the Lexicon Automating the
Lexicon II Schema, Oxford University Press, pp. 153-180. ps
8. Church, K, “Text
Analysis,” in Mellish (ed.), (to appear)
Encyclopedia of Language and Linguistics, Pergamon
Press, Aberdeen University Press. 9. Church, K., “Comments on
Computational Learning Model for Metrical Phonology,” Levine, R. (ed.),
Formal Grammar: Theory and Implementation, Vancouver Studies in
Cognitive Science Series, UBC Press, Vol. 2 (1992). ps 10.Church, K. (1992) “Current
Practice in Part of Speech Tagging and Suggestions for the Future,” in
Simmons (ed.), Abornik praci: In Honor of 11.Gale, W. and Church, K. (1993)
“Statistical Approaches to Aligning Sentences and Identifying Word
Correspondences in Parallel Texts: a Report on Work in Progress,” in D.
J. Hand (ed.) Artificial Intelligence Frontiers in Statistics, AI and
Statistics III, pp. 281-294. 12.Gale, W. and Church, K. (1994)
“What's Wrong with Adding One?” in N. Oostdijk
and P. de Haan (eds.), Corpus-Based Research
into Languge: In honour
of Jan Aarts, Rodopi,
Amsterdam, pp. 189-200. ps
13.Gale, W. and Church, K. (1994)
“Discrimination Decisions in 100,000 Dimensional Spaces,” in A. Zampolli, N. Calzolari and M. Palmer (eds.), Current
Issues in Computational Linguisitcs: In honour of Don Walker, Kluwer
Academic Publishers, The Netherlands, pp. 429-550. ps
14. Tutorials 1. Ngrams
, ACL 1995. ps 2. NLP Techniques and Text Retrieval, Lugano
3. Unix for Poets ps 4. Applications of Text Analysis ps
5. Empirical
Methods in Natural Language Processing: What's Happened Since the First
SIGDAT Meeting? NAACL-2000 abstract,
slides & handout Invited Talks 1. EACL-1993 2. What’s Happened Since the First SIGDAT Meeting?, SIGDAT-1999 ppt 3. Empiricism form TMI-1992 to AMTA-2002 to AMTA-2012: Have IBM Models 1-5 failed to solve all the world's problems? AMTA-2002 ppt 4. Church,
K. (2003) “Speech and Language Processing: Where have we been and where
are we going,” Eurospeech, 5. LREC 2004 Workshop CFP: MEMURA-2004 Methodologies and Evaluation of Multiword Units in Real-world Applications, http://memura2004.di.ubi.pt/ ppt 6. Joint talk to EMNLP-2004 (http://www.cs.ualberta.ca/~lindek/emnlp04/) and Senseval-2004 (http://www.cs.unt.edu/~rada/senseval/senseval3/workshop.html) (2004) ppt 7. Church, K. (2004) ``Speech and
Language Processing: Can we use the past to predict the future,” Seventh International Conference on TEXT, SPEECH and DIALOGUE,
Brno, Czech Republic. slides pdf Panels
1. KDD-2004. slides
|
|||||