Introduction to the special issue on--688IT编程网

The MIT Press Journals

mitpress.mit.edu/journals

This article is provided courtesy of The MIT Press.

To join an e-mail alert list and receive the latest news on our publications, please visit: mitpress.mit.edu/e-mail

Introduction to the Special Issue on Computational Anaphora Resolution

Ruslan Mitkov∗Branimir Boguraev†

University of Wolverhampton IBM T.J.Watson Research Center Shalom Lappin‡

King’s College,London

Anaphora accounts for cohesion in texts and is a phenomenon under active study in formal and computational linguistics alike.The correct interpretation of anaphora is vital for natural language processing(NLP).For example,anaphora resolution is a key task in natural language interfaces,machine t

ranslation,text summarization, information extraction,question answering,and a number of other NLP applications.

After considerable initial research,followed by years of relative silence in the early 1980s,anaphora resolution has attracted the attention of many researchers in the last10 years and a great deal of successful work on the topic has been carried out.Discourse-oriented theories and formalisms such as Discourse Representation Theory and Cen-tering Theory inspired new research on the computational treatment of anaphora.The drive toward corpus-based robust NLP solutions further stimulated interest in alterna-tive and/or data-enriched approaches.Last,but not least,application-driven research in areas such as automatic abstracting and information extraction independently high-lighted the importance of anaphora and coreference resolution,boosting research in this area.

Much of the earlier work in anaphora resolution heavily exploited domain and lin-guistic knowledge(Sidner1979;Carter1987;Rich and LuperFoy1988;Carbonell and Brown1988),which was difﬁcult both to represent and to process,and which required considerable human input.However,the pressing need for the development of robust and inexpensive solutions to meet the demands of practical NLP systems encouraged many researchers to move away from extensive domain and linguistic knowledge and to embark instead upon knowledge-poor anaphora resolution strategies.A nu

mber of proposals in the1990s deliberately limited the extent to which they relied on domain and/or linguistic knowledge and reported promising results in knowledge-poor oper-ational environments(Dagan and Itai1990,1991;Lappin and Leass1994;Nasukawa 1994;Kennedy and Boguraev1996;Williams,Harvey,and Preston1996;Baldwin1997; Mitkov1996,1998b).

The drive toward knowledge-poor and robust approaches was further motivated by the emergence of cheaper and more reliable corpus-based NLP tools such as part-of-speech taggers and shallow parsers,alongside the increasing availability of corpora and other NLP ,ontologies).In fact,the availability of corpora,both raw and annotated with coreferential links,provided a strong impetus to anaphora resolu-∗School of Humanities,Language and Social Sciences,Stafford Street,Wolverhampton WV11SB,UK.

E-mail:r.mitkov@wlv.ac.uk

†30Saw Mill River Road,Hawthorne,NY10532,USA.E-mail:bkb@watson.ibm

‡Department of Computer Science,King’s College,The Strand,London WC2R2LS,UK.

E-mail:lappin@dcs.kcl.ac.uk

c 2001Association for Computational Linguistics

Computational Linguistics Volume27,Number4 tion with regard to both training and evaluation.Corpora(especially when annotated) are an invaluable source not only for empirical research but also for automated learning (e.g.,machine learning)methods aiming to develop new rules and approaches;they also provide an important resource for evaluation of the implemented approaches. From simple co-occurrence rules(Dagan and Itai1990)through training decision trees to identify anaphor-antecedent pairs(Aone and Bennett1995)to genetic algorithms to optimize the resolution factors(Or˘a san,Evans,and Mitkov2000),the successful per-formance of more and more modern approaches was made possible by the availability of suitable corpora.

While the shift toward knowledge-poor strategies and the use of corpora repre-sented the main trends of anaphora resolution in the1990s,there are other signiﬁ-cant highlights in recent anaphora resolution research.The inclusion of the corefer-ence task in the Sixth and Seventh Message Understanding Conferences(MUC-6and MUC-7)gave a considerable impetus to the development of coreference resolution algorithms and systems,such as those described in Baldwin et al.(1995),Gaizauskas and Humphreys(1996),and Kameyama(1997).The last decade of the20th century saw a number of anaphora resolution projects for languages other than English such as French,German,Japanese,Spa

nish,Portuguese,and Turkish.Against the background of a growing interest in multilingual NLP,multilingual anaphora/coreference reso-lution has gained considerable momentum in recent years(Aone and McKee1993; Azzam,Humphreys,and Gaizauskas1998;Harabagiu and Maiorano2000;Mitkov and Barbu2000;Mitkov1999;Mitkov and Stys1997;Mitkov,Belguith,and Stys1998). Other milestones of recent research include the deployment of probabilistic and ma-chine learning techniques(Aone and Bennett1995;Kehler1997;Ge,Hale,and Char-niak1998;Cardie and Wagstaff1999;the continuing interest in centering,used either in original or in revised form(Abra¸c os and Lopes1994;Strube and Hahn1996;Hahn and Strube1997;Tetreault1999);and proposals related to the evaluation methodology in anaphora resolution(Mitkov1998a,2001b).For a more detailed survey of the state of the art in anaphora resolution,see Mitkov(forthcoming).

The papers published in this issue reﬂect the major trends in anaphora resolution in recent years.Some of them describe approaches that do not exploit full syntactic knowledge(as in the case of Palomar et al.’s and Stuckardt’s work)or that employ machine learning techniques(Soon,Ng,and Lim);others present centering-based pro-noun resolution(Tetreault)or discuss theoretical centering issues(Kibble).Almost all of the papers feature extensive evaluation(including comparative evaluation as in the case of Tetreault’s and Palomar et al.’s work)or discuss general evaluation issues (Byron as well as Stuckardt).

Palomar et al.’s paper describes an approach that works from the output of a partial parser and handles third person personal,demonstrative,reﬂexive,and zero pronouns,featuring among other things syntactic conditions on Spanish NP-pronoun noncoreference and an enhanced set of resolution preferences.The authors also im-plement several known methods and compare their performance with that of their own algorithm.An indirect conclusion from this work is that an algorithm requires semantic knowledge in order to hope for a success rate higher than75%.

Soon,Ng,and Lim describe a C5-based learning approach to coreference resolu-tion of noun phrases in unrestricted text.The approach learns from a small,annotated corpus and tackles pronouns,proper names,and deﬁnite descriptions.The coreference resolution module is part of a larger coreference resolution system that also includes sentence segmentation,tokenization,morphological analysis,part-of-speech tagging, noun phrase identiﬁcation,named entity recognition,and semantic class determina-tion(via WordNet).The evaluation is carried out on the MUC-6and MUC-7test 474

Mitkov,Boguraev,and Lappin Anaphora Resolution:Introduction corpora.The paper reports on experiments aimed at quantifying the contribution of each resolution factor and features error analysis.

Stuckardt’s work presents an anaphor resolution algorithm for systems where only partial syntactic info

centering

rmation is available.Stuckardt applies Government and Bind-ing Theory principles A,B,and C to the task of coreference resolution on partially parsed texts.He also argues that evaluation of anaphora resolution systems should take into account several factors beyond simple accuracy of resolution.In particular, both ,related to the selection of optimal resolution factors) and ,related to the requirement of the application,as in the case of information extraction,where a proper name antecedent is needed)evaluation metrics should be considered.

Tetreault’s contribution features comparative evaluation involving the author’s own centering-based pronoun resolution algorithm called the Left-Right Centering algorithm(LRC)as well as three other pronoun resolution methods:Hobbs’s naive algorithm(Hobbs1978),BFP(Brennan,Friedman,and Pollard1987),and Strube’s S-list approach(Strube1998).The LRC is an alternative to the original BFP algorithm in that it processes utterances incrementally.It works byﬁrst searching for an antecedent in the current sentence;if none can be found,it continues the search on the Cf-list of the previous and the other preceding utterances in a left-to-right fashion.

In her squib,Byron maintains that additional kinds of information should be included in an evaluation in order to make the performance of algorithms on pronoun resolution more transparent.In particular,she suggests that the pronoun coverage be explicitly reported and proposes that the evaluation details be

presented in a concise and compact tabular format called standard disclosure.Byron also proposes a measure, the resolution rate,which is computed as the number of pronouns resolved correctly divided by the number of(only)referential pronouns.

Finally,in his squib Kibble discusses a reformulation of the centering transitions (Continue,Retain,and Shift),which specify the center movement across sentences. Instead of deﬁning a total preference ordering,Kibble argues that a partial ordering emerges from the interaction among cohesion(maintaining the same center),salience (realizing the center as subject),and cheapness(realizing the anticipated center of a following utterance as subject).

The last years have seen considerable advances in theﬁeld of anaphora resolution, but a number of outstanding issues either remain unsolved or need more attention and,as a consequence,represent major challenges to the further development of the ﬁeld(Mitkov2001a).A fundamental question that needs further investigation is how far the performance of anaphora resolution algorithms can go and what the limitations of knowledge-poor methods are.In particular,more research should be carried out on the factors inﬂuencing the performance of these algorithms.One of the impediments to the evaluation or fuller utilization of machine learning techniques is the lack of widely available corpora annotated for anaphoric or coreferential links.More work toward the proposal of consistent and compre

hensive evaluation is necessary;so too is work in multilingual contexts.Some of these challenges have been addressed in the papers published in this issue,but ongoing research will continue to address them in the near future.

References

Abra¸c os,Jose and Jos´e Lopes.1994. Extending DRT with a focusing mechanism for pronominal anaphora and ellipsis resolution.In Proceedings of the15th

International Conference on Computational Linguistics(COLING’94),pages1128–1132, Kyoto,Japan.

Aone,Chinatsu and Scott Bennett.1995. Evaluating automated and manual

475

Computational Linguistics Volume27,Number4

acquisition of anaphora resolution strategies.In Proceedings of the33rd Annual Meeting of the Association for Computational Linguistics(ACL’95),pages122–129,Las Cruces,NM.

Aone,Chinatsu and Douglas McKee.1993.

A language-independent anaphora resolution system for understanding multilingual texts.In Proceedings of the31st Annual Meeting of the Association for Computational Linguistics(ACL’93),

pages156–163,Columbus,OH. Azzam,Saliha,Kevin Humphreys,and Robert Gaizauskas.1998.Coreference resolution in a multilingual information extraction.In Proceedings of a Workshop on Linguistic Coreference,Granada,Spain. Baldwin,Breck.1997.CogNIAC:High precision coreference with limited knowledge and linguistic resources.In Proceedings of the ACL’97/EACL’97 Workshop on Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts,pages38–45,Madrid,Spain. Baldwin,Breck,Jeff Reynar,Mike Collins, Jason Eisner,Adwait Ratnaparki,Joseph Rosenzweig,Anoop Sarkar,and Srivinas Bangalore.1995.Description of the University of Pennsylvania system used for MUC-6.In Proceedings of the Sixth Message Understanding Conference

(MUC-6),pages177–191,Columbia,MD. Brennan,Susan,Marilyn Friedman,and Carl Pollard.1987.A centering approach to pronouns.In Proceedings of the25th Annual Meeting of the Association for Computational Linguistics(ACL’87),

pages155–162,Stanford,CA. Carbonell,Jaime and Ralf Brown.1988. Anaphora resolution:A multi-strate

gy approach.In Proceedings of the12th International Conference on Computational Linguistics(COLING’88),volume1, pages96–101,Budapest,

Hungary.

Cardie,Claire and Kiri Wagstaff.1999. Noun phrase coreference as clustering.In Proceedings of the1999Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pages82–89,College Park,MD. Carter,David M.1987.Interpreting Anaphors in Natural Language Texts.Ellis Horwood, Chichester,UK.

Dagan,Ido and Alon Itai.1990.Automatic processing of large corpora for the resolution of anaphora references.In Proceedings of the13th International Conference on Computational Linguistics (COLING’90),volume3,pages1–3, Helsinki,Finland.Dagan,Ido and Alon Itai.1991.A statistical ﬁlter for resolving pronoun references.In Yishai A.Feldman and Alfred Bruckstein, editors,Artiﬁcial Intelligence and Computer Vision.Elsevier Science Publishers B.V. (North-Holland),Amsterdam,pages

125–135.

Gaizauskas,Robert and Kevin Humphreys. 1996.Quantitative evaluation of coreference algorithms in a

n information extraction system.Presented at Discourse Anaphora and Anaphor Resolution Colloquium(DAARC-1),Lancaster,UK. Reprinted in Simon Botley and Tony McEnery,editors,Corpus-Based and Computational Approaches to Discourse Anaphora.John Benjamins,Amsterdam, 2000,pages143–167.

Ge,Niyu,John Hale,and Eugene Charniak. 1998.A statistical approach to anaphora resolution.In Proceedings of the Sixth Workshop on Very Large Corpora,

pages161–170,Montreal,Canada. Hahn,Udo and Michael Strube.1997. Centering-in-the-large:Computing referential discourse segments.In Proceedings of the35th Annual Meeting of the Association for Computational Linguistics (ACL’97/EACL’97),pages104–111, Madrid,Spain.

Harabagiu,Sanda and Steven Maiorano. 2000.Multilingual coreference resolution. In Proceedings of Conference on Applied Natural Language Processing/North American Chapter of the Association for Computational Linguistics(ANLP-NAACL2000),pages 142–149,Seattle,WA.

Hobbs,Jerry.1978.Resolving pronoun references.Lingua,44:311–338. Kameyama,Megumi.1997.Recognizing referential links:An information extraction perspective.In Proceedings of the ACL’97/EACL’97Workshop on Operational Factors in Practical,Robust Anaphora R

esolution for Unrestricted Texts,

pages46–53,Madrid,Spain.

Kehler,Andrew.1997.Probabilistic coreference in information extraction.In Proceedings of the2nd Conference on Empirical Methods in Natural Language Processing(EMNLP-2),pages163–173, Providence,RI.

Kennedy,Christopher and Branimir Boguraev.1996.Anaphora for everyone: Pronominal anaphora resolution without a parser.In Proceedings of the16th International Conference on Computational Linguistics(COLING’96),pages113–118, Copenhagen,Denmark.

Lappin,Shalom and Herbert Leass.1994. An algorithm for pronominal anaphora

476

Mitkov,Boguraev,and Lappin Anaphora Resolution:Introduction

resolution.Computational Linguistics,

20(4):535–561.

Mitkov,Ruslan.1996.Pronoun resolution: The practical alternative.Presented at the Discourse Anaphora and Anaphor Resolution Colloquium(DAARC-1), Lancaster,UK.Reprinted in Simon Botley and Tony McEnery,editors,Corpus-Based and Computational Approaches to Discourse Anaphora.John Benjamins,Amsterdam, 2000,189–212.

Mitkov,Ruslan.1998a.Evaluating anaphora resolution approaches.In Proceedings of the Discourse Anaphora and Anaphora Resolution Colloquium(DAARC-2),Lancaster,UK. Mitkov,Ruslan.1998b.Robust pronoun resolution with limited knowledge.In Proceedings of the36th Annual Meeting of the Association for Computational Linguistics and the17th International Conference on Computational Linguistics

(COLING’98/ACL’98),pages869–875, Montreal,Canada.

Mitkov,Ruslan.1999.Multilingual anaphora resolution.Machine Translation,

14(3–4):281–299.

Mitkov,Ruslan.2001a.Outstanding issues in anaphora resolution.In Alexander Gelbukh,editor,Computational Linguistics and Intelligent Text Processing.Springer, Berlin,pages110–125.

Mitkov,Ruslan.2001b.Towards a more consistent and comprehensive evaluation of anaphora resolution algorithms and systems.Applied Artiﬁcial Intelligence:An International Journal,15:253–276. Mitkov,Ruslan.Forthcoming.Anaphora Resolution.Longman,Harlow,UK. Mitkov,Ruslan,Lamia Belguith,and Malgorzata Stys.1998.Multilingual robust anaphora resolution.In Proceedings of the Third International Conference on Empirical Methods in Natural Language Processing (EMNLP-3),pages7–16,Granada,Spain. Mitkov,Ruslan and Malgorzata Stys.1997. Robust reference resolution with limited knowledge:High precision genre-speciﬁc approach for English and Polish.In Proceedings of the International Conference on Recent Advances in Natural Language Processing(RANLP’97),pages74–81, Tzigov Chark,Bulgaria.Mitkov,Ruslan and Catalina Barbu.2000. Improving pronoun resolution in two languages by means of bilingual corpora. In Proceedings of the Discourse,Anaphora and Reference Resolution Conference(DAARC 2000),pages133–137,Lancaster,UK. Nasukawa,Tetsuya.1994.Robust method of pronoun resolution using full-text information.In Proceedings of the15th International Conference on Computational Linguistics(COLING’94),pages1157–1163, Kyoto,Japan.

Or˘a san,Constantin,Richard Evans,and Ruslan Mitkov.2000.Enhancing preference-based anaphora resolution with genetic algorithms.In Proceedings of NLP-2000,pages185–195,Patras,Greece. Rich,El

aine and Susann LuperFoy.1988.An architecture for anaphora resolution.In Proceedings of the Second Conference on Applied Natural Language Processing (ANLP-2),pages18–24,Austin,TX. Sidner,Candace.1979.Toward a computational theory of deﬁnite anaphora comprehension in English.Technical Report AI-TR-537,MIT,Cambridge,MA. Strube,Michael.1998.Never look back:An alternative to centering.In Proceedings of the36th Annual Meeting of the Association for Computational Linguistics and the17th International Conference on Computational Linguistics(COLING’98/ACL’98),

pages1251–1257,Montreal,Canada. Strube,Michael and Udo Hahn.1996. Functional centering.In Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics(ACL’96),

pages270–277,Santa Cruz,CA. Tetreault,Joel.1999.Analysis of

syntax-based pronoun resolution methods.In Proceedings of the37th Annual Meeting of the Association for Computational Linguistics(ACL’99),pages602–605, College Park,MD.

Williams,Sandra,Mark Harvey,and Keith Preston.1996.Rule-based reference resolution for unrestricted text using

part-of-speech tagging and noun phrase parsing.In Proceedings of the Discourse Anaphora and Anaphora Resolution Colloquium(DAARC-1),pages441–456, Lancaster,UK.

477

688IT编程网

Introduction to the special issue on

发表评论

推荐文章

应用程序的安全检测方法、装置、电子设备和存储介质

nginx map用法正则

VBA之正则表达式(1)--基础篇

Prometheus监控学习笔记之初识PromQL

关于PHP中的webshell

热门文章

m函数数字提取

jest断言方法大全

中兴ZXSEC US 管理员手册

keras系列(一):参数设置

Qt从QString中提取出数字

element input 金额千分位格式化

freemaker 参数解析正则

C#正则验证数字

form表单验证正则

scanf正则表达式用法

grafana value的正则表达式

Android平台浮点数运算应用

js-(JS正则表达式验证数字)

判断Python输入是否是整数,字符,或浮点数

c语言 sscanf 正则规则

从文本中提取数值技巧

js将整数转换成两位浮点数的方法

vue正则限制浮点数

8到20的结尾的正则

shell 正则表达式最后一行

最新文章

应用程序的安全检测方法、装置、电子设备和存储介质

VBA之正则表达式(1)--基础篇

代码编辑的辅助方法、装置及电子设备

SHELL查字符串中包含字符的命令

String方法中replace和replaceAll的区别详解(源码分析)

双字节符号正则

标签列表

688IT编程网

Introduction to the special issue on

发表评论

推荐文章

应用程序的安全检测方法、装置、电子设备和存储介质

nginx map用法 正则

VBA之正则表达式(1)--基础篇

Prometheus监控学习笔记之初识PromQL

关于PHP中的webshell

热门文章

m函数数字提取

jest断言方法大全

中兴ZXSEC US 管理员手册

keras系列(一):参数设置

Qt从QString中提取出数字

element input 金额千分位格式化

freemaker 参数解析正则

C#正则验证数字

form表单验证正则

scanf正则表达式用法

grafana value的正则表达式

Android平台浮点数运算应用

js-(JS正则表达式验证数字)

判断Python输入是否是整数,字符,或浮点数

c语言 sscanf 正则规则

从文本中提取数值技巧

js将整数转换成两位浮点数的方法

vue正则限制浮点数

8到20的结尾的正则

shell 正则表达式 最后一行

最新文章

应用程序的安全检测方法、装置、电子设备和存储介质

VBA之正则表达式(1)--基础篇

代码编辑的辅助方法、装置及电子设备

SHELL查字符串中包含字符的命令

String方法中replace和replaceAll的区别详解(源码分析)

双字节符号正则

标签列表

nginx map用法正则

shell 正则表达式最后一行