Currently my main research focus is on parsing algorithms, both from a theoretical and practical standpoint.
I work with parsing
schemata as a framework for designing, studying and comparing
parsers for constituency-based (context-free
grammars and tree-adjoining
grammars) and dependency-based formalisms, including robust,
error-repair parsers. The practical aspect
consists on implementing these parsers, evaluating their performance
and using them on tasks such as information
retrieval.
Most of this was included in my PhD dissertation, titled "Parsing
Schemata for Practical Text Analysis", which was defended in June 2009; and in the homonymous book published in 2010. The book is based on the thesis, but it contains extensive changes, additions and improvements.
Lately, I am also working on the formalism of linear
context-free rewriting systems (LCFRS), which is a general class of
mildly context-sensitive grammar systems whose synchronous rewriting
capabilities make it useful for machine translation.
Another line of research, related to the previous two, is that of non-projective dependency parsing:
the search for parsing algorithms that can efficiently handle
linguistic structures that contain crossing dependency links (or,
roughly equivalently, that contain discontinuous phrases). I am
attacking this problem both from the point of view of parsing schemata
and from that of transition-based dependency parsing.
The current results of this work can be seen in the publications below.
Other subjects I am interested in, but have not touched much due to
insufficient multitasking capabilities, are text summarization, machine
translation, supervised and unsupervised grammar induction, dialogue
systems and natural language generation.
Publications: Publications are listed in reverse chronological order. A
list of publications by all members of the COLE research group can be
found here.
-2012- Carlos Gómez-Rodríguez and Daniel Fernández-González, Dependency Parsing with Undirected Graphs, in
Proc. of the 13th Conference of the European Chapter of the Association
for Computational Linguistics (EACL 2012), Avignon, France, 2012.
Pending publication.
-2011- Carlos Gómez-Rodríguez, John Carroll and David Weir, Dependency Parsing Schemata and Mildly Non-Projective Dependency Parsing, Computational Linguistics, 37(3):541-586, 2011. ISSN 0891-2017 2011.
[GomCarWei2011a.pdf]
Shay Cohen, Carlos Gómez-Rodríguez and Giorgio Satta, Exact Inference for Generative Probabilistic Non-Projective Dependency Parsing, in
Proc. of the 2011 Conference on Empirical Methods in Natural Language
Processing (EMNLP 2011), pp. 1234-1245. Edinburgh, UK, 2011. ISBN
978-1-937284-11-4.
[CohGomSat2011a.pdf]
Marco Kuhlmann, Carlos Gómez-Rodríguez and Giorgio Satta, Dynamic Programming Algorithms for Transition-Based Dependency Parsers, in
Proc. of 49th Annual Meeting of the Association for Computational
Linguistics: Human Language Technologies (ACL HLT 2011), pp. 673-682, Portland,
Oregon, USA, 2011. ISBN 978-1-932432-87-9.
[KuhGomSat2011a.pdf]
-2010- Carlos Gómez-Rodríguez, Parsing
Schemata for Practical Text Analysis,
Volume 1 of Mathematics, Computing, Language and Life: Frontiers in
Mathematical Linguistics and Language Theory.
Imperial College Press, 2010. ISBN 978-1-84816-560-1.
[Book
website] [Buy at Amazon]
Joakim Nivre, Laura Rimell, Ryan McDonald and Carlos Gómez-Rodríguez, Evaluation of Dependency Parsers on Unbounded Dependencies,
in Proc. of the 23rd International Conference on Computational
Linguistics (COLING 2010), pp. 833-841, Beijing, China, 2010. ISBN
978-7-900268-00-6. [NivRimMcDGom2010a.pdf] Carlos Gómez-Rodríguez and Joakim Nivre, A Transition-Based Parser for 2-Planar Dependency Structures,
in Proc. of the 48th Annual Meeting of the Association for
Computational Linguistics (ACL 2010), pp. 1492-1501, Uppsala, Sweden, 2010. ISBN 978-1-932432-67-1. [GomNiv2010a.pdf] Carlos Gómez-Rodríguez, Marco Kuhlmann and Giorgio Satta, Efficient Parsing of Well-Nested Linear Context-Free Rewriting Systems, in
NAACL HLT 2010. Human Language Technologies: The 11th Annual Conference
of the North American Chapter of the Association for Computational
Linguistics. Proceedings of the Conference, pp. 276-284, Los Angeles, California,
USA, 2010. ISBN 978-1-932432-65-7. [GomKuhSat2010a.pdf]
Carlos Gómez-Rodríguez, Miguel A.
Alonso and Manuel Vilares, Error-repair
parsing schemata,
Theoretical Computer Science, 411(7-9):1121-1139, 2010. ISSN 0304-3975. DOI 10.1016/j.tcs.2009.12.007.
[GomAloVil2010a.pdf]
-2009- Carlos Gómez-Rodríguez, Parsing
Schemata for Practical Text Analysis,
Ph.D. Thesis, Departamento de Computación, Universidade da Coruńa,
Spain, 2009 (xviii + 290 pp). Carlos Gómez-Rodríguez and Giorgio
Satta, An
Optimal-Time Binarization Algorithm for Linear Context-Free Rewriting
Systems with Fan-Out Two,
in Proc. of Joint conference of the 47th Annual Meeting of the
Association for Computational Linguistics and the 4th International
Joint Conference on Natural Language Processing of the Asian Federation
of Natural Language Processing (ACL-IJCNLP 2009), pp. 985-993, Suntec,
Singapore, 2009. ISBN 1-932432-46-9.
[GomSat2009a.pdf]
Carlos Gómez-Rodríguez, Marco
Kuhlmann, Giorgio Satta and David Weir, Optimal
Reduction of Rule Length in Linear Context-Free Rewriting Systems,
in Proc. of the North American Chapter of the Association for
Computational Linguistics - Human Language Technologies Conference
(NAACL'09:HLT), pp. 539-547, Boulder, Colorado,
2009. ISBN 978-1-932432-41-1.
[GomKuhSatWei2009a.pdf]
Carlos Gómez-Rodríguez, David Weir and
John Carroll, Parsing
Mildly Non-Projective Dependency Structures,
in Proc. of the 12th Conference of the European Chapter of the
Association for Computational Linguistics (EACL-09), pp. 291-299,
Athens, Greece,
2009. ISBN 978-1-932432-16-9.
[GomWeiCar2009a.pdf]
Carlos Gómez-Rodríguez, Miguel A.
Alonso and Manuel Vilares, A
general method for transforming standard parsers into error-repair
parsers,
in
Alexander Gelbukh (ed.), Computational
Linguistics and Intelligent Text Processing, volume 5449 of
Lecture Notes in
Computer Science, pp. 207-219, Springer-Verlag, Berlin-Heidelberg-New
York, 2009.
ISSN 0302-9743, DOI 10.1007/978-3-642-00382-0_17.
[GomAloVil2009a.pdf]
Carlos Gómez-Rodríguez, Jesús Vilares
and Miguel A. Alonso, A
compiler for parsing schemata,
Software: Practice and Experience, 39(5):441-470, 2009. ISSN 0038-0644,
DOI 10.1002/spe.904.
(preprint: [GomVilAlo2009a.pdf]) -2008- Carlos Gómez-Rodríguez, John Carroll
and David Weir, A
Deductive Approach to Dependency Parsing,
in Proc. of The 46th Annual Meeting of the Association for
Computational Linguistics: Human Language Technologies (ACL'08:HLT),
pp. 968-976,
Columbus, Ohio, USA, 2008. ISBN 978-1-932432-04-6.
[GomCarWei2008a.pdf] Carlos Gómez-Rodríguez,
David Weir and John Carroll, Parsing
Mildly Non-Projective Dependency Structures (extended version of the
homonymous paper to appear in EACL-09),
Technical Report CSRP 600, Department of Informatics, University of
Sussex. ISSN 1350-3162.
[Download] -2007- Carlos Gómez-Rodríguez, Jesús Vilares
and Miguel A. Alonso, Compiling
Declarative Specifications of Parsing Algorithms,
in R. Wagner, R. Newell and G. Pernul (eds.), Database and Expert
Systems Applications, volume 4653 of Lecture Notes in Computer Science,
pp. 529-538, Springer-Verlag, Berlin-Heidelberg-New York, 2007. ISSN
0302-9743.
[GomVilAlo2007a.pdf]
Carlos Gómez-Rodríguez, Miguel A.
Alonso and Manuel Vilares, Generation
of indexes for compiling efficient parsers from formal specifications,
in Proc. of Eleventh International Conference on Computer Aided Systems
Theory (EUROCAST 2007), Las Palmas, Spain, 2007.
(as Extended Abstract: [GomAloVil2007a.pdf])
and in Roberto
Moreno-Díaz, Franz
Pichler, and Alexis Quesada-Arencibia (eds.), Computer Aided Systems
Theory, volume of Lecture Notes in Computer Science, Springer-Verlag,
Berlin-Heidelberg-New York, 2007. ISSN 0302-9743.
[GomAloVil2007b.pdf]
Carlos Gómez-Rodríguez, Miguel A. Alonso and Manuel Vilares, Técnicas
deductivas para el análisis sintáctico con corrección de errores,
Procesamiento del Lenguaje Natural, 39:295-296, 2007. ISSN 1135-5948.
[GomAloVil2007c.pdf]
Carlos Gómez-Rodríguez, Jesús Vilares and Miguel A. Alonso, Prototyping
Efficient Natural Language Parsers,
in Proc. of International Conference RANLP 2007, Recent Advances in
Natural Language Processing, pp. 246-250, Borovets, Bulgaria, 2007.
ISBN 978-954-91743-7-3. [GomVilAlo2007b.pdf]
-2006-
Jesús Vilares, Carlos Gómez-Rodríguez
and Miguel A. Alonso, Syntactic
and pseudo-syntactic approaches for text retrieval,
in Vicente P. Guerrero-Bote (ed.), Current Research in Information
Sciences and Technologies: multidisciplinary approaches to global
information systems. Proceedings of the First International Conference
on Multidisciplinary Information Sciences and Technologies --- InSciT
2006. October 25-28th, 2006. Mérida, Spain, pp. 104-108,
Open Institute of Knowledge, Badajoz, Spain, 2006. ISBN 84-611-3104-5.
[VilGomAlo2006b.pdf]
Carlos Gómez-Rodríguez, Jesús Vilares
and Miguel A. Alonso, Automatic
Generation of Natural Language Parsers from Declarative Specifications,
in Loris Penserini, Pavlos Peppas and Anna Perini (eds.), STAIRS 2006 -
Proceedings of the Third Starting AI Researchers' Symposium, Riva del
Garda, Italy, August 28-29, 2006, volume 142 of Frontiers in Artificial
Intelligence and Applications, pp. 259-260, IOS Press,
Amsterdam/Berlin/Oxford/Tokyo/Washington DC, 2006. ISSN 0922-6389 /
ISBN 1-58603-645-9.
[GomVilAlo2006a.pdf]
Carlos Gómez-Rodríguez, Miguel A. Alonso and
Manuel Vilares, On
Theoretical and Practical Complexity of TAG Parsers,
in Paola Monachesi, Gerald Penn, Giorgio Satta and Shuly Wintner
(eds.), FG 2006: The 11th conference on Formal Grammar. Malaga, Spain,
July 29-30, 2006, chapter 5, pp. 61-75, Center for the Study of
Language and Information, Stanford, 2006.
[GomAloVil2006b.pdf]
Carlos Gómez-Rodríguez, Miguel A. Alonso and
Manuel Vilares, Generating
XTAG Parsers from Algebraic Specifications,
in Proceedings of the 8th International Workshop on Tree Adjoining
Grammar and Related Formalisms. Sydney, July 2006, pp. 103-108,
Association for Computational Linguistics, East Stroudsburg, PA, 2006.
ISBN: 1-932432-85-X.
[GomAloVil2006a.pdf]
Carlos Gómez-Rodríguez, Miguel A. Alonso and
Manuel Vilares, Estudio
comparativo del rendimiento de analizadores sintácticos para
gramáticas de adjunción de árboles,
Procesamiento del Lenguaje Natural, 37:179-186, 2006. ISSN 1135-5948.
[GomAloVil2006c.pdf]
Jesús Vilares, Carlos Gómez-Rodríguez
and Miguel A. Alonso, Enfoques
sintáctico y pseudo-sintáctico para la
recuperación de información en espańol,
in Alejandro Sobrino and José Ángel Olivas
(eds.), Recuperación de información textual:
aspectos lógicos y ecológicos --- Text
Information Retrieval: Soft-Computing and Ecological Aspects, pp.
127-137, Servizo de Publicacións e Intercambio
Científico, Universidade de Santiago de Compostela, 2006.
ISBN 84-9750-525-5.
[VilGomAlo2006a.pdf] -2005-
Carlos Gómez-Rodríguez, Jesús Vilares
and Miguel A. Alonso, Generación
automática de analizadores sintácticos a partir
de esquemas de análisis,
Procesamiento del Lenguaje Natural, 35:401-408, 2005. ISSN 1135-5948.
[GomVilAlo2005a.pdf]
Jesús Vilares, Carlos Gómez-Rodríguez
and Miguel A. Alonso, Managing
Syntactic Variation in Text Retrieval,
in Peter R. King (ed.), Proceedings of the 2005 ACM Symposium on
Document Engineering. November 2-4, 2005. Bristol, United Kingdom, pp.
162-164, ACM Press, New York, USA, 2005. ISBN 1-59593-240-2.
[VilGomAlo2005a.pdf]
Carlos Gómez-Rodríguez, Jesús Vilares
and Miguel A. Alonso, Compilación
eficiente de esquemas de análisis sintáctico,
in Francisco Javier López Fraguas (ed.), Actas de las V
Jornadas sobre Programación y Lenguajes (PROLE 2005).
Granada, 13 al 16 de Septiembre de 2005, pp. 175-184. Thomson
Paraninfo, Madrid, 2005. ISBN 84-9732-438-2.
[GomVilAlo2005b.pdf]
Last update in which I
remembered to revise this line: 2012-02-03