Fully Quantized Transformer for Machine Translation. Gabriele Prato, Ella Charlaix and Mehdi Rezagholizadeh.
Summarizing Chinese Medical Answer with Graph Convolution Networks and Question-focused Dual Attention. Ningyu Zhang, Shumin Deng, Juan Li, xi chen, Wei Zhang and Huajun Chen.
Stay Hungry, Stay Focused: Generating Informative and Specific Questions in Information-Seeking Conversations. Peng Qi, Yuhao Zhang and Christopher D. Manning.
GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis. Huaishao Luo, Lei Ji, Tianrui Li, Daxin Jiang and Nan Duan.
Reducing Sentiment Bias in Language Models via Counterfactual Evaluation. Po-Sen Huang, Huan Zhang, Ray Jiang, Robert Stanforth, Johannes Welbl, Jack Rae, Vishal Maini, Dani Yogatama and Pushmeet Kohli.
Improving Text Understanding via Deep Syntax-Semantics Communication. Hao Fei, Yafeng Ren and Donghong Ji.
GRUEN for Evaluating Linguistic Quality of Generated Text. Wanzheng Zhu and Suma Bhat.
Difference-aware Knowledge Selection for Knowledge-grounded Conversation Generation. Chujie Zheng, Yunbo Cao, Daxin Jiang and Minlie Huang.
An Attentive Recurrent Model for Incremental Prediction of Sentence-final Verbs. Wenyan Li, Alvin Grissom II and Jordan Boyd-Graber.
Transformer-GCRF: Recovering Chinese Dropped Pronouns with General Conditional Random Fields. Jingxuan Yang, Kerui Xu, Jun Xu, Si Li, Sheng GAO, Jun Guo, Ji-Rong Wen and Nianwen Xue.
Converting the Point of View of Messages Spoken to Virtual Assistants. Gunhee Lee, Vera Zu, Sai Srujana Buddi, Dennis Liang, Purva Kulkarni and Jack FitzGerald.
Few-shot Natural Language Generation for Task-Oriented Dialog. Baolin Peng, Chenguang Zhu, Chunyuan Li, Xiujun Li, Jinchao Li, Michael Zeng and Jianfeng Gao.
Mimic and Conquer: Heterogeneous Tree Structure Distillation for Syntactic NLP. Hao Fei, Yafeng Ren and Donghong Ji.
A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining. Chenguang Zhu, Ruochen Xu, Michael Zeng and Xuedong Huang.
Active Testing: An Unbiased Evaluation Method for Distantly Supervised Relation Extraction. Pengshuai Li, Xinsong Zhang, Weijia Jia and Wei Zhao.
Semantic Matching for Sequence-to-Sequence Learning. Ruiyi Zhang, Changyou Chen, Xinyuan Zhang, Ke Bai and Lawrence Carin.
How Decoding Strategies Affect the Verifiability of Generated Text. Luca Massarelli, Fabio Petroni, Aleksandra Piktus, Myle Ott, Tim Rocktäschel, Vassilis Plachouras, Fabrizio Silvestri and Sebastian Riedel.
Minimize Exposure Bias of Seq2Seq Models in Joint Entity and Relation Extraction. Ranran Haoran Zhang, Qianying Liu, Aysa Xuemo Fan, Heng Ji, Daojian Zeng, Fei Cheng, Daisuke Kawahara and Sadao Kurohashi.
Gradient-based Analysis of NLP Models is Manipulable. Junlin Wang, Jens Tuyls, Eric Wallace and Sameer Singh.
A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction. Masato Mita, Shun Kiyono, Masahiro Kaneko, Jun Suzuki and Kentaro Inui.
Understanding tables with intermediate pre-training. Julian Eisenschlos, Syrine Krichene and Thomas Müller.
Multilingual Argument Mining: Datasets and Analysis. Orith Toledo-Ronen, Matan Orbach, Yonatan Bilu, Artem Spector and Noam Slonim.
Improving Grammatical Error Correction with Machine Translation Pairs. Wangchunshu Zhou, Tao Ge, Chang Mu, Ke Xu, Furu Wei and Ming Zhou.
Machines Getting with the Program: Understanding Intent Arguments of Non-Canonical Directives. Won Ik Cho, Youngki Moon, Sangwhan Moon, Seok Min Kim and Nam Soo Kim.
The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation Classification. Abdullatif Köksal and Arzucan Özgür.
Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text Generation. Giuseppe Russo, Nora Hollenstein, Claudiu Cristian Musat and Ce Zhang.
Open-Ended Visual Question Answering by Multi-Modal Domain Adaptation. Yiming Xu, Lin Chen, Zhongwei Cheng, Lixin Duan and Jiebo Luo.
Dual Low-Rank Multimodal Fusion. Tao Jin, Siyu Huang, Yingming Li and Zhongfei Zhang.
Contextual Modulation for Relation-Level Metaphor Identification. Omnia Zayed, John P. McCrae and Paul Buitelaar.
Dialogue Generation on Infrequent Sentence Functions via Structured Meta-Learning. Yifan Gao, Piji Li, Wei Bi, Xiaojiang Liu, Michael Lyu and Irwin King.
A Fully Hyperbolic Neural Model for Hierarchical Multi-Class Classification. Federico López and Michael Strube.
Claim Check-Worthiness Detection as Positive Unlabelled Learning. Dustin Wright and Isabelle Augenstein.
ConceptBert: Concept-Aware Representation for Visual Question
Answering. François Gardères, Maryam Ziaeefard, baptiste abeloos and Freddy Lecue.
Bootstrapping a Crosslingual Semantic Parser. Tom Sherborne, Yumo Xu and Mirella Lapata.
Revisiting Representation Degeneration Problem in Language Modeling. Zhong Zhang, Chongming Gao, Cong Xu, Rui Miao, Qinli Yang and Junming Shao.
The workweek is the best time to start a family – A Study of GPT-2 Based Claim Generation. Shai Gretz, Yonatan Bilu, Edo Cohen-Karlik and Noam Slonim.
Dynamic Data Selection for Curriculum Learning via Ability Estimation. John P. Lalor and hong yu.
Fixed Encoder Self-Attention Patterns in Transformer-Based Machine Translation. Alessandro Raganato, Yves Scherrer and Jörg Tiedemann.
ZEST: Zero-shot Learning from Text Descriptions using Textual Similarity and Visual Summarization. Tzuf Paz-Argaman, Reut Tsarfaty, Gal Chechik and Yuval Atzmon.
A structure-enhanced graph convolutional network for sentiment analysis. Fanyu Meng, Junlan Feng, danping yin, Si Chen and Min Hu.
PBoS: Probabilistic Bag-of-Subwords for Generalizing Word Embedding. Zhao Jinman, Shawn Zhong, Xiaomin Zhang and Yingyu Liang.
Interpretable Entity Representations through Large-Scale Typing. Yasumasa Onoe and Greg Durrett.
Empirical Studies of Institutional Federated Learning For Natural Language Processing. Xinghua Zhu, Jianzong Wang, Zhenhou Hong and Jing Xiao.
NeuReduce: Reducing Mixed Boolean-Arithmetic Expressions by Recurrent Neural Network. Weijie Feng, Binbin Liu, Dongpeng Xu, Qilong Zheng and Yun Xu.
From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?. Maryam Hashemzadeh, Greta Kaufeld, Martha White, Andrea E. Martin and Alona Fyshe.
Revisiting Pre-Trained Models for Chinese Natural Language Processing. Yiming Cui, Wanxiang Che, Ting Liu, Bing Qin, Shijin Wang and Guoping Hu.
Cascaded Semantic and Positional Self-Attention Network for Document Classification. Juyong Jiang, Jie Zhang and Kai Zhang.
Toward Recognizing More Entity Types in NER: An Efficient Implementation using Only Entity Lexicons. Minlong Peng, Ruotian Ma, Qi Zhang, Lujun Zhao, Mengxi Wei, Changlong Sun and Xuanjing Huang.
From Disjoint Sets to Parallel Data to Train Seq2Seq Models for Sentiment Transfer. Paulo Cavalin, Marisa Vasconcelos, Marcelo Grave, Claudio Pinhanez and Victor Henrique Alves Ribeiro.
Document Ranking with a Pretrained Sequence-to-Sequence Model. Rodrigo Nogueira, Zhiying Jiang, Ronak Pradeep and Jimmy Lin.
Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior. Zi Lin, Jeremiah Liu, Zi Yang, Nan Hua and Dan Roth.
Rethinking Self-Attention: Towards Interpretability in Neural Parsing. Khalil Mrini, Franck Dernoncourt, Quan Hung Tran, Trung Bui, Walter Chang and Ndapa Nakashole.
A Linguistic Analysis of Visually Grounded Dialogues Based on Spatial Expressions. Takuma Udagawa, Takato Yamazaki and Akiko Aizawa.
Efficient Context and Schema Fusion Networks for Multi-Domain Dialogue State Tracking. Su Zhu, Jieyu Li, Lu Chen and Kai Yu.
Syntactic and Semantic-driven Learning for Open Information Extraction. Jialong Tang, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun, Xinyan Xiao and Hua Wu.
Group-wise Contrastive Learning for Neural Dialogue Generation. Hengyi Cai, Hongshen Chen, Yonghao Song, zhuoye Ding, Yongjun Bao, Weipeng Yan and Xiaofang Zhao.
E-BERT: Efficient-Yet-Effective Entity Embeddings for BERT. Nina Poerner, Ulli Waltinger and Hinrich Schütze.
A Multi-task Learning Framework for Opinion Triplet Extraction. Chen Zhang, Qiuchi Li, Dawei Song and benyou wang.
Event Extraction as Multi-turn Question Answering. Fayuan Li, Weihua Peng, Yuguang Chen, Quan Wang, Lu Pan, Yajuan Lyu and Yong Zhu.
Improving QA Generalization by Concurrent Modeling of Multiple Biases. Mingzhu Wu, Nafise Sadat Moosavi, Andreas Rücklé and Iryna Gurevych.
Actor-Double-Critic: Incorporating Model-Based Critic for Task-Oriented Dialogue Systems. Yen-chen Wu, Bo-Hsiang Tseng and Milica Gasic.
AirConcierge: Generating Task-Oriented Dialogue via Efficient Large-Scale Knowledge Retrieval. Chieh-Yang Chen, Pei-Hsin Wang, Shih-Chieh Chang, Da-Cheng Juan, Wei Wei and Jia-Yu Pan.
DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding. Zilong Wang, Mingjie Zhan, xuebo liu and ding liang.
A Study in Improving BLEU Reference Coverage with Diverse Automatic Paraphrasing. Rachel Bawden, Biao Zhang, Lisa Yankovskaya, Andre Tättar and Matt Post.
Cross-lingual Alignment Methods for Multilingual BERT: A Comparative Study. Saurabh Kulshreshtha, Jose Luis Redondo Garcia and Ching-Yun Chang.
SeNsER: Learning Cross-Building Sensor Metadata Tagger. Yang Jiao, Jiacheng Li, Jiaman Wu, Dezhi Hong, Rajesh Gupta and Jingbo Shang.
Persian \textitEzafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging. Ehsan Doostmohammadi, Minoo Nassajian and Adel Rahimi.
Scene Graph Modification Based on Natural Language Commands. Xuanli He, Quan Hung Tran, Gholamreza Haffari, Walter Chang, Zhe Lin, Trung Bui, Franck Dernoncourt and Nhan Dam.
LiMiT: The Literal Motion in Text Dataset. Irene Manotas, Ngoc Phuoc An Vo and Vadim Sheinin.
Generative Data Augmentation for Commonsense Reasoning. Yiben Yang, Chaitanya Malaviya, Jared Fernandez, Swabha Swayamdipta, Ronan Le Bras, Ji-Ping Wang, Chandra Bhagavatula, Yejin Choi and Doug Downey.
HybridQA: A Dataset of Multi-Hop Question Answering \\over Tabular and Textual Data. Wenhu Chen, Hanwen Zha, Zhiyu Chen, Wenhan Xiong, Hong Wang and William Yang Wang.
ESTeR: Combining Word Co-occurrences and Word Associations for Unsupervised Emotion Detection. Sujatha Das Gollapalli, Polina Rozenshtein and See-Kiong Ng.
GCDST: A Graph-based and Copy-augmented Multi-domain Dialogue State Tracking. Peng Wu, Bowei Zou, Ridong Jiang and AiTi Aw.
Why do you think that? Exploring Faithful Sentence-Level Rationales Without Supervision. Max Glockner, Ivan Habernal and Iryna Gurevych.
Semi-Supervised Learning for Video Captioning. Ke Lin, Zhuoxin Gan and Liwei WANG.
Multi$^2$OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT. Youngbin Ro, Yukyung Lee and Pilsung Kang.
Using the Past Knowledge to Improve Sentiment Classification. Qi Qin, Wenpeng Hu and Bing Liu.
High-order Semantic Role Labeling. Zuchao Li, Hai Zhao, Rui Wang and Kevin Parnow.
Undersensitivity in Neural Reading Comprehension. Johannes Welbl, Pasquale Minervini, Max Bartolo, Pontus Stenetorp and Sebastian Riedel.
AutoETER: Automated Entity Type Representation for Knowledge Graph Embedding. Guanglin Niu, Bo Li, Yongfei Zhang, Shiliang Pu and Jingyang Li.
Learning Robust and Multilingual Speech Representations. Kazuya Kawakami, Luyu Wang, Chris Dyer, Phil Blunsom and Aaron van den Oord.
FQuAD: French Question Answering Dataset. Martin D'Hoffschmidt, Wacim Belblidia, Quentin Heinrich, Tom Brendlé and Maxime Vidal.
Dynamic Semantic Matching and Aggregation Network for Few-shot Intent Detection. Hoang Nguyen, Chenwei Zhang, Congying Xia and Philip Yu.
Quantifying the Contextualization of Word Representations with Semantic Class Probing. Mengjie Zhao, Philipp Dufter, Yadollah Yaghoobzadeh and Hinrich Schütze.
FELIX: Flexible Text Editing Through Tagging and Insertion. Jonathan Mallinson, Aliaksei Severyn, Eric Malmi and Guillermo Garrido.
Unsupervised Relation Extraction from Language Models using Constrained Cloze Completion. Ankur Goswami, Akshata Bhat, Hadar Ohana and Theodoros Rekatsinas.
Language Generation via Combinatorial Constraint Satisfaction: A Tree Search Enhanced Monte-Carlo Approach. Maosen Zhang, Nan Jiang, Lei Li and Yexiang Xue.
Evaluating Models’ Local Decision Boundaries via Contrast Sets. Matt Gardner, Yoav Artzi, Victoria Basmov, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, Nitish Gupta, Hannaneh Hajishirzi, Gabriel Ilharco, Daniel Khashabi, Kevin Lin, Jiangming Liu, Nelson F. Liu, Phoebe Mulcaire, Qiang Ning, Sameer Singh, Noah A. Smith, Sanjay Subramanian, Reut Tsarfaty, Eric Wallace, Ally Zhang and Ben Zhou.
Optimizing Word Segmentation for Downstream Task. Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki and Naoaki Okazaki.
A Compare Aggregate Transformer for Understanding Document-grounded Dialogue. Longxuan Ma, Wei-Nan Zhang, runxin sun and Ting Liu.
TextHide: Tackling Data Privacy in Language Understanding Tasks. Yangsibo Huang, Zhao Song, Danqi Chen, Kai Li and Sanjeev Arora.
Modeling Intra and Inter-modality Incongruity for Multi-Modal Sarcasm Detection. Hongliang Pan, Zheng Lin, Peng Fu, Yatao Qi and Weiping Wang.
Improving Knowledge-Aware Dialogue Response Generation by Using Human-Written Prototype Dialogues. Sixing Wu, Ying Li, Dawei Zhang and Zhonghai Wu.
Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots. Jia-Chen Gu, Zhenhua Ling, Quan Liu, Zhigang Chen and Xiaodan Zhu.
Privacy-Preserving News Recommendation Model Learning. Tao Qi, Fangzhao Wu, Chuhan Wu, Yongfeng Huang and Xing Xie.
Balancing via Generation for Multi-Class Text Classification Improvement. Naama Tepper, Esther Goldbraich, Naama Zwerdling, George Kour, Ateret Anaby Tavor and Boaz Carmeli.
Conditional Neural Generation using Sub-Aspect Functions for Extractive News Summarization. Zhengyuan Liu, Ke Shi and Nancy Chen.
Research Replication Prediction Using Weakly Supervised Learning. Tianyi Luo, xingyu li, Hainan Wang and Yang Liu.
Semantically Driven Sentence Fusion: Modeling and Evaluation. Eyal Ben-David, Orgad Keller, Eric Malmi, Idan Szpektor and Roi Reichart.
Predicting Responses to Psychological Questionnaires from Participants’ Social Media Posts and Question Text Embeddings. Huy Vu, Suhaib Abdurahman, Sudeep Bhatia and Lyle Ungar.
Will it Unblend?. Yuval Pinter, Cassandra L. Jacobs and Jacob Eisenstein.
CodeBERT: A Pre-Trained Model for Programming and Natural Languages. Zhangyin Feng, Daya Guo, Duyu Tang, Nan Duan, Xiaocheng Feng, Ming Gong, Linjun Shou, Bing Qin, Ting Liu, Daxin Jiang and Ming Zhou.
StyleDGPT: Stylized Response Generation with Pre-trained Language Models. Ze Yang, wei wu, Can Xu, Xinnian Liang, jiaqi bai, Liran Wang, Wei Wang and Zhoujun Li.
Enhancing Automated Essay Scoring Performance via Fine-tuning Pre-trained Language Models with Combination of Regression and Ranking. Ruosong Yang, Jiannong Cao, Zhiyuan Wen, Youzheng Wu and Xiaodong He.
Neural Dialogue State Tracking with Temporally Expressive Networks. Junfan Chen, Richong Zhang, Yongyi Mao and Jie Xu.
Inferring about fraudulent collusion risk on Brazilian public works
contracts in official texts using a Bi-LSTM approach. MARCOS LIMA, Roberta Silva, Felipe Lopes de Souza Mendes, Leonardo R. de Carvalho, Aleteia Araujo and Flavio de Barros Vidal.
Data-to-Text Generation with Style Imitation. Shuai Lin, Wentao Wang, Zichao Yang, Xiaodan Liang, Frank F. Xu, Eric Xing and Zhiting Hu.
Teaching Machine Comprehension with Compositional Explanations. Qinyuan Ye, Xiao Huang, Elizabeth Boschee and Xiang Ren.
A Knowledge-Driven Approach to Classifying Object and Attribute Coreferences in Opinion Mining. Jiahua Chen, Shuai Wang, Sahisnu Mazumder and Bing Liu.
SimAlign: High Quality Word Alignments Without Parallel Training Data Using Static and Contextualized Embeddings. Masoud Jalili Sabet, Philipp Dufter, François Yvon and Hinrich Schütze.
Octa: Omissions and Conflicts in Target-Aspect Sentiment Analysis. Zhe Zhang, Chung-Wei Hang and Munindar Singh.
On the Language Neutrality of Pre-trained Multilingual Representations. Jindřich Libovický, Rudolf Rosa and Alexander Fraser.
Improving Constituency Parsing with Span Attention. Yuanhe Tian, Yan Song, Fei Xia and Tong Zhang.
RecoBERT: A Catalog Language Model for Text-Based Recommendations. Itzik Malkiel, Oren Barkan, Avi Caciularu, Noam Razin, Ori Katz and Noam Koenigstein.
Multi-Agent Mutual Learning at Sentence-Level and Token-Level for Neural Machine Translation. Baohao Liao, Yingbo Gao and Hermann Ney.
RMM: A Recursive Mental Model for Dialogue Navigation. Homero Roman Roman, Yonatan Bisk, Jesse Thomason, Asli Celikyilmaz and Jianfeng Gao.
Will This Idea Spread Beyond Academia? Understanding Knowledge Transfer of Scientific Concepts across Text Corpora. Hancheng Cao, Mengjie Cheng, Zhepeng Cen, Daniel McFarland and Xiang Ren.
Recurrent Inference in Text Editing. Ning Shi, Ziheng Zeng, Haotian Zhang and Yichen Gong.
An Empirical Exploration of Local Ordering Pre-training for Structured Prediction. Zhisong Zhang, Xiang Kong, Lori Levin and Eduard Hovy.
Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers. Shusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei and Ming Zhou.
Active Learning Approaches to Enhancing Neural Machine Translation. Yuekai Zhao, Haoran Zhang, Shuchang Zhou and Zhihua Zhang.
AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent Detection and Slot Filling. Libo Qin, Xiao Xu, Wanxiang Che and Ting Liu.
CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning. Bill Yuchen Lin, Wangchunshu Zhou, Ming Shen, Pei Zhou, Chandra Bhagavatula, Yejin Choi and Xiang Ren.
On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries. Tianze Shi, Chen Zhao, Jordan Boyd-Graber, Hal Daumé III and Lillian Lee.
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising. Ziyi Yang, Chenguang Zhu, Robert Gmyr, Michael Zeng, Xuedong Huang and Eric Darve.
Improving End-to-End Bangla Speech Recognition with Semi-supervised Training. Nafis Sadeq, Nafis Tahmid Chowdhury, Farhan Tanvir Utshaw, Shafayat Ahmed and Muhammad Abdullah Adnan.
No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform Gestures. Chaitanya Ahuja, Dong Won Lee, Ryo Ishii and Louis-Philippe Morency.
UNIFIEDQA: Crossing Format Boundaries with a Single QA System. Daniel Khashabi, Sewon Min, Tushar Khot, Ashish Sabharwal, Oyvind Tafjord, Peter Clark and Hannaneh Hajishirzi.
Robust and Interpretable Grounding of Spatial References with Relation Networks. Tsung-Yen Yang, Andrew Lan and Karthik Narasimhan.
Pragmatic Issue-Sensitive Image Captioning. Allen Nie, Reuben Cohn-Gordon and Christopher Potts.
SynET: Synonym Expansion using Transitivity. Jiale Yu, Yongliang Shen, Xinyin Ma, Chenghao Jia, Chen Chen and Weiming Lu.
Scheduled DropHead: A Regularization Method for Transformer Models. Wangchunshu Zhou, Tao Ge, Furu Wei, Ming Zhou and Ke Xu.
Multi-Turn Dialogue Generation in E-Commerce Platform with the Context of Historical Dialogue. WeiSheng Zhang, Kaisong Song, Yangyang Kang, Zhongqing Wang, Changlong Sun, Xiaozhong Liu, Shoushan Li, Min Zhang and Luo Si.
Ruler: Data Programming by Demonstration for Document Labeling. Sara Evensen, Chang Ge and Cagatay Demiralp.
Dual Reconstruction: a Unifying Objective for Semi-Supervised Neural Machine Translation. Weijia Xu, Xing Niu and Marine Carpuat.
Focus-Constrained Attention Mechanism for CVAE-based Response Generation. Zhi Cui, Yanran Li, Jiayi ZHANG, Jianwei Cui, Chen Wei and Bin Wang.
Chunk-based Chinese Spelling Check with Global Optimization. Zuyi Bao, Chen Li and Rui Wang.
Multi-pretraining for Large-scale Text Classification. Kang-Min Kim, Bumsu Hyeon, Yeachan Kim, Jun-Hyung Park and SangKeun Lee.
End-to-End Speech Recognition and Disfluency Removal. Paria Jamshid Lou and Mark Johnson.
Characterizing the Value of Information in Medical Notes. Chao-Chun Hsu, Shantanu Karnwal, Sendhil Mullainathan, Ziad Obermeyer and Chenhao Tan.
KLearn: Background Knowledge Inference from Summarization Data. Maxime Peyrard and Robert West.
Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training. Dongha Choi and Hyunju Lee.
Logic2Text: High-Fidelity Natural Language Generation from Logical Forms. Zhiyu Chen, Wenhu Chen, Hanwen Zha, Xiyou Zhou, Yunkai Zhang, Sairam Sundaresan and William Yang Wang.
Diversify Question Generation with Continuous Content Selectors and Question Type Modeling. Zhen Wang, Siwei Rao, Jie Zhang, Zhen Qin, Guangjian Tian and Jun Wang.
Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages. Wilhelmina Nekoto, Vukosi Marivate, Tshinondiwa Matsila, Timi Fasubaa, Taiwo Fagbohungbe, Solomon Oluwole Akinola, Shamsuddeen Muhammad, Salomon KABONGO KABENAMUALU, Salomey Osei, Freshia Sackey, Rubungo Andre Niyongabo, Ricky Macharm, Perez Ogayo, Orevaoghene Ahia, Musie Meressa Berhe, Mofetoluwa Adeyemi, Masabata Mokgesi-Selinga, Lawrence Okegbemi, Laura Martinus, Kolawole Tajudeen, Kevin Degila, Kelechi Ogueji, Kathleen Siminyu, Julia Kreutzer, Jason Webster, Jamiil Toure Ali, Jade Abbott, Iroro Orife, Ignatius Ezeani, Idris Abdulkadir Dangana, Herman Kamper, Hady Elsahar, Goodness Duru, ghollah kioko, Murhabazi Espoir, Elan van Biljon, Daniel Whitenack, Christopher Onyefuluchi, Chris Chinenye Emezue, Bonaventure F. P. Dossou, Blessing Sibanda, Blessing Bassey, Ayodele Olabiyi, Arshath Ramkilowan, Alp Öktem, Adewale Akinfaderin and Abdallah Bashir.
ConveRT: Efficient and Accurate Conversational Representations from Transformers. Matthew Henderson, Iñigo Casanueva, Nikola Mrkšić, Pei-Hao Su, Tsung-Hsien Wen and Ivan Vulić.
Computer Assisted Translation with Neural Quality Estimation and Automatic Post-Editing. Ke Wang, Jiayi Wang, Niyu Ge, Yangbin Shi, Yu Zhao and Kai Fan.
Zero-Shot Rationalization by Multi-Task Transfer Learning from Question Answering. Po-Nien Kung, Tse-Hsuan Yang, Yi-Cheng Chen, Sheng-Siang Yin and Yun-Nung Chen.
The Role of Reentrancies in Abstract Meaning Representation Parsing. Marco Damonte, Ida Szubert, Shay B. Cohen and Mark Steedman.
Cross-Lingual Suicidal-Oriented Word Embedding toward Suicide Prevention. Daeun Lee, Soyoung Park, Jiwon Kang, Daejin Choi and Jinyoung Han.
Reinforcement Learning with Imbalanced Dataset for Data-to-Text Medical Report Generation. Toru Nishino, Ryota Ozaki, Yohei Momoki, Tomoki Taniguchi, Ryuji Kano, Norihisa Nakano, Yuki Tagawa, Motoki Taniguchi, Tomoko Ohkuma and keigo nakamura.
Reducing Quantity Hallucinations in Abstractive Summarization. Zheng Zhao, Shay B. Cohen and Bonnie Webber.
Rethinking Topic Modelling: From Document-Space to Term-Space. Magnus Sahlgren.
A Semi-supervised Approach to Generate the Code-Mixed Text using Pre-trained Encoder and Transfer Learning. Deepak Gupta, Asif Ekbal and Pushpak Bhattacharyya.
BERT-MK: Integrating Graph Contextualized Knowledge into Pre-trained Language Models. Bin He, Di Zhou, Jinghui Xiao, Xin Jiang, Qun Liu, Nicholas Jing Yuan and Tong Xu.
Recursive Top-Down Production for Sentence Generation with Latent Trees. Shawn Tan, Yikang Shen, Alessandro Sordoni, Aaron Courville and Timothy J. O'Donnell.
Guided Dialogue Policy Learning without Adversarial Learning in the Loop. Ziming Li, Sungjin Lee, Baolin Peng, Jinchao Li, Julia Kiseleva, Maarten de Rijke, Shahin Shayandeh and Jianfeng Gao.
MultiDM-GCN: Aspect-guided Response Generation in Multi-domain Multi-modal Dialogue System using Graph Convolutional Network. Mauajama Firdaus, NIDHI THAKUR and Asif Ekbal.
Edge-Enhanced Graph Convolution Networks for Event Detection with Syntactic Relation. Shiyao Cui, Bowen Yu, Tingwen Liu, Zhenyu Zhang, Xuebin Wang and Jinqiao Shi.
Semi-supervised Formality Style Transfer using Language Model Discriminator and Mutual Information Maximization. Kunal Chawla and Diyi Yang.
Differentially Private Representation for NLP: Formal Guarantee and An Empirical Study on Privacy and Fairness. Lingjuan Lyu, Xuanli He and Yitong Li.
Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems. Andrea Madotto, Samuel Cahyawijaya, Genta Indra Winata, Yan Xu, Zihan Liu, Zhaojiang Lin and Pascale Fung.
ProphetNet: Predicting Future N-gram for Sequence-to-SequencePre-training. Weizhen Qi, Yu Yan, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang and Ming Zhou.
DivGAN: Towards Diverse Paraphrase Generation via Diversified Generative Adversarial Network. Yue Cao and Xiaojun Wan.
Plug-and-Play Conversational Models. Andrea Madotto, Etsuko Ishii, Zhaojiang Lin, Sumanth Dathathri and Pascale Fung.
Event-Driven Learning of Systematic Behaviours in Stock Markets. Xianchao Wu.
COSMIC: COmmonSense knowledge for eMotion Identification in Conversations. Deepanway Ghosal, Navonil Majumder, Alexander Gelbukh, Rada Mihalcea and Soujanya Poria.
Improving Compositional Generalization in Semantic Parsing. Inbar Oren, Jonathan Herzig, Nitish Gupta, Matt Gardner and Jonathan Berant.
On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained Transformers. Marius Mosbach, Anna Khokhlova, Michael A. Hedderich and Dietrich Klakow.
How Does Context Matter? On the Robustness of Event Detection with Context-Selective Mask Generalization. Jian Liu, Yubo Chen, Kang Liu, Yantao Jia and Zhicheng Sheng.
Adaptive Feature Selection for End-to-End Speech Translation. Biao Zhang, Ivan Titov, Barry Haddow and Rico Sennrich.
Abstractive Multi-Document Summarization via Joint Learning with Single-Document Summarization. Hanqi Jin and Xiaojun Wan.
Blockwise Self-Attention for Long Document Understanding. Jiezhong Qiu, Hao Ma, Omer Levy, Wen-tau Yih, Sinong Wang and Jie Tang.
Unsupervised Few-Bits Semantic Hashing with Implicit Topics Modeling. Fanghua Ye, Jarana Manotumruksa and Emine Yilmaz.
Grid Tagging Scheme for Aspect-oriented Fine-grained Opinion Extraction. Zhen Wu, Chengcan Ying, Fei Zhao, Zhifang Fan, Xinyu Dai and Rui Xia.
Fast End-to-end Coreference Resolution for Korean. Cheoneum Park, Jamin Shin, Sungjoon Park, Joonho Lim and Changki Lee.
Toward Stance-based Personas for Opinionated Dialogues. Thomas Scialom, Serra Sinem Tekiroğlu, Jacopo Staiano and Marco Guerini.
Hierarchical Pre-training for Sequence Labelling in Spoken Dialog. Emile Chapuis, Pierre Colombo, Matteo Manica, Matthieu Labeau and Chloé Clavel.
Out-of-Sample Representation Learning for Knowledge Graphs. Marjan Albooyeh, Rishab Goel and Seyed Mehran Kazemi.
Fine-Grained Grounding for Multimodal Speech Recognition. Tejas Srinivasan, Ramon Sanabria, Florian Metze and Desmond Elliott.
Unsupervised Expressive Rules Provide Explainability and Assist Human Experts Grasping New Domains. Eyal Shnarch, Leshem Choshen, Guy Moshkowich, Ranit Aharonov and Noam Slonim.
Textual Supervision for Visually Grounded Spoken Language Understanding. Bertrand Higy, Desmond Elliott and Grzegorz Chrupała.
Universal Dependencies According to BERT: Both More Specific and More General. Tomasz Limisiewicz, David Mareček and Rudolf Rosa.
Visual Objects As Context: Exploiting Visual Objects for Lexical Entailment. Masayasu Muraoka, Tetsuya Nasukawa and Bishwaranjan Bhattacharjee.
Learning to Plan and Realize Separately for Open-Ended Dialogue Systems. Sashank Santhanam, Zhuo Cheng, Brodie Mather, Bonnie Dorr, Archna Bhatia, Bryanna Hebenstreit, Alan Zemel, Adam Dalton, Tomek Strzalkowski and Samira Shaikh.
Be Different to Be Better! A Benchmark to Leverage the Complementarity of Language and Vision. Sandro Pezzelle, Claudio Greco, Greta Gandolfi, Eleonora Gualdoni and Raffaella Bernardi.
Improving Word Embedding Factorization for Compression Using Distilled Nonlinear Neural Decomposition. Vasileios Lioutas, Ahmad Rashid, Krtin Kumar, Md. Akmal Haidar and Mehdi Rezagholizadeh.
PharmMT: A Neural Machine Translation Approach to Simplify Prescription Directions. Jiazhao Li, Corey Lester, Xinyan Zhao, Yuting Ding, Yun Jiang and V.G.Vinod Vydiswaran.
LSTMs Compose—and Learn—Bottom-Up. Naomi Saphra and Adam Lopez.
Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs. Ana Marasović, Chandra Bhagavatula, Jae sung Park, Ronan Le Bras, Noah A. Smith and Yejin Choi.
Corpora Evaluation and System Bias Detection in Multi-document Summarization. Alvin Dey, Tanya Chowdhury, Yash Kumar and Tanmoy Chakraborty.
Graph-to-Tree Neural Networks for Learning Structured Input-Output Translation with Applications to Semantic Parsing and Math Word Problem. Shucheng Li, Lingfei Wu, Shiwei Feng, Fangli Xu, Fengyuan Xu and Sheng Zhong.
Target Conditioning for One-to-Many Generation. marie-anne lachaux, Armand Joulin and Guillaume Lample.
Inferring symmetry in natural language. Chelsea Tanchip, Lei Yu, Aotao Xu and Yang Xu.
A Concise Model for Multi-Criteria Chinese Word Segmentation with Transformer Encoder. Xipeng Qiu, Hengzhi Pei, Hang Yan and Xuanjing Huang.
Enhancing Content Planning for Table-to-Text Generation with Data Understanding and Verification. Heng Gong, Wei Bi, Xiaocheng Feng, Bing Qin, Xiaojiang Liu and Ting Liu.
Contextual Text Style Transfer. Yu Cheng, Zhe Gan, Yizhe Zhang, Oussama Elachqar, Dianqi Li and Jingjing Liu.
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling. Jiecao Chen, Liu Yang, Karthik Raman, Michael Bendersky, Jung-Jung Yeh, Yun Zhou, Marc Najork, Danyang Cai and Ehsan Emadzadeh.
Cross-Lingual Dependency Parsing by POS-Guided Word Reordering. Lu Liu, Yi Zhou, Jianhan Xu, Xiaoqing Zheng, Kai-Wei Chang and Xuanjing Huang.
Assessing Robustness of Text Classification through Maximal Safe Radius Computation. Emanuele La Malfa, Min Wu, Luca Laurenti, Benjie Wang, Anthony Hartshorn and Marta Kwiatkowska.
Social Commonsense Reasoning with Multi-Head Knowledge Attention. Debjit Paul and Anette Frank.
TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog. Erik Ekstedt and Gabriel Skantze.
A little goes a long way: Improving toxic language classification despite data scarcity. Mika Juuti, Tommi Gröndahl, Adrian Flanagan and N. Asokan.
General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference. Jingfei Du, Myle Ott, Haoran Li, Xing Zhou and Veselin Stoyanov.
Learning to Model and Ignore Dataset Bias with Mixed Capacity Ensembles. Christopher Clark, Mark Yatskar and Luke Zettlemoyer.
Learning to Generalize for Sequential Decision Making. Xusen Yin, Ralph Weischedel and Jonathan May.
Effective Crowd-Annotation of Participants, Interventions, and Outcomes in the Text of Clinical Trial Reports. Markus Zlabinger, Marta Sabou, Sebastian Hofstätter and Allan Hanbury.
Adversarial Grammatical Error Correction. Vipul Raheja and Dimitris Alikaniotis.
CLAR: A Cross-Lingual Argument Regularizer for Semantic Role Labeling. Ishan Jindal, Yunyao Li, Siddhartha Brahma and Huaiyu Zhu.
Neutralizing Gender Bias in Word Embeddings with Latent Disentanglement and Counterfactual Generation. Seungjae Shin, Kyungwoo Song, JoonHo Jang, Hyemi Kim, Weonyoung Joo and Il-Chul Moon.
Towards Domain-Independent Text Structuring Trainable on Large Discourse Treebanks. Grigorii Guz and Giuseppe Carenini.
A Multilingual View of Unsupervised Machine Translation. Xavier Garcia, Pierre Foret, Thibault Sellam and Ankur Parikh.
An Evaluation Method for Diachronic Word Sense Induction. Ashjan Alsulaimani, Erwan Moreau and Carl Vogel.
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning. Bingbing Li, Zhenglun Kong, Tianyun Zhang, Ji Li, Zhengang Li, Hang Liu and Caiwen Ding.
Towards Zero-Shot Conditional Summarization with Adaptive Multi-Task Fine-Tuning. Travis Goodwin, Max Savery and Dina Demner-Fushman.
Multilingual Knowledge Graph Completion via Ensemble Knowledge Transfer. Xuelu Chen, Muhao Chen, Changjun Fan, Ankith Uppunda, Yizhou Sun and Carlo Zaniolo.
Towards Controllable Biases in Language Generation. Emily Sheng, Kai-Wei Chang, Prem Natarajan and Nanyun Peng.
RobBERT: a Dutch RoBERTa-based Language Model. Pieter Delobelle, Thomas Winters and Bettina Berendt.
Regularization of Distinct Strategies for Unsupervised Question Generation. Junmo Kang, Giwon Hong, Haritz Puerto San Roman and Sung-Hyon Myaeng.
Graph-to-Graph Transformer for Transition-based Dependency Parsing. Alireza Mohammadshahi and James Henderson.
DeSMOG: Detecting Stance in Media On Global Warming. Yiwei Luo, Dallas Card and Dan Jurafsky.
Improve Transformer Models with Better Relative Position Embeddings. zhiheng huang, Davis Liang, Peng Xu and Bing Xiang.
A Sentiment-Controllable Topic-to-Essay Generator with Topic Knowledge Graph. Lin Qiao, Jianhao Yan, Fandong Meng, Zhendong Yang and Jie Zhou.
What-if I ask you to explain: Explaining the effects of perturbations in procedural text. Dheeraj Rajagopal, Niket Tandon, Peter Clark, Bhavana Dalvi and Eduard Hovy.
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models. Samuel Gehman, Suchin Gururangan, Maarten Sap, Yejin Choi and Noah A. Smith.
Composed Variational Natural Language Generation for Few-shot Intents. Congying Xia, Caiming Xiong, Philip Yu and Richard Socher.
Literature Retrieval for Precision Medicine with Neural Matching and Faceted Summarization. Jiho Noh and Ramakanth Kavuluru.
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks. Stephen Mussmann, Robin Jia and Percy Liang.
A Dual-Attention Network for Joint Named Entity Recognition and Sentence Classification of Adverse Drug Events. Susmitha Wunnava, Xiao Qin, Tabassum Kakar, Xiangnan Kong and Elke Rundensteiner.
BERT-kNN:
Adding a kNN Search Component to Pretrained Language Models
for Better QA. Nora Kassner and Hinrich Schütze.
Identifying Spurious Correlations for Robust Text Classification. Zhao Wang and Aron Culotta.
HoVer: A Dataset for Many-Hop Fact Extraction And Claim Verification. Yichen Jiang, Shikha Bordia, ZHENG ZHONG, Charles Dognin, Maneesh Singh and Mohit Bansal.
Continual Learning for Natural Language Generation in Task-oriented Dialog Systems. Fei Mi, Liangwei Chen, Mengjie Zhao, Minlie Huang and Boi Faltings.
UNQOVERing Stereotyping Biases via Underspecified Questions. Tao Li, Daniel Khashabi, Tushar Khot, Ashish Sabharwal and Vivek Srikumar.
A Semantics-based Approach to Disclosure Classification in User-Generated Online Content. Chandan Akiti, Anna squicciarini and Sarah Rajtmajer.
Mining Knowledge for Natural Language Inference from Wikipedia Categories. Mingda Chen, Zewei Chu, Karl Stratos and Kevin Gimpel.
OCNLI: Original Chinese Natural Language Inference. Hai Hu, Kyle Richardson, Liang Xu, Lu Li, Sandra Kübler and Lawrence Moss.
Margin-aware Unsupervised Domain Adaptation for Cross-lingual Text Labeling. Dejiao Zhang, Ramesh Nallapati, Henghui Zhu, Feng Nan, Cicero Nogueira dos Santos, Kathleen McKeown and Bing Xiang.
Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems. Ziming Li, Julia Kiseleva and Maarten de Rijke.
Accurate polyglot semantic parsing with DAG grammars. Federico Fancellu, Ákos Kádár, Ran Zhang and Afsaneh Fazly.
Approximation of Response Knowledge Retrieval in Knowledge-grounded Dialogue Generation. Wen Zheng, Natasa Milic-Frayling and Ke Zhou.
Evaluating Factuality in Generation with Dependency-level Entailment. Tanya Goyal and Greg Durrett.
Cross-Lingual Text Classification with Minimal Resources by Transferring a Sparse Teacher. Giannis Karamanolakis, Daniel Hsu and Luis Gravano.
A Multi-Persona Chatbot for Hotline Counselor Training. Orianna Demasi, Yu Li and Zhou Yu.
Narrative Text Generation with a Latent Discrete Plan. Harsh Jhamtani and Taylor Berg-Kirkpatrick.
Graph Transformer Networks with Syntactic and Semantic Structures for Event Argument Extraction. Amir Pouran Ben Veyseh, Tuan Ngo Nguyen and Thien Huu Nguyen.
The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation. Jie He, Tao Wang, Deyi Xiong and Qun Liu.
CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems. Yiran Chen, Pengfei Liu, Ming Zhong, Zi-Yi Dou, Danqing Wang, Xipeng Qiu and Xuanjing Huang.
Attending to Long-Distance Document Context for Sequence Labeling. Matthew Jörke, Jon Gillick, Matthew Sims and David Bamman.
Global Bootstrapping Neural Network for Entity Set Expansion. Lingyong Yan, Xianpei Han, Ben He and Le Sun.
Adversarial Augmentation Policy Search for Domain and Cross-Lingual Generalization in Reading Comprehension. Adyasha Maharana and Mohit Bansal.
Denoising Multi-Source Weak Supervision for Neural Text Classification. Wendi Ren, Yinghao Li, Hanting Su, David Kartchner, Cassie Mitchell and Chao Zhang.
Dr. Summarize: Global Summarization of Medical Dialogue by Exploiting Local Structures.. Anirudh Joshi, Namit Katariya, Xavier Amatriain and Anitha Kannan.
Generating Accurate Electronic Health Assessment from Medical Graph. Zhichao Yang and hong yu.
Do Models of Mental Health Based on Social Media Data Generalize?. Keith Harrigian, Carlos Aguirre and Mark Dredze.
Context Analysis for Pre-trained Masked Language Models. Yi-An Lai, Garima Lalwani and Yi Zhang.
Controllable Text Generation with Focused Variation. Lei Shu, Alexandros Papangelis, Yi-Chia Wang, Gokhan Tur, Hu Xu, Zhaleh Feizollahi, Bing Liu and Piero Molino.
Modeling Preconditions in Text with a Crowd-sourced Dataset. Heeyoung Kwon, Mahnaz Koupaee, Pratyush Singh, Gargi Sawhney, Anmol Shukla, Keerthi Kumar Kallur, Nathanael Chambers and Niranjan Balasubramanian.
Reevaluating Adversarial Examples in Natural Language. John Morris, Eli Lifland, Jack Lanchantin, Yangfeng Ji and Yanjun Qi.
Question Answering with Long Multiple-Span Answers. Ming Zhu, Aman Ahuja, Da-Cheng Juan, Wei Wei and Chandan K Reddy.
Event-Related Bias Removal for Real-time Disaster Events. SALVADOR MEDINA MAZA, Evangelia Spiliopoulou, Eduard Hovy and Alexander Hauptmann.
It's not a Non-Issue: Negation as a Source of Error in Machine Translation. Md Mosharaf Hossain, Antonios Anastasopoulos, Eduardo Blanco and Alexis Palmer.
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework. Mingbo Ma, Baigong Zheng, Kaibo Liu, Renjie Zheng, Hairong Liu, Kainan Peng, Kenneth Church and Liang Huang.
Joint Turn and Dialogue level User Satisfaction Estimation on Multi-Domain Conversations. Praveen Kumar Bodigutla, Aditya Tiwari, Spyros Matsoukas, Josep Valls-Vargas and Lazaros Polymenakos.
ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments. Hyounghun Kim, Abhaysinh Zala, Graham Burri, Hao Tan and Mohit Bansal.
Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training. Renjie Zheng, Mingbo Ma, Baigong Zheng, Kaibo Liu, Jiahong Yuan, Kenneth Church and Liang Huang.
Towards Context-Aware Code Comment Generation. Xiaohan Yu, Quzhe Huang, Zheng Wang, Yansong Feng and Dongyan Zhao.
Finding the Optimal Vocabulary Size for Neural Machine Translation. Thamme Gowda and Jonathan May.
Making Information Seeking Easier: An Improved Pipeline for Conversational Search. Vaibhav Kumar and Jamie Callan.
Generalizable and Explainable Dialogue Generation via Explicit Action Learning. Xinting Huang, Jianzhong Qi, Yu Sun and Rui Zhang.
Determining Event Outcomes: The Case of #fail. Srikala Murugan, Dhivya Chinnappa and Eduardo Blanco.
WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization. Faisal Ladhak, Esin Durmus, Claire Cardie and Kathleen McKeown.
Adversarial Training for Code Retrieval with Question-Description Relevance Regularization. Jie Zhao and Huan Sun.
Large Product Key Memory for Pretrained Language Models. Gyuwan Kim and Tae Hwan Jung.
STANDER: An Expert-Annotated Dataset for News Stance Detection and Evidence Retrieval. Costanza Conforti, Jakob Berndt, Mohammad Taher Pilehvar, Chryssi Giannitsarou, Flavio Toxvaerd and Nigel Collier.
SupMMD: A Sentence Importance Model for Extractive Summarization using Maximum Mean Discrepancy. Umanga Bista, Alexander Mathews, Aditya Menon and Lexing Xie.
Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering. Peifeng Wang, Nanyun Peng, Filip Ilievski, Pedro Szekely and Xiang Ren.
No Answer is Better Than Wrong Answer: A Reflection Model for Document Level Machine Reading Comprehension. Xuguang Wang, Linjun Shou, Ming Gong, Nan Duan and Daxin Jiang.
Reference Language based Unsupervised Neural Machine Translation. Zuchao Li, Hai Zhao, Rui Wang, Masao Utiyama and Eiichiro Sumita.
TinyBERT: Distilling BERT for Natural Language Understanding. Xiaoqi Jiao, Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen, Linlin Li, Fang Wang and Qun Liu.
Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder. Alvin Chan, Yi Tay, Yew-Soon Ong and Aston Zhang.
Assessing Human-Parity in Machine Translation on the Segment Level. Yvette Graham, Christian Federmann, Maria Eskevich and Barry Haddow.
Multichannel Generative Language Model: Learning All Possible Factorizations Within and Across Channels. Harris Chan, Jamie Kiros and William Chan.
Factorized Transformer for Multi-Domain Neural Machine Translation. Yongchao Deng, Hongfei Yu, Heng Yu, Xiangyu Duan and Weihua Luo.
Improving Named Entity Recognition with Attentive Ensemble of Syntactic Information. Yuyang Nie, Yuanhe Tian, Yan Song, Xiang Ao and Xiang Wan.
Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines. Łukasz Borchmann, Dawid Wisniewski, Andrzej Gretkowski, Izabela Kosmala, Dawid Jurkiewicz, Łukasz Szałkiewicz, Gabriela Pałka, Karol Kaczmarek, Agnieszka Kaliska and Filip Graliński.
Vocabulary Adaptation for Domain Adaptation in Neural Machine Translation. Shoetsu Sato, Jin Sakuma, Naoki Yoshinaga, Masashi Toyoda and Masaru Kitsuregawa.
A Shared-Private Representation Model with Coarse-to-Fine Extraction
for Target Sentiment Analysis. Peiqin Lin and Meng Yang.
Detecting Media Bias in News Articles using Gaussian Bias Distributions. Wei-Fan Chen, Khalid Al Khatib, Benno Stein and Henning Wachsmuth.
Looking inside Noun Compounds: Unsupervised Prepositional and Free Paraphrasing. Girishkumar Ponkiya, Rudra Murthy, Pushpak Bhattacharyya and Girish Palshikar.
BERT for Monolingual and Cross-Lingual Reverse Dictionary. Hang Yan, Xiaonan Li, Xipeng Qiu and bocao deng.
What's so special about BERT's layers? A closer look at the NLP pipeline in monolingual and multilingual models. Wietse de Vries, Andreas van Cranenburgh and Malvina Nissim.
Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?. Peter Hase, Shiyue Zhang, Harry Xie and Mohit Bansal.
A Pointer Network Architecture for Joint Morphological Segmentation and Tagging. Amit Seker and Reut Tsarfaty.
Beyond Language: Learning Commonsense from Images for Reasoning. Wanqing Cui, Yanyan Lan, Liang Pang, Jiafeng Guo and Xueqi Cheng.
A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies.. Ho-Lam Chung, Ying-Hong Chan and Yao-Chung Fan.
Consistent Response Generation with Controlled Specificity. Junya Takayama and Yuki Arase.
Internal and external pressures on language emergence:
least effort, object constancy and frequency. Diana Rodríguez Luna, Edoardo Maria Ponti, Dieuwke Hupkes and Elia Bruni.
Parsing All: Syntax and Semantics, Dependencies and Spans. Junru Zhou, Zuchao Li and Hai Zhao.
LIMIT-BERT : Linguistics Informed Multi-Task BERT. Junru Zhou, Zhuosheng Zhang, Hai Zhao and Shuailiang Zhang.
Improving Limited Labeled Dialogue State Tracking with Self-Supervision. Chien-Sheng Wu, Steven C.H. Hoi and Caiming Xiong.
PrivNet: Safeguarding Private Attributes in Transfer Learning for Recommendation. Guangneng Hu and Qiang Yang.
Learning to Learn to Disambiguate: Meta-Learning for Few-Shot Word Sense Disambiguation. Nithin Holla, Pushkar Mishra, Helen Yannakoudakis and Ekaterina Shutova.
An Empirical Investigation of Beam-Aware Training in Supertagging. Renato Negrinho, Matthew R. Gormley and Geoff Gordon.
Decoding Language Spatial Relations to 2D Spatial Arrangements. Gorjan Radevski, Guillem Collell, Marie-Francine Moens and Tinne Tuytelaars.
Why and when should you pool? Analyzing Pooling in Recurrent Architectures. Pratyush Maini, Keshav Kolluru, Danish Pruthi and Mausam -.
Long Document Ranking with Query-Directed Sparse Transformer. Jyun-Yu Jiang, Chenyan Xiong, Chia-Jung Lee and Wei Wang.
Visuo-Linguistic Question Answering (VLQA) Challenge. Shailaja Keyur Sampat, Yezhou Yang and Chitta Baral.
Exploring BERT's Sensitivity to Lexical Cues using Tests from Semantic Priming. Kanishka Misra, Allyson Ettinger and Julia Rayz.
Multi-hop Question Generation with Graph Convolutional Network. Dan Su, Yan Xu, Wenliang Dai, Ziwei Ji, Tiezheng Yu and Pascale Fung.
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering. Aisha Urooj, Amir Mazaheri, Niels Da vitoria lobo and Mubarak Shah.
Thinking Like a Skeptic: Defeasible Inference in Natural Language. Rachel Rudinger, Vered Shwartz, Jena D. Hwang, Chandra Bhagavatula, Maxwell Forbes, Ronan Le Bras, Noah A. Smith and Yejin Choi.
Guiding Attention for Self-Supervised Learning with Transformers. Ameet Deshpande and Karthik Narasimhan.
Language-Conditioned Feature Pyramids for Visual Selection Tasks. Taichi Iki and Akiko Aizawa.
BERT-QE: Contextualized Query Expansion for Document Re-ranking. Zhi Zheng, Kai Hui, Ben He, Xianpei Han, Le Sun and Andrew Yates.
ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations. Shizhe Diao, Jiaxin Bai, Yan Song, Tong Zhang and Yonggang Wang.
Probabilistic Case-based Reasoning for \\Open-World Knowledge Graph Completion. Rajarshi Das, Ameya Godbole, Nicholas Monath, Manzil Zaheer and Andrew McCallum.
TLDR: Extreme Summarization of Scientific Documents. Isabel Cachola, Kyle Lo, Arman Cohan and Daniel Weld.
Tri-Train: Automatic Pre-Fine Tuning between Pre-Training and Fine-Tuning for SciNER. Qingkai Zeng, Wenhao Yu, Mengxia Yu, Tianwen Jiang, Tim Weninger and Meng Jiang.
On the Sub-layer Functionalities of Transformer Decoder. Yilin Yang, Longyue Wang, Shuming Shi, Prasad Tadepalli, Stefan Lee and Zhaopeng Tu.
Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation. Insoo Chung, Byeongwook Kim, Yoonjung Choi, Se Jung Kwon, Yongkweon Jeon, Baeseong Park, Sangha Kim and Dongsoo Lee.
Robust Backed-off Estimation of Out-of-Vocabulary Embeddings. Nobukazu Fukuda, Naoki Yoshinaga and Masaru Kitsuregawa.
Tensorized Embedding Layers. Oleksii Hrinchuk, Valentin Khrulkov, Leyla Mirvakhabova, Elena Orlova and Ivan Oseledets.
Speaker or Listener? The Role of a Dialog Agent. Yafei Liu, Hongjin Qian, Hengpeng Xu and Jinmao Wei.
Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing. Xi Victoria Lin, Richard Socher and Caiming Xiong.
Paraphrasing vs Coreferring: Two Sides of the Same Coin. Yehudit Meged, Avi Caciularu, Vered Shwartz and Ido Dagan.
Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete Space. Dongyu Ru, Jiangtao Feng, Lin Qiu, Hao Zhou, Mingxuan Wang, Weinan Zhang, Yong Yu and Lei Li.
Coming to Terms: Automatic Formation of Neologisms in Hebrew. Moran Mizrahi, Stav Yardeni Seelig and Dafna Shahaf.
Joint Intent Detection and Entity Linking on Spatial Domain Queries. lei zhang, Runze Wang, Jingbo Zhou, Jingsong Yu, Zhenhua Ling and Hui Xiong.
IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages. Divyanshu Kakwani, Anoop Kunchukuttan, Satish Golla, Gokul N.C., Avik Bhattacharyya, Mitesh M. Khapra and Pratyush Kumar.
Weakly-Supervised Modeling of Contextualized Event Embedding for Discourse Relations. I-Ta Lee, Maria Leonor Pacheco and Dan Goldwasser.
Short Papers
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences. Boon Peng Yap, Andrew Koh and Eng Siong Chng.
Adversarial Text Generation via Sequence Contrast Discrimination. Ke Wang and Xiaojun Wan.
A Greedy Bit-flip Training Algorithm for Binarized Knowledge Graph Embeddings. Katsuhiko Hayashi, Koki Kishimoto and Masashi Shimbo.
Neural Speed Reading Audited. Anders Søgaard.
Robustness to Modification with Shared Words in Paraphrase Identification. Zhouxing Shi and Minlie Huang.
Pretrain-KGE: Learning Knowledge Representation from Pretrained Language Models. Zhiyuan Zhang, Xiaoqian Liu, Yi Zhang, Qi Su, Xu SUN and Bin He.
Enhance Robustness of Sequence Labelling with Masked Adversarial Training. Luoxin Chen, Xinyue Liu, Weitong Ruan and Jianhua Lu.
Context-aware Stand-alone Neural Spelling Correction. Xiangci Li, Hairong Liu and Liang Huang.
A Novel Workflow for Accurately and Efficiently Crowdsourcing Predicate Senses and Argument Labels. Youxuan Jiang, Huaiyu Zhu, Jonathan K. Kummerfeld, Yunyao Li and Walter Lasecki.
KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding. Jiyeon Ham, Yo Joong Choe, Kyubyong Park, Ilji Choi and Hyungjoon Soh.
Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning. Zhaojiang Lin, Andrea Madotto and Pascale Fung.
Few-Shot Multi-Hop Relation Reasoning over Knowledge Bases. Chuxu Zhang, Lu Yu, Mandana Saebi, Meng Jiang and Nitesh Chawla.
Learning to Stop: A Simple yet Effective Approach to Urban Vision-Language Navigation. Jiannan Xiang, Xin Wang and William Yang Wang.
PolicyQA: A Reading Comprehension Dataset for Privacy Policies. Wasi Ahmad, Jianfeng Chi, Yuan Tian and Kai-Wei Chang.
Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data. Katja Filippova.
Sequential Span Classification with Neural Semi-Markov CRFs for Biomedical Abstracts. Kosuke Yamada, Tsutomu Hirao, Ryohei Sasano, Koichi Takeda and Masaaki Nagata.
Where to Submit? Helping Researchers to Choose the Right Venue. Konstantin Kobs, Tobias Koopmann, Albin Zehe, David Fernes, Philipp Krop and Andreas Hotho.
Pretrained Language Models for Dialogue Generation with Multiple Input Sources. Yu Cao, Wei Bi, Meng Fang and Dacheng Tao.
Hybrid Emoji-Based Masked Language Models for Zero-Shot Abusive Language Detection. Michele Corazza, Stefano Menini, Elena Cabrio, Sara Tonelli and Serena Villata.
Transition-based Parsing with Stack-Transformers. Ramón Fernandez Astudillo, Miguel Ballesteros, Tahira Naseem, Austin Blodgett and Radu Florian.
PhoBERT: Pre-trained language models for Vietnamese. Dat Quoc Nguyen and Anh Tuan Nguyen.
Make Templates Smarter: A Template Based Data2Text System Powered by Text Stitch Model. Bingfeng Luo, Zuo Bai, Kunfeng Lai and Jianping Shen.
Incorporating Stylistic Lexical Preferences in Generative Language Models. Hrituraj Singh, Gaurav Verma and Balaji Vasan Srinivasan.
HyperText: Endowing FastText with Hyperbolic Geometry. Yudong Zhu, Di Zhou, Jinghui Xiao, Xin Jiang, Xiao Chen and Qun Liu.
Learning to Generate Clinically Coherent Chest X-Ray Reports. Justin Lovelace and Bobak Mortazavi.
What Can We Do to Improve Peer Review in NLP?. Anna Rogers and Isabelle Augenstein.
Biomedical Event Extraction with Hierarchical Knowledge Graphs. Kung-Hsiang Huang, Mu Yang and Nanyun Peng.
Examining the Ordering of Rhetorical Strategies in Persuasive Requests. Omar Shaikh, Jiaao Chen, Jon Saad-Falcon, Polo Chau and Diyi Yang.
Parsing with Multilingual BERT, a Small Corpus, and a Small Treebank. Ethan C. Chau, Lucy H. Lin and Noah A. Smith.
OptSLA: an Optimization-Based Approach for Sequential Label Aggregation. Nasim Sabetpour, Adithya Kulkarni and Qi Li.
Dynamically Updating Event Representations for Temporal Relation
Classification with Multi-category Learning. Fei Cheng, Masayuki Asahara, Ichiro Kobayashi and Sadao Kurohashi.
Investigating Transferability in Pretrained Language Models. Alex Tamkin, Trisha Singh, Davide Giovanardi and Noah Goodman.
exBERT: Extending Pre-trained Models with Domain-specific Vocabulary Under Constrained Training Resources. Wen Tai, H. T. Kung, Xin Dong, Marcus Comiter and Chang-Fu Kuo.
Open Domain Question Answering based on Text Enhanced Knowledge Graph with Hyperedge Infusion. Jiale Han, Bo Cheng and Xu Wang.
Inexpensive Domain Adaptation of Pretrained Language Models: Case Studies on Biomedical NER and Covid-19 QA. Nina Poerner, Ulli Waltinger and Hinrich Schütze.
Pseudo-Bidirectional Decoding for Local Sequence Transduction. Wangchunshu Zhou, Tao Ge and Ke Xu.
TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification. Francesco Barbieri, Jose Camacho-Collados, Luis Espinosa Anke and Leonardo Neves.
Cost-effective Selection of Pretraining Data: A Case Study of Pretraining BERT on Social Media. Xiang Dai, Sarvnaz Karimi, Ben Hachey and Cecile Paris.
TopicBERT for Energy Efficient Document Classification. Yatin Chaudhary, Pankaj Gupta, Khushbu Saxena, Vivek Kulkarni, Thomas Runkler and Hinrich Schütze.
DomBERT: Domain-oriented Language Model for Aspect-based Sentiment Analysis. Hu Xu, Bing Liu, Lei Shu and Philip Yu.
Continual Learning Long Short Term Memory. Xin Guo, Yu Tian, Qinghan Xue, Panos Lampropoulos, steven eliuk, Kenneth Barner and Xiaolong wang.
Constrained Decoding for Computationally Efficient Named Entity Recognition Taggers. Brian Lester, Daniel Pressel, Amy Hemmeter, Sagnik Ray Choudhury and Srinivas Bangalore.
PTUM: Pre-training User Model from Unlabeled User Behaviors via Self-supervision. Chuhan Wu, Fangzhao Wu, Tao Qi, Jianxun Lian, Yongfeng Huang and Xing Xie.
Adversarial Subword Regularization for Robust Neural Machine Translation. Jungsoo Park, Mujeen Sung, Jinhyuk Lee and Jaewoo Kang.
Learning Visual-Semantic Embeddings for Reporting Abnormal Findings on Chest X-rays. Jianmo Ni, Chun-Nan Hsu, Amilcare Gentili and Julian McAuley.
Automatically Identifying Gender Issues in Machine Translation using Perturbations. Hila Gonen and Kellie Webster.
MedICaT: A Dataset of Medical Images, Captions, and Textual References. Sanjay Subramanian, Lucy Lu Wang, Ben Bogin, Sachin Mehta, Madeleine van Zuylen, Sravanthi Parasa, Sameer Singh, Matt Gardner and Hannaneh Hajishirzi.
TSDG: Content-aware Neural Response Generation with Two-stage Decoding Process. Junsheng Kong, Zhicheng Zhong, Yi Cai, Xin Wu and Da Ren.
Unsupervised Cross-Lingual Adaptation of Dependency Parsers Using CRF Autoencoders. Zhao Li and Kewei Tu.
Service-oriented Text-to-SQL Parsing. Wangsu Hu and Jilei Tian.
Sparse and Decorrelated Representations for Stable Zero-shot NMT. Bokyung Son and Sungwon Lyu.
Helpful or Hierarchical? Predicting the Communicative Strategies of Chat Participants, and their Impact on Success. Farzana Rashid, Tommaso Fornaciari, Dirk Hovy, Eduardo Blanco and Fernando Vega-Redondo.
Adapting Open Domain Fact Extraction and Verification to COVID-FACT through In-Domain Language Modeling. Zhenghao Liu, Chenyan Xiong, Zhuyun Dai, Si Sun, Maosong Sun and Zhiyuan Liu.
Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback. Makesh Narsimhan Sreedhar, Kun Ni and Siva Reddy.
Adapting Coreference Resolution to Twitter Conversations. Berfin Aktaş, Veronika Solopova, Annalena Kohnert and Manfred Stede.
On Romanization for Model Transfer Between Scripts in Neural Machine Translation. Chantal Amrhein and Rico Sennrich.
Answer Span Correction in Machine Reading Comprehension. Revanth Gangi Reddy, Md Arafat Sultan, Efsun Sarioglu Kayi, Rong Zhang, Vittorio Castelli and Avi Sil.
Zero-shot Entity Linking with Efficient Long Range Sequence Modeling. Zonghai Yao, Liangliang Cao and Huapu Pan.
An Investigation of Potential Function Designs for Neural CRF. Zechuan Hu, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu.
Extending Multilingual BERT to Low-Resource Languages. Zihan Wang, Karthikeyan K, Stephen Mayhew and Dan Roth.
Cross-Lingual Training of Neural Models for Document Ranking. Peng Shi, He Bai and Jimmy Lin.
Can Pre-training help VQA with Lexical Variations?. Shailza Jolly and Shubham Kapoor.
FENAS: Flexible and Expressive Neural Architecture Search. Ramakanth Pasunuru and Mohit Bansal.
LEGAL-BERT: The Muppets straight out of Law School. Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, Nikolaos Aletras and Ion Androutsopoulos.
An Instance Level Approach for Shallow Semantic Parsing in Scientific Procedural Text. Daivik Swarup, Ahsaas Bajaj, Sheshera Mysore, Tim O'Gorman, Rajarshi Das and Andrew McCallum.
On Long-Tailed Phenomena in Neural Machine Translation. Vikas Raunak, Siddharth Dalmia, Vivek Gupta and Florian Metze.
Knowing What You Know: Calibrating Dialogue Belief State Distributions via Ensembles. Carel van Niekerk, Michael Heck, Christian Geishauser, Hsien-chin Lin, Nurul Lubis, Marco Moresi and Milica Gasic.
Domain Adversarial Fine-Tuning as an Effective Regularizer. Giorgos Vernikos, Katerina Margatina, Alexandra Chronopoulou and Ion Androutsopoulos.
Data Annealing for Informal Language Understanding Tasks. Jing Gu and Zhou Yu.
Integrating Task Specific Information into Pretrained Language Models for Low Resource Fine Tuning. Rui Wang, Shijing Si, Guoyin Wang, Lei Zhang, Lawrence Carin and Ricardo Henao.
KoBE: Knowledge-Based Machine Translation Evaluation. Zorik Gekhman, Roee Aharoni, Genady Beryozkin, Markus Freitag and Wolfgang Macherey.
Pushing the Limits of AMR Parsing with Self-Learning. Young-Suk Lee, Ramón Fernandez Astudillo, Tahira Naseem, Revanth Gangi Reddy, Radu Florian and Salim Roukos.
WER we are and WER we think we are. Piotr Szymański, Piotr Żelasko, Mikolaj Morzy, Adrian Szymczak, Marzena Żyła-Hoppe, Joanna Banaszczak, Lukasz Augustyniak, Jan Mizgajski and Yishay Carmiel.
A Novel Challenge Set for Hebrew Morphological Disambiguation and Diacritics Restoration. Avi Shmidman, Joshua Guedalia, Shaltiel Shmidman, Moshe Koppel and Reut Tsarfaty.
Improving Event Duration Prediction via Time-aware Pre-training. Zonglin Yang, Xinya Du, Alexander Rush and Claire Cardie.
What do we expect from Multiple-choice QA Systems?. Krunal Shah, Nitish Gupta and Dan Roth.
Resource-Enhanced Neural Model for Event Argument Extraction. Jie Ma, Shuai Wang, Rishita Anubhai, Miguel Ballesteros and Yaser Al-Onaizan.
Improving Target-side Lexical Transfer in Multilingual Neural Machine Translation. Luyu Gao, Xinyi Wang and Graham Neubig.
Using Visual Feature Space as a Pivot Across Languages. Ziyan Yang, Leticia Pinto-Alva, Franck Dernoncourt and Vicente Ordonez.
Document Classification for COVID-19 Literature. Bernal Jimenez Gutierrez, Jucheng Zeng, Dongdong Zhang, Ping Zhang and Yu Su.
Inserting Information Bottlenecks for Attribution in Transformers. Zhiying Jiang, Raphael Tang, Ji Xin and Jimmy Lin.
MCMH: Learning Multi-Chain Multi-Hop Rules for Knowledge Graph Reasoning. Lu Zhang, Mo Yu, Tian Gao and yue yu.
Weakly- and Semi-supervised Evidence Extraction. Danish Pruthi, Bhuwan Dhingra, Graham Neubig and Zachary C. Lipton.
More Embeddings, Better Sequence Labelers?. Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu.
NLP Service APIs and Models for Efficient Registration of New Clients. Sahil Shah, Vihari Piratla, Soumen Chakrabarti and Sunita Sarawagi.
Effects of Naturalistic Variation in Goal-Oriented Dialog. Jatin Ganhotra, Robert Moore, Sachindra Joshi and Kahini Wadhawan.
Temporal Reasoning in Natural Language Inference. Siddharth Vashishtha, Adam Poliak, Yash Kumar Lal, Benjamin Van Durme and Aaron Steven White.
A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese. Anh Tuan Nguyen, Mai Hoang Dao and Dat Quoc Nguyen.
An Empirical Methodology for Detecting and Prioritizing Needs during Crisis Events. M. Janina Sarol, Ly Dinh, Rezvaneh Rezapour, Chieh-Li Chin, Pingjing Yang and Jana Diesner.
Towards Low-Resource Semi-Supervised Dialogue Generation with Meta-Learning. Yi Huang, Junlan Feng, Shuo Ma, Xiaoyu Du and Xiaoting Wu.
#Turki$hTweets: A Benchmark Dataset for Turkish Text Correction. Asiye Tuba Koksal, Ozge Bozal, Emre Yürekli and Gizem Gezici.
Query-Key Normalization for Transformers. Alex Henry, Prudhvi Raj Dachapally, Shubham Shantaram Pawar and Yuxuan Chen.
How Can Self-Attention Networks Recognize Dyck-n Languages?. Javid Ebrahimi, Dhruv Gelda and Wei Zhang.
Training Flexible Depth Model by Multi-Task Learning for Neural Machine Translation. Qiang Wang, Tong Xiao and Jingbo Zhu.
The birth of Romanian BERT. Stefan Dumitrescu, Andrei-Marius Avram and Sampo Pyysalo.
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?. Shayne Longpre, Yu Wang and Chris DuBois.
Visually-Grounded Planning without Vision: Language Models Infer Detailed Plans from High-level Instructions. Peter Jansen.
On the Branching Bias of Syntax Extracted from Pre-trained Language Models. Huayang Li, Lemao Liu, Guoping Huang and Shuming Shi.
The Pragmatics behind Politics: Modelling Metaphor, Framing and Emotion in Political Discourse. Pere-Lluís Huguet Cabot, Verna Dankers, David Abadi, Agneta Fischer and Ekaterina Shutova.
SMRT Chatbots: Improving Non-Task-Oriented Dialog with Simulated Multiple Reference Training. Huda Khayrallah and João Sedoc.
Improving Aspect-based Sentiment Analysis with Gated Graph Convolutional Networks and Syntax-based Regulation. Amir Pouran Ben Veyseh, Nasim Nouri, Franck Dernoncourt, Quan Hung Tran, Dejing Dou and Thien Huu Nguyen.
The Dots Have Their Values: Exploiting the Node-Edge Connections in Graph-based Neural Models for Document-level Relation Extraction. Hieu Minh Tran, Minh Trung Nguyen and Thien Huu Nguyen.
Structural and Functional Decomposition for Personality Image Captioning in a Communication Game. Minh Thu Nguyen, Duy Phung, Minh Hoai and Thien Huu Nguyen.
Byte Pair Encoding is Suboptimal for Language Model Pretraining. Kaj Bostrom and Greg Durrett.
Learning to Classify Events from Human Needs Category Descriptions. Haibo Ding and Zhe Feng.
Automatic Term Name Generation for Gene Ontology: Task and Dataset. Yanjian Zhang, Qin Chen, Yiteng Zhang, Zhongyu Wei, Yixu Gao, Jiajie Peng, Zengfeng Huang, Weijian Sun and Xuanjing Huang.
Compressing Transformer-Based Semantic Parsing Models using Compositional Code Embeddings. Prafull Prakash, Saurabh Kumar Shashidhar, Wenlong Zhao, Subendhu Rongali, Haidar Khan and Michael Kayser.
Finding Friends and Flipping Frenemies:
Automatic Paraphrase Dataset Augmentation Using Graph Theory. Hannah Chen, Yangfeng Ji and David Evans.
Hierarchical Region Learning for Nested Named Entity Recognition. Xinwei Long, Shuzi Niu and Yucheng Li.
Understanding User Resistance Strategies in Persuasive Conversations. Youzhi Tian, Weiyan Shi, Chen Li and Zhou Yu.
Exploiting Unsupervised Data for Emotion Recognition in Conversations. Wenxiang Jiao, Michael Lyu and Irwin King.
Do Language Embeddings capture Scales?. Xikun Zhang, Deepak Ramachandran, Ian Tenney, Yanai Elazar and Dan Roth.
Dual Inference for Improving Language Understanding and Generation. Shang-Yu Su, Yung-Sung Chuang and Yun-Nung Chen.
Enhancing Generalization in Natural Language Inference by Syntax. Qi He, Han Wang and Yue Zhang.