26：Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

今日の論文

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models arxiv.org Wei, Jason, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Ed Chi, Quoc Le, and Denny Zhou. "Chain of thought prompting elicits reasoning in large language model…

2023-05-26

今日の論文2023/05/24,25：Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data

今日の論文

Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data aclanthology.org Katja Filippova. 2020. Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data. In Findings of the Association for Computational…

2023-05-23

今日の論文2023/05/21,22：Retrieval Augmentation Reduces Hallucination in Conversation

今日の論文

Retrieval Augmentation Reduces Hallucination in Conversation aclanthology.org Kurt Shuster, Spencer Poff, Moya Chen, Douwe Kiela, and Jason Weston. 2021. Retrieval Augmentation Reduces Hallucination in Conversation. In Findings of the Asso…

2023-05-21

今日の論文2023/05/20：On the Origin of Hallucinations in Conversational Models:Is it the Datasets or the Models?

今日の論文

On the Origin of Hallucinations in Conversational Models:Is it the Datasets or the Models? aclanthology.org ©2022 Association for Computational Linguistics License: Creative Commons Attribution 4.0 International License(CC-BY) 本記事は、原…

2023-05-20

今日の論文2023/05/18,19：Diving Deep into Modes of Fact Hallucinations in Dialogue Systems

今日の論文

Diving Deep into Modes of Fact Hallucinations in Dialogue Systems aclanthology.org ©2022 Association for Computational Linguistics License: Creative Commons Attribution 4.0 International License(CC-BY) 本記事は、原著の内容に基づき筆者が要…

2023-05-17

今日の論文2023/05/15,16：The Curious Case of Hallucinations in Neural Machine Translation

今日の論文

The Curious Case of Hallucinations in Neural Machine Translation aclanthology.org ©2022 Association for Computational Linguistics License: Creative Commons Attribution 4.0 International License(CC-BY) 本記事は、原著の内容に基づき筆者が要約…

2023-05-15

今日の論文2023/05/13,14：A Distributional Lens for Multi-Aspect Controllable Text Generation

今日の論文

A Distributional Lens for Multi-Aspect Controllable Text Generation aclanthology.org ©2022 Association for Computational Linguistics License: Creative Commons Attribution 4.0 International License(CC-BY) 本記事は、原著の内容に基づき筆者が…

2023-05-13

今日の論文2023/05/11, 12：AttentionViz: A Global View of Transformer Attention

今日の論文

AttentionViz: A Global View of Transformer Attention arxiv.org Yeh, Catherine, Yida Chen, Aoyu Wu, Cynthia Chen, Fernanda Viégas, and Martin Wattenberg. "AttentionViz: A Global View of Transformer Attention." arXiv preprint arXiv:2305.0321…

2023-05-11

今日の論文2023/05/9,10：CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm

今日の論文

CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm aclanthology.org ©2022 Association for Computational Linguistics License: Creative Commons Attribution 4.0 International License(CC-BY) 本記事は、原著の…

2023-05-09

今日の論文2023/05/07,8：Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents

今日の論文

Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents aclanthology.org Eric Smith, Orion Hsu, Rebecca Qian, Stephen Roller, Y-Lan Boureau, and Jason Weston. 2022. …

2023-05-06

今日の論文2023/05/04,05：Long-term Control for Dialogue Generation: Methods and Evaluation

今日の論文

Long-term Control for Dialogue Generation: Methods and Evaluation aclanthology.org Ramya Ramakrishnan, Hashan Narangodage, Mauro Schilman, Kilian Weinberger, and Ryan McDonald. 2022. Long-term Control for Dialogue Generation: Methods and E…

2023-05-04

今日の論文2023/05/03：SKILL: Structured Knowledge Infusion for Large Language Models.

今日の論文

SKILL: Structured Knowledge Infusion for Large Language Models. aclanthology.org Fedor Moiseev, Zhe Dong, Enrique Alfonseca, and Martin Jaggi. 2022. SKILL: Structured Knowledge Infusion for Large Language Models. In Proceedings of the 2022…

2023-05-02

今日の論文2023/05/01,02：Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

今日の論文

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space aclanthology.org Mor Geva, Avi Caciularu, Kevin Wang, and Yoav Goldberg. 2022. Transformer Feed-Forward Layers Build Predictions by Promoting C…

izmyonの日記

奈良の山奥で研究にいそしむ大学院生の学習記録。

2023-05-01から1ヶ月間の記事一覧

今日の論文2023/05/26：Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

今日の論文2023/05/24,25：Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data

今日の論文2023/05/21,22：Retrieval Augmentation Reduces Hallucination in Conversation

今日の論文2023/05/20：On the Origin of Hallucinations in Conversational Models:Is it the Datasets or the Models?

今日の論文2023/05/18,19：Diving Deep into Modes of Fact Hallucinations in Dialogue Systems

今日の論文2023/05/15,16：The Curious Case of Hallucinations in Neural Machine Translation

今日の論文2023/05/13,14：A Distributional Lens for Multi-Aspect Controllable Text Generation

今日の論文2023/05/11, 12：AttentionViz: A Global View of Transformer Attention

今日の論文2023/05/9,10：CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm

今日の論文2023/05/07,8：Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents

今日の論文2023/05/04,05：Long-term Control for Dialogue Generation: Methods and Evaluation

今日の論文2023/05/03：SKILL: Structured Knowledge Infusion for Large Language Models.

今日の論文2023/05/01,02：Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space