GenRead-3B-TQA / README.md
wyu1's picture
Update README.md
572cb2a
metadata
license: cc-by-4.0

GenRead: FiD model trained on TQA

-- This is the model checkpoint of GenRead [2], based on the T5-3B and trained on the TriviaQA [1].

-- Hyperparameters: 8 x 80GB A100 GPUs; batch size 16; AdamW; LR 6e-5; best dev at 8500 steps

References:

[1] TriviaQA: A Large Scale Dataset for Reading Comprehension and Question Answering. ACL 2017

[2] Generate rather than Retrieve: Large Language Models are Strong Context Generators. arXiv 2022

Model performance

We evaluate it on the TriviaQA dataset, the EM score is 71.55.

--- license: cc-by-4.0 ---