Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation Paper • 2510.22115 • Published Oct 25 • 83
A Comprehensive Study of Knowledge Editing for Large Language Models Paper • 2401.01286 • Published Jan 2, 2024 • 21
Know Your Needs Better: Towards Structured Understanding of Marketer Demands with Analogical Reasoning Augmented LLMs Paper • 2401.04319 • Published Jan 9, 2024 • 1
A Fast Fourier Convolutional Deep Neural Network For Accurate and Explainable Discrimination Of Wheat Yellow Rust And Nitrogen Deficiency From Sentinel-2 Time-Series Data Paper • 2306.17207 • Published Jun 29, 2023 • 1
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts Paper • 2405.19893 • Published May 30, 2024 • 33
OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System Paper • 2412.20005 • Published Dec 28, 2024 • 17
OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs Paper • 2409.05152 • Published Sep 8, 2024 • 32
Retrieve, Summarize, Plan: Advancing Multi-hop Question Answering with an Iterative Approach Paper • 2407.13101 • Published Jul 18, 2024
KAG: Boosting LLMs in Professional Domains via Knowledge Augmented Generation Paper • 2409.13731 • Published Sep 10, 2024 • 1
A Natural Language Processing Pipeline of Chinese Free-text Radiology Reports for Liver Cancer Diagnosis Paper • 2004.13848 • Published Apr 10, 2020
LookAhead Tuning: Safer Language Models via Partial Answer Previews Paper • 2503.19041 • Published Mar 24 • 5
Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs Paper • 2503.05139 • Published Mar 7 • 4
K-ON: Stacking Knowledge On the Head Layer of Large Language Model Paper • 2502.06257 • Published Feb 10
Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis Paper • 2502.03843 • Published Feb 6
Have We Designed Generalizable Structural Knowledge Promptings? Systematic Evaluation and Rethinking Paper • 2501.00244 • Published Dec 31, 2024 • 1
Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging Paper • 2502.06876 • Published Feb 8
Think-in-Memory: Recalling and Post-thinking Enable LLMs with Long-Term Memory Paper • 2311.08719 • Published Nov 15, 2023
Toward Stable and Consistent Evaluation Results: A New Methodology for Base Model Evaluation Paper • 2503.00812 • Published Mar 2
Bi'an: A Bilingual Benchmark and Model for Hallucination Detection in Retrieval-Augmented Generation Paper • 2502.19209 • Published Feb 26