InvBERT: Reconstructing Text from Contextualized Word Embeddings by inverting the BERT pipeline Paper • 2109.10104 • Published Sep 21, 2021 • 4
Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction Paper • 2602.22752 • Published 13 days ago • 6
Next Reply Prediction X Dataset: Linguistic Discrepancies in Naively Generated Content Paper • 2602.19177 • Published 17 days ago • 4
Don't Trust Generative Agents to Mimic Communication on Social Networks Unless You Benchmarked their Empirical Realism Paper • 2506.21974 • Published Jun 27, 2025 • 5
Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs Paper • 2309.07311 • Published Sep 13, 2023 • 4