[54] Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models

August 25, 2022 ยท 2 min ยท long8v ยท 

[16] Counterfactual Memorization in Neural Language Models

March 25, 2022 ยท 3 min ยท long8v ยท 

[15] Quantifying Memorization Across Neural Language Models

March 24, 2022 ยท 3 min ยท long8v ยท