Follow
Marius Mosbach
Marius Mosbach
Mila - Quebec AI Institute, McGill University
Verified email at mila.quebec - Homepage
Title
Cited by
Cited by
Year
On the stability of fine-tuning BERT: Misconceptions, explanations, and strong baselines
M Mosbach, M Andriushchenko, D Klakow
ICLR 2021, 2021
4242021
Adapting pre-trained language models to African languages via multilingual adaptive fine-tuning
JO Alabi, DI Adelani, M Mosbach, D Klakow
🏆 Best Paper Award 🏆 COLING 2022, 2022
1052022
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
P BehnamGhader, V Adlakha, M Mosbach, D Bahdanau, N Chapados, ...
COLM 2024, 2024
962024
Few-shot fine-tuning vs. in-context learning: A fair comparison and evaluation
M Mosbach, T Pimentel, S Ravfogel, D Klakow, Y Elazar
ACL 2023 (Findings), 2023
942023
Logit pairing methods can fool gradient-based attacks
M Mosbach, M Andriushchenko, T Trost, M Hein, D Klakow
NeurIPS 2018 - Workshop on Security in Machine Learning, 2018
812018
On the interplay between fine-tuning and sentence-level probing for linguistic knowledge in pre-trained transformers
M Mosbach, A Khokhlova, MA Hedderich, D Klakow
EMNLP 2020 (Findings), 2020
502020
Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions
Y Elazar, N Kassner, S Ravfogel, A Feder, A Ravichander, M Mosbach, ...
arXiv preprint arXiv:2207.14251, 2022
482022
MCSE: Multimodal Contrastive Learning of Sentence Embeddings
M Zhang, M Mosbach, DI Adelani, MA Hedderich, D Klakow
NAACL 2022, 2022
332022
Fusion Models for Improved Image Captioning
M Kalimuthu, A Mogadala, M Mosbach, D Klakow
Pattern Recognition. ICPR International Workshops and Challenges: Virtual …, 2021
182021
Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study
BM Abdullah, M Mosbach, I Zaitova, B Möbius, D Klakow
Interspeech 2021, 2021
162021
Weaker Than You Think: A Critical Look at Weakly Supervised Learning
D Zhu, X Shen, M Mosbach, A Stephan, D Klakow
🏆 Best Paper Award 🏆 ACL 2023, 2023
142023
StereoKG: Data-Driven Knowledge Graph Construction for Cultural Knowledge and Stereotypes
A Deshpande, D Ruiter, M Mosbach, D Klakow
Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH), 2022
122022
On the Security Relevance of Initial Weights in Deep Neural Networks
K Grosse, TA Trost, M Mosbach, M Backes, D Klakow
Artificial Neural Networks and Machine Learning–ICANN 2020: 29th …, 2020
12*2020
Graph-based argument quality assessment
E Saveleva, V Petukhova, M Mosbach, D Klakow
Proceedings of the International Conference on Recent Advances in Natural …, 2021
112021
incom. py-A Toolbox for Calculating Linguistic Distances and Asymmetries between Related Languages
M Mosbach, I Stenger, T Avgustinova, D Klakow
Proceedings of the International Conference on Recent Advances in Natural …, 2019
102019
The impact of demonstrations on multilingual in-context learning: A multidimensional analysis
M Zhang, V Gautam, M Wang, JO Alabi, X Shen, D Klakow, M Mosbach
ACL 2024 (Findings), 2024
82024
Some steps towards the generation of diachronic WordNets
Y Bizzoni, M Mosbach, D Klakow, S Degaetano-Ortlieb
Proceedings of the 22nd Nordic conference on computational linguistics, 55-64, 2019
82019
Large GPT-like Models are Bad Babies: A Closer Look at the Relationship between Linguistic Competence and Psycholinguistic Measures
J Steuer, M Mosbach, D Klakow
🏆 Best Paper Award 🏆 BabyLM workshop 2023, 2023
72023
Artefact retrieval: Overview of NLP models with knowledge base access
V Zouhar, M Mosbach, D Biswas, D Klakow
arXiv preprint arXiv:2201.09651, 2022
62022
A Closer Look at Linguistic Knowledge in Masked Language Models: The Case of Relative Clauses in American English
M Mosbach, S Degaetano-Ortlieb, MP Krielke, BM Abdullah, D Klakow
COLING 2020, 2020
52020
The system can't perform the operation now. Try again later.
Articles 1–20