Amir H. Kargaran

@amir_nlp

On job maket / 🤖 PhD student @CisLmu/ 🛠️ Multilingual NLP / Previous: Intern @huggingface. My views!

Munich, Germany

kargaranamir.github.io

Joined August 2019

390Posts 812Followers 3KFollowing

You might like

@lpq29743

@KChitsaz

@NielKlug

@optimopium

@Taaraa_99

$MS_Akhondzadeh's profile picture. PhD student @ Uni-Köln, previously-{@AxeleraAI, @CISPA}.$

@MS_Akhondzadeh

@_No_Article

@iamaminsamadi

@neegarin

@Arash_M_O

@m_alikhasi_

Amir H. Kargaran reposted

Mohammad Sadegh Akhondzadeh

@MS_Akhondzadeh

Nov 5

Excited to share our latest work, “KurTail: Kurtosis-based LLM Quantization”, which will be presented as a poster at EMLNP 2025! You can find our paper here: aclanthology.org/2025.findings-…

MS_Akhondzadeh's tweet image. Excited to share our latest work, “KurTail: Kurtosis-based LLM Quantization”, which will be presented as a poster at EMLNP 2025!

You can find our paper here: aclanthology.org/2025.findings-…

I'm tired of hallucinated citations.* I never use AI for literature review, and I always, especially as I gain more experience, verify every paper I cite. *As a reviewer, if I encounter such issues, I will view the paper negatively, as they reflect a lack of academic integrity.

Amir H. Kargaran

@amir_nlp

Oct 29

Nice feature from @openreviewnet. You can track publication versions across different venues (e.g., arXiv, ICLR, ACL)!

Amir H. Kargaran reposted

Sahand Sharifzadeh

@sahandsharif

Oct 23

It's been a while, but I'm glad that my DeepMind M2L lecture on Transformers is still proving useful to the community. My goal was to explain Transformers through the lens of images and graphs, a perspective I personally helped shape in earlier days. youtu.be/IvWHJvasGb0?si…

sahandsharif's tweet card. 2023 1.3 Transformers - Sahand Sharifzadeh

youtube.com

YouTube

2023 1.3 Transformers - Sahand Sharifzadeh

Source: youtube.com

Amir H. Kargaran reposted

Haneul Yoo

@HaneulYoo13

Oct 7

📢 I'm not physically attending #COLM2025 @COLM_conf, but organizing @MeltWorkshop ✨Multilingual and Equitable Language Technologies✨ 📍Rm 520D 📅 Oct 10 (Fri) 🔗 melt-workshop.github.io Please stop by our workshop if you're working on multilingual/multicultural LLMs 🙌

Amir H. Kargaran reposted

Guilherme Penedo

@gui_penedo

Oct 7

We're in 🇨🇦 for @COLM_conf Come talk to me and @HKydlicek on Wednesday at the FineWeb2 poster session (Session 3, Poster #58) @LoubnaBenAllal1 will be on Session 5, Poster #23 (SmolLM2)

gui_penedo's tweet image. We're in 🇨🇦 for @COLM_conf

Come talk to me and @HKydlicek on Wednesday at the FineWeb2 poster session (Session 3, Poster #58)

@LoubnaBenAllal1 will be on Session 5, Poster #23 (SmolLM2)

Amir H. Kargaran

@amir_nlp

Aug 23

Stella Biderman

@BlancheMinerva

Aug 22

What do you call those units of semantic text the LLM compresses English and German into when you brag about the compression rate? It's not UTF-8 bytes... there's a word for it, maybe starts with a a T?

Amir H. Kargaran reposted

Guilherme Penedo

@gui_penedo

Aug 22

> SmolLM3 > GLM-4.5 > NVIDIA-Nemotron-Nano These are just some of the recent OS releases relying on 🥂 FineWeb2 for their multilingual data Proud that the community trusts us for their data supply 🫡

gui_penedo's tweet image. &gt; SmolLM3
&gt; GLM-4.5
&gt; NVIDIA-Nemotron-Nano

These are just some of the recent OS releases relying on 🥂 FineWeb2 for their multilingual data

Proud that the community trusts us for their data supply 🫡

Amir H. Kargaran reposted

Sarath Chandar

@apsarathchandar

Aug 22

Molecules speak in atoms and bonds. LLMs can learn that language. Even with SOTA #denovo design, our largest molecular LLM study finds a plot twist: early saturation, weak scaling, and proxy metrics that mislead on real tasks! Led by @kchitsaz and @roshan_msb 🧵 More in thread:

Amir H. Kargaran

@amir_nlp

Aug 7

Crazy how much attention OCR is getting. I couldn't find any that work well with less common scripts. Stress-test it with Cuneiform, at least it's part of Unicode!

Amir H. Kargaran

@amir_nlp

Jul 27

I'm at the @aclmeeting to present our papers on multilingual evaluation, programming languages, and translation! #ACL2025 Feel free to stop by to exchange ideas and discuss! I’m also on the job market. If you think there’s a potential fit, I’d love to hear from you.

amir_nlp's tweet image. I'm at the @aclmeeting to present our papers on multilingual evaluation, programming languages, and translation! #ACL2025
Feel free to stop by to exchange ideas and discuss!

I’m also on the job market. If you think there’s a potential fit, I’d love to hear from you.

Amir H. Kargaran reposted

Guilherme Penedo

@gui_penedo

Jul 9

FineWeb2 🥂 has been accepted to @COLM_conf See you in October 🇨🇦

Guilherme Penedo

@gui_penedo

Jun 27

We have finally released the 📝paper for 🥂FineWeb2, our large multilingual pre-training dataset. Along with general (and exhaustive) multilingual work, we introduce a concept that can also improve English performance: deduplication-based upsampling, which we call rehydration.

gui_penedo's tweet image. We have finally released the 📝paper for 🥂FineWeb2, our large multilingual pre-training dataset.

Along with general (and exhaustive) multilingual work, we introduce a concept that can also improve English performance: deduplication-based upsampling, which we call rehydration.

Amir H. Kargaran reposted

Guilherme Penedo

@gui_penedo

Jun 27

Amir H. Kargaran reposted

Haneul Yoo

@HaneulYoo13

Jun 5

Are you working on multilingual, multicultural #LLM? Interested in diverse & inclusive language modeling? 😎 Stay tuned at our MELT workshop collocated with #COLM2025 🔗 melt-workshop.github.io 🫶 We welcome 2p (EA), 4p (short), 8p (long) papers as well as talented reviewers!

MELT Workshop

@MeltWorkshop

Jun 4

🌍 ✨ Introducing Melt Workshop 2025: Multilingual, Multicultural, and Equitable Language Technologies A workshop on building inclusive, culturally-aware LLMs! 🧠 Bridging the language divide in AI 📅 October 10, 2025 | Co-located with @COLM_conf 🔗 melt-workshop.github.io