TinyLLM

@tiny_language

TinyLLM: Bringing language models to constrained edge devices for supporting embedded sensing applications

tinyllm.org

於十二月 2024 加入

8貼文 0位跟隨者 16個跟隨中

TinyLLM 已轉發

Vega Shah

@dr_alphalyrae

年12月24日

Automating the Search for Artificial Life with Foundation Models arxiv.org/abs/2412.17799

TinyLLM 已轉發

Atul Butte

@atulbutte

年12月23日

Whoa! In @NEJM_AI: A Multimodal Biomedical Foundation Model Trained from 15 Million Image–Text Pairs… fully open-access foundation model! ai.nejm.org/doi/full/10.10…

TinyLLM 已轉發

maharshi

@maharshii

年12月24日

150GB of math dataset, holy shit

TinyLLM 已轉發

Jonas Pfeiffer

@PfeiffJo

年12月24日

🧠💡 Our LLMs just had a ‘memory augmentation’—now they can deliberate like seasoned thinkers! arxiv.org/abs/2412.17747

TinyLLM 已轉發

Sloth: scaling law for LLM skills predicts benchmark scores (even from a single model per family). It is based on factor analysis and has family-specific parameters, making it interpretable and accurate! Paper: arxiv.org/abs/2412.06540 GitHub: github.com/felipemaiapolo… 🧵1/8

felipemaiapolo's tweet image. Sloth: scaling law for LLM skills predicts benchmark scores (even from a single model per family). It is based on factor analysis and has family-specific parameters, making it interpretable and accurate!

Paper: arxiv.org/abs/2412.06540
GitHub: github.com/felipemaiapolo…

🧵1/8

TinyLLM 已轉發

Surya Ganguli

@SuryaGanguli

年12月23日

Our new paper "Fooling LLM graders into giving better grades through neural activity guided adversarial prompting" expertly lead by @atsushi_y1230 arxiv.org/abs/2412.15275 With the potential rise of automated grading, we examine the fragility of these grading systems to attacks…

Atsushi Yamamura (山村篤志)

@atsushi_y1230

年12月23日

Excited to share our latest work, "Fooling LLM graders into giving better grades through neural activity-guided adversarial prompting" (w/ @SuryaGanguli)! We investigate distorting AI decision-making to build fair and robust AI judges/graders.arxiv.org/abs/2412.15275 #AISafety 1/n

TinyLLM 已轉發

Yuchen Jin

@Yuchenj_UW

年12月23日

I'm more and more confident that tokenization will be gone. Humans don't think in "tokens". Tokens are hardcoded abstractions in LLMs that lead to weird behavior: LLMs can solve PhD-level math questions but cannot answer "Is 9.9 > 9.11?" Meta is shifting LLMs to LCMs (Large…