KeepCurrent

@_keepcurrent

We organize #meetups, #hackathons and conduct #tailormade #trainings and #workshops for companies: #MachineLearning #MLOps, #DataScience & #SoftwareDevelopment.

Science & Technology

Vienna, Austria

keep-current.com

Joined May 2018

114Posts 96Followers 295Following

You might like

@Blindspot_AI

@GapDataInst

@thomaspamminger

KeepCurrent reposted

Hila Gonen

@hila_gonen

Aug 9, 2024

Do you like yellow? Then, according to LLMs, you are probably a school bus driver! Excited to share our new paper about Semantic Leakage in Language Models! Joint work with my wonderful collaborators @terra @alisawuffles @luke @nlpnoah Paper: gonenhila.github.io/files/Semantic… 1/10

hila_gonen's tweet image. Do you like yellow? Then, according to LLMs, you are probably a school bus driver!
Excited to share our new paper about Semantic Leakage in Language Models!
Joint work with my wonderful collaborators @terra @alisawuffles @luke @nlpnoah

Paper: gonenhila.github.io/files/Semantic…

1/10

KeepCurrent reposted

Vaibhav (VB) Srivastav

@reach_vb

Jul 21, 2024

Probably the craziest week in Open Source AI (yet): 1. Mistral (in collaboration with Nvidia) dropped Apache 2.0 licensed NeMo 12B LLM, better than L3 8B and Gemma 2 9B. Models are multilingual with 128K context and a highly efficient tokenizer - tekken. 2. Apple released DCLM…

KeepCurrent reposted

Tiago Pimentel

@tpimentelms

Dec 8, 2023

Are you interested in word lengths and natural language’s efficiency? If yes, our new #EMNLP2023 paper has everything you need: drama, suspense, a new derivation of Zipf’s law, an update to Piantadosi et al’s classic word length paper, transformers... 🧵 arxiv.org/abs/2312.03897

tpimentelms's tweet image. Are you interested in word lengths and natural language’s efficiency? If yes, our new #EMNLP2023 paper has everything you need: drama, suspense, a new derivation of Zipf’s law, an update to Piantadosi et al’s classic word length paper, transformers... 🧵

arxiv.org/abs/2312.03897

KeepCurrent reposted

Zhiqiu (Oscar) Xu

@oscar_zhiqiu_xu

Dec 1, 2023

You don’t have to train from scratch whenever developing a smaller model of an existing model family. Sharing our latest work - “Initializing Models with Larger Ones” arxiv preprint: arxiv.org/abs/2311.18823 code: github.com/OscarXZQ/weigh…

oscar_zhiqiu_xu's tweet image. You don’t have to train from scratch whenever developing a smaller model of an existing model family.

Sharing our latest work - “Initializing Models with Larger Ones”

arxiv preprint: arxiv.org/abs/2311.18823
code: github.com/OscarXZQ/weigh…

KeepCurrent reposted

Aviv Slobodkin

@lovodkin93

Nov 14, 2023

🎉Excited to announce our paper's acceptance at #EMNLP2023! We explore a fascinating question: When faced with (un)answerable queries, do LLMs actually grasp the concept of (un)answerability?🧐 This work is a collaborative effort with @clu_avi @ravfogel @omerNLP and Ido Dagan 1/n

KeepCurrent reposted

Samuel Müller

@SamuelMullr

Nov 9, 2023

There is a paper by Google trending right now, that claims transformer in-context learning cannot generalize between two function classes I have reproduced their experiment in a colab and come to a very different conclusion...

SamuelMullr's tweet image. There is a paper by Google trending right now, that claims transformer in-context learning cannot generalize between two function classes

I have reproduced their experiment in a colab and come to a very different conclusion...

KeepCurrent reposted

ACL 2025

@aclmeeting

Oct 21, 2023

ACL org announcement: 📢The list of accepted workshops in the ACL Conferences (@aclmeeting, @eaclmeeting, @naaclmeeting, @emnlpmeeting) in 2024 is out! Please help spread the word. Retweeting w/ references, esp. w/organisers information is very much appreciated - thanks! #NLProc

KeepCurrent reposted

Bojan Tunguz

@tunguz

Apr 5, 2023

Pandas 2.0 is here! This is the biggest overhaul of Pandas since its inception, and it has been years in the making. However, you will probably not notice too many changes, and all your existing Pandas code will most likely run the same as before. All the major changes are under…

tunguz's tweet image. Pandas 2.0 is here! This is the biggest overhaul of Pandas since its inception, and it has been years in the making. However, you will probably not notice too many changes, and all your existing Pandas code will most likely run the same as before. All the major changes are under…

KeepCurrent reposted

Steve Stewart-Williams

@SteveStuWill

Mar 18, 2023

Psychologists have posited hundreds of cognitive biases over the years. A new paper argues that they all boil down to one of a handful of fundamental beliefs coupled with confirmation bias. doi.org/10.1177/174569…

SteveStuWill's tweet image. Psychologists have posited hundreds of cognitive biases over the years. A new paper argues that they all boil down to one of a handful of fundamental beliefs coupled with confirmation bias. doi.org/10.1177/174569…

KeepCurrent reposted

Guillaume Lample @ NeurIPS 2024

@GuillaumeLample

Feb 24, 2023

Today we release LLaMA, 4 foundation models ranging from 7B to 65B parameters. LLaMA-13B outperforms OPT and GPT-3 175B on most benchmarks. LLaMA-65B is competitive with Chinchilla 70B and PaLM 540B. The weights for all models are open and available at research.facebook.com/publications/l… 1/n

GuillaumeLample's tweet image. Today we release LLaMA, 4 foundation models ranging from 7B to 65B parameters.
LLaMA-13B outperforms OPT and GPT-3 175B on most benchmarks. LLaMA-65B is competitive with Chinchilla 70B and PaLM 540B.
The weights for all models are open and available at research.facebook.com/publications/l…
1/n

KeepCurrent reposted

DeepAI

@DeepAI

Feb 3, 2023

Read about Non-Parametric Model deepai.org/machine-learni… #OrdinalNumber #NonParametricModel

Non-Parametric Model

Source: deepai.org

KeepCurrent reposted

NeurIPS Conference

@NeurIPSConf

Jan 13, 2023

You can now watch the recorded material from #NeurIPS2022 online without registration at: slideslive.com/neurips-2022

KeepCurrent reposted

Science Is Strategic

@scienceisstrat1

Jan 10, 2023

1/ The 2022 climate data is out. And it’s terrifying. Overall, 2022 was the fifth warmest year on record. Below is a short🧵on 2022’s record-breaking droughts, floods, ‘rain bombs,’ wildfires & more Cc: @dwallacewells @JesseJenkins @ylecun @leahstokes @Noahpinion @MichaelEMann

scienceisstrat1's tweet image. 1/ The 2022 climate data is out. And it’s terrifying.

Overall, 2022 was the fifth warmest year on record.

Below is a short🧵on 2022’s record-breaking droughts, floods, ‘rain bombs,’ wildfires &amp; more

Cc: @dwallacewells @JesseJenkins @ylecun @leahstokes @Noahpinion @MichaelEMann

KeepCurrent reposted

Sasha Rush

@srush_nlp

Dec 21, 2022

If you are looking for a winter break project, here is the full collection of ML/coding puzzles. (I think this is more useful than prompting, but who knows?) * github.com/srush/tensor-p… * github.com/srush/gpu-puzz… * github.com/srush/autodiff… * github.com/srush/raspy

srush_nlp's tweet image. If you are looking for a winter break project, here is the full collection of ML/coding puzzles.
(I think this is more useful than prompting, but who knows?)
* github.com/srush/tensor-p…
* github.com/srush/gpu-puzz…
* github.com/srush/autodiff…
* github.com/srush/raspy

KeepCurrent reposted

Simone Scardapane

@s_scardapane

Dec 23, 2022

*Thinking Like Transformers* Awesome blog post by @srush_nlp based on the paper by the same name. If you write a programming language inspired by the way Transformers work, how easy would it be to program in it? 👀 Blog: srush.github.io/raspy/ Paper: arxiv.org/pdf/2106.06981…

s_scardapane's tweet image. *Thinking Like Transformers*

Awesome blog post by @srush_nlp based on the paper by the same name.

If you write a programming language inspired by the way Transformers work, how easy would it be to program in it? 👀

Blog: srush.github.io/raspy/
Paper: arxiv.org/pdf/2106.06981…

KeepCurrent reposted

Fangyu Liu

@hardy_qr

Dec 21, 2022

📍🧵🚨 QA on plots & charts is a complex task requiring sophisticated reasoning - our visual language models struggle with this. LLMs are super strong reasoners - but they only work for text. What do we do? We translate plots & charts to text so LLM can understand!

hardy_qr's tweet image. 📍🧵🚨 QA on plots &amp; charts is a complex task requiring sophisticated reasoning - our visual language models struggle with this.

LLMs are super strong reasoners - but they only work for text.

What do we do? We translate plots &amp; charts to text so LLM can understand!

KeepCurrent reposted

Abhilasha Ravichander

@lasha_nlp

Nov 8, 2022

🚨Help NLP models process negation🚨 Introducing ℂ𝕆ℕ𝔻𝔸ℚ𝔸, a *contrastive* reading comprehension dataset that requires reasoning about negation w/ @nlpmattg & @anmarasovic @ai2_allennlp, at #EMNLP2022 📝Paper arxiv.org/abs/2211.00295 🚀Data github.com/AbhilashaRavic… [1/8]

lasha_nlp's tweet image. 🚨Help NLP models process negation🚨

Introducing ℂ𝕆ℕ𝔻𝔸ℚ𝔸, a *contrastive* reading comprehension dataset that requires reasoning about negation

w/ @nlpmattg &amp; @anmarasovic @ai2_allennlp, at #EMNLP2022

📝Paper arxiv.org/abs/2211.00295
🚀Data github.com/AbhilashaRavic… [1/8]

KeepCurrent reposted

Aman Jha

@amanjha__

Oct 24, 2022

It's here! Upload *any paper* to Explainpaper and start instantly getting explanations! Ask follow up questions if you need a more in-depth answer. Go to explainpaper.com and go read all the papers you've been saving! 📝📝📝

KeepCurrent reposted

Mathias Lechner

@mlech26l

Oct 13, 2022

We (joint work with @ramin_m_h) have released PyHopper, a hyperparameter tuning platform for streamlining machine learning research. Pyhopper's goal is to enable people of any skill level to set up advanced multi-GPU hyperparameter tuning processes in less than a minute.

mlech26l's tweet image. We (joint work with @ramin_m_h) have released PyHopper, a hyperparameter tuning platform for streamlining machine learning research.
Pyhopper's goal is to enable people of any skill level to set up advanced multi-GPU hyperparameter tuning processes in less than a minute.

KeepCurrent reposted

GenBench

@GenBench

Oct 7, 2022

Ever wanted to know more about generalisation in NLP but overwhelmed with the number of papers on ArXiv? Fear not! We read 400+ papers, 600+ experiments, and designed a taxonomy 📝 to categorise the research for you! (1/n) 🧵 arxiv.org/abs/2210.03050