#regression_analysis_in_machine_learning 搜尋結果

Devvrit

年10月16日

Wish to build scaling laws for RL but not sure how to scale? Or what scales? Or would RL even scale predictably? We introduce: The Art of Scaling Reinforcement Learning Compute for LLMs

Devvrit_Khatri's tweet image. Wish to build scaling laws for RL but not sure how to scale? Or what scales? Or would RL even scale predictably?
We introduce: The Art of Scaling Reinforcement Learning Compute for LLMs

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

1 分鐘

Important Statistical Techniques. @abacusai #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #Linux #Statistics #Programming #Coding #100DaysofCode…

gp_pulipaka's tweet image. Important Statistical Techniques. @abacusai #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #Linux #Statistics #Programming #Coding #100DaysofCode…

LLMs are injective and invertible. In our new paper, we show that different prompts always map to different embeddings, and this property can be used to recover input tokens from individual embeddings in latent space. (1/6)

GladiaLab's tweet image. LLMs are injective and invertible.

In our new paper, we show that different prompts always map to different embeddings, and this property can be used to recover input tokens from individual embeddings in latent space.

(1/6)

Beginnersblog

@beginnersblog1

年10月26日

90% of ML portfolios look identical. If yours only has Titanic survival predictions, you’re invisible. These 5 projects? They’ll make hiring managers stop scrolling and start DM’ing you Most job seekers showcase the same old projects: Titanic survival, MNIST digit recognition,…

beginnersblog1's tweet image. 90% of ML portfolios look identical. If yours only has Titanic survival predictions, you’re invisible. These 5 projects? They’ll make hiring managers stop scrolling and start DM’ing you

Most job seekers showcase the same old projects: Titanic survival, MNIST digit recognition,…

Python Programming

@PythonPr

年10月22日

TOP 8 Machine Learning Regression Metrics

Suhas Kotha

@kothasuhas

年9月19日

Since compute grows faster than the web, we think the future of pre-training lies in the algorithms that will best leverage ♾ compute We find simple recipes that improve the asymptote of compute scaling laws to be 5x data efficient, offering better perf w/ sufficient compute

kothasuhas's tweet image. Since compute grows faster than the web, we think the future of pre-training lies in the algorithms that will best leverage ♾ compute

We find simple recipes that improve the asymptote of compute scaling laws to be 5x data efficient, offering better perf w/ sufficient compute

Alex L Zhang

@a1zhang

年10月15日

What if scaling the context windows of frontier LLMs is much easier than it sounds? We’re excited to share our work on Recursive Language Models (RLMs). A new inference strategy where LLMs can decompose and recursively interact with input prompts of seemingly unbounded length,…

a1zhang's tweet image. What if scaling the context windows of frontier LLMs is much easier than it sounds?

We’re excited to share our work on Recursive Language Models (RLMs). A new inference strategy where LLMs can decompose and recursively interact with input prompts of seemingly unbounded length,…

ueaj

@_ueaj

年10月26日

New blog post! This one is a purely theoretical one attempting identifying the central reason why LLMs suffer from mode collapse in RL and fail to generate novel or truly diverse outputs. It's actually a way more complicated problem than you think! Naively encouraging…

_ueaj's tweet image. New blog post! This one is a purely theoretical one attempting identifying the central reason why LLMs suffer from mode collapse in RL and fail to generate novel or truly diverse outputs. It's actually a way more complicated problem than you think!

Naively encouraging…

Probability and Statistics

@probnstat

年9月1日

Network regression analyzes relationships in networked data, like social media connections or biological pathways. It goes beyond traditional models by accounting for interdependencies between nodes. In statistics and ML, it’s used for tasks like link prediction and community…

probnstat's tweet image. Network regression analyzes relationships in networked data, like social media connections or biological pathways. It goes beyond traditional models by accounting for interdependencies between nodes. In statistics and ML, it’s used for tasks like link prediction and community…

Probability and Statistics

@probnstat

年10月27日

Statistics by simulation, or Monte Carlo methods, uses computational power to solve problems that are too hard for traditional math. Instead of solving complex equations, we program a computer to mimic a random process (like rolling dice or modeling stock prices) millions of…

probnstat's tweet image. Statistics by simulation, or Monte Carlo methods, uses computational power to solve problems that are too hard for traditional math. Instead of solving complex equations, we program a computer to mimic a random process (like rolling dice or modeling stock prices) millions of…

Quant Science

@quantscience_

年10月26日

159 page PDF download. The best examples of how machine learning is used in finance and algorithmic trading. Grab the paper here:

quantscience_'s tweet image. 159 page PDF download.

The best examples of how machine learning is used in finance and algorithmic trading.

Grab the paper here:

StudyGuides.com

@studyguides

39 分鐘

Analysis of Variance (ANOVA) is a statistical method used to test differences between two or more means. It helps determine if the means of different groups are significantly different from each other. ANOVA is particularly useful when comparing three or more groups, as it avoids…

studyguides's tweet image. Analysis of Variance (ANOVA) is a statistical method used to test differences between two or more means. It helps determine if the means of different groups are significantly different from each other. ANOVA is particularly useful when comparing three or more groups, as it avoids…

Alex Prompter

@alex_prompter

17 小時

Everyone says “LLMs are black boxes.” This paper "How Do LLMs Use Their Depth?” just opened one and showed how intelligence forms layer by layer. They follow a “Guess → Refine” strategy: • Early layers make statistical guesses using frequent tokens (“the”, “of”, “and”) •…

alex_prompter's tweet image. Everyone says “LLMs are black boxes.”

This paper "How Do LLMs Use Their Depth?” just opened one and showed how intelligence forms layer by layer.

They follow a “Guess → Refine” strategy:

• Early layers make statistical guesses using frequent tokens (“the”, “of”, “and”)
•…

Probability and Statistics

@probnstat

年10月26日

PCA (Principal Component Analysis) is a powerful machine learning method for dimensionality reduction. It finds the most important patterns (principal components) in high-dimensional data, allowing it to be compressed with minimal information loss. In ML, this speeds up model…

probnstat's tweet image. PCA (Principal Component Analysis) is a powerful machine learning method for dimensionality reduction. It finds the most important patterns (principal components) in high-dimensional data, allowing it to be compressed with minimal information loss. In ML, this speeds up model…

Nathan Lambert

@natolambert

年10月16日

The first fantastic paper on scaling RL with LLMs just dropped. I strongly recommend taking a look and will be sharing more thoughts on the blog soon. The Art of Scaling Reinforcement Learning Compute for LLMs Khatri & Madaan et al.

natolambert's tweet image. The first fantastic paper on scaling RL with LLMs just dropped. I strongly recommend taking a look and will be sharing more thoughts on the blog soon.

The Art of Scaling Reinforcement Learning Compute for LLMs
Khatri &amp; Madaan et al.

tokenbender

@tokenbender

年10月17日

this paper costed 4.2 mil USD to write holy... most labs haven't reached the point of releasing models that costed that much let alone a paper that covers all the details

tokenbender's tweet image. this paper costed 4.2 mil USD to write
holy...
most labs haven't reached the point of releasing models that costed that much
let alone a paper that covers all the details

Selçuk Korkmaz

@selcukorkmaz

年12月2日

Key Takeaways from “Regression Modeling Strategies” by Frank Harrell (@f2harrell) A must-read for anyone working with predictive modeling. Here’s what you need to know: Plan your model with clear goals—whether prediction, effect estimation, or hypothesis testing. Avoid…

selcukorkmaz's tweet image. Key Takeaways from “Regression Modeling Strategies” by Frank Harrell (@f2harrell)

A must-read for anyone working with predictive modeling. Here’s what you need to know:

Plan your model with clear goals—whether prediction, effect estimation, or hypothesis testing. Avoid…

B. Noted

@LetIt_BNoted

20 小時

***Ordinary Regression versus Logistic Regression *** Mathematical Foundation • Ordinary Regression uses the least squares method to minimize the sum of squared errors between predicted and actual values. • Logistic Regression uses maximum likelihood estimation (MLE) to find…