#bestopensourcedeeplearningtools résultats de recherche

Aucun résultat pour "#bestopensourcedeeplearningtools"

i’ve tested 99% of AI image generation methods since i started making these videos... 99% of outputs are slop, but me and my team have discovered a few really good frameworks that let’s us create extremely tasteful and high quality images We train our images off of the…

beechinour's tweet image. i’ve tested 99% of AI image generation methods since i started making these videos...

99% of outputs are slop, but me and my team have discovered a few really good frameworks that let’s us create extremely tasteful and high quality images

We train our images off of the…
beechinour's tweet image. i’ve tested 99% of AI image generation methods since i started making these videos...

99% of outputs are slop, but me and my team have discovered a few really good frameworks that let’s us create extremely tasteful and high quality images

We train our images off of the…
beechinour's tweet image. i’ve tested 99% of AI image generation methods since i started making these videos...

99% of outputs are slop, but me and my team have discovered a few really good frameworks that let’s us create extremely tasteful and high quality images

We train our images off of the…
beechinour's tweet image. i’ve tested 99% of AI image generation methods since i started making these videos...

99% of outputs are slop, but me and my team have discovered a few really good frameworks that let’s us create extremely tasteful and high quality images

We train our images off of the…

If you want to learn Deep Learning from the ground up to advanced techniques, this open resource is a gem. Full notebook suite -> Link in comments

HeyNina101's tweet image. If you want to learn Deep Learning from the ground up to advanced techniques, this open resource is a gem.

Full notebook suite -> Link in comments

🚀Introducing #CapRL, the first study of applying GRPO for the open-ended and subjective image captioning task. 🤯 🤖The trained CapRL-3B model achieves image captioning performance comparable to Qwen2.5-VL-72B. ✨CapRL introduces a novel training framework that redefines caption…

intern_lm's tweet image. 🚀Introducing #CapRL, the first study of applying GRPO for the open-ended and subjective image captioning task. 🤯
🤖The trained CapRL-3B model achieves image captioning performance comparable to Qwen2.5-VL-72B.
✨CapRL introduces a novel training framework that redefines caption…
intern_lm's tweet image. 🚀Introducing #CapRL, the first study of applying GRPO for the open-ended and subjective image captioning task. 🤯
🤖The trained CapRL-3B model achieves image captioning performance comparable to Qwen2.5-VL-72B.
✨CapRL introduces a novel training framework that redefines caption…
intern_lm's tweet image. 🚀Introducing #CapRL, the first study of applying GRPO for the open-ended and subjective image captioning task. 🤯
🤖The trained CapRL-3B model achieves image captioning performance comparable to Qwen2.5-VL-72B.
✨CapRL introduces a novel training framework that redefines caption…
intern_lm's tweet image. 🚀Introducing #CapRL, the first study of applying GRPO for the open-ended and subjective image captioning task. 🤯
🤖The trained CapRL-3B model achieves image captioning performance comparable to Qwen2.5-VL-72B.
✨CapRL introduces a novel training framework that redefines caption…

The 45 Best AI Tools for 2025 (Tried and Tested) Get started for FREE The best AI tools by category Part 5 Knowledge management: Presentations: Gamma, Copilot for PowerPoint Voice generation: ElevenLabs, Murf Music generation: Suno, Udio Sales: Attio Like+Rt Drop A Follow

anu_youraiwoman's tweet image. The 45 Best AI Tools for 2025 (Tried and Tested)

Get started for FREE
The best AI tools by category
Part 5

Knowledge management: 
Presentations: Gamma, Copilot for PowerPoint

Voice generation: ElevenLabs, Murf

Music generation: Suno, Udio

Sales: Attio

Like+Rt 
Drop A
Follow

Top 4 open-source LLM finetuning libraries!

DailyDoseOfDS_'s tweet image. Top 4 open-source LLM finetuning libraries!

🚨 This might be the biggest leap in AI agents since ReAct. Researchers just dropped DeepAgent a reasoning model that can think, discover tools, and act completely on its own. No pre-scripted workflows. No fixed tool lists. Just pure autonomous reasoning. It introduces…

rryssf_'s tweet image. 🚨 This might be the biggest leap in AI agents since ReAct.

Researchers just dropped DeepAgent a reasoning model that can think, discover tools, and act completely on its own.

No pre-scripted workflows. No fixed tool lists. Just pure autonomous reasoning.

It introduces…

🚨 DeepSeek just did something wild. They built an OCR system that compresses long text into vision tokens literally turning paragraphs into pixels. Their model, DeepSeek-OCR, achieves 97% decoding precision at 10× compression and still manages 60% accuracy even at 20×. That…

godofprompt's tweet image. 🚨 DeepSeek just did something wild.

They built an OCR system that compresses long text into vision tokens  literally turning paragraphs into pixels.

Their model, DeepSeek-OCR, achieves 97% decoding precision at 10× compression and still manages 60% accuracy even at 20×. That…

Quick tips for using AI to make banger images for your or others PFPs: <> Chatgpt. Better for creating images with sceneries <> Gemini. Better for creating specific adjustments ( Making your pfp bald or make it hold something ) Bookmark this 2 images for comparison

AtnsXBT's tweet image. Quick tips for using AI to make banger images for your or others PFPs:

&amp;lt;&amp;gt; Chatgpt. Better for creating images with sceneries 
&amp;lt;&amp;gt; Gemini. Better for creating specific adjustments ( Making your pfp bald or make it hold something ) 

Bookmark this 

2 images for comparison
AtnsXBT's tweet image. Quick tips for using AI to make banger images for your or others PFPs:

&amp;lt;&amp;gt; Chatgpt. Better for creating images with sceneries 
&amp;lt;&amp;gt; Gemini. Better for creating specific adjustments ( Making your pfp bald or make it hold something ) 

Bookmark this 

2 images for comparison

DeepSeek-OCR looks impressive, but its core idea is not new. Input “Text” as “Image” — already explored by: LANGUAGE MODELING WITH PIXELS (Phillip et al., ICLR 2023) CLIPPO: Image-and-Language Understanding from Pixels Only (Michael et al. CVPR 2023) Pix2Struct: Screenshot…

awinyimgprocess's tweet image. DeepSeek-OCR looks impressive, but its core idea is not new.

Input “Text” as “Image” — already explored by:
LANGUAGE MODELING WITH PIXELS (Phillip et al., ICLR 2023)
CLIPPO: Image-and-Language Understanding from Pixels Only (Michael et al. CVPR 2023)
Pix2Struct: Screenshot…
awinyimgprocess's tweet image. DeepSeek-OCR looks impressive, but its core idea is not new.

Input “Text” as “Image” — already explored by:
LANGUAGE MODELING WITH PIXELS (Phillip et al., ICLR 2023)
CLIPPO: Image-and-Language Understanding from Pixels Only (Michael et al. CVPR 2023)
Pix2Struct: Screenshot…
awinyimgprocess's tweet image. DeepSeek-OCR looks impressive, but its core idea is not new.

Input “Text” as “Image” — already explored by:
LANGUAGE MODELING WITH PIXELS (Phillip et al., ICLR 2023)
CLIPPO: Image-and-Language Understanding from Pixels Only (Michael et al. CVPR 2023)
Pix2Struct: Screenshot…
awinyimgprocess's tweet image. DeepSeek-OCR looks impressive, but its core idea is not new.

Input “Text” as “Image” — already explored by:
LANGUAGE MODELING WITH PIXELS (Phillip et al., ICLR 2023)
CLIPPO: Image-and-Language Understanding from Pixels Only (Michael et al. CVPR 2023)
Pix2Struct: Screenshot…

what a bold direction by deepseek once again. they took "a picture is worth a thousand words" literally or the idea of "photographic memory" if i am to commit the crime of anthropomorphisation.

tokenbender's tweet image. what a bold direction by deepseek once again. 
they took &quot;a picture is worth a thousand words&quot; literally or the idea of &quot;photographic memory&quot; if i am to commit the crime of anthropomorphisation.

Top 4 open-source LLM finetuning libraries:

DailyDoseOfDS_'s tweet image. Top 4 open-source LLM finetuning libraries:

I found 20 channels that’ll teach you AI for free. _______ P.S introducing FlowithOS — the world's first operating system natively built for ai agents. self-evolving. memory-powered. lightning-fast. try.flowith.io

socialwithaayan's tweet image. I found 20 channels that’ll teach you AI for free.

_______
P.S introducing FlowithOS — the world&apos;s first operating system natively built for ai agents. self-evolving. memory-powered. lightning-fast.

try.flowith.io

FlowithOS is scary GOOD. I ran: “Compare MacBook Air M3 (16GB RAM) prices across Amazon, BestBuy, and Apple in the US.” Results summarized in seconds



A more serious thread on the DeepSeek-OCR hype / serious misinterpretation going on. 1. On token reduction via representing text in images, researchers from Cambridge have previously shown that 500x prompt token compression is possible (ACL'25, Li, Su, and Collier). Without…

Kangwook_Lee's tweet image. A more serious thread on the DeepSeek-OCR hype / serious misinterpretation going on.

1. 
On token reduction via representing text in images, researchers from Cambridge have previously shown that 500x prompt token compression is possible (ACL&apos;25, Li, Su, and Collier). 

Without…
Kangwook_Lee's tweet image. A more serious thread on the DeepSeek-OCR hype / serious misinterpretation going on.

1. 
On token reduction via representing text in images, researchers from Cambridge have previously shown that 500x prompt token compression is possible (ACL&apos;25, Li, Su, and Collier). 

Without…
Kangwook_Lee's tweet image. A more serious thread on the DeepSeek-OCR hype / serious misinterpretation going on.

1. 
On token reduction via representing text in images, researchers from Cambridge have previously shown that 500x prompt token compression is possible (ACL&apos;25, Li, Su, and Collier). 

Without…

🚀 DeepSeek-OCR — the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model support. 🧠 Compresses visual contexts up to 20× while keeping…

vllm_project's tweet image. 🚀 DeepSeek-OCR — the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model support.

🧠 Compresses visual contexts up to 20× while keeping…
vllm_project's tweet image. 🚀 DeepSeek-OCR — the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model support.

🧠 Compresses visual contexts up to 20× while keeping…
vllm_project's tweet image. 🚀 DeepSeek-OCR — the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model support.

🧠 Compresses visual contexts up to 20× while keeping…

Generating an image from 1,000 words. Very excited to release Fibo 😃, the first ever open-source model trained exclusively on long, structured captions. Fibo sets a new standard for controllability and disentanglement in image generation [1/6] 🧵

MokadyRon's tweet image. Generating an image from 1,000 words.

Very excited to release Fibo 😃, the first ever open-source model trained exclusively on long, structured captions.

Fibo sets a new standard for controllability and disentanglement in image generation

 [1/6] 🧵

NVIDIA Just Released 8M Sample Open Dataset + OCR Tooling on @huggingface - 3x larger than v1 (just 2 months ago!) - Image/video QA, reasoning, multilingual OCR - Commercial-ready (CC-BY-4.0) @NVIDIAAI is one of the few major AI labs releasing datasets 🤗

vanstriendaniel's tweet image. NVIDIA Just Released 8M Sample Open Dataset + OCR Tooling  on @huggingface 

- 3x larger than v1 (just 2 months ago!)
- Image/video QA, reasoning, multilingual OCR
- Commercial-ready (CC-BY-4.0)

@NVIDIAAI is one of the few major AI labs releasing datasets 🤗

🚀 Deep‑Eye v1.4.0 Now Released 🔍 What’s new & Github Repo👇🏻

_0b1d1's tweet image. 🚀 Deep‑Eye v1.4.0 Now Released 🔍

What’s new &amp;amp; Github Repo👇🏻
_0b1d1's tweet image. 🚀 Deep‑Eye v1.4.0 Now Released 🔍

What’s new &amp;amp; Github Repo👇🏻
_0b1d1's tweet image. 🚀 Deep‑Eye v1.4.0 Now Released 🔍

What’s new &amp;amp; Github Repo👇🏻
_0b1d1's tweet image. 🚀 Deep‑Eye v1.4.0 Now Released 🔍

What’s new &amp;amp; Github Repo👇🏻

Google offers several powerful AI tools for free. They’re useful for professionals, creators, and businesses trying to use AI. From content to visuals to coding, they simplify how you work and enhance your creativity. Here are 11 free AI tools from Google (and how to use…

AndrewBolis's tweet image. Google offers several powerful AI tools for free.

They’re useful for professionals, creators, and businesses trying to use AI.

From content to visuals to coding, they simplify how you work and enhance your creativity.

Here are 11 free AI tools from Google (and how to use…

I should charge $99 for this. But I’m giving away my ChatGPT Images Mastery Guide for free. It has: → Step-by-step prompting walkthroughs → JSON templates to generate any image → Side-by-side comparisons with Gemini → My personal prompt cheatsheet This guide turns ChatGPT…

godofprompt's tweet image. I should charge $99 for this.

But I’m giving away my ChatGPT Images Mastery Guide for free.

It has:

→ Step-by-step prompting walkthroughs
→ JSON templates to generate any image
→ Side-by-side comparisons with Gemini
→ My personal prompt cheatsheet

This guide turns ChatGPT…

Loading...

Something went wrong.


Something went wrong.


United States Trends