#visionlanguagemodel 검색 결과

AI DETA

. 11. 6.

👀 เบื้องหลัง VLM: ตา-ล่าม-สมอง (ฉบับเข้าใจง่าย) 💡 ใช้ได้จริงแล้วในหลายงาน ทั้งสรุปสไลด์ อ่านเอกสาร ค้นข้อมูลจากรูป หรือใช้เป็นผู้ช่วยวิเคราะห์ข้อมูลภาพ–ข้อความแบบเรียลไทม์ aideta.com/blog/arin13qgl… #AI #VisionLanguageModel #AITransformation

aideta_th's tweet image. 👀 เบื้องหลัง VLM: ตา-ล่าม-สมอง (ฉบับเข้าใจง่าย)

💡 ใช้ได้จริงแล้วในหลายงาน ทั้งสรุปสไลด์ อ่านเอกสาร ค้นข้อมูลจากรูป หรือใช้เป็นผู้ช่วยวิเคราะห์ข้อมูลภาพ–ข้อความแบบเรียลไทม์

aideta.com/blog/arin13qgl…

#AI #VisionLanguageModel #AITransformation

Davide Scaramuzza

@davsca1

. 8. 26.

We are excited to be among the very first groups selected by @NVIDIARobotics to test the new @NVIDIA #Thor. We have managed to run a #VisionLanguageModel (Qwen 2.5 VL) for semantic understanding of the environment, along with a monocular depth model (#DepthAnything v2), for safe…

Asia Business Outlook

@AsiaBusinessOu1

. 10. 24.

ByteDance Unveils Seed3D 1.0 for AI-Driven 3D Simulation Read More: lnkd.in/gd7CG6HM @BytedanceTalk #VisionLanguageModel #RoboticManipulation #MagicArticulate #DigitalTwinSystems

AsiaBusinessOu1's tweet image. ByteDance Unveils Seed3D 1.0 for AI-Driven 3D Simulation

Read More: lnkd.in/gd7CG6HM

@BytedanceTalk

#VisionLanguageModel #RoboticManipulation #MagicArticulate #DigitalTwinSystems

Elle Neal | Vibe Coder

@ElleAI_

. 1. 24.

The 32×32 Patch Grid Why does ColPali “see” so well? Each page is divided into patch grids—so it knows exactly where an image ends and text begins. That local + global context means no detail is missed, from small icons to big headers. #colpali #visionlanguagemodel

ElleAI_'s tweet image. The 32×32 Patch Grid
Why does ColPali “see” so well?

Each page is divided into patch grids—so it knows exactly where an image ends and text begins.

That local + global context means no detail is missed, from small icons to big headers.

#colpali #visionlanguagemodel

Glowin

@glow1n

2024. 5. 15.

Google 发布了新的视觉语言模型 PaliGemma，它可以接收图像和文本输入，并输出文本。PaliGemma 包含预训练模型、混合模型和微调模型三种类型，具有图像字幕、视觉问答、目标检测、指代分割等多种能力。 #GoogleAI #PaliGemma #VisionLanguageModel huggingface.co/blog/paligemma

glow1n's tweet image. Google 发布了新的视觉语言模型 PaliGemma，它可以接收图像和文本输入，并输出文本。PaliGemma 包含预训练模型、混合模型和微调模型三种类型，具有图像字幕、视觉问答、目标检测、指代分割等多种能力。 #GoogleAI #PaliGemma #VisionLanguageModel

huggingface.co/blog/paligemma

Videep

@theMaxscriptGuy

. 9. 18.

#FastVLM #VisionLanguageModel #OnDeviceAI #EdgeAI #AI #MultimodalAI #LowLatencyAI

Micha(el) Bladowski 🇩🇪 🇺🇦

@michabbb

. 1. 22.

#UITARS Desktop: The Future of Computer Control through Natural Language 🖥️ 🎯 #ByteDance introduces GUI agent powered by #VisionLanguageModel for intuitive computer control Code: lnkd.in/eNKasq56 Paper: lnkd.in/eN5UPQ6V Models: lnkd.in/eVRAwA-9 #ai 🧵 ↓

NRI Global Technologies

@nriglobaltech

2023. 6. 8.

2/ 🎯 MiniGPT-4 empowers image description generation, story writing, problem-solving, and more! 💻 Open source availability fuels innovation and collaboration. ✨ The future of vision-language models is here! minigpt-4.github.io #AI #MiniGPT4 #VisionLanguageModel

nriglobaltech's tweet image. 2/ 🎯 MiniGPT-4 empowers image description generation, story writing, problem-solving, and more!

💻 Open source availability fuels innovation and collaboration.

✨ The future of vision-language models is here!

minigpt-4.github.io

#AI #MiniGPT4 #VisionLanguageModel

JohnSnowLabs

@JohnSnowLabs

. 6. 4.

Explore on AWS: hubs.li/Q03q147d0 #ClinicalAI #RadiologyAI #VisionLanguageModel #LLM #GenerativeAI

sumit singh

@withlove_sumit

2024. 7. 31.

Save your time QCing label quality with @Labellerr1 new feature and do it 10X faster. See the demo below- #qualitycontrol #imagelabeling #visionlanguagemodel #visionai

NRI Global Technologies

@nriglobaltech

2023. 6. 8.

5/ 🚀 MiniGPT-4 is a game-changer in the field of vision-language models. 🔥 Its impressive performance and advanced multi-modal capabilities are propelling AI to new frontiers. #MiniGPT4 #VisionLanguageModel #AI #innovation nobraintech.com/2023/06/minigp…

nobraintech.com

MiniGPT-4: Empowering Vision and Language with Open Source Brilliance

In the ever-evolving landscape of artificial intelligence (AI) , the latest advancements have taken us into uncharted territory. The release...

출처: nobraintech.com

JohnSnowLabs

@JohnSnowLabs

. 5. 15.

Explore the model on AWS Marketplace: hubs.li/Q03mQ5qT0 #MedicalAI #VisionLanguageModel #HealthcareAI #ClinicalDecisionSupport #RadiologyAI #GenerativeAI #JohnSnowLabs #LLM #RAG #MedicalImaging #NLPinHealthcare

JohnSnowLabs

@JohnSnowLabs

. 8. 24.

Read the full article: hubs.li/Q03F78zL0 #MedicalAI #VisionLanguageModel #RadiologyAI #HealthcareAI #GenerativeAI #MedicalImaging #NLPinHealthcare #JohnSnowLabs

JohnSnowLabs's tweet image. Read the full article: hubs.li/Q03F78zL0

#MedicalAI #VisionLanguageModel #RadiologyAI #HealthcareAI #GenerativeAI #MedicalImaging #NLPinHealthcare #JohnSnowLabs

ContentBuffer

@contentbuffer

. 1. 31.

Alibaba's QWEN 2.5 VL: A Vision Language Model That Can Control Your Computer #alibaba #qwen #visionlanguagemodel #AI #viral #viralvideos #technology #engineering #trending #tech #engineer #reelsvideo contentbuffer.com/issues/detail/…

contentbuffer's tweet image. Alibaba's QWEN 2.5 VL: A Vision Language Model That Can Control Your Computer

#alibaba #qwen #visionlanguagemodel #AI #viral #viralvideos #technology #engineering #trending #tech #engineer #reelsvideo

contentbuffer.com/issues/detail/…

My Social AI

@MySocia87248255

2023. 7. 10.

Discover #GPT4RoI, the #VisionLanguageModel that supports multi-region spatial instructions for detailed region-level understanding. #blogger #bloggers #bloggingcommunity #WritingCommunity #blogs #blogposts #LanguageModels #AI #MachineLearning #AIModel socialviews81.blogspot.com/2023/07/gpt4ro…

MySocia87248255's tweet image. Discover #GPT4RoI, the #VisionLanguageModel that supports multi-region spatial instructions for detailed region-level understanding.
#blogger #bloggers #bloggingcommunity #WritingCommunity #blogs #blogposts #LanguageModels #AI #MachineLearning #AIModel
socialviews81.blogspot.com/2023/07/gpt4ro…

Asia Business Outlook

@AsiaBusinessOu1

. 10. 24.

ByteDance Unveils Seed3D 1.0 for AI-Driven 3D Simulation Read More: lnkd.in/gd7CG6HM @BytedanceTalk #VisionLanguageModel #RoboticManipulation #MagicArticulate #DigitalTwinSystems

Videep

@theMaxscriptGuy

. 9. 18.

#FastVLM #VisionLanguageModel #OnDeviceAI #EdgeAI #AI #MultimodalAI #LowLatencyAI

Ruchish (Ru) Shah

@ruchishshah

. 8. 31.

Seeing #VisionLanguageModel with Qwen 2.5 VL + DepthAnything v2 running live on Jetson Thor is next-level for robotics. Fusing semantic/context with real-time depth makes agile, adaptive bots possible. What benchmarks should we watch for? #AI

Davide Scaramuzza

@davsca1

. 8. 26.

Davide Scaramuzza

@davsca1

. 8. 26.

JohnSnowLabs

@JohnSnowLabs

. 8. 24.

Read the full article: hubs.li/Q03F78zL0 #MedicalAI #VisionLanguageModel #RadiologyAI #HealthcareAI #GenerativeAI #MedicalImaging #NLPinHealthcare #JohnSnowLabs

Vidit Ostwal

@ViditOstwal

. 7. 27.

This 6 hours video from Umar Jamil @hkproj, has to be the finest video on VLM from scratch. Next Goal, Fine-tuning on image segmentation or object detection. youtube.com/watch?v=vAmKB7… #LargeLanguageModel #VisionLanguageModel

ViditOstwal's tweet card. Coding a Multimodal (Vision) Language Model from scratch in PyTorch...

youtube.com

YouTube

Coding a Multimodal (Vision) Language Model from scratch in PyTorch...

출처: youtube.com

"#visionlanguagemodel"에 대한 결과가 없습니다

AI DETA

@aideta_th

. 11. 6.

Glowin

@glow1n

2024. 5. 15.

Asia Business Outlook

@AsiaBusinessOu1

. 10. 24.

ByteDance Unveils Seed3D 1.0 for AI-Driven 3D Simulation Read More: lnkd.in/gd7CG6HM @BytedanceTalk #VisionLanguageModel #RoboticManipulation #MagicArticulate #DigitalTwinSystems

Videep

@theMaxscriptGuy

. 9. 18.

#FastVLM #VisionLanguageModel #OnDeviceAI #EdgeAI #AI #MultimodalAI #LowLatencyAI

NRI Global Technologies

@nriglobaltech

2023. 6. 8.

1/ ⚙️ Efficient training with only a single linear projection layer. 🌐 Promising results from finetuning on high-quality, well-aligned datasets. 📈 Comparable performance to the impressive GPT-4 model. #MiniGPT4 #VisionLanguageModel #MachineLearning

nriglobaltech's tweet image. 1/ ⚙️ Efficient training with only a single linear projection layer.

🌐 Promising results from finetuning on high-quality, well-aligned datasets. 📈 Comparable performance to the impressive GPT-4 model.

#MiniGPT4 #VisionLanguageModel #MachineLearning

JohnSnowLabs

@JohnSnowLabs

. 6. 4.

Explore on AWS: hubs.li/Q03q147d0 #ClinicalAI #RadiologyAI #VisionLanguageModel #LLM #GenerativeAI

NRI Global Technologies

@nriglobaltech

2023. 6. 8.

Elle Neal | Vibe Coder

@ElleAI_

. 1. 24.

My Social AI

@MySocia87248255

2023. 7. 10.

JohnSnowLabs

@JohnSnowLabs

. 8. 24.

Read the full article: hubs.li/Q03F78zL0 #MedicalAI #VisionLanguageModel #RadiologyAI #HealthcareAI #GenerativeAI #MedicalImaging #NLPinHealthcare #JohnSnowLabs

Anik Das

@Dev_anik2003

2024. 6. 16.

Had a fantastic time at the event where @ritwik_raha delivered an insightful session on PaliGemma! It was very interactive and informative. #Paligeema #google #visionlanguagemodel #AI

Dev_anik2003's tweet image. Had a fantastic time at the event where @ritwik_raha delivered an insightful session on PaliGemma! It was very interactive and informative.
#Paligeema #google #visionlanguagemodel
#AI

JohnSnowLabs

@JohnSnowLabs

. 5. 15.

ContentBuffer

@contentbuffer

. 1. 31.

Vlad Ruso PhD

@vlruso

. 1. 29.

Qwen AI Releases Qwen2.5-VL: A Powerful Vision-Language Model for Seamless Computer Interaction #QwenAI #VisionLanguageModel #AIInnovation #TechForBusiness #MachineLearning itinai.com/qwen-ai-releas…

vlruso's tweet image. Qwen AI Releases Qwen2.5-VL: A Powerful Vision-Language Model for Seamless Computer Interaction

#QwenAI #VisionLanguageModel #AIInnovation #TechForBusiness #MachineLearning

itinai.com/qwen-ai-releas…

EngagePro Video For Business

@EngageProVideo

2023. 12. 18.

buff.ly/41bdyNy New, open-source AI vision model emerges to take on ChatGPT — but it has issues #AIImpactTour #NousHermes2Vision #VisionLanguageModel

EngageProVideo's tweet image. buff.ly/41bdyNy New, open-source AI vision model emerges to take on ChatGPT — but it has issues

#AIImpactTour #NousHermes2Vision #VisionLanguageModel

JohnSnowLabs

@JohnSnowLabs

. 5. 23.

See how domain specialization transforms medical reasoning: hubs.li/Q03nRpCk0 #MedicalAI #VisionLanguageModel #HealthcareAI #ClinicalDecisionSupport #GenerativeAI #RadiologyAI #LLM #NLPinHealthcare

JohnSnowLabs's tweet image. See how domain specialization transforms medical reasoning:
hubs.li/Q03nRpCk0

#MedicalAI #VisionLanguageModel #HealthcareAI #ClinicalDecisionSupport #GenerativeAI #RadiologyAI #LLM #NLPinHealthcare

Anita K.

@web3nam3

2024. 8. 25.

SpatialRGPT.com available for sale #SpatialRGPT is an advanced region-level #VisionLanguageModel (#VLM) designed to comprehend both two-dimensional and three-dimensional spatial configurations. It has the capability to analyze any form of region proposal, such as…

web3nam3's tweet image. SpatialRGPT.com available for sale

#SpatialRGPT is an advanced region-level #VisionLanguageModel (#VLM) designed to comprehend both two-dimensional and three-dimensional spatial configurations. It has the capability to analyze any form of region proposal, such as…

AI개발자

@gaebalai

2024. 11. 10.

각종 Vision VLM 모델로 이미지 생성 비교하기 by Ollama (Llama 3.2, llava-llama3, llama-phi) <ComfyUI워크플로우 포함> URL: blog.naver.com/beyond-zero/22… #ComfyUI #Llama #visionlanguagemodel #VLM #LLM #이미지생성형

gaebalai's tweet image. 각종 Vision VLM 모델로 이미지 생성 비교하기 by Ollama (Llama 3.2, llava-llama3, llama-phi) &lt;ComfyUI워크플로우 포함&gt;
URL: blog.naver.com/beyond-zero/22…

#ComfyUI #Llama #visionlanguagemodel #VLM #LLM #이미지생성형

JohnSnowLabs

@JohnSnowLabs

. 6. 17.

Clinical notes, X-rays, charts—our new VLM interprets them all. See the model in AWS Marketplace: 🔗 hubs.li/Q03sfVDZ0 #VisionLanguageModel #RadiologyAI #ClinicalAI #MedicalImaging #GenerativeAI

JohnSnowLabs's tweet image. Clinical notes, X-rays, charts—our new VLM interprets them all.
See the model in AWS Marketplace:
🔗 hubs.li/Q03sfVDZ0

#VisionLanguageModel #RadiologyAI #ClinicalAI #MedicalImaging #GenerativeAI

Something went wrong.

United States Trends

1. Comet 27.3K posts
2. Fame 55.2K posts
3. Amon Ra 1,009 posts
4. TPUSA 83.3K posts
5. Letitia James 14.9K posts
6. Amorim 54K posts
7. Matt Campbell 1,305 posts
8. Teslaa 1,971 posts
9. The Supreme Court 29.6K posts
10. Oviedo 3,886 posts
11. #LightningStrikes N/A
12. The Password 3,093 posts
13. Spaghetti 10.7K posts
14. fnaf 2 17K posts
15. ARSB N/A
16. Ugarte 13.8K posts
17. Ingram 2,447 posts
18. Jameson Williams 1,837 posts
19. #MissVenezuela2025 N/A
20. Wray 29.2K posts