manish
@manish_iitg
AI/ML, Programming, Finance, Coding
You might like
I admitted my son to a hospital today. I have a ₹1.2 crore Acko Platinum Health Plan the one with no room rent limit. @ACKOIndia @duavarun Guess what? The hospital flat out denied me a suite room. 👇🏻
Weekends are made up. It’s just another day to do what you love.
shiva burns ignorance with fire—ai burns data with loss: L = -log p(y|x). each epoch’s a tandava, shredding dumb. happy mahashivratri
Large heap of garbage is accumulating behind Bhutani Alphathum Noida (Near Sector 137). Please get it removed @noida_authority @CeoNoida @myogiadityanath
And so it begins. The Pune Porsche crash accused, including the underage son, have ALL named the Agarwal family driver. He is being questioned today.
Tired of MMLU? The current models already hit the ceiling? It's time to upgrade MMLU! Introducing our new benchmark MMLU-Pro, a more robust and challenging massive multi-task language understanding benchmark with 12K questions. What's New? 1. MMLU-Pro uses 10 options instead of…
The EU has found cancer-causing chemicals in 527 Indian products reaching their borders. Last night, after an hour or two of Internet search, I believe I succeeded in finding out the likely source of that list, and what I saw is really worrying. I will share all my findings in…
Releasing aditi synthetic dataset containing high quality multi turn/instruct conversation on hinglish/hindi huggingface.co/datasets/manis… Also open sourcing the data pipeline used to generate these conversations so you can extend the same to your use case github.com/manishiitg/adi…
github.com
GitHub - manishiitg/aditi_dataset
Contribute to manishiitg/aditi_dataset development by creating an account on GitHub.
Want to learn how to generate synthetic data for fine-tuning large language models? Check out the airoboros repo: github.com/jondurbin/airo… This awesome open-source tool provides a full framework to create high-quality synthetic datasets tailored to your LLM use case.
github.com
GitHub - jondurbin/airoboros: Customizable implementation of the self-instruct paper.
Customizable implementation of the self-instruct paper. - jondurbin/airoboros
When fine tuning LLM's what's the best to reduce the size of the dataset to filter out the lower quality document? Should word embedding be used to remove similar documents?
Not able to get dpo working at all.. what's the secret? What am I missing
We’ve backed several of the fastest growing and largest open source founders in India over the last few years via @OSSCapital … Many millions of users. If this does not get reversed, we will reverse our historical position of encouraging founders to stay put and build their…
India just kissed its future goodbye! Every company deploying a GenAI model now requires approval from the Indian government! That is, you now need approval for merely deploying a 7b open source model 🤯🤯 If you know the Indian government, you know this will a huge drag!…
🚀 Exciting News! Introducing Open-Aditi-Hi-v2, our cutting-edge Language Model for Hindi! 🌟 With superior performance over existing models like Airavata and OpenHati, it's perfect for content generation, chatbots, and more. huggingface.co/manishiitg/ope…
huggingface.co
manishiitg/open-aditi-hi-v2 · Hugging Face
manishiitg/open-aditi-hi-v2 · Hugging Face
Introducing Eagle-7B Based on the RWKV-v5 architecture, bringing into opensource space, the strongest - multi-lingual model (beating even mistral) - attention-free transformer today (10-100x+ lower inference) With comparable English performance with the best 1T 7B models
United States Trends
- 1. Florida 102K posts
- 2. Texas 174K posts
- 3. #SmallBusinessSaturday 2,321 posts
- 4. Ohio State 27.6K posts
- 5. Kentucky 14.5K posts
- 6. Go Blue 6,745 posts
- 7. Buckeyes 4,947 posts
- 8. Go Bucks 2,252 posts
- 9. Good Saturday 35.9K posts
- 10. Tyler Adams 2,898 posts
- 11. Saban 5,471 posts
- 12. Raphinha 18.9K posts
- 13. Leeds 24.8K posts
- 14. Georgia 51.6K posts
- 15. #MeAndTheeSeriesEP3 1.46M posts
- 16. #SaturdayVibes 4,012 posts
- 17. Gameday 13.8K posts
- 18. Bernal 14.6K posts
- 19. #StrayKidsAtMAMA2025 13.5K posts
- 20. Grade 3 2,801 posts
Something went wrong.
Something went wrong.