
George Ho
@_eigenfoo
Natural language processing, Bayesian modeling, open source, crosswords, donuts and coffee. Currently ML at @flatironhealth (he/him/his)
你可能會喜歡
hello I'm new to the stock market is it good when the intel ceo starts praying
“Let your eyes look straight ahead; fix your gaze directly before you. Give careful thought to the paths for your feet and be steadfast in all your ways” Proverbs 4:25-26
It was hard to find quality OCR data... until today! Super excited to announce the release of the 2 largest public OCR datasets ever 📜 📜 OCR is critical for document AI: here, 26M+ pages, 18b text tokens, 6TB! Thanks to @ucsf_library, @industrydocs and @PDFAssociation 🧶 ↓

The perfect peer-reviewed article title does not exi-

Very excited to introduce DocLLM, a multimodal LLM developed by my colleagues @jpmorgan. DocLLM-7B outperforms other SotA LLMs on 12/16 benchmarks within four core Document AI tasks! Incredibly proud of the team for their hard work. Check it out at arxiv.org/abs/2401.00908

JPMorgan announces DocLLM A layout-aware generative language model for multimodal document understanding paper page: huggingface.co/papers/2401.00… Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the…

JPMorgan announces DocLLM A layout-aware generative language model for multimodal document understanding paper page: huggingface.co/papers/2401.00… Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the…

I sawed my copy of the power broker in half so that it’s easier to carry around When a book’s size becomes an impediment to reading it, I feel like something’s gone seriously wrong

Hi yes hello good morning I was on a podcast, talking about crossword archivism and milk cartons You can listen to it here: podcast.data-is-plural.com/2159594/141791…
podcast.data-is-plural.com
S2E5: Crosswords - Data Is Plural
Gerty and Carl Cori won the Nobel Prize together in 1947. Then 6 of their students won Nobel Prizes, all in physiology/medicine and chemistry. (Five separate prizes in total; one was shared.) amazon.com/Crucible-Scien…
Beyond ecstatic for our Cooper Brue team from @cooperunion for winning both best beer label and 3rd place overall in the annual beer brewing competition at AIChE. Go team and thanks Ana for helping us compete! And yes, the poster is hand drawn!



i would retire too if i had to rewrite the entire HuggingFace Trainer to work with HuggingFace Accelerate, jesus that must have been a nightmare
Yesterday was my last day at Hugging Face. The past three years have been exhilarating and I am very proud of what the team has accomplished during that time! Taking a bit of a break with opensource full time (though I will still contribute to Transformers and Accelerate)
Hello, long time no #crossword! A new #cryptic is up, and I’m pretty happy with it! My favorite clue: I'm about to stuff fruit with trace of radium — it might bring death (4,6) georgeho.org/crosswords/019/
#ML can extract clinically relevant information from EHRs at scale, but evaluating its quality has focused on single variables. This @flatironhealth study aims to evaluating ML's usefulness for research & RWE generation at scale: flatiron.com/resources/repl… @Cancers_MDPI
Big reveal of Flatiron Health #machinelearning with #language and documents in EHR. The full text explainer from our team is here: medrxiv.org/content/10.110…

Extracting meaningful clinical detail from EHRs for millions of patients with cancer is challenging. @FlatironHealth uses #NLP & #ML to extract key information from unstructured documents in the curation of high quality #RWD. Read more on our approach: flatiron.com/resources/appr…
Today I capitulated and finally learnt how to save places on Google Maps and I think this is about to change my life Maybe my hyperfixated techie friends know a thing or two about using technology to improve lives after all
United States 趨勢
- 1. White House 119K posts
- 2. #Integra 1,047 posts
- 3. #hoothoot N/A
- 4. NASA 60K posts
- 5. #pilotstwtselfieday N/A
- 6. Warner Bros 5,866 posts
- 7. Rick Scott 1,764 posts
- 8. #JUNGKOOKXCALVINKLEIN 39.7K posts
- 9. #gachiakuta153 N/A
- 10. NBA IS BACK 24.9K posts
- 11. Taco Tuesday 13.6K posts
- 12. Nordiques 1,081 posts
- 13. Gucci 30.3K posts
- 14. East Wing 103K posts
- 15. CARAMELO ON TLMD 23.7K posts
- 16. Joe Carter 3,825 posts
- 17. Duffy 8,760 posts
- 18. Whalers N/A
- 19. Pizza Hut 13.2K posts
- 20. Happy NBA 5,142 posts
Something went wrong.
Something went wrong.