jayvanzyl's profile picture. Interested in real-time predictions and experimentation http://ecosystem.Ai

Jay van Zyl

@jayvanzyl

Interested in real-time predictions and experimentation http://ecosystem.Ai

Important factors to consider wrt cost of model training and serving: “SOTA models these days have about ~500B parameters and that represents at least ~1TB of GPU memory to operate with specialized infrastructure. That's a minimum of ~$60,000 - $100,000 p…lnkd.in/gKRDCbfa


StableLM is trained on a new experimental dataset built on The Pile, but three times larger with 1.5 trillion tokens of content. The richness of this dataset gives StableLM surprisingly high performance in conversational and coding…lnkd.in/g3UA_jPw lnkd.in/gQ47SQU7


The analogy between the syntax-semantics of natural languages and the sequence-function of proteins has revolutionized the way humans inves- tigate the language of life. lnkd.in/g_wnAatf


With YouTube creators becoming increasingly empowered by versatile generative AI tools, it will only amplify the rising trend of audiences consuming more user-generated content on TVs, conducive to more YouTube advertising revenue,…lnkd.in/g-ZSaH7p lnkd.in/g6xjWzgM


They say a good craftsman shouldn't blame his tools, but can a good tool [LLM] blame a shoddy craftsman? But Large language models specialize in generating human-like text. Correct answers are a bonus. lnkd.in/gVXfvhSE


Another key concept to understand: Most of the AI-generated images currently produced rely on Diffusion Models as their foundation. lnkd.in/gxdM5sAJ


Together with ecosystem.Ai real-time behavioral capabilities, generative models add a much needed angle to AI for business usefulness. Here is a another outline in summary for those who need a quick reference: Generativ…lnkd.in/g7pGgkep lnkd.in/gxkQY9KW


Cape Town looks like a safe option while we're working on solving all of this :) lnkd.in/gMYRVb_M


Excellent share @dxbrob. "It is perhaps uncontroversial to say that this claim that one of us made eight years ago (Soman, 2015) is now accepted as universal truth. Governments, for-profit organizations, not for profits, startups, consumer protect…lnkd.in/gGprFd8y


FinGPT emphasizes the critical significance of data collecting, cleaning, and preprocessing in creating open-source FinLLMs using a data-centric approach. FinGPT seeks to advance financial research, cooperation, and innovation by p…lnkd.in/gcDUmgy7 lnkd.in/g69ivMnZ


Great paper on transformers: “Transformer large language models (LLMs) have sparked admiration for their exceptional performance on tasks that demand intricate multi-step reasoning. Yet, these models simultaneously show failures on…lnkd.in/gsxXReqV lnkd.in/gwxwhsaN


Gorilla is a major addition to the list of language models, as it even addresses the issue of writing API calls. Its capabilities enable the reduction of problems related to hallucination and reliability. lnkd.in/g7g_qd-E


Another great set of models. Why use Falcon-40B? 1. It is the best open-source model currently available. Falcon-40B outperforms LLaMA, StableLM, RedPajama, MPT, etc. See the OpenLLM Leaderboard. 2. It features an architecture optimized for inference, wit…lnkd.in/gR-sq7cK

linkedin.com

Another great set of models. Why use Falcon-40B? 1. It is the best open-source model currently...

Another great set of models. Why use Falcon-40B? 1. It is the best open-source model currently available. Falcon-40B outperforms LLaMA, StableLM, RedPajama, MPT, etc. See the OpenLLM Leaderboard. 2....


As the commoditization of LLM models continue, here's a list to review. lnkd.in/g2ruTgi7


Loading...

Something went wrong.


Something went wrong.