Ethan Mollick’s Post

4mo Edited

Remember BloombergGPT, which was a specially trained finance LLM, drawing on all of Bloomberg's data? It made a bunch of firms decide to train their own models to reap the benefits of their special information and data. You may not have seen that GPT-4 (the old, pre-turbo version with a small context window), without specialized finance training or special tools, beat it on almost all finance tasks. It is part of a pattern - the smartest generalist frontier models beat specialized models in specialized topics. Your special proprietary data may be less useful than you think in the world of LLMs... https://lnkd.in/e4QKBFPK

140 Comments

Ethan Mollick

4mo

Yes, smaller models have advantages over larger models in areas like speed & cost, which is why we are likely to see many types of LLMs work together 👇 But BloombergGPT was trained on financial dafa so as to be better than generalist models at financial analysis, which it wasn’t. https://www.oneusefulthing.org/p/an-ai-haunted-world

16 Reactions

Sanchit Garg

Generalist | 3x Startups | IIM Indore | IIIT Delhi

4mo

Could one of the reasons be that Bloomberg's proprietary data comprised only 0.7% of the overall training dataset? Bloomberg portrayed that majority data was their own financial data, however, as per their research paper, 99.3% of the data was anyway generally available public data.

32 Reactions

Gang Lee

Founder & CEO at ELGO Technologies

4mo

It is always a balancing act. While the best generalist model might outperform specialized models, the compute and hosting costs of these generalist models usually is huge compared to the specialized models. Companies would look at several aspects such as cost, control, security and privacy other than just costs when choosing the right model to deploy for their use case. I believe that specialized models are still relevant (if not even more relevant) in the future. We will start seeing more ensembles of a generalist model (for reasoning and routing tasks) and specialized models (specialized tasks).

4 Reactions

Gary Longsine

Fractional CTO. Collaborate • Deliver • Iterate. 📱

4mo

Some technology races are won by watching them for a while, from the sidelines, before entering. If one assumes that this won't be a "winner take all" game (and be warned, many people assume that it *is* just that) then it might be best to build some infrastructure and practice a bit, and build up team skills — but with the expectation that this round is just preparation.

8 Reactions

Harsh Singhal

4mo

This has serious implications on the open source LLM ecosystem. Especially given the primary advantage with open source LLMs is fine-tuning. And come to think of it, GPT-4 continues to be the solution to generate SFT data.

3 Reactions

Manprit Singh

Data and AI CTO Healthcare and Fintech

4mo

Other recent studies that show how prompting strategies alone can be effective in evoking this kind of domain-specific expertise from generalist foundation models. https://www.microsoft.com/en-us/research/blog/the-power-of-prompting/

4 Reactions

James V Baber

Technology Leadership | AI Transformation Consulting | AI Startup Investor

4mo

RAG is a hack primarily because we have to chunk data into context windows with the same 8KB memory capacity of a calculator from the mid 1980's, then feed it to an LLM with significant token costs so the quantity of chunks delivered for processing is necessarily kept to a minimum. This results in missed content, even with Vector Search, Semantic Ranking, and Knowledge Graphs. I challenge anyone to build a RAG model with your favorite LLM where you upload 500 SEC Form 10-Q documents that are 25 page PDFs and then ask your RAG-enabled LLM to list all 500 10-Q's by company name and summarize the operational challenges of each. It either won't due to limits, or you'll spend a fortune on tokens. Your expectations for what you can achieve with a RAG model have to be constrained to it's structural ability, context window size, and token costs. However, a recent RAG model I built for an engineering (materials testing) lab far exceeds generalist frontier models, drafting with a Professional Engineer's terminology and context; but that's because I knew exactly how to build it (with some NLP summarization tricks up my sleeve), knew my client's expectations, and gave explicit instructions and sample queries on how to use it effectively.

14 Reactions

Abhishek Gupta

Simplifying technology adoption, step-by-step | AE @ Whatfix

4mo

Is there a case of creating guardrails when training purpose-specific models hence decreasing the freedom of degree?

2 Reactions

Dan Wasserman

Less talk, more prompting

4mo

Every business leader wants a model trained on their proprietary data but this is a great example of the hesitation I have recommending that right now. It's not a slam dunk like you'd expect.

3 Reactions

Joseph Pareti

AI Consultant @ Joseph Pareti's AI Consulting Services | AI in CAE, HPC, Health Science

4mo

'Your special proprietary data may be less useful than you think' --- I would be careful with this type pf statements, one example in #healthscience : #BioNeMo provides large scale, optimized training on YOUR OWN DATA

2 Reactions

See more comments

To view or add a comment, sign in

More Relevant Posts

Decimal Point Analytics

20,843 followers
7mo
Report this post
Credit Pulse leverages cutting-edge machine learning algorithms to analyze textual data from regulatory filings, swiftly and accurately identifying credit default and bankruptcy risks. This empowers financial institutions with a competitive edge in crucial investment decisions. #CreditPulse #CreditAnalysis #MachineLearning #RegulatoryFilings #CreditDefault #BankruptcyRisks #FinancialInstitutions #InvestmentDecisions #FinanceInsights
Like Comment
To view or add a comment, sign in
Hudson & Thames Quantitative Research

17,412 followers
7mo
Report this post
🌟 Unlocking the Potential of Quantitative Finance and Machine Learning 📈 🔍 Our Expertise: At Hudson and Thames, we're on a mission to reshape the landscape of quantitative finance and machine learning. Our team of experts specializes in crafting customized solutions that empower you to navigate the ever-evolving financial terrain. From algorithmic trading strategies to data-driven insights, we've got the knowledge and experience to guide you towards success. 💡📊 🤝 Collaborate for Excellence: In today's competitive landscape, collaboration is the key to staying ahead. Let's embark on a journey of innovation, where your goals become our goals. Together, we'll create a future where your financial strategies stand out from the crowd. 🌐🤝 📞 Connect with Us: Ready to revolutionize your approach to finance and machine learning? Reach out to us today, and let's embark on a transformative journey towards excellence. 🔗 For more information, visit us at: https://lnkd.in/diJU25km #QuantitativeFinance #MachineLearning #FinancialInnovation #ConsultingExcellence #DataDrivenInsights
Like Comment
To view or add a comment, sign in
Riskfuel

2,091 followers
6mo
Report this post
Riskfuel is definitely one to watch! We've been honoured by Chartis Research as "One to watch" in the Financial Risk and Reporting category of their latest Risktech100 awards. The amazing team at Riskfuel is building very fast valuation and risk models for financial institutions using cutting edge machine learning techniques. Riskfuel ML models destroy the bottleneck that banks have been fighting for decades. Risktech100 report: https://lnkd.in/d7iP_9PM #risk management #financial risk #machine learning #quantitative finance
1 Comment
Like Comment
To view or add a comment, sign in
Melissa Koide

CEO at FinRegLab
10mo
Report this post
FinRegLab has undertaken empirical and policy analyses evaluating the explainability and fairness questions related to artificial intelligence and machine learning models in credit decisions. The findings are also relevant to the use of AI/ML across a range of sectors. Our new paper highlights key policy questions and the potential for regulatory guidance to encourage more consistent, responsible use of new generation models and fairness and explainability techniques, even as the underlying technologies and research continue to evolve.

FinRegLab

2,102 followers
10mo

Our research interrogates the “black box” issue of using complex algorithmic models. While data science techniques are still evolving, we find that some tools can help lenders understand key aspects of model operations and search for fairer models. But expert human oversight is critical to produce safe, fair, and reliable models. https://lnkd.in/gY6Nv7Zs

Explainability & Fairness in Machine Learning for Credit Underwriting: Policy & Empirical Findings Overview - FinRegLab

https://finreglab.org

2 Comments
Like Comment
To view or add a comment, sign in
AI LABS

406 followers
2mo
Report this post
📈 Financial time-series data is essential for algorithmic trading, as it provides the historical insight needed for forecasting models. However, this data is rarely flawless. It frequently contains errors, gaps, and extraneous noise that can lead to inaccurate outcomes and undermine the effectiveness of trading algorithms. A dependable algorithmic trading system is built on the foundation of clean, precise, and comprehensive data. The consequences of imprecise data are significant, affecting the algorithm's performance both in simulation and real-world trading environments. 💡 Explore the methodology behind preparing data for trading algorithms in our latest blog: https://lnkd.in/dFgvJmP9

Data Cleaning and AI Model Training in Algorithmic Training

ailabs.global
Like Comment
To view or add a comment, sign in
Kendall Quinones

Professional Technology & Business Intelligence Leader Focused in Business Operations & Analytics to Influence Transformational Improvements
5mo
Report this post
Interpretability is not just a nice-to-have feature for machine learning models in credit risk; it's a critical component that impacts transparency, fairness, compliance, and overall model effectiveness. Financial institutions must strike a balance between the predictive power of complex models and the need for transparency and accountability in credit decisions.
Like Comment
To view or add a comment, sign in
Duncan Toms, CFA

Associate Director, Multi-Asset Strategist at HSBC
10mo
Report this post
Back in January, we launched ECCLES, our machine learning model to help us identify which stage of the equity cycle we are in right now. Now, 6 months later, there has been a lot of interest… and some questions! Swipe through the slides below to find out what questions clients have asked us 👇 Want to learn more? Clients of HSBC Global Research can read our ECCLES Exposed report from the following link: REPORT 👉 https://lnkd.in/eq5BFBqy Questions? Just askresearch@hsbc.com
Like Comment
To view or add a comment, sign in

116,778 followers

739 Posts

View Profile Follow

Ethan Mollick’s Post

More Relevant Posts

Explore topics