Serving the Quantitative Finance Community

  • 1
  • 3
  • 4
  • 5
  • 6
  • 7
 
User avatar
jasonbell
Topic Author
Posts: 310
Joined: May 6th, 2022, 4:16 pm
Location: Limavady, NI, UK
Contact:

Re: AI's Mean Reversion.

January 29th, 2025, 9:06 pm

Have to admit I love Reinforcement Learning. I also love the amount of back pedalling the US companies are doing at the moment.
Linkedin: https://www.linkedin.com/in/jasonbelldata/
Author of Machine Learning: Hands on for Developers and Technical Professionals (Wiley).
Contributor: Machine Learning in the City (Wiley).
 
User avatar
katastrofa
Posts: 7929
Joined: August 16th, 2007, 5:36 am
Location: Event Horizon

Re: AI's Mean Reversion.

January 30th, 2025, 10:21 am

RL never gone away.
Reading the DS paper: 2501.12948
While they aim to explore innovative solution, they are by necessity falling into the footsteps of OpenAI. They started with pure RL, but it went bad, so they added human feedback as cold-start. ChatGPT went all the way with this approach (RLHF - RL from Human Feedback) to make the chat more "human-like". There are many similar or superior models to DS out there, Chinese or not (i'm sure China has better models behing the digital wall). The only diff is that DS is marketed outside of China and as a research innovation.