So Essentially

So Essentially

Home
Podcast
Archive
About
WebWatcher Wins!
Alibaba model WebWatch processes visual web search with ease!
Aug 19 • 
Dhruv Diddi

May 2025

Reward Modeling Reasoning Tuned!
Fine tuning should consider reasoning are a foundational standard
May 8 • 
Dhruv Diddi
Reinforcement Learning with Tools
Your AI agent should be reasonably tuned to your tools
May 6 • 
Dhruv Diddi
Identification of Gaps in AI Governance
We need to focus on real world AI deployment
May 6 • 
Dhruv Diddi
3

April 2025

Tom and Jerry Videos
Generate your own one minute Tom and Jerry episodes with 5B diffusion model
Apr 8 • 
Dhruv Diddi
1
DeepMind releases Distributed Low-Communication LLM Training
DiLoCo does not seem like loco idea
Apr 8 • 
Dhruv Diddi

February 2025

Latent Space Reasoning
Latent Space reasoning is at least 10X better than word space reasoning
Feb 10 • 
Dhruv Diddi
MM-IQ Test for AI
AI has low Visual IQ
Feb 5 • 
Dhruv Diddi
Do you even PhysBench bro?
PhysBench shows VLMs exhibit poor understanding of the physical world
Feb 4 • 
Dhruv Diddi
DiLoCo Distributed Lunch
DiLoCo trained distributed better than co-located accelerators
Feb 4 • 
Dhruv Diddi
1

January 2025

On-the-Fly Persona Alignment with TPO
Test-time Preference Optimization implies better persona aligned responses
Jan 23 • 
Dhruv Diddi
3
ByteDance releases Agent R
Agent R has Reflective Self Training
Jan 22 • 
Dhruv Diddi
© 2025 So Essentially
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture