So Essentially
Subscribe
Sign in
Home
Podcast
Archive
About
Latest
Top
Discussions
WebWatcher Wins!
Alibaba model WebWatch processes visual web search with ease!
Aug 19
•
Dhruv Diddi
May 2025
Reward Modeling Reasoning Tuned!
Fine tuning should consider reasoning are a foundational standard
May 8
•
Dhruv Diddi
Reinforcement Learning with Tools
Your AI agent should be reasonably tuned to your tools
May 6
•
Dhruv Diddi
Identification of Gaps in AI Governance
We need to focus on real world AI deployment
May 6
•
Dhruv Diddi
3
April 2025
Tom and Jerry Videos
Generate your own one minute Tom and Jerry episodes with 5B diffusion model
Apr 8
•
Dhruv Diddi
1
DeepMind releases Distributed Low-Communication LLM Training
DiLoCo does not seem like loco idea
Apr 8
•
Dhruv Diddi
February 2025
Latent Space Reasoning
Latent Space reasoning is at least 10X better than word space reasoning
Feb 10
•
Dhruv Diddi
MM-IQ Test for AI
AI has low Visual IQ
Feb 5
•
Dhruv Diddi
Do you even PhysBench bro?
PhysBench shows VLMs exhibit poor understanding of the physical world
Feb 4
•
Dhruv Diddi
DiLoCo Distributed Lunch
DiLoCo trained distributed better than co-located accelerators
Feb 4
•
Dhruv Diddi
1
January 2025
On-the-Fly Persona Alignment with TPO
Test-time Preference Optimization implies better persona aligned responses
Jan 23
•
Dhruv Diddi
3
ByteDance releases Agent R
Agent R has Reflective Self Training
Jan 22
•
Dhruv Diddi
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts