Archive - So Essentially

WebWatcher Wins!

Alibaba model WebWatch processes visual web search with ease!

Aug 19 •

May 2025

Reward Modeling Reasoning Tuned!

Fine tuning should consider reasoning are a foundational standard

May 8 •

Reinforcement Learning with Tools

Your AI agent should be reasonably tuned to your tools

May 6 •

Identification of Gaps in AI Governance

We need to focus on real world AI deployment

May 6 •

April 2025

Tom and Jerry Videos

Generate your own one minute Tom and Jerry episodes with 5B diffusion model

Apr 8 •

DeepMind releases Distributed Low-Communication LLM Training

DiLoCo does not seem like loco idea

Apr 8 •

February 2025

Latent Space Reasoning

Latent Space reasoning is at least 10X better than word space reasoning

Feb 10 •

MM-IQ Test for AI

AI has low Visual IQ

Feb 5 •

Do you even PhysBench bro?

PhysBench shows VLMs exhibit poor understanding of the physical world

Feb 4 •

DiLoCo Distributed Lunch

DiLoCo trained distributed better than co-located accelerators

Feb 4 •

January 2025

On-the-Fly Persona Alignment with TPO

Test-time Preference Optimization implies better persona aligned responses

Jan 23 •

ByteDance releases Agent R

Agent R has Reflective Self Training

Jan 22 •

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts