The Agent Company: Benchmarking LLM Agents on Consequential Real World Tasks
Samuel Albanie
@samuelalbanie1About
AI research. Note that opinions expressed are my own. However, for conflict of interest purposes, please note that I'm employed by Google DeepMind (GDM). This almost certainly biases my judgment to some degree. I think GDM is pretty great. Other related content: - misc/outdoor stuff channel: https://www.youtube.com/@samuelalbaniemisc - https://x.com/SamuelAlbanie - https://bsky.app/profile/samuelalbanie.bsky.social FAQ: Software used to make videos - keynote on mac (to make slides) - Adobe Premiere Pro for editing
Latest Posts
No results found. Try different keywords.
Video Description
A video summary of "TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks" by Xu et al. (2024) Links: - paper: https://arxiv.org/abs/2412.14161 - website: https://the-agent-company.com/ - code: https://github.com/TheAgentCompany/TheAgentCompany
Empower Your AI Journey
AI-recommended products based on this video

NEXPOW Car Jump Starter,Car Battery Jump Starter Pack 1500A Peak Q10S for Up to 7.0L Gas and 5.5L Diesel Engine12V Auto Battery Booster,Jumper Cables,Portable Lithium Jump Box with LED Light/USB QC3.0

Firefly Variety 8 Pack - Fire Starter Accessory for Swiss Army Victorinox Knives (Neon Green-Yellow Glow)

9-in-1 5000A 150PSI Car Battery Booster Jump Starter with Air Compressor (All Gas/9L Diesel), Portable Car Battery Booster Pack, Safe Durable Car Jump Starter with Extended Jumper Cables, Glove, Light
