The Agent Company: Benchmarking LLM Agents on Consequential Real World Tasks

Samuel Albanie January 5, 2025
Video Thumbnail
Samuel Albanie Logo

Samuel Albanie

@samuelalbanie1

About

AI research. Note that opinions expressed are my own. However, for conflict of interest purposes, please note that I'm employed by Google DeepMind (GDM). This almost certainly biases my judgment to some degree. I think GDM is pretty great. Other related content: - misc/outdoor stuff channel: https://www.youtube.com/@samuelalbaniemisc - https://x.com/SamuelAlbanie - https://bsky.app/profile/samuelalbanie.bsky.social FAQ: Software used to make videos - keynote on mac (to make slides) - Adobe Premiere Pro for editing

Latest Posts

No results found. Try different keywords.

Video Description

A video summary of "TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks" by Xu et al. (2024) Links: - paper: https://arxiv.org/abs/2412.14161 - website: https://the-agent-company.com/ - code: https://github.com/TheAgentCompany/TheAgentCompany

You May Also Like