Today, we’re diving into Hugging Face’s smolagents – a new development that gives AI models more autonomy. Hugging Face, the open-source AI powerhouse behind technologies like Transformers, has now turned its attention to AI agents – programs where AI models can plan and execute tasks on their own – and their latest library smolagents makes building these agents simpler than ever. In this short episode, I’ll break down what smolagents are, how they work, and why they’re a big deal for developers, businesses, and researchers alike.

What Is Hugging Face’s smolagents?

So, what exactly is smolagents? In a nutshell, it’s a new Python library from Hugging Face that simplifies the creation of AI agents, making them more accessible to developers. Hugging Face themselves describe smolagents as a “library that unlocks agentic capabilities for language models”. This means you can take an LLM (a proprietary OpenAI model, an open-weight Llama model from Meta or a fully open-source model from DeepSeek) and easily give it the ability to act – to use tools, call APIs, and perform multi-step reasoning to accomplish a task.

Under the hood, smolagents is designed with minimal complexity. The core logic fits in only a few thousand lines of code, keeping abstractions light and above raw Python. This simplicity is intentional: it lets developers understand and tweak agent behaviors without wading through overly complex framework code. Smolagents is actually the successor to Hugging Face’s earlier “transformers.agents” module (which it will eventually replace), reflecting lessons learned to make this library lean and user-friendly.

One of the standout features is its “CodeAgent” approach. Unlike some agent frameworks that represent actions in abstract JSON or text, smolagents agents literally write Python code as their way of reasoning and taking actions. In other words, the AI agent decides what to do next by generating code (for example, calling a tool or performing a calculation), which is then executed. This code-centric approach is powerful because it leverages the full flexibility of Python for the agent’s actions. To keep things safe, these code executions can be sandboxed – run in isolated environments so that the agent doesn’t do anything harmful on your actual system. (So you get the flexibility of letting the agent write code, with safeguards in place.) And if you prefer a more traditional style, smolagents still supports a standard “ToolCallingAgent” that uses structured text for actions, so you have options.

Smolagents also comes with first-class tool integrations. You can easily turn any Python function into a tool that the agent can use – just by decorating it with @tool and adding some type hints and docstrings. These tools could be anything from a web search function to a database query or a calculator. Even better, Hugging Face has integrated this with their Hub: you can share and load tools from the Hugging Face Hub. This means a community of users can publish useful agent tools (say, a weather API fetcher or a text translator) and others can plug those into their agents in seconds. It’s a bit like an app store for AI agent tools.

Another key feature is that smolagents is model-agnostic. You’re not locked into a single AI model or provider. It supports any large language model – you can use open-source models from the Hugging Face Hub (with the library loading them via Transformers or via Hugging Face’s inference API), or you can use API-based models from OpenAI, Anthropic, and others through an integration called LiteLLM. You could even hook up local models if you have them. Essentially, whether it’s a 7-billion-parameter open model running on your laptop or a cutting-edge model via cloud API, you can plug it into smolagents. This flexibility is great for developers who want to experiment or deploy agents in different environments.

To recap the technical highlights, smolagents offers the following five features:

Simplicity: A lightweight, clear codebase (on the order of thousands of lines) that’s easy to understand. Minimal abstraction means you work almost directly with Python code, not obscure configuration.
Code-Based Actions: Agents that write and execute Python code to do things, enabling complex actions and multi-step reasoning in a familiar language (with sandboxing for safety). This is a distinctive “agents that write code” design.
Tool Integrations & Sharing: A convenient way to define tools (just Python functions) and an ability to share those tools or entire agents via Hugging Face Hub. This fosters reusability and community collaboration.
Any Model Support: Compatibility with a wide range of LLMs – from local open-source models to popular APIs (OpenAI, Anthropic, etc.) – through a unified interface. You choose the model that fits your needs or constraints.
Out-of-the-Box Usability: It even provides command-line interfaces (like a smolagent CLI to run an agent with one command, and a specialized webagent for web browsing tasks) to quickly test or deploy agents. This makes it easy to get started without writing a lot of boilerplate.

Real-World Use Cases for smolagents

Alright, this sounds cool – but what can you actually do with smolagents? Well, quite a lot. Because smolagents lets an LLM use tools and perform multi-step reasoning, it opens up many real-world applications across industries. Here are a few interesting examples:

Complex Question Answering and Research: You can build an agent that, given a hard question, will search the web, find information, and then compose an answer. For instance, Hugging Face provided a demo that showed an agent using a search tool to figure out “How many seconds would it take for a leopard at full speed to run through Pont des Arts?” Like OpenAI’s $200/month Deep Research functionality, you could use smolagents be a kind of research assistant, fetching facts and data from various sources to answer user queries or to write reports.
Task Automation and Planning: No piece of content on agents is complete without a travel planning example so here we go… Suppose you need to plan a trip or an event with many parameters. With smolagents, you could create an agent that takes a high-level goal like “Plan a trip to Tokyo, Kyoto, and Osaka between March 28 and April 7” and then calls tools to gather information – searching for flights, checking train schedules, looking up hotel reviews, etc. In fact, one example command runs: smolagent "Plan a trip to Tokyo, Kyoto and Osaka between Mar 28 and Apr 7." with a travel planning tool. The agent could break down the task, perhaps use a maps API for distances or a web scraper for attractions, and return an organized itinerary. This kind of multi-step planning agent could be useful for travel agencies, event planners, or personal assistants.
Web Browsing and Data Extraction: Smolagents can also drive a web browser to perform actions. Hugging Face provides a webagent that can take instructions like “Go to an e-commerce website, navigate to the sales section, click the first product, and get its details and price”.
Data Analysis and Reports: Because smolagents agents can execute Python code, you could have them perform data analysis on the fly. Imagine an agent given a prompt to “Analyze last month’s sales figures and identify the top three products by growth.” The agent could load a CSV using pandas (as a tool), compute the results, and then output a summary. It essentially can combine data processing and language generation.

These examples barely scratch the surface. The key is that smolagents allows LLMs to interact with the world (through tools and code) in a controlled way. This means any task that involves finding information, transforming data, or making decisions based on intermediate results could potentially be handled by an AI agent. And since Hugging Face has made it easy to share components, the community can rapidly build out a library of tools and agents for countless niche applications.

Broader Industry Impact

Now, let’s zoom out: how do smolagents fit into the broader AI ecosystem, and what do they mean for businesses and data scientists? In 2024 and particularly 2025, there’s been a lot of buzz about AI agents – from autonomous research assistants to AI that can use software like humans do. Smolagents is part of this movement, and its introduction is likely to have a notable impact.

For businesses, smolagents could lower the barrier to automating complex tasks with AI. Instead of needing a whole team of engineers to stitch together a custom solution, a company could use smolagents to prototype an AI agent that, say, handles customer inquiries or performs market research, with just a few dozen lines of Python. Because it supports local and open-source models, companies worried about data privacy can even keep everything in-house, running agents on their own secure servers. This flexibility means businesses can choose between using powerful proprietary models or cheaper/free open models, whichever suits their needs and budget – all within the same framework. In short, smolagents can help companies inject AI-driven automation into their workflows faster and more cost-effectively.

For data scientists and software developers, smolagents is a new powerful tool in their toolbox that’s both accessible and versatile. If you’re a developer who has used frameworks like LangChain or worked with OpenAI’s function-calling, you’ll find smolagents gives you similar capabilities but in a lightweight, transparent package. It’s all open-source, so you can inspect how it works, debug it, or extend it. The minimalistic design means you can probably learn it quickly. This empowers individual developers (not just big teams) to experiment with building their own agents.

For researchers, smolagents provides a convenient platform to study how AI models behave when given more autonomy. Researchers interested in reasoning, tool use, and multi-step decision-making can use the library to test new ideas (for example, plugging in a new reasoning algorithm or comparing how different size models handle agent tasks). The fact that you can swap in any model means it’s great for benchmarking – e.g., how does a smaller open model vs. a larger proprietary model perform in an agent role?

Conclusion

To sum up, Hugging Face’s smolagents is a convenient new framework that brings the concept of AI agents to the mainstream in a small, simple, and accessible form. In today’s small episode, I covered how smolagents works and its key features – from code-writing agents and easy tool integration to support for any model you want. We explored examples of what you can build, like travel planners, web scrapers and research assistants, all of which could be applied across any industry. Been waiting to get started with your own agentic AI application in your particular commercial or academic niche? With smolagents now available, it could be exactly the right time to get going! Looking to brainstorm ideas on products or features you could build with AI-agent functionality? No problem; talk to your colleagues or friends or your favorite conversational LLM like Claude 3.7 Sonnet, Google Gemini or OpenAI’s GPT-4.5.

And, while this episode obviously focused on smolagents, it of course isn’t your only option for developing and deploying AI agents:

Earlier in this episode, I already mentioned LangChain, which is the most popular agentic framework today. Robust and feature-rich, LangChain surpasses smolagents in scalability, memory management, and built-in integrations, making it ideal for complex agent workflows.
LlamaIndex is another solid option. It excels at managing large datasets and memory-intensive retrieval tasks, offering out-of-the-box data integration capabilities not present in comparatively lightweight smolagents.
Pydantic AI provides structured, type-safe agent outputs through explicit schema validation, contrasting smolagents' simpler but less controlled output handling.
Microsoft AutoGen specializes in collaborative, multi-agent systems with dynamic coordination and communication, whereas smolagents focuses primarily on single-agent tasks.
And, finally, the popular CrewAI framework was built explicitly for orchestrating multi-agent teams through clearly defined roles and tasks, making it superior to smolagents in coordinated agent collaborations.

The SuperDataScience podcast is available on all major podcasting platforms, YouTube, and at SuperDataScience.com.