Behavior & cognition

Problem-solving

Problem-solving is one of the most discussed topics in animal cognition, and also one of the easiest to misread. When a crow bends a wire to reach food, or a chimpanzee stacks boxes to climb toward a reward, it is tempting to read the act as a window into a mind much like our own. Comparative-cognition researchers are far more cautious. They distinguish what an animal does, which can be observed and measured, from why it does it, which must be inferred carefully and is often debated.

This guide gives a broad overview of how scientists study problem-solving across very different animals, drawing on examples that are reasonably well supported in the research literature. It deliberately avoids ranking animals, naming a "smartest" species, or assigning intelligence scores. As you will see, performance on a problem depends heavily on how the problem is designed, whether the animal was tested in captivity or in the wild, and what the species' way of life calls for — so a single ladder of intelligence is the wrong picture from the start.

What "problem-solving" means in animal-cognition research

In comparative cognition, problem-solving usually refers to an animal reaching a goal — often food, escape, or access — when the direct route is blocked and some novel or flexible behaviour is required. Researchers are interested in whether the animal can find a solution it has not simply been trained to perform, and how it gets there: by trial and error, by learning from others, by reorganising familiar actions, or in some debated cases by appearing to work out a relationship before acting.

A key distinction runs through the whole field: observable behaviour versus inferred internal states. We can record that an octopus removed a screw-top lid, or that a crow dropped stones into a tube to raise a water level. What we cannot read directly is what the animal understood while doing it. Careful studies try to separate associative learning (gradual reinforcement of what works) from more flexible cognition, but the boundary is genuinely difficult, and many behaviours involve elements of both rather than a clean either-or.

Because of this, researchers tend to describe results in cautious language — "is consistent with", "may indicate", "in this studied population" — rather than declaring that an animal "understands" or "thinks". That caution is not vagueness; it reflects how hard it is to rule out simpler explanations for an impressive-looking result.

Corvids and tool use: studied behaviour, careful claims

Some members of the crow family — including certain populations of New Caledonian crows (Corvus moneduloides) and some rooks and ravens in captivity — have been studied solving multi-step physical tasks. Documented examples include using and shaping stick-like tools to extract food, and dropping stones into a water-filled tube to bring a floating reward within reach. These are real, repeatable findings from controlled studies, not viral clips, and they are striking because tool use of this kind is uncommon among birds.

It is important not to overgeneralise from them. Tool-related behaviour is well documented in particular corvid species and populations; it is not a property of "all crows" or of birds in general. Some of the most cited stick-tool behaviour is associated with wild New Caledonian crows specifically, where it appears in natural foraging, while other corvids show flexible problem-solving mainly in experimental settings. Whether a given solution reflects insight, prior experience, or step-by-step learning is often still discussed, so the safest summary is that these species show flexible, sometimes tool-assisted problem-solving under the conditions studied.

Great apes, octopuses, and elephants: different bodies, different tasks

Great apes such as chimpanzees (Pan troglodytes) and orangutans (Pongo species) have a long history in cognition research, including using objects as tools, combining actions to reach rewards, and learning from watching others. Wild chimpanzee populations have been observed using sticks and stones in foraging, with local differences between communities. These descriptions stay close to what is observed; claims about apes "reasoning like people" go well beyond the evidence and are avoided in careful work.

Octopuses (class Cephalopoda) are a striking contrast — invertebrates with a very different nervous system, much of it distributed through the arms. In captivity some individuals have been reported opening containers, navigating mazes, and manipulating objects, and there is research interest in their flexible behaviour. Findings vary between individuals and studies, and much octopus work involves small numbers of captive animals, so conclusions are held loosely. Elephants (family Elephantidae) have likewise been studied on cooperative and physical tasks, and have been a focus of mirror-mark studies in a few individuals; results are intriguing but limited in sample size and should not be stretched into broad claims about the whole group.

The wider lesson is that intelligence is not one thing measured on one scale. A corvid, an ape, an octopus, and an elephant inhabit different bodies, senses, and ecological pressures. Comparing them on a single "who is smarter" axis ignores that each is solving the problems its own way of life presents, which is why researchers describe cognition as context-specific rather than ranked.

Why task design shapes the result

How a problem is presented can change the outcome as much as the animal's ability. A task that suits one species' body, senses, and motivation may be nearly impossible for another for reasons that have nothing to do with cognition. A test relying on visual cues disadvantages an animal that leads with smell or touch; a food reward an animal does not value, an apparatus it cannot grip, or an unfamiliar testing room can all depress performance. Researchers spend considerable effort designing tasks that are fair to the species being tested, and re-examining tasks where the design may have favoured one group.

This is why the field treats cross-species comparisons with caution. If two species are tested on the same apparatus, the one whose body and senses suit the apparatus has an advantage built into the test itself. Differences in results may reflect the task as much as the mind behind it. Good comparative studies try to control for these factors, report their methods openly, and avoid sweeping claims — an approach consistent with how FaunaHub handles sourcing through its animal research sources methodology.

Failing a task is not proof of inability

A central principle in this research is that a negative result is weak evidence. If an animal does not solve a problem, the reason might be the design — wrong reward, wrong sensory channel, stress, lack of motivation, or simply that the individual tried a different strategy. It does not establish that the species cannot solve such problems, only that these animals did not, under these conditions. This is the mirror image of the rule for positive results, where researchers work hard to rule out simpler explanations before crediting an impressive performance.

The same caution applies to widely discussed tests such as the mirror self-recognition test, where an animal is checked for whether it reacts to a mark visible only in a mirror. Passing it is not proof of human-like consciousness, and failing it is not proof that an animal lacks any sense of self. The test has known limitations and is biased toward species that rely heavily on vision and naturally attend to mirrors, so results are best read narrowly rather than as a verdict on awareness.

Captivity, the wild, and reading internal states

Where an animal is studied matters. Captive settings allow controlled, repeatable tasks but can also encourage behaviours rarely seen in nature, shaped by training, boredom, or the apparatus itself. Wild observations capture problem-solving in its natural context but are harder to control and interpret. A behaviour reliably produced by captive individuals should not be assumed typical of wild populations, and vice versa; the two lines of evidence inform each other but are not interchangeable. Enrichment in captivity is sometimes part of how these behaviours are studied, and is mentioned here only as research context, not as advice.

Finally, problem-solving sits next to questions about emotion and communication, and the same discipline applies. Stress, persistence, or apparent frustration during a task are described from observable behaviour, not asserted as definite inner feelings. And solving a problem, or signalling about one, is communication or cognition — it is not evidence of human-like language or of a mind that works like ours. Keeping observation and inference separate is what allows this field to say something meaningful about animal problem-solving without overclaiming.

Related research methods

Related animal groups

How whole groups of animals show this behavior:

Explore more behavior guides

This guide is part of FaunaHub's animal intelligence & behavior cluster. For how these claims are sourced, see animal research sources, and for the biology behind behavior, see animal senses & adaptations.

Frequently asked questions

Which animal is the best problem-solver?
Comparative-cognition research does not support a single "best" answer, and reputable scientists avoid the question framed that way. Intelligence is context-specific: a corvid, a great ape, an octopus, and an elephant each solve the problems their own bodies, senses, and environments present. Performance also depends heavily on how a task is designed and whether the animal was tested in captivity or the wild. Ranking species on one scale ignores all of that, so this guide describes what is observed rather than naming a winner.
Does using a tool prove an animal is intelligent?
Tool use is interesting behaviour, but on its own it does not prove a particular kind of intelligence. Some behaviours that look like clever reasoning can also arise from learning what works over many attempts, and the two are genuinely hard to separate. Researchers study the details — whether the animal adjusts flexibly to new situations, and whether simpler explanations can be ruled out — before drawing conclusions. Documented tool use in certain corvids and apes is well supported, but it describes what those studied animals do, not a verdict on their whole class or order.
If an animal fails a problem-solving test, does that mean it cannot do it?
No. A failure under test conditions is weak evidence about ability. The animal may have been given the wrong kind of reward, a task that did not suit its senses or body, an unfamiliar or stressful setting, or simply tried a different strategy. A negative result shows that these individuals did not solve the problem this time, not that the species is incapable. This is why task design and fair testing matter so much in comparative cognition.
Is animal communication the same as language?
Not in the human sense. Many animals have rich communication systems — alarm calls, the honeybee waggle dance, whale song, and more — that carry real information. But communication systems are not the same as human language, which has features such as open-ended grammar and combining symbols into endless new meanings. Solving a problem or signalling about it is communication or cognition; describing it as "language" overstates what the evidence shows.