Categorization is a type of cognition involving conceptual differentiation between characteristics of conscious experience, such as objects, events, or ideas. It involves the abstraction and differentiation of aspects of experience by sorting and distinguishing between groupings, through classification or typification on the basis of traits, features, similarities or other criteria that are universal to the group. Categorization is considered one of the most fundamental cognitive abilities, and it is studied particularly by psychology and cognitive linguistics.

Categorization is sometimes considered synonymous with classification (cf., Classification synonyms). Categorization and classification allow humans to organize things, objects, and ideas that exist around them and simplify their understanding of the world. Categorization is something that humans and other organisms do: "doing the right thing with the right kind of thing." The activity of categorizing things can be nonverbal or verbal. For humans, both concrete objects and abstract ideas are recognized, differentiated, and understood through categorization. Objects are usually categorized for some adaptive or pragmatic purposes.

Categorization is grounded in the features that distinguish the category's members from nonmembers. Categorization is important in learning, prediction, inference, decision making, language, and many forms of organisms' interaction with their environments.

Overview

Categories are distinct collections of concrete or abstract instances (category members) that are considered equivalent by the cognitive system. Using category knowledge requires one to access mental representations that define the core features of category members (cognitive psychologists refer to these category-specific mental representations as concepts).

To categorization theorists, the categorization of objects is often considered using taxonomies with three hierarchical levels of abstraction. For example, a plant could be identified at a high level of abstraction by simply labeling it a flower, a medium level of abstraction by specifying that the flower is a rose, or a low level of abstraction by further specifying this particular rose as a dog rose. Categories in a taxonomy are related to one another via class inclusion, with the highest level of abstraction being the most inclusive and the lowest level of abstraction being the least inclusive.

  • Basic Level, Species (e.g., Rose) - The middle level of abstraction. Rosch and colleagues (1976) suggest the basic level to be the most cognitively efficient. This development succeeds in organisms that only demonstrate simple reflexes (see articles on the binding problem, cognition, cognitive development, infant cognitive development, multisensory integration, and perception). For their nervous systems, the environment is a cacophony of sensory stimuli: electromagnetic waves, chemical interactions, and pressure fluctuations.

Categorization thought involves the abstraction and differentiation of aspects of experience that rely upon such power of mind as intentionality and perception. The problem is that these young organisms should already grasp the abilities of intentionality and perception to categorize the environment. This is a vicious circle: categorization needs intentionality and perception, which only appear in the categorized environment. So, the young, inexperienced organism does not have abstract thinking and cannot independently accomplish conceptual differentiation between characteristics of conscious experience if it solves the categorization problem alone.

Studying the origins of social cognition in child development, developmental psychologist Michael Tomasello developed the notion of Shared intentionality to account for unaware processes during social learning after birth to explain processes in shaping intentionality. Further, Latvian professor Igor Val Danilov expanded this concept to the intrauterine period by introducing a Mother-Fetus Neurocognitive model: a hypothesis of neurophysiological processes occurring during Shared intentionality. Evidence in neuroscience supports the hypothesis. Hyperscanning research studies observed inter-brain activity under conditions without communication in pairs while subjects were solving the shared cognitive problem, and they registered an increased inter-brain activity in contrast to the condition when subjects solved a similar problem alone. These data show that collaborative interaction without sensory cues can emerge in mother-child dyads, providing Shared intentionality. Aristotle's categorical method of analysis was transmitted to the scholastic medieval university through Porphyry's Isagoge. The classical view of categories can be summarized into three assumptions: a category can be described as a list of necessary and sufficient features that its membership must have, categories are discrete in that they have clearly defined boundaries (either an element belongs to one or not, with no possibilities in between), and all the members of a category have the same status. (There are no members of the category which belong more than others). In the classical view, categories need to be clearly defined, mutually exclusive and collectively exhaustive; this way, any entity in the given classification universe belongs unequivocally to one, and only one, of the proposed categories.

The classical view of categories first appeared in the context of Western Philosophy in the work of Plato, who, in his Statesman dialogue, introduces the approach of grouping objects based on their similar properties. This approach was further explored and systematized by Aristotle in his Categories treatise, where he analyzes the differences between classes and objects. Aristotle also applied intensively the classical categorization scheme in his approach to the classification of living beings (which uses the technique of applying successive narrowing questions such as "Is it an animal or vegetable?", "How many feet does it have?", "Does it have fur or feathers?", "Can it fly?"...), establishing this way the basis for natural taxonomy.

Examples of the use of the classical view of categories can be found in the western philosophical works of Descartes, Blaise Pascal, Spinoza and John Locke, and in the 20th century in Bertrand Russell, G.E. Moore, the logical positivists. It has been a cornerstone of analytic philosophy and its conceptual analysis, with more recent formulations proposed in the 1990s by Frank Cameron Jackson and Christopher Peacocke.

The classical model of categorization has been used at least since the 1960s from linguists of the structural semantics paradigm, by Jerrold Katz and Jerry Fodor in 1963, which in turn have influenced its adoption also by psychologists like Allan M. Collins and M. Ross Quillian.

Modern versions of classical categorization theory study how the brain learns and represents categories by detecting the features that distinguish members from nonmembers.

Prototype theory

The pioneering research by psychologist Eleanor Rosch and colleagues since 1973, introduced the prototype theory, according to which categorization can also be viewed as the process of grouping things based on prototypes. This approach has been highly influential, particularly for cognitive linguistics. Under the prototype theory, this stored representation consists of a summary representation of the category's members. This prototype stimulus can take various forms. It might be a central tendency that represents the category's average member, a modal stimulus representing either the most frequent instance or a stimulus composed of the most common category features, or, lastly, the "ideal" category member, or a caricature that emphasizes the distinct features of the category. An important consideration of this prototype representation is that it does not necessarily reflect the existence of an actual instance of the category in the world. For example, while one's prototype for the category of beverages may be soda or seltzer, the context of brunch might lead them to select mimosa as a prototypical beverage.

The prototype theory claims that members of a given category share a family resemblance, and categories are defined by sets of typical features (as opposed to all members possessing necessary and sufficient features).

Exemplar theory

Another instance of the similarity-based approach to categorization, the exemplar theory likewise compares the similarity of candidate category members to stored memory representations. Under the exemplar theory, all known instances of a category are stored in memory as exemplars. When evaluating an unfamiliar entity's category membership, exemplars from potentially relevant categories are retrieved from memory, and the entity's similarity to those exemplars is summed to formulate a categorization decision. This effectively biases categorization decisions towards exemplars most similar to the entity to be categorized.

Conceptual clustering

Conceptual clustering is a machine learning paradigm for unsupervised classification that was defined by Ryszard S. Michalski in 1980. It is a modern variation of the classical approach of categorization, and derives from attempts to explain how knowledge is represented. In this approach, classes (clusters or entities) are generated by first formulating their conceptual descriptions and then classifying the entities according to the descriptions.

Conceptual clustering developed mainly during the 1980s, as a machine paradigm for unsupervised learning. It is distinguished from ordinary data clustering by generating a concept description for each generated category.

Conceptual clustering is closely related to fuzzy set theory, in which objects may belong to one or more groups, in varying degrees of fitness. A cognitive approach accepts that natural categories are graded (they tend to be fuzzy at their boundaries) and inconsistent in the status of their constituent members. The idea of necessary and sufficient conditions is almost never met in categories of naturally occurring things.

Category learning

While an exhaustive discussion of category learning is beyond the scope of this article, a brief overview of category learning and its associated theories is useful in understanding formal models of categorization.

If categorization research investigates how categories are maintained and used, the field of category learning seeks to understand how categories are acquired in the first place. To accomplish this, researchers often employ novel categories of arbitrary objects (e.g., dot matrices) to ensure that participants are entirely unfamiliar with the stimuli. Category learning researchers have generally focused on two distinct forms of category learning. Classification learning tasks participants with predicting category labels for a stimulus based on its provided features. Classification learning is centered around learning between-category information and the diagnostic features of categories. In contrast, inference learning tasks participants with inferring the presence/value of a category feature based on a provided category label and/or the presence of other category features. Inference learning is centered on learning within-category information and the category's prototypical features. Prevailing theories of category learning include the prototype theory, the exemplar theory, and the decision bound theory.

Formal models

Computational models of categorization have been developed to test theories about how humans represent and use category information. Medin and Schaffer's (1978) context model was expanded upon by Nosofsky (1986) in the mid-1980s, resulting in the production of the Generalized Context Model (GCM). While simple logical rules are ineffective at learning poorly defined category structures, some proponents of the rule-based theory of categorization suggest that an imperfect rule can be used to learn such category structures if exceptions to that rule are also stored and considered. To formalize this proposal, Nosofsky and colleagues (1994) designed the RULEX model. The RULEX model attempts to form a decision tree composed of sequential tests of an object's attribute values. Categorization of the object is then determined by the outcome of these sequential tests. The RULEX model searches for rules in the following ways:

  • Exact Search for a rule that uses a single attribute to discriminate between classes without error.
  • Imperfect Search for a rule that uses a single attribute to discriminate between classes with few errors
  • Conjunctive Search for a rule that uses multiple attributes to discriminate between classes with few errors.
  • Exception Search for exceptions to the rule.

The method that RULEX uses to perform these searches is as follows: It is often the case that learned category representations vary depending on the learner's goals, as well as how categories are used during learning.

Social categories based on age, race, and gender are used by people when they encounter a new person. Because some of these categories refer to physical traits, they are often used automatically when people do not know each other. These categories are not objective and depend on how people see the world around them. They allow people to identify themselves with similar people, and to identify people who are different. They are useful in one's identity formation with the people around them. One can build their own identity by identifying themselves in a group or by rejecting another group.

Social categorization is similar to other types of categorization since it aims at simplifying the understanding of people. However, creating social categories implies that people will position themselves in relation to other groups. A hierarchy in group relations can appear as a result of social categorization. These hierarchical relations participate in the promotion of stereotypes about people and groups, sometimes based on subjective criteria. Social categories can encourage people to associate stereotypes to groups of people. Associating stereotypes to a group, and to people who belong to this group, can lead to forms of discrimination towards people of this group. The perception of a group and the stereotypes associated with it have an impact on social relations and activities.

Some social categories have more weight than others in society. For instance, in history and still today, the category of "race" is one of the first categories used to sort people. However, only a few categories of race are commonly used such as "Black", "White", "Asian" etc. It participates in the reduction of the multitude of ethnicities to a few categories based mostly on people's skin color.

The process of sorting people creates a vision of the other as 'different', leading to the dehumanization of people. Scholars talk about intergroup relations with the concept of social identity theory developed by H. Tajfel. To do "the right thing with the right kind of thing.", there has to be both a right and a wrong thing to do. Not only does a category of which "everything" is a member lead logically to the Russell paradox ("is it or is it not a member of itself?"), but without the possibility of error, there is no way to detect or define what distinguishes category members from nonmembers.

An example of the absence of nonmembers is the problem of the poverty of the stimulus in language learning by the child: children learning the language do not hear or make errors in the rules of Universal Grammar (UG). Hence they never get corrected for errors in UG. Yet children's speech obeys the rules of UG, and speakers can immediately detect that something is wrong if a linguist generates (deliberately) an utterance that violates UG. Hence speakers can categorize what is UG-compliant and UG-noncompliant. Linguists have concluded from this that the rules of UG must be somehow encoded innately in the human brain.

Ordinary categories, however, such as "dogs," have abundant examples of nonmembers (cats, for example). So it is possible to learn, by trial and error, with error-correction, to detect and define what distinguishes dogs from non-dogs, and hence to correctly categorize them. This kind of learning, called reinforcement learning in the behavioral literature and supervised learning in the computational literature, is fundamentally dependent on the possibility of error, and error-correction. Miscategorization—examples of nonmembers of the category—must always exist, not only to make the category learnable, but for the category to exist and be definable at all.

See also

  • Categorical perception
  • Characterization (mathematics)
  • Classification (general theory)
  • Knolling
  • Library classification
  • Multi-label classification
  • Pattern recognition
  • Shared intentionality
  • Statistical classification
  • Symbol grounding problem

References

  • To Cognize is to Categorize: Cognition is Categorization
  • Wikipedia Categories Visualizer
  • Interdisciplinary Introduction to Categorization: Interview with Dvora Yanov (political sciences), Amie Thomasson (philosophy) and Thomas Serre (artificial intelligence)