TextContents[text]
gives a dataset of information about entities, dates, quantities and other content-related elements found in text.
TextContents[text,form]
searches for cases of the type form.
TextContents[text,{form1,form2,…}]
searches for cases of types form1, form2, …
TextContents[text,forms,props]
includes the property props for each object in the dataset produced.
TextContents
Listing of Text Content Types »TextContents[text]
gives a dataset of information about entities, dates, quantities and other content-related elements found in text.
TextContents[text,form]
searches for cases of the type form.
TextContents[text,{form1,form2,…}]
searches for cases of types form1, form2, …
TextContents[text,forms,props]
includes the property props for each object in the dataset produced.
Details and Options
- In TextContents[text,…], text can be a string, a file with plain text represented by File[…], a ContentObject expression or a list of these text objects.
- TextContents[{text1,text2,…},…] gives cases for each texti.
- Identification type form can be:
-
"type" any text content type (e.g. "Noun", "City") Entity[…,…] a specific entity of a text content type form1|form2|… form matching any of the formi Containing[outer,inner] forms of type outer containing type inner Verbatim["string"] a specific string to be matched exactly pattern a string pattern to be matched Automatic entities, dates, quantities and other content-related elements - Possible choices for the property prop include:
-
"String" string of the identified text (default) "Position" start and end position of the string in text "Probability" estimated probability that the identification is correct "Type" type of content (entity type, …) "Interpretation" standard interpretation of the identified string "Snippet" a snippet around the identified string "HighlightedSnippet" a snippet with the identified string highlighted All all the preceding properties {prop1,prop2,…} a list of property specifications - The following options can be given:
-
AcceptanceThreshold Automatic minimum probability to accept identification TargetDevice "CPU" whether CPU or GPU computation should be used for entity detection VerifyInterpretation False whether interpretability should be verified
Examples
open all close allBasic Examples (1)
TextContents["The flag of Italy is green, white and red. Since 1861, the capital is Rome, which also serves as the capital of the Lazio region. With 2,872,800 residents in 1,285 km2 (496.1 sq mi)"]Only get the results for locations:
TextContents["The flag of Italy is green, white and red. Since 1861, the capital is Rome, which also serves as the capital of the Lazio region. With 2,872,800 residents in 1,285 km2 (496.1 sq mi)", "Location"]Only get the results for locations and quantities:
TextContents["The flag of Italy is green, white and red. Since 1861, the capital is Rome, which also serves as the capital of the Lazio region. With 2,872,800 residents in 1,285 km2 (496.1 sq mi)", {"Location", "Quantity"}]Get interpretations for all cases:
TextContents["The flag of Italy is green, white and red. Since 1861, the capital is Rome, which also serves as the capital of the Lazio region. With 2,872,800 residents in 1,285 km2 (496.1 sq mi)", {"Location", "Quantity"}, All]Get a specified set of properties for entities:
TextContents["The flag of Italy is green, white and red. Since 1861, the capital is Rome, which also serves as the capital of the Lazio region. With 2,872,800 residents in 1,285 km2 (496.1 sq mi)", Automatic, {"HighlightedSnippet", "Interpretation"}]Options (2)
AcceptanceThreshold (1)
By default, all the detected entities have an estimated probability higher than 0.5:
TextContents[ExampleData[{"Text", "JFKInaugural"}]]Get only the entities that are highly probable to be correct by setting a high AcceptanceThreshold:
TextContents[ExampleData[{"Text", "JFKInaugural"}], AcceptanceThreshold -> 0.9]VerifyInterpretation (1)
By default, some entities cannot be interpreted, either because they are not correct or because they are not yet in the knowledgebase:
TextContents["We visited Toulouse and Auterive in Midi-Pyrénées in France.", Automatic, {"String", "Interpretation"}]Use VerifyInterpretation to filter out the entities that cannot be interpreted:
TextContents["We visited Toulouse and Auterive in Midi-Pyrénées in France.", Automatic, {"String", "Interpretation"}, VerifyInterpretation -> True]Properties & Relations (1)
TextContents handles the same types as TextPosition and TextCases and always identifies the same substrings as these functions for a given type:
TextContents["Boston, Worcester, and Springfield are the largest cities in Massachusetts.", "City"]TextCases["Boston, Worcester, and Springfield are the largest cities in Massachusetts.", "City"]TextPosition["Boston, Worcester, and Springfield are the largest cities in Massachusetts.", "City"]A dataset that is similar to the output of TextContents can be obtained using TextCases:
Dataset@TextCases["Boston, Worcester, and Springfield are the largest cities in Massachusetts.", "City" -> Identity]Dataset@TextCases["Boston, Worcester, and Springfield are the largest cities in Massachusetts.", "City" -> Function[<|"City" -> Last[#Interpretation][[1]], "State" -> Last[#Interpretation][[2]], "Country" -> Last[#Interpretation][[3]], "Position" -> #Position, "HighlightedSnippet" -> #HighlightedSnippet|>]]Neat Examples (1)
Load the text of a Wikipedia page about the Moon:
moon = WikipediaData["Moon"];Snippet[moon, 5]Extract notable text contents from the page:
contents = TextContents[moon, VerifyInterpretation -> True]Visualize the frequency of content types found on the page:
counts = ReverseSort@CountsBy[contents, #Type&]WordCloud[counts]Find potential notable persons identified on the page:
Normal[Select[contents, #Type === "Person"&][[All, "String"]]]Interpret these persons as entities:
persons = Normal[Select[contents, #Type === "Person"&][[All, "Interpretation"]]]Visualize occupations of these persons:
WordCloud[Counts[Flatten@EntityValue[persons, EntityProperty["Person", "Occupation"]]]]See Also
TextCases TextPosition TextSearch FindTextualAnswer TextStructure ImageContents
Function Repository: GenerateQuestionsFromSentence
Related Guides
History
Text
Wolfram Research (2019), TextContents, Wolfram Language function, https://reference.wolfram.com/language/ref/TextContents.html.
CMS
Wolfram Language. 2019. "TextContents." Wolfram Language & System Documentation Center. Wolfram Research. https://reference.wolfram.com/language/ref/TextContents.html.
APA
Wolfram Language. (2019). TextContents. Wolfram Language & System Documentation Center. Retrieved from https://reference.wolfram.com/language/ref/TextContents.html
BibTeX
@misc{reference.wolfram_2026_textcontents, author="Wolfram Research", title="{TextContents}", year="2019", howpublished="\url{https://reference.wolfram.com/language/ref/TextContents.html}", note=[Accessed: 13-June-2026]}
BibLaTeX
@online{reference.wolfram_2026_textcontents, organization={Wolfram Research}, title={TextContents}, year={2019}, url={https://reference.wolfram.com/language/ref/TextContents.html}, note=[Accessed: 13-June-2026]}