If you are looking for connecting the painters into a network, the e-flux web scraping project connected painters based on co-exhibitions and analyzed the network afterwards. ArtProject has other networks of the painters (and styles) too, e.g. based on time and place.
The aim of this project is to create a dataset of painters from sources such as WikiArt and Art500k, combining features, substituting missing data of painters via the Wikipedia API and make corrections/additions both automated and manually. After finding mappings between painters in the two sources + Wikidata, the dataset includes around 10000 painters with many attributes.
Currently, the dataset includes 29 attributes:
- Biography data: Nationality/citizenship, name, birth and death years and places, gender
- Artistic style data
- Locations of activity (sometimes with years)
- Occupations (e.g. painter, sculptor, litographer, etc.)
- Influences: on painters, and from painters, pupils, teachers
- Friends, coworkers (limited data)
- Quantities of paintings, in styles, etc.
The dataset is intended to be used for various purposes, including data analysis, machine learning, and visualization projects.
One long-term goal would be to create a JSON file that contains all combined hierarchically. A level in the structure could be art movement, inside it, are artists with some base bio data, an even lower layer could be the paintings of the painter (even better could be eras of painters in their substructure, and inside them the paintings).
We have created multiple networks of painters (based on being at the same places at the same time + nationality, additionally style similarity, or who influenced whom networks) in another project (see: ArtProject). A network of styles and movements were also created.
Download the latest version (1.0.2) here
In any case, the final ("compiled") dataset is stored in the artists.csv file (raw file here: raw, often this is better import / look at as it doesn't have the commit ID in the URL so this gives back always the freshest version).
Don't forget to scroll
artist | Nationality | citizenship | gender | styles | movement (Wiki) | Movements (Art500k) | birth place | death place | birth year | death_year | FirstYear | LastYear | wikiart_pictures_count | locations | locations_with_years | styles_extended | StylesCount | StylesYears | occupations | PaintingsExhibitedAt | PaintingsExhibitedAtCount | PaintingSchool | Influencedby | Influencedon | Pupils | Teachers | FriendsandCoworkers | Contemporary | ArtMovement | Type |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Bracha L. Ettinger | French,Jewish,Israeli | Israel | female | New European Painting | New European Painting | {New European Painting:21} | Tel Aviv | NaN | 1948 | NaN | 1991 | 2009 | 21 | [] | [] | {New European Painting:21} | {New European Painting:21} | New European Painting:1991-2009 | philosopher, psychoanalyst, painter, photographer | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Yes | NaN | NaN | |
Friedrich Schroder-Sonnenstern | German | Germany | male | Art Brut | Outsider art (Art brut) | {Outsider art (Art brut):10} | Sovetsk | Berlin | 1892 | 1982 | 1948 | 1965 | 10 | ['Berlin'] | ['Berlin:1949-1982'] | {Art Brut:10} | {Art Brut:10} | Art Brut:1948-1965 | painter, drawer | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No | NaN | NaN | |
Nuno Gonçalves | Portuguese | Kingdom of Portugal | male | Early Renaissance | Early Renaissance | {Early Renaissance:10} | NaN | NaN | 1425 | 1492 | 1450 | 1490 | 10 | [] | [] | {Early Renaissance:10} | {Early Renaissance:10} | Early Renaissance:1450-1490 | portraitist, painter, musician, researcher | Lisbon, Portugal | {Lisbon:1},{Portugal:1} | NaN | National Museum of Ancient Art (MNAA), Lisbon,... | NaN | NaN | NaN | NaN | No | NaN | NaN |
Jose Ferraz de Almeida Junior | Brazilian | Brazil | male | Academicism, Realism | Realism | {Realism:64} | Itu | Piracicaba | 1850 | 1899 | 1850 | 1899 | 65 | [] | [] | {Academicism:13},{Realism:52} | {Realism:51}, {Academicism:15} | Realism:1850-1899,Academicism:1850-1895 | painter | Brazil, Rio de Janeiro, Sao Paulo | {Sao Paulo:30},{Brazil:36},{Rio de Janeiro:4} | NaN | NaN | NaN | NaN | NaN | NaN | No | NaN | NaN |
Frank W. Benson | American | United States of America | male | Impressionism, Naturalism, Realism | Impressionism | {Impressionism:103} | San Jose | Redlands | 1858 | 1911 | 1884 | 1924 | 103 | [] | [] | {Impressionism:66},{Naturalism:5},{Realism:32} | {Realism:48}, {Naturalism:5}, {Impressionism:68} | Realism:1884-1922,Naturalism:1895-1924,Impress... | lawyer, politician, painter, graphic artist | US, Chicago, Washington DC, IL, NY, New York City | {Chicago:1},{IL:1},{US:4},{New York City:1},{N... | Ten (Ten American Painters) | Johannes Vermeer,Diego Velazquez,Claude Monet,... | NaN | NaN | NaN | Edmund Charles Tarbell,Robert Lewis Reid,Abbot... | No | NaN | NaN |
Maria Helena Vieira da Silva | French,Portuguese | Portugal | female | Abstract Art, Expressionism, Surrealism, Tachi... | Abstract Art | {Abstract Art:38} | Lisbon | Paris | 1908 | 1992 | 1932 | 1995 | 38 | [] | [] | {Abstract Art:5},{Expressionism:5},{Surrealism... | {Cubism:2}, {Expressionism:8}, {Surrealism:9},... | Cubism:1932-1932,Expressionism:1932-1942,Surre... | artist, painter, graphic artist, stained-glass... | Lisbon, Portugal | {Lisbon:1},{Portugal:1} | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
Jean-Baptiste-Simeon Chardin | NaN | France | male | Baroque, Rococo | Baroque | {Realism:1} | Paris | Paris | 1699 | 1779 | 1728 | 1753 | 79 | ['Paris'] | [] | {Baroque:17},{Rococo:62} | NaN | NaN | painter, visual artist | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No | NaN | NaN |
... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
- Find the aliases of painters in Art500k dataset (one painter, multiple instances with different names e.g., Rembrandt and Rembrandt van Rijn); currently the methods are being developed and discussed (the two highest candidate methods are finding aliases through Wikipedia and Wikidata, and using a word embedding to find the very similar names).
- Broader combination of datasets (handle aliases, add more painters to the final dataset)
These were taken from ArtProject.
Figure 1: Painters connected based on time and place (roughly if they painted at the same places at the same time), arranged in birth year order. The color of the nodes represents the movement most common in the painter's styles. An updated, more accurate and larger version of this network is visualized below, but not ordered by time, unlike above.
Figure 1b: Painters connected based on time and place + post filtering, arranged in birth year order but not ordered by time. Out of >10000 painters in the dataset, around 3000 had enough connections, these were chosen and their network was visualized.
Figure 2: Network of movements: two movements are connected if it's common enough that painters painted in both styles.
Don't have to download the file:
You can use the dataset by importing it in your Python environment (using something like Pandas):
import pandas as pd
url = "https://raw.githubusercontent.com/me9hanics/PainterPalette/main/datasets/artists.csv"
artists = pd.read_csv(url)
artists
The artists.csv file contains all information about painters, each row representing a painter, columns representing an attribute. An example of a few painters from the dataset:
ID | artist | Nationality | citizenship | gender | styles | movement | Art500k_Movements | birth_place | death_place | birth_year | death_year | FirstYear | LastYear | wikiart_pictures_count | locations | locations_with_years | styles_extended | StylesCount | StylesYears | occupations | PaintingsExhibitedAt | PaintingsExhibitedAtCount | PaintingSchool | Influencedby | Influencedon | Pupils | Teachers | FriendsandCoworkers | Contemporary |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | Richard Pousette-Dart | American | United States of America | male | Abstract Art, Abstract Expressionism, Academicism | Abstract Art | {Abstract Expressionism:54} | Saint Paul | Rockland County | 1916.0 | 1992.0 | 1930.0 | 1992.0 | 54.0 | [] | [] | {Abstract Art:10},{Abstract Expressionism:43},{Academicism:1} | {Abstract Expressionism:43}, {Abstract Art:11}, {Academicism:1} | Abstract Expressionism:1940-1992,Abstract Art:1930-1992,Academicism:1944-1944 | photographer, painter, drawer | NY, New York City, US | {New York City:2},{NY:2},{US:2} | New York School,Irascibles | NaN | NaN | NaN | NaN | NaN | NaN |
1 | Ethel Léontine Gabain | French,British | United Kingdom | female | Neo-Romanticism | Neo-Romanticism | NaN | Le Havre | London | 1883.0 | 1950.0 | 1930.0 | 1944.0 | 45.0 | [] | [] | {Neo-Romanticism:45} | NaN | NaN | lithographer, painter | London, Manchester, UK | {London:2},{UK:3},{Manchester:1} | NaN | NaN | NaN | NaN | NaN | NaN | No |
2 | Charles-Amable Lenoir | NaN | France | male | Academicism, Unknown | Academic Art | {Academic Art:9} | Châtelaillon-Plage | Paris | 1860.0 | 1926.0 | NaN | NaN | 9.0 | [] | [] | {Academicism:1},{Unknown:8} | {Academicism:1} | NaN | painter | NaN | NaN | NaN | NaN | NaN | NaN | |||
3 | Francisco de Zurbaran | Spanish | Spain | male | Baroque, Unknown | Baroque | {Baroque:96} | Fuente de Cantos | Madrid | 1598.0 | 1664.0 | 1625.0 | 1664.0 | 154.0 | ['Seville', 'Madrid'] | ['Seville:1614-1658', 'Madrid:1658-1664'] | {Baroque:150},{Unknown:4} | {Baroque:94} | Baroque:1625-1664 | painter | Hungary, Museo del Prado, Paris, Barcelona, Budapest,Seville, Moscow, Pasadena, London, Sweden, France, Hartford, Grenoble, Munich, CA, Nationalmuseum, ... | {Grenoble:7},{France:19},{Seville:31},{Spain:36},{Bordeaux:1},{Besançon:1},{Barcelona:1},{Paris:4},{Budapest:3},{Hungary:3},{Saint Petersburg:1},{Russia:2},{Museo del Prado:14},{Madrid:19}, ... | NaN | Caravaggio | Gustave Courbet | NaN | Francisco Pacheco | NaN | No |
4 | Pieter van Hanselaere | Belgian | Belgium | male | Neoclassicism | Neoclassicism | {Neoclassicism:8} | Ghent | Ghent | 1786.0 | 1862.0 | 1817.0 | 1827.0 | 8.0 | ['Paris', 'Ghent', 'Italy'] | ['Paris:1809-1815,1812-1812', 'Italy:1815-1815'] | {Neoclassicism:8} | {Neoclassicism:8} | Neoclassicism:1817-1827 | painter | Netherlands, Amsterdam | {Amsterdam:2},{Netherlands:2} | NaN | NaN | NaN | NaN | Jacques-Louis David | NaN | No |
5 | Jean-Honore Fragonard | French | France | male | Rococo, Unknown | Rococo | {Rococo:72},{Renaissance:1} | Grasse | Paris | 1732.0 | 1806.0 | 1750.0 | 1790.0 | 69.0 | ['Vienna', 'Tivoli', 'Rome', 'Paris', 'Naples', 'Strasburg', 'Grasse', 'Prague', 'Dresden', 'Frankfurt'] | [] | {Rococo:64},{Unknown:5} | {Rococo:70} | Rococo:1750-1790 | illustrator, painter, printmaker, architectural draftsperson, drawer | Netherlands, Paris,London, Pasadena, Moscow, NY, Washington DC, France, Marseille, Munich, CA, Germany, New York City, MA, UK, Toledo, Madrid, OH, Rotterdam, ... | {France:21},{Paris:8},{Moscow:1},{Russia:3},{Saint Petersburg:2},{Washington DC:2},{US:9},{New York City:2},{NY:2},{Rotterdam:1},{Netherlands:1},{Toledo:1},{OH:1},{Munich:1},{Germany:1},{Williamstown:1},... | NaN | NaN | NaN | NaN | NaN | NaN | No |
6 | Ion Theodorescu-Sion | Romanian | Romania | male | Art Nouveau (Modern), Impressionism, Post-Impressionism, Symbolism | Post-Impressionism | {Post-Impressionism:43} | Ianca | Bucharest | 1882.0 | 1939.0 | 1909.0 | 1938.0 | 43.0 | [] | [] | {Art Nouveau (Modern):1},{Impressionism:8},{Post-Impressionism:33},{Symbolism:1} | {Post-Impressionism:33}, {Impressionism:8}, {Cubism:2}, {Art Nouveau (Modern):1}, {Symbolism:1} | Post-Impressionism:1912-1938,Impressionism:1913-1934,Cubism:1925-1936,Art Nouveau (Modern):1925-1925,Symbolism:1909-1909 | trade unionist, caricaturist, painter | NaN | NaN | Balchik School | NaN | NaN | NaN | NaN | NaN | No |
7 | Janos Mattis-Teutsch | Hungarian,Romanian | Romania | male | Abstract Art, Constructivism, Cubism, Expressionism, Fauvism, Socialist Realism | Constructivism | {Art Nouveau:1},{Socialist realism:1},{Abstract art:1},{Modern art:1},{Constructivism:109} | Brașov | Brașov | 1884.0 | 1960.0 | 1909.0 | 1947.0 | 108.0 | [] | [] | {Abstract Art:59},{Constructivism:7},{Cubism:4},{Expressionism:2},{Fauvism:31},{Socialist Realism:5} | {Constructivism:11}, {Abstract Art:61}, {Expressionism:2}, {Cubism:4}, {Fauvism:31}, {Socialist Realism:5} | Constructivism:1925-1930,Abstract Art:1918-1925,Expressionism:1947-1947,Cubism:1926-1928,Fauvism:1909-1947,Socialist Realism:1928-1945 | writer, poet, painter, sculptor, journalist | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
Capital first letter means the attribute is collected from Art500k paintings data, non-capital means the attribute is collected from WikiArt or Wikidata.
- Nationality and citizenship: The first one is gathered from Art500k, the second one is from fetching Wikidata.
- Styles and movements: Paintings have an artistic style, and painters have (mostly one) artistic movement. In
movement
, this most commonly associated movement with the painter is stored, this is from WikiArt paintings, painters in that dataset only have one movement assigned to them. Styles are counted and stored instyles
,styles_extended
,StylesCount
,StylesYears
(andArt500k_Movements
) store information on the quantity of paintings painted by this painter, and the periods in which he painted them. - WikiArt pictures count: how many paintings the painter has in the WikiArt dataset.
- First and last year: The earliest year contained in the Art500k dataset when the painter had a painting with this year in the date column + same with last year.
- Locations: Places where the painter lived or worked, and the years when he was there. The painter's exhibited paintings locations are also stored under
PaintingsExhibitedAt
andPaintingsExhibitedAtCount
. - Influences, teachers, friends, coworkers: The names in these cells are artists, so e.g. which painters the painter was influenced by (this is very sparse data)
- Others: mostly bio data, such as birth.
Here are some examples of using the code to work with the dataset:
- Filter Renaissance painters:
display(artists[artists['styles'].str.contains('Renaissance')])
artist | Nationality | citizenship | gender | styles | movement | Art500k_Movements | birth_place | death_place | birth_year | occupations | PaintingsExhibitedAt | PaintingsExhibitedAtCount | PaintingSchool | Influencedby | Influencedon | Pupils | Teachers | FriendsandCoworkers | Contemporary |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Nuno Gonçalves | Portuguese | Kingdom of Portugal | male | Early Renaissance | Early Renaissance | {Early Renaissance:10} | NaN | NaN | 1425.0 | portraitist, painter, musician, researcher | Lisbon, Portugal | {Lisbon:1},{Portugal:1} | NaN | National Museum of Ancient Art (MNAA), Lisbon,... | NaN | NaN | NaN | NaN | No |
Cristovao de Figueiredo | Portuguese | Kingdom of Portugal | male | Northern Renaissance | Northern Renaissance | {Northern Renaissance:12} | NaN | NaN | 1450.0 | painter | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No |
Giovanni da Milano | Italian | NaN | male | Proto Renaissance | Proto Renaissance | {Early Renaissance:86} | Valmorea | NaN | 1325.0 | painter, teacher | Netherlands, Amsterdam | {Amsterdam:1},{Netherlands:1} | Lombard School | Giotto, | NaN | NaN | NaN | NaN | No |
Bernard Van Orley | NaN | Southern Netherlands | male | Northern Renaissance | Northern Renaissance | NaN | Brussels metropolitan area | City of Brussels | 1488.0 | painter | Bélgica | {Bélgica:1} | NaN | NaN | NaN | NaN | NaN | NaN | No |
... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
- List painters who lived in Paris:
display(artists[(~artists['Places'].isna())&(artists['Places'].str.contains('Paris'))])
artist | Nationality | citizenship | gender | styles | movement | Art500k_Movements | birth_place | death_place | birth_year | occupations | PaintingsExhibitedAt | PaintingsExhibitedAtCount | PaintingSchool | Influencedby | Influencedon | Pupils | Teachers | FriendsandCoworkers | Contemporary |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Jean-Baptiste-Simeon Chardin | NaN | France | male | Baroque, Rococo | Baroque | {Realism:1} | Paris | Paris | 1699.0 | painter, visual artist | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No |
Johann Anton de Peters | German | Germany | male | Rococo | Rococo | {Rococo:12} | Cologne | Cologne | 1725.0 | engraver, painter | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No |
... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
- Get Monet's number of paintings per style (those available on WikiArt), sorted by number of paintings:
import pandas as pd
style_counts = pd.read_csv('https://raw.githubusercontent.com/me9hanics/PainterPalette/main/datasets/wikiart_artists_styles_grouped.csv')
display(style_counts[style_counts['artist']=="Claude Monet"].sort_values(by='count', ascending=False))
style artist movement count
Impressionism Claude Monet Impressionism 1341
Realism Claude Monet Impressionism 12
Unknown Claude Monet Impressionism 12
Academicism Claude Monet Impressionism 1
Japonism Claude Monet Impressionism 1
A lot more functionalities of the dataset are used in ArtProject.
This project is licensed under the MIT License, corresponding author: Mihaly Hanics (CEU Vienna, Austria).
Contact: [email protected]
Linking painters/people/entities together:
PageRank / Wiki Connections
Wiki Connections: partial dataset
http://www.iesl.cs.umass.edu/data/data-wiki-links
smaller dataset: https://snap.stanford.edu/data/wikispeedia.html
Philosopher's web: Only available after paying 10$ for pro user
Philosophy NLP data: https://philosophydata.com/phil_nlp.zip
Network of the people connected to Francis Bacon, the network contains mostly born in the 16th century and are English so most philosophers in this list are not super relevant, there is no Kant, Nietzsche, etc. But good example of a network
http://www.sixdegreesoffrancisbacon.com/?ids=10000473&min_confidence=60&type=network
Code for obtaining graph
import igraph as ig #To install: conda install -c conda-forge python-igraph
people = pd.read_csv('datasets/SDFB_people_.csv')
relationships = pd.read_csv('datasets/SDFB_relationships_.csv')
#I used igraph, because it's faster than networkx, and graph-tool sucks on Windows
network = relationships.rename(columns={'id': 'relationship_id', }).drop(columns=['created_by', 'approved_by', 'citation'])
print(network.head(), '\n')
cols = network.columns.tolist()
cols = cols[1:3] + cols[0:1] + cols[3:]
network = network[cols]
network = network[network['person1_index'] != 10050190] #for some reason, there is no person with this id, I did a loop
# I used the documentation here: https://python.igraph.org/en/stable/generation.html#from-pandas-dataframe-s this I followed
# this is important too: https://python.igraph.org/en/stable/api/igraph.Graph.html#DataFrame
g = ig.Graph.DataFrame(network, directed=False, vertices=people[['id', 'display_name','historical_significance','birth_year','death_year']], use_vids=False)
print(g.summary().replace(',', '\n'))
Code for filtering
filtered = g.vs.select(_degree = 0) #https://python.igraph.org/en/stable/tutorial.html#selecting-vertices-and-edges
g.delete_vertices(filtered)
import cairo #Needed for plotting #import cairocffi as cairo # can do matplotlib too
#layout = g.layout(layout='auto')
#ig.plot(g, layout = layout) #ig.plot(g) #looks even worse
Code for obtaining graph
layout = g.layout(layout='reingold_tilford_circular') #kamada_kawai requires too much computing, 'fruchterman_reingold' is too dense
visual_style = {}
visual_style["vertex_size"] = 5
visual_style["vertex_color"] = "blue"
visual_style['bbox'] = (900, 900)
visual_style["layout"] = layout
#ig.plot(g, **visual_style) #Commented out because it takes big memory
# Needs improvement, but it's a start
https://global.health/ they got nice data on diseases, probably time-variant too, such as monkeypox, ebola.
Modeling of Biological + Socio-tech systems (MOBS) Lab: https://www.mobs-lab.org/