Cleopatra


Hi again! I learned about this outlier on twitter. I saw many people wondering why was Cleopatra a trending page on Wikipedia Spanish:

Screenshot showing two tweets. The first one says 'why is Cleopatra' always a top read article on Wikipedia'. The second one says in Spanish: Could someone explain me why is Cleopatra the most visited term on Wikipedia each day'.

Then, I was curious and went to do some research on the available data to see what was going on. I came across the awesome package pageview that allows us to get some data about the number of visits of Wikipedia articles. Exactly what I needed =)! So I got the information about which were the top visited articles in the past months, starting from January 2021. After removing the Home page and Search page from Wikipedia, I realized that indeed Cleopatra was the top visited article. The following bump chart (that I made thanks to the ggbump package following this code) shows the ranking of the top 5 wikipedia articles in Spanish between January and May 2021. When the articles are not included in the top 5, they are represented on the bottom of the chart in faded gray. I hightlighted the ones that repeted the most on the top 5, these are: Cleopatra, the Periodic Table, Isabel II, and Translation. The rest of the articles of this ranking appear to be visited in relation to some punctual events.

A figure that shows a bump or ranking plot of the top visited articles on Wikipedia in Spanish. 'Cleopatra', 'Tabla periódica de los elementos', 'Isabel II del Reino Unido' and 'Traduccion' are highlighted because they appear in more than one month. Cleopatra appears always first.

After this, I went to see the exact number of visits per day that the articles were having to understand the visit patterns. I plotted some of the top articles on the following line chart and highlighted the patterns corresponding to Cleopatra, the Periodic Table (that has a periodic pattern of visits!!!), and Marie Curie. On one hand, the figure shows a huge spike on the Marie Curie article pattern that corresponds to the launch of the Radioactive movie on Netflix in Spanish on April, 15. On the other hand, Cleopatra article is constantly super visited with a daily number of visits around 100000, a lot more than the rest of the popular articles.

Time series that shows the patterns of the number of visits to some Wikipedia articles with 'Cleopatra' and 'Tabla periódica de los Elementos' highlighted.

And after seeing this, of course, I wanted to know the reason of this outlier. Related to this topic, @dredondosanchez has a bot that publishes every day the most visited articles on Wikipedia in Spanish. And I saw that he twitted this:

He made a twitter poll asking: “Should I delete Cleopatra from the top of the daily most visited articles?.” It turns out that one of the default suggestions that Google Assistant make to users is to search Cleopatra on Wikipedia. And a lot of people seem to be doing this unintentionally after some seconds in case of no answer to the app. It seems that the mistery is solved. I show you here a screenshot of my sister’s phone asking her to search Cleopatra on Wikipedia:

Screenshot of my sister's phone that shows the Google Assistant suggesting to search Cleopatra on Wikipedia.

See you soon for more random facts!

EN | ES