Portfolio

Academic and artistic projects founded in solid coding in R, Stata, Python, and php.

 

Data Scraping

From online sources data is brought home to do our bidding.

Graph Eulerisation

Graph eulerisation

A programmatical approach to the Chineese postman problem.

 

Voting networks

Tracing patterns in the voting of the Eurovision Song Contest.

 

Networks that organised competition

Doctoral thesis in Economic History - Umeå University.

Eastwards

Österled / Eastwards

My podcast - a backpacker's literary journey through time and space in the Russian-speaking world.

 

Alternative geography

A network-based world map visualisation to re-think the world.

Data scraping

Web scraping is the automated process of pulling ordered formats of data from publicly available sources where data-retrevial would be time-consuming and monotonous for humans to do manually.

I have tought data-scraping at companies and study-circles and university workshops since I got into mining data for a research project a few years back.

I've also mined to create comprehensive databases of political speeches, erotic short-stories, Swedish schools, Swedish house-prises, EU-institution documents, American patents and more. If there is publicly available data, their retrieval can be automated.

Scraped Steam traffic-speeds to Belgium 2016-2019

Result from daily data scraping of the online gaming platform Steam.

Check out my git for some data-mining tutorials as part of workshops that I have given in data-mining.

Networks that organised competion

My doctoral thesis (published here) in 2019 is a study of emerging market structures and corporate strategies related to different kinds of interfirm corporate relations and resource sharing. It is a historic case-study of 75 years of property insurance and proposes a new history of that industry in Sweden.

The study proposes a network perspective on the organisation of competition and collaboration. It finds that networks lowered firms’ cost-threshold for underwriting diversification, causing wellconnected firms to expand into new markets more easily. An essential resource to underwriters was information, and information exchange motivated several interfirm rapprochements. The driving forces for the organisational shift towards increased networking were, however, complex, and included both socioeconomic and strategic factors.

Market shares of leading Swedish property underwriters 1875-1950

Bump chart over leading Swedish property underwriters and their market-share by insurance premiums. Not published in thesis

Through networks of mutual resource sharing, the consolidation that appears in the industry after 1950 was preceded by a long historical process in which firms who would later merge developed measurably clustered network structures as early as in the 1910s. In the 1920s the networks already contributed to a high market concentration. Networks thereby conditioned the underwriting operation of individual firms as well as the structural evolution of the Swedish insurance market as a whole.

Eurovision voting patterns

Since the late 1990s, the Eurovision Song Contest has applied a system of per-country telephone votes. The voting patterns have been argued to contain predictable themes, such as mutually high points awarded between Greece and Cyprus, or between the Scandinavian countries.

Using a community-detection algorithm that assigns groups of countries based on the voting networks between 1999 and 2019, an interesting pattern emerges. The sum of points given between each pair of participating countries is much stronger between certain country-pairs than between others.

Clusters of countries by voting-patterns 1999-2019

The cluster analysis is based on a cut-off algorythm where clusters are found where within-community mutual voting is significatnly stronger than outside-comunity votes. Visualisation algorythm draws same-community countries in proximity. Country-size represents total number of points recieved 1999-2019.

Telephone-voters of culturally close countries is found to appreciate the musical contributions of their in-group members. The country-groups found using the voter-patterns show that cultural and historical clusters of countries like Scandinavia, former Yogoslavia or former Soviet states, still generally prefer the music from each other's countries.

This visualisation uses the igraph-package for R and visualises community-members near each other using the Fruchterman-Reingold algorithm which takes into account the edge-weights defined by the total mutual points awarded between each pair of countries in the Eurovision Song Contest finals of the past decades.

The scripts to mine the voting-data and perform the analysis are available as tutorials on my git.

Eastwards / Österled

Eastwards is an economic-historic backpacker's podcast journeying through time and space in the Russian-speaking world. Each leg of my trips in Russia, Georgia, Ukraine and Moldova are accompanied by historical or literary characters and episodes explore themes in 200 years of Russian history.

Eastwards

Photo: Josef Lilljegren © 2016

The podcast is in Swedish and has reached few but enthusiastic listeners. I have scheduled English summary episodes for spring 2020. As this is an ongoing low-priority project, new episodes still emerge with regular (and sometimes not so regular) intervals.

The material was recorded during my summer travels in 2018 and again during 2019. The podcast is loosely scripted. It alternates between contextualising post-production philosophical reflections about all things Ruissan, and the rougher local recordings from Russia, Georgia, Moldova and the Ukraine where I travelled for four months.

The podcast lives on iTunes and Spotify. You can also listen and explore the project on the Österled web-page

Chinese postman problem

In network analysis, a graph contains eulerian cycles if one can visit all nodes without travelling any of its edges more than once. The problem, commonly known as the Chinese postman problem or the Seven Bridges of Königsberg was solved by mathematical father-figure Leonard Euler in the 18th century as an answer to the question if one could pass across neighbourhoods of Prussian Königsberg without crossing any of its bridges more than once. The answer was no. But what is the minimum number of bridges one would have had to construct to enable a Eulerian cycle?

Old bridge Euler's new bridge

Carl Hierholzer (1873) had explained how eulerian cycles exist for graphs that are 1) connected, and 2) contain only vertices with even degrees. In a pedagogical script answering a stack overflow question, I wrote this function in R which takes any graph and attempts to make minimal manipulations of it to achieve a structure where a eulerian path is possible.

Visit my stackexchange answer or download the scripts written in R.

Alternative geography using networks

This network visualises the land borders of countries of the world. It truthfully depicts relative population sizes and honors neighbor-relations, but otherwise disreguards geograph space. How does our view of the world exist in our minds?

This alternative world map was made to facilitate the re-thinking of concepts like countries and continents, and to trick the beholder into turning the attention inwards - towards the beholder of such consepts and the cognitive functions on which we rely to order and navigate in the world. We can recognise the world in it, but need to rediscover and realign the alternative map to our established perseption in order to read the image.

Network-generated world-map layout

A world map visualised by algoritmic placement of country-nodes in a network of countries joined by land-borders.

Infographs communicate factual information about the world and permit us form our understanding of it. Once acquired, though, our understanding of the world is sticky. Humans rely on habits and acquired patterns to function since a constantly reevaluation of our surrounding would be cognatively exhausting. Still, sticking to old info is a cognitive bias and problematic trait in human behaviour. Public helath super-lecturer Hans Rosling noted that the world view of his students corresponded roughly to the socioeconomic state of the world at the time when their teachers were children.

I am fasinated by the waking up from perseptional slumbers. The moment when captured atention or kindeled interest precipitates re-thinking. Here, the statistician and the artist meet and converge over bothe objectives and means to, as Picaso said, brush away from the soul the dust of everyday life.

Download here the edited pdf and the original script in R using igraph and relational data to create the world-map network.

More portfolio projects:

Résumé

Recent professional projects and publications are listed below. Download full CV as pdf here.

Curriculum Vitæ

  • (2019) Investigator at Transport Analysis, the Swedish governement agency for transport policy advice.
  • (2019) PhD in Economic history, Umeå University
  • (2017-2019) Research assistant, Umeå Center for Gender Studies
  • (2018-2019) Freelance researcher and lecturer in informatics.
  • (2018 - ) Podcaster for Österled: recording, scripting, audio-editing and web development.

Publications

Contact

Email me at contact at Lilljegren dot com or fill in the following form to reach me. I am based in Brussels - Belgium.