Tag Archives: altmetrics

Chat about Research Repository #theta2015

Elements Integration – lets chat about Research Repository and populating Researcher Profiles (abstract)
Leonie Hayes and Anne Harvey

[Facilitated audience discussion of various questions only loosely related. Probably unintended that largely drew an audience of people perhaps more interested in learning about Elements than of people who had already implemented it.]

Discussion of data – Creative Commons licenses not very appropriate to datasets because immediately locks down opportunities for reuse. Creative Commons Zero is better here.

“Sunshine cleaning” – when you hang your data out to dry and everyone sees how dirty it is so you quickly clean it. [Very effective but terrifying for many researchers so I suggest an alternative might be to put the data, like the journal article, out for peer review.]

Looking at impact for Creative Works – altmetrics. Many don’t see themselves as researchers but as practitioners. Uptake of workshops is low as often working from home. The institution needs to focus on areas outside STEM and traditional metrics – these alienate scholars in other fields.

Open Access policy. Many have ideals but doesn’t translate into practice. Especially license issues. Difficulties when managing a PBRF version vs an open access version.

Social media as an agent of socio-economic change #vala14 p2

Johan Bollen Social media as an agent of socio-economic change: analytics and applications

World we live in increasingly about online connections. First computer had 1KB RAM and programmable by BASIC. Now can wake up parents in Belgium by FaceTime. Data from 2012 2.4billion internet users worldwide (15.6% Africa to 78.6% North America, 67.6% Oceania/Australia). Amount of online content staggering.

Facebook, LiveJournal, Twitter… We’re not using these networks to broadcast – they’re to collaborate socially. Many-to-many. Generates content and establishes social relations — collaboratively.

Displays xkcd cartoon re ubiquity of phones and map of usage of Twitter and Flickr. Visualising languages spoken; what things are being downloaded. Using Twitter to map discussion of beer vs church. And using it to monitor outbreaks of flu.

Wikipedia using collaboration to create content. Estimize using it to predict markets.
“Prevailing pessimism about large groups collaborating in a productive manner, absent central authority, may not be justified.” From the “madness of crowds” (wacky ideas) to “the wisdom of crowds”. On “Who wants to be a millionaire”, asking an expert gets it right 65%, asking the audience 91% right. When you ask people questions they have to guestimate an answer to, “the average of two guesses from one individual was more accurate than either guess alone”.

Galton (1907), Nature, 1949(75):450-451 – aggregating judgements of people of weight of dressed ox got within 1% of accuracy.
Condorcet Jury Theorem (1785) – even if jurors individually are rarely right, going for a majority vote the chance of being right approaches unity.
Collective intelligence – birds flocking, ants finding food.

We have telescopes to look at huge things, microscopes to look at tiny things – we need a macroscope to look at really complex things: this is computational social science studying data generated by social media. Network analysis. Natural language processing.

Epictetus “Men are disturbed, not by things, but by the principles and notions which they form concerning things”.

Sentiment analysis. eg “Affective Norms for English Words” rated along valence, arousal and dominance, OpinionFinder, SentiWordnet. We understand individual emotions well, not so much collective emotions. Diagram charting fluctuations in collective mood based on Twitter feeds; correlating with market fluctuations – discovered that the Twitter ‘calm’ mood correlated with increase in DOW three days in advance 85%. Other results have largely confirmed this using Google trends, using dataset from LiveJournal posts.

Where does collective emotion come from? Is it more than the sum of individual emotions? Do sad people flock together or do they make each other sad? Homophily (bird of a feather) prevalent in social networks. People connected to lots of people tend to be connected to other people who are connected to lots of people. (Ie the popular kids hang out with each other.) Image of political homophily on Twitter. So does mood act in the same way? Looked at reciprocal following on Twitter. Found small cluster of negative-emotion users, and larger cluster of positive-emotion users. (Don’t know where causation is.) The closer the friendship, the more reliable this was.

Application to bibliometrics: got rejected from journals so published on arXiv and got massively read and within a month cited. So looked at arXiv papers and found a weak correlation between Twitter mentions and early citations. But the problem with altmetrics: the biggest nodes are the media, big blogs etc. The number mentions doesn’t matter as who is mentioning.

Radical proposal for funding science (developed over alcohol-fueled Christmas party grumps about writing funding proposals). (Motto: “What would the aliens say?”) Fund people not projects. Science as gift-economy. Encourage innovation. Change scholarly incentives for the better. Congress should give money to scientific community – every scientist gets an equal chunk, but you have to donate a certain percentage to anyone you want (who have to donate a percentage of what they’ve received). Would lead to an uneven “but fair” distribution. [My criticism: would be susceptible to issues of implicit bias against women, people of colour, etc. However don’t know if it’d be more or less susceptible to these problems than the current system is.] Ran a simulation using network data: when F=0.5 it matches the distribution by the NSF and NIH.

Q: Risk of feedback loops?
A: Yes – citing hacking of Twitter account to post about bombs in White House leading to massive market shorting – not just people getting freaked out, algorithms getting freaked out. Positive feedback loops bad news – hopefully can set up things so instead you’ll get negative feedback loops that lead to homeostasis. Can only mitigate problems by understanding how things work.