I live blogged Cameron Neylon’s talk today at Newcastle University, and I did it in a Wave. There were a few pluses, and a number of minuses. Still, it’s early days yet and I’m willing to take a few hits and see if things get better (perhaps by trying to write my own robots, who knows?). In effect, today was just an exercise, and what I wrote in the Wave could have equally well been written directly in this blog.
(You’ll get the context of this post if you read my previous post on trying to play around with Google Wave. Others, since, have had a similar experience to mine. Even so, I’m still smiling – most of the time
)
Pluses: The Wave was easy to write in, and easy to create. It was a very similar experience to my normal WordPress blogging experience.
Minuses: I wanted to make the Wave public from the start, but have yet to succeed in this. Adding public@a.googlewave.com or public@a.gwave.com just didn’t work: nothing I tried was effective. Also, the copying and pasting simply failed to work when copying the content of the Wave from Iron into my WordPress post in Firefox: while I could copy into other windows and editors, I simply couldn’t copy into WordPress. When I logged into Wave via Firefox, the copy-and-paste worked, but automatically included the highlighting that occurred due to my selecting the text, and then I couldn’t un-highlight the wave! What follows is a very colorful copy of my notes. I’ve removed the highlighting now, to make it more readable.
I’d like to embed the Wave here directly. In theory, I can do this with the following command:
[wave id="googlewave.com!w%252BtZ-uDfrYA.2"]
Unfortunately, it seems this Wavr plugin is not available via the wordpress.com setup. So, I’ll just post the content of the Wave below, so you can all read about Cameron Neylon’s fantastic presentation today, even if my first experiment in Wave wasn’t quite what I expected. Use the Wave id above to add this Wave to your inbox, if you’d like to discuss his presentation or fix any mistakes of mine. It should be public, but I’m having some issues with that, too!
Cameron Neylon’s talk on Capturing Process and Science Online. Newcastle University, 15 October 2009.
Please note that all the mistakes are mine, and no-one else’s. I’m happy to fix anything people spot!
We’re either on top of a dam about to burst, or under it about to get flooded. He showed a graph of data entering GenBank. Interestingly, the graph is no longer exponential, and this is because most of the sequence data isn’t goinginto GenBank, but is being put elsehwere.
The human scientist does not scale. But the web does scale! The scientist needs help with their data, with their analysis etc. They’ll go to a computer scientist to help them out. The CS person gives them a load of technological mumbo jumbo that they are suspicious of. What they need is someone to interpolate the computer stuff and the biologist. They may try an ontologist, however, that also isn’t always too productive: the message they’re getting is that they’re being told how to do stuff, which doesn’t go down very well. People are shouting, but not communicating. This is because all the people might want different things (scientists want to record what’s happening in the lab, the ontologist wants to ensure that communication works, and the CS person wants to be able to take the data and do cool stuff with it).
Scientists are worried that other people might want to use their work. Let’s just assume they think that sharing data is exciting. Science wants to capture first and communicate second, ontologists want to communicate, and CS wants to process. There are lots of ways to publish on the web, in an appropriate way. However, useful sharing is harder than publishing. We need the agreed structure to do the communication, because machines need structure. However, that’s not the way humans work: humans tell stories. We’ve created a disconnect between these two things. The journal article is the story, but isn’t necessarily providing access to all the science.
So, we need to capture research objects, publish those objects, and capture the structure through the storytelling. Use the MyTea project as a example/story: a fully semantic (RDF-backed) laboratory record for synthetic chemistry. This is a structured discipline which has very consistent workflows. This system was tablet-based. It is effective and is still being used. However, what it didn’t work for was molecular biology / bioengineering etc — a much wider range of things than just chemistry. So Cameron and others got some money to modify the system: take MyTea (highly structured and specific system) and extend it into molecular biology. Could they make it more general, more unstructured? One thing that immediately stands out for unstructured/flexible is blogs. So, they thought that they could make a blog into a lab notebook. Blogs already have time stamps and authors, but there isn’t much revision history therefore that got built into the new system.
However, was this unstructured system a recipe for disaster? Well, yes it is — to start with. What warrants a post, for example? Should a day be one post? An experiment? There was little in the way of context or links. People who also kept a physical lab book ended up having huge lists of lab book references. So, even though there was a decent amount of good things (google indexing etc) it was still too messy. However, as more information was added, help came from an unexpected source: post metadata. They found that pull-down menus for templates were being populated by the titles of the posts. They used the metadata from the posts and used that to generate the pull-down menu. In the act of choosing that post, a link is created from that post to the new page made by the template. The templates depend on the metadata, and because the templates are labor saving, users will put in metadata! Templates feed on metadata, which feed the templates, and so on: a reinforcing system.
An ontology was “self-assembled” out of this research work and the metadata used for the templates. Their terms were compared to the Sequence Ontology and found some exact matches and some places where they identified some possible errors in the sequence ontology (e.g. conflation of purpose into one term). They’re capturing first, and then the structure gets added afterwards. They can then map their process and ontologies onto agreed vocabularies for the purpose of a particular story. They do this because we want to communicate to other communities and researchers that are interested in their work.
So, you need tools to do this. Luckily, there are tools available that exploit structure where it already exists (like they’ve done in their templates, aka workflows). You can imagine instruments as bloggers (take the human out of the loop). However, we also need tools to tell stories: to wire up the research objects into particular stories / journal articles. This allows people who are telling different stories to connect to the same objects. You could aggregate a set of web objects into one feed, and link them together with specific predicates such as vocabs, relationships, etc. This isn’t very narrative, though. So, we need tools that interact with people while they’re doing things – hence Google Wave.
An example is Igor, the Google Wave citation robot. You’re having a “conversation” with this Robot: it’s offering you links, choices, etc while having it look and feel like you’re writing a document. Also is the ChemSpider Robot, written by Cameron. Here, you can create linked data without knowing you’ve done it. The Robots will automatically link your story to the research objects behind it. Robots can work off of each other, even if they aren’t intended to work together. Example: Janey-robot plus Graphy. If you pull the result from a series of robots into a new Wave, the entire provenance from the original wave is retained, and is retained over time. Workflows, data, or workflows+data can be shared.
Where does this take us? Let’s say we type “the new rt-pcr sample”. The system could check for previous rt-pcr samples, and choose the most recent one to link to in the text (after asking them if they’re sure). As a result of typing this (and agreeing with the robot), another robot will talk to a MIBBI standard to get the required minimum information checklist and create a table based on that checklist. And always, adding links as you type. Capture the structure – it’s coming from the knowledge that you’re talking about a rt-pcr reaction. This is easier than writing out by hand. As you get a primer, you drop it into your database of primers (which is also a Wave), and then it can be automatically linked in your text. Allows you to tell a structured story.
Natural user interaction: easy user interaction with web services and databases. You have to be careful: you don’t want to be going back to the chemical database every time you type He, is, etc. In the Wave, you could somehow state that you’re NOT doing arsenic chemistry (the robot could learn and save your preferences on a per-user, per-wave basis. There are problems about Wave: one is the client interface, another is user understanding. In the client, some strange decisions have been made – it seems to have been made the way that people in Google think. However, the client is just a client. Specialized clients, or just better clients, will be some of the first useful tools. In terms of user understanding, all of us don’t quite understand yet what Wave is.
We’re not getting any smarter. Experimentalists need help, and many recognize this and are hoping to use these new technologies. To provide help, we need structure so machines can understand things. However, we need to recognize and leverage the fact that humans tell stories. We need to have structure, but we need to use that structure in a narrative. Try to remember that capturing and communication are two different things.
The sound of two hands Waving
October 13, 2009
I got a Google Wave account (grin) via Cameron Neylon on Monday morning (thanks, Cameron!). I’m trying not to get caught up in all the hype, but I can’t help grinning when I’m using it, even though I don’t really know what I’m doing, and even after seeing the Science Online Demo and a couple Google videos.
But where and how will we get the benefit of the Wave?
I’ve read a few articles, and played around a little, and chatted with people, but I’m still a complete novice. So, I’m not going to talk about technical aspects of waving here. However, even now I can see that the power of Wave will not be in what’s available by default (as was the case with Gmail – you got an account, started using it, and that was pretty much it). It will be in the new applications, interfaces and most especially the Robots that will be riding the Wave with us where the most value will be. OK, so I’ve only had an account for one day, but I think even as a beginner, I can see it is in what we will create for ourselves and our communities to use that will make or break this new thing. And, as ‘we‘ are so much a requirement for this to work, my next point becomes pretty important.
What it will really take to get the best out of Wave for us researchers and scientists?
It will take many, many scientists participating. Social networking needs to get a lot more important to people who currently may just make use of e-mail and web browsing. This is exciting, but we’ll need their help. A very good slideshow by Sacha Chua about this can be found on Slideshare. Use it to convince your friends!
First steps.
As for me, I’ll be waving with both hands this Thursday at 2pm, when Cameron Neylon comes to talk about open science, Google Wave, and more. Unless Cameron is a fantastic multitasker, I may be the only one with an account at the presentation. Not sure how interesting it will be if I am the only one waving. I’ll keep you updated, and post my experience with live blogging with Wave here, and let you know how it goes.
I’m also hoping that I can get some of my research out there into the wider world via Wave robots. I have an interest in structured information (ontologies, data standards etc) and think this may lead to some interesting things.
So, the sound of two hands waving? Pretty quiet, I think. But add another few hundred pairs of hands, and things may get a lot louder.
Inspiring Science Autumn Newsletter
October 12, 2009
I recently attended an open day at the Science Learning Centre North-East (SLCNE) in my role as half of a Teacher Scientist Network (TSN) partnership. There Louise, my partnered teacher, and I gave a short presentation on how the TSN works, and more specifically about our efforts last year. I enjoyed talking about what a positive experience it was, and also enjoyed seeing the other initiatives (such as Science in the Spotlight and Scientists@Work) that the SLCNE manages.
As an extra bonus, the newsletter for this Centre for Autumn had an article on my TSN partnership with Louise (hence the categorization of this post into the “Self Reference” section). Not only can you read the interview with me and Louise, but you can also read about:
- ‘Liquid Science’ in March 2010 at Newcastle’s Liquid and Diva Nightclub
- How you can get funding from the Royal Society (up to £3000!) for “teachers and scientists or engineers to work together on creative investigations involving 5–16 year olds”. The funding goes straight to the school, and the closing date is November 6th. More information: www.royalsoc.ac.uk/education/partnership.htm.
- Details on the 2009 SLCNE Christmas Lecture from Dr. Laura Grant. She’ll be giving a ‘Cool Science’ presentation “which looks at some of the strange things that happen at low temperatures. The lectures will be performed at four venues across the North East during the first week of December and are suitable for Year 6/7 pupils.” More information: www.slcne.org.uk/christmas.
I strongly encourage you all to join in with your local SLC or branch of TSN, and to have a look at this season’s newsletter!
Science Online London 09: Thoughts, not Transcript
August 24, 2009
First off, I’d like to thank the many people who re-tweeted my blog posts throughout Science Online London this past Saturday. With your help, Saturday was my best day ever for visits to the site. I hope people enjoyed my posts, and perhaps stayed long enough to find out what I blog about when I’m not at conferences (those I’m most proud of include a day I spent at a primary school last year, and a co-authored post with Frank Gibson on attribution versus citation).
Those solo09 posts I wrote on Saturday were intended mainly as notes, as a transcript of what went on. It helps me concentrate to take notes, and due to my fabulous parents talking me into taking typing classes in high school, I am able to (mostly) keep up with presentations! But I wasn’t the only one blogging, and many people since Saturday have been writing up and posting their thoughts: Martin Fenner has been keeping track of what seem to be all blog posts about solo09, so please visit his post to find out what everyone thought of the day.
My blog posts on the day were a record of the day’s presentations, from my point of view. Today’s post is more personal – it was my first time at a Science Online conference, and this is a record of my impressions.
The day started very early for me, though I was not alone in this. I was on a 6am train, and managed to find my way to the Royal Institue (my first visit) before 8:30am. Luckily, they had already laid out the name badges of people whose first name began with “A”, and I grabbed my badge and went to see how many people were around. After geeking out way too much when I met Cameron Neylon for the first time in the physical world (when discussing online avatars with him I tried a bad pun referencing the recent Guild music video about avatars which fell a bit flat), I went for a wander around the building. In one of the libraries I found this book, which amused me:
Then I wandered upstairs and had a look at the Faraday Theatre, with its surprisingly uncomfortable seating but beautiful fittings and fantastic ambience. Just a tip though – watch out for the Ambulatory Displays up there on the first floor. The British Library had a table set up in a prime position opposite the Faraday Theatre, and at that table I met some BL people as well as Stewart Wills, an Editor for Science. I had never spoken with a Science editor before, and I had a really enjoyable conversation with him and the BL people about wildflowers and ontologies for 20 minutes or so, until it was time for the conference to start.
I won’t go heavily into the presentations, as I have already covered them. Suffice to say I thought they were all very interesting, often entertaining, and definitely educational. While I would have loved to have much more time for open discussion at the end of each presentation, that didn’t spoil my enjoyment. I had my first experience with Second Life, and watching the odd behaviors of the avatars in it was almost hypnotic. One seemed to be playing the spoons or typing on an invisible keyboard or something. Many others seemed to be hanging off an invisible wire in their back, and others flounced, tilted alarmingly, or even looked attentive.
I will choose a favorite presentation though: I loved the theatrics and the content of John Gilbey. He presented a number of speculations about the far future, and said that we could all vote for our favorite by emailing him in the next week. Then, he’ll do his best to write about it in the context of the University of Rural England and get it into print
Fun! You can email him at gilbey@bcs.org.uk.
I had a number of good conversations with Sara Fletcher of Diamond Light Source about power cables, last year’s Science Online, and meeting people in the real world who you’ve gotten to know only through the (unreal?) world of the Internet. We were the ones sitting near the annoying ringing iPhone during the metrics/statistics talk by Richard Grant and others. No, it was NOT our phone, and yes, we tried to find it to turn it off but were unsuccessful.
It was great seeing bloggers made flesh: Petra Boynton, Jack of Kent, Cameron Neylon and Peter Murray-Rust were just a few of the people I either listened to or spoke with for the first time. Peter, Phil Lord and I had a great conversation about ontologies OWL ontologies – well, about semantics.
I left London that evening, this time on a full train of tired people wanting to get home that was in stark contrast to the quiet, empty train and the beautiful sunrise that began the day. I had a great experience and my thanks goes out to all the organizers and people who helped make Science Online London work. I am now more interested in Google Wave, still want a single unifying identifier for me and my online personas (one identifier per persona, or one per person?) and am more aware of the legal implications of blogging. I feel like I’ve increased not just my knowledge of all things science and online, but also the size of my online science community, which is a community that has enriched my research environment and work life more in the past year than I ever thought possible. The Life Scientists, Science 2.0, Twitter and my good friend Google Reader keep me in touch with all of the other blogs of science of friends and colleagues, and I’m following many more after Science Online. I am a better scientist and researcher because of my connections to this community – Thank you all!
Google Wave: Just another ripple or science communication tsunami? (Science Online London 2009)
August 22, 2009
Cameron Neylon, Chris Thorpe, Ian Mulvany
Google Wave is a new tool for communication and collaboration on the web that will be released later this year. For this session we plan a live demo of the prerelease version of Google Wave to show off the potential for scientists.
What can you do with a wave? Make robots, embed into blog, build gadgets. Robots (server side) can inspect data within a wave, then go and do something about it and change the content within a wave. For the geeks, it’s powered by webhooks. You can put waves anywhere, into any HTML file. Changes are immediately propogated to every embedded wave. Therefore, if you make a comment on a waved blog, that comment appears wherever people have requested it. It makes flame wars almost immediate
Gadgets (client side) extend the functionality of waves, and are xml-based and store their data within a Wave. Changes can be replayed and are stored on a per-user/wavelet basis.
Cameron then live-demoed a wave by writing something “like an email” and showed how it propogated to other users. (Ian said “o noes! i iz in ur wave editing ur text”. Highly amusing. But they’re just showing versioned instant messaging, right now. cool, but I would like to see more.) He can invoke the Guardian robot with “?guardian” and the search results are put right back into the wave. There’s also a robot for chemspider, and another for producing Latex figures (Watexy).
They also showed Igor, a robot which helps retrieve citations. Also Graphy which, as the name suggests, produces basic graphs from text that look suspiciously like what you might want an SBML pathway to look like!
The entire Google Wave system is going to be open-sourced. Most of the client architecture is HTML 5 and Javascript. Google had a robot (not public) that would translate into another language as you typed – supposedly quite resource hungy?
What would make people use it who aren’t geeks? At the moment, it is difficult to get used to using the interface. Also, it doesn’t yet integrate with email as we know it. However, Cameron Neylon says that it’s easier than it looks to use, so once they sort the interface it should become popular.
IM: If Google wave is as easy to install by institutions as a wiki setup, then it might work and really help collaborations and sharing. Even more so if Wave successfully integrates email.
More short notes about the demo and discussion:
- CN: I have the feeling it will be very very good at taking collaborative note taking during talks.
- People can edit each other’s comments, and there is versioning so you can see how things have changed.
- Wave is much more efficient in terms of resources – not a whole series of gets, but instead a few puts (if I understand this correctly).
- One problem: Google Wave can’t be used offline. Is there any way to get some limited functionality offline?
Phil Lord suggested that google wave might be good for collaborative ontology development. (I agree!)
Please note that this post is merely my notes on the presentation. They are not guaranteed to be correct, and unless explicitly stated are not my opinions. They do not reflect the opinions of my employers. Any errors you can happily assume to be mine and no-one else’s. I’m happy to correct any errors you may spot – just let me know!
Far out: Speculations on science communication 50 years from now (Science Online London 2009)
August 22, 2009
John Gilbey
This session will discuss future models for online science communication – but on a timescale well beyond the usual technology horizon. To judge the role of science communication in possible futures, we need to assess how research itself will be carried out in the future. In many scenarios online communication becomes the core enabling force – rather than a useful adjunct – and we can speculate as to the form that communication might best take.
He will be discussing science communication “in the broadest sense”.
(Who is he? A science fiction author; a former research scientist. You may have seen his work in Nature, New Scientist, Times Higher Education, Guardian, Nature Physics. He is speaking only on behalf of himself, and not on behalf of any of his employers.)
If he could distill everything he’s learnt about the scientific process and create the fictional “University of Rural England (URE)”, where things are not always as they seem, and where students and faculty suffer the same weaknesses. Then he switches to a synopsis of the first Nature journal in 1869, where in the editorial TH Huxley said the people in 50 years would look at the back issues of Nature “not without a smile”. We’re in danger of losing that connection, he says.
Then he moves on to talking about Second Life, and speaks about physical representations of a virtual space being captured in a digital media and re-presented back in SL.
To his generation, internet/computers/etc are still the future, even though they’re here. To younger people, they are the present – this is a different way of thinking.
Three options for the future: 1) steady state 2) step change (significant developments) 3) surprise parties (major unexpected advances completely changing the game). So, back to URE, the fictional place where he has sci-fi story ideas: machine-enhanced clairvoyance for science quality auditors; network developments expose a temporal portal to allow historic (dead) research leaders to be employed on projects; digitally-supported thought control of higher mammals. Speculation 1: in 50 years’ time, the world political, economic and social structure will change radically. In that case, who will our sponsors be for research? How “free” will the science community be? Will science be encouraged to engage with the wider social environment? If it was your job on the line, would you lie or toe the party line?
Will you suffer for your integrity?
Speculation 2: Virtual reality in some form will become ubiquitous in society across the globe: location becomes irrelevant, scientists become nomadic, opportunities for citizen science increase, social involvement with science grows. Speculation 3: significant environment events will spur major increases in research activity: science profile is raised significantly, there is a greater need for communication of science. Speculation 4: Society crashes totally following an unrecoverable Internet failure.
Email him in the next week to vote as to which scenario you’d like to see the URE address, and he’ll do his best to get it into print! He’s at gilbey@bcs.org.uk
(This was one of my favorite talks of today.)
Question: will our universities be around in 50 years’ time? JG: I think they will be, but in a radically-different form. “VR” classes, for one.
And there was at least one other Science Online London attendee blogging this presentation – take a look!
Please note that this post is merely my notes on the presentation. They are not guaranteed to be correct, and unless explicitly stated are not my opinions. They do not reflect the opinions of my employers. Any errors you can happily assume to be mine and no-one else’s. I’m happy to correct any errors you may spot – just let me know!
…of managing online scientific communities
Arikia Millikan, Corie Lok, Ijad Madisch
This session will provide you with an inside look into how online science communities are built and maintained. We will discuss how to manage expectations, social/cultural issues, the role of moderation, differences between science communities and ‘other communities’, and how to encourage diversity/debate whilst maintaining some sort of order. You’ll come away with tips on how to successfully build community and maintain it throughout flame wars and other tribulations.
Arikia Millikan (scienceblogs):
Scienceblogs started as a network, and ended up as a community. It was a very successful project. To do things well, you need a diverse selection of bloggers, which makes the cat herding more difficult. To have a successful blogging community, you first need a solid technological foundation. Secondly, you need acknowledgement, accessibility and analytics (it can be a very good motivational tool to see who is looking at your site and how often). Higher up in the hierarchy, you need to allow identity and individuality for your bloggers. When all these aspects are present you have a good community, but if any of these components fail you start to have problems.
If needs aren’t met, a lot of the energy that normally gets sent to achieving science is made more destructive. This is when you get the dreaded flame wars, which has happened in the past at scienceblogs. This is where the community management skills come in. Something important to know about such situations is that they are not always bad – you can learn important lessons from them. There are also a lot of rewarding aspects working at scienceblogs: it’s never boring, and there’s lots of nice science, and it is emotionally rewarding (all the benefits of a strong community).
Corie Lok (Nature Network):
She described what she’s seen work in a community of bloggers. Bloggers in such a community need to remain online and engage with the commenters. Friends can help you do this. You have to find the right balance in the volume of your postings. The main thing that the bloggers struggle with is incentive: it’s hard to get payback from the mainstream community. Seeding the blogging community with people other scientists want to interact with has been very important in making the community successful. Having forums such as the Ask the Nature Editor forum, and on fluorescence imaging in the life sciences has been really useful.
There’s a new website for Parkinson’s set up by the Michael J Fox foundation that has been really successful, and is a really great model of how things could proceed.
Ijad Madisch (co-founder of ResearchGATE):
He’s originally a medical doctor and is also working in Computer Science. ResearchGATE helps researchers with targeted, rapid-response Q&A and provides efficient literature search and professional bookmarking. For institutional users, ResearchGATE offers a comprehensive communication platform and collaboration tools for promoting inter-disciplinary research.
The greatest challenges are to serve a variety of disciplines and to perform community management on a large scale. However, they’re proud of the global online community they’ve created, and the feedback they’ve received. There is a new API coming soon to enable scientists to add their own applications to the ResearchGATE platform.
Discussion:
Question: What percentage of your registered users are active? IM: 30-35% log in at least once per month in ResearchGATE. Similar numbers for Nature Network.
Question (Matt Brown): How do you guard against spam attacks, and what do you do about potentially libelous (or similar) comments, or other legal problems? CL: There is spam software in place. In terms of legal aspects, UK libel law puts publishers in a difficult position: publishers are better off not moderating content in a legal sense. So you have to find a balance. IM: They don’t have that many spam problems, but they have created a reporting system. ResearchGATE has one lawyer, working full time. AM: They’ve had some pretty bad spam problems in the past, but the community helps out a lot with this. Bloggers also moderate their own comments. Legally, there hasn’t been that many problems yet.
Question: Is there an optimum group size for a blogging or online research community? CL: She hasn’t seen any correlation between group size and activity. AM: There may be capacity issues – as the group gets larger, it’s harder to sustain the interpersonal aspects of it.
Question: What is the effect of networks like this on scientists’ productivity – it would be great if that information were published more? Are the networks ever acknowledged in the paper? CL: There have been fruitful collaborations that came out of Nature Network. IM: Collaborations at ResearchGATE have resulted in at least one paper. However, less tangible things such as discussions are harder to quantify.
Question (Cameron Neylon): It would be really nice to see some serious social anthropology happening with these communities, and even for the communities themselves to consider funding.
Question: There are a number of social networks available. How do you coordinate where and when to pull the information from? CL: Over time, we just see which ones survive. But it’s good to have choice. IM: They want to connect ResearchGATE to science-related comments on FF and Twitter. AM: It’s quite exciting right now to see what features and useability and tools will take off and be successful.
Please note that this post is merely my notes on the presentation. They are not guaranteed to be correct, and unless explicitly stated are not my opinions. They do not reflect the opinions of my employers. Any errors you can happily assume to be mine and no-one else’s. I’m happy to correct any errors you may spot – just let me know!


