In The News Outreach SOC

Finding Ada in Scientific Data: Ada Lovelace Day 2020

Today is Ada Lovelace Day, as run by the lovely people at Finding Ada and as advocated by the STEM Ambassador Programme of which I am a part. I thought about choosing a famous woman from history, or a contemporary of ours to inspire us. But what really caught my imagination was the wonderful conglomerate of women in science that I have met, worked and become friends with since the start of my career.

But how to properly acknowledge them? As an ontologist, my mind immediately leapt to the creation of an ontology; I could describe the women, our various associations, and how we interrelated in specific and intricate detail. I was all ready to do it when I realized that just uploading an OWL file to my blog wasn’t very visually stimulating. Also, I realized that my eagerness to create an ontology would result in my spending far more time on getting it exactly right than I actually have.1 So, although only yesterday I was scoffing at spreadsheets, I ended up using exactly that kind of “unsuitable” method to quickly do what I needed. The graph below can be copied and modified to allow you to correct any errors and add any of us that I (gasp) am bound to have forgotten.

Update 17.10.20: Groups are from: Me (blue), Dagmar (yellow), Melanie S. (green), Melanie C. (purple), Jane (turquoise), and Katherine (orange), and Rachael (dark purple). Details of each person, including ORCIDs, in the graph and further down the page. Previous iterations of the graph at the end of the post.

Latest version with a few more connections added by Dagmar:

As this is about Ada Lovelace Day, this is a network of women. And, because of the way I have chosen to celebrate women in STEM, it includes all of those women with whom I have worked with directly2. (It is necessarily self centered, though I was certainly not aiming to center myself!) At every stage of my career, I was one of the lucky ones to have female and male bosses who actively sought out excellence based on merit. I want to celebrate the collective research power of women in the field of data science that I have chosen. I hope it’s the interconnectedness of this graph, rather than the small singular point it began from, that gives you an idea of how much of an impact women have had in our research area.3

I’ve included the ORCIDs, where I know them, to allow you all to peruse the outputs of their research should you so desire. For a few I don’t know their ORCIDs, but you’ll just have to take my word for how fantastic they are, with a little detail on just a few of them.

There’s Katherine C., who was my roommate back at Rice University during our undergraduate days. She majored in Computer Science while I was studying Biology; I didn’t understand that funny programming stuff at all back then. Little did I know that I was headed slowly but surely directly down that path! And Katherine’s interest and brilliance were definitely factors when I changed direction from Biology to Bioinformatics and ultimately to data curation, ontologies, and data science that I’m involved with today.There’s Melanie, who I always seek out at conferences as I look forward to her sharp mind and great company. There’s Maria-Jesus and Claire, who took a chance and gave me my very first job in this industry in my early 20s. Susanna is the most insightful, gregarious and focused woman I have ever met in my career, and constantly amazes me with her understanding of the research community and how to draw the best out of all of us. Dawn was just marvellous in many ways, and a person I felt a secret association with as we both worked in the UK and came from the US. Trish is smart, engaging and I see way too little of her.

At every stage of my career, and in every conference I went to or workshop I, uh, worked at, I found brilliant women who helped each other along. There are many ways to slice a population of researchers – only one of which is by gender – but I am proud of the women with whom I have worked over the years, and this is just one small way to say thank you. Where do I find Ada? In every single woman I’ve worked with.

Thanks, Ladies.

The STEM Ambassador Hub at DEBP challenged Ambassadors to join in today, and to write about who inspired us, though lots of other groups (such as WISE) are taking part. I’d like to think that some aspiring scientists might come across this, and realize that perhaps they wouldn’t be as alone as they might think, if they chose to take a career in STEM. There are gender issues in STEM that should not be forgotten about, but today is about raising up and celebrating.

  1. I have been researching visualization tools for OWL, RDF and similar formats and have yet to find something I am completely happy with. OLS does virtually everything that I need, but you need to install a local version of it if you want to visualize your own ontologies (I think they would be justified in not accepting conglomeration of female scientists as a community-driven ontology suitable for inclusion on their site!). WebVOWL is beautiful and allows upload of your own OWL files, but I find it difficult to do all the tweaks I would like. All I wanted to do was to provide a website with a list of ORCIDs and have it pop out a suitable bit of RDF or similar as to how all of these researchers were connected (via their publications, organizations, etc). Then I could tweak the resulting RDF and run it through WebVOWL. I even tweeted about it (without success)… But, as I couldn’t find a tool to do that, and I didn’t have the time to write it, I had to find a quicker alternative. To allow the quick conversion of a list of nodes and edges to a nice visualization, I found Flourish, which is what I used to make the graph in this post.
  2. As this day is focusing on women in STEM I have not added nodes for the many men I have worked with, but you know who you are, and you’re great. 🙂
  3. I’ve included ORCIDs where I can find them, and I all of the connections are from my memory (which as I have said is faulty), backed up by publicly-available information. In other words, I haven’t added any information that isn’t already out there on the interwebs. However, if you prefer not to be included in the graph, then please do let me know privately and I’ll remove you.

Previous Revisions:
Groups were added by: Me (blue), Dagmar (yellow), Melanie S. (green), Melanie C. (purple), Jane (turquoise), and Katherine (orange).


Update after Rachael Huntley (0000-0001-6718-3559) added her connections, : see

Latest update by Katherine James ( Please feel free to duplicate+edit, then let me know and I’ll include it here!

Update 16.10.20: Jane’s version plus extra edits by Melanie Courtot and me.

A version with additional small updates by Melanie Courtot and by me! Find it at to duplicate+edit.

Update 16.10.20: The stupendous Jane Lomax (ORCID: 0000-0001-8865-4321) has also extended the graph! Here it is – feel free to move things about, as it’s getting rather crowded now – perfect!

Jane Lomax’s graph from – you know you want to add to it! 🙂

Update 15.10.20: The fantastic Melanie Courtot (ORCID: 0000-0002-9551-6370) has also extended the graph! I’ve added hers here, with permission. Thanks!

Melanie Courtot’s graph from – thank you all!

Update 15.10.20: The amazing Melanie Stefan (ORCID: 0000-0002-6086-7357) has also extended the graph! I’ve added hers here, with permission. Thanks!

Melanie Stefan’s graph from

Update 14.10.20: The fabulous Dagmar Waltemath (ORCID: 0000-0002-5886-5563) has extended the graph! I’ve added hers here, with permission. Thanks!

Dagmar Waltemath’s graph from . Love how much she has added! (And apologies – I really should have added her in the first place!)
The highly non-scientific network of women in STEM that I have had the pleasure of working with over the years. Completely inadequate as I’m sure I’m missing people (feel free to edit my graph and republish via, but my point isn’t so much about the individuals as it is about how, every step of the way, there are women who help each other and lift each other up in science.
In The News Outreach

What’s your favorite taxonomic controversy?

In January I’ll be running an event with another STEM ambassador for Years 5 and 6 at a local primary school. One year will be getting the fantastic Mystery Boxes, which I love doing with any age group, and the other year is currently studying Taxonomy and Classification. I love the idea of talking about the big debates that scientists have, and how we scientists aren’t a bunch of homogeneous fact-tellers. Instead we’re messy humans who like having arguments, and I think taxonomy is one of those areas that has many arguments.

So, what debates (historical or modern) do you most enjoy hearing about within taxonomic research? Here are some ideas I have, but would love to hear some specific examples from you all:

  • DNA Barcoding (summarized nicely by the Dept of Sociology at Lancaster Uni, and a 2005 POV article in Systematic Biology),
  • Taxonomy “vandalism” (see this Smithsonian piece), which I hadn’t realized was a thing,
  • Where do hominids fit in with respect to great apes (e.g. this opinion piece)?

I’ll probably simplify the general idea behind this lesson plan and throw in some soft toy animals for the kids to classify, but if you have any interesting ideas please let me know!

Housekeeping & Self References In The News Outreach

WISE-ing up: Encouraging girls (and kids generally) in STEM

Kids Love Science

Kids love science (you should see their hands up at a STEM event!), but somehow as they get older many of them learn (or are taught) that it’s boring, or not cool. I do a decent amount of STEM Ambassador volunteering to try to ensure this change in perception never happens: I’ve made Jelly baby DNA with Key Stage 1, talked about non-standard career trajectories with kids almost ready to start university, built birds’ nests with 4 year olds… I’ve even single-handedly done combination presentation-and-practicals for an entire Junior School over the course of one day! I usually get really good feedback from teachers about the events I run, and I also get lots of support from my local STEM Ambassador Hub (one lovely lady even dropping off supplies for an event at my house on her way home!), but it’s not often that I get a letter from a child.

So imagine my pleasure and surprise when I received a letter this week from a child in the Junior School where I did the day-long event. She wrote so eloquently and earnestly. Of course I felt great that she said some lovely things about me. But what was even better is that the event seemed to really spark an interest. Irrespective of her (and all the other children’s) ultimate careers, I’m hoping that the work I do with them encourages them to face the world with open eyes and a thoughtful mind. Words like this are what really keep us STEM Ambassadors going:

Thank you so much for teaching us about DNA. You have sparked my curiosity […] I loved learning all the interesting facts […] This amazed and confused me too! I would love to learn even more about DNA […] Science week would not have been the same without you.

I absolutely agree – Science can be amazing and confusing. And weird, and wonderful, and mind blowing.

Women and Girls in STEM

Encouraging an interest in STEM for all children is at the heart of the volunteering that I do. Recently, however, I have started to learn more about how to specifically encourage women and girls into STEM careers. There’s a lot of talk in the news about gender balance and pay equality, and even the big names in tech like Microsoft have been struggling both to retain women and provide an equal playing field.

It’s not all bad news, though. Every single group and department I’ve worked in (that’s right, every single job) has had lots of diversity, and I’ve never felt neglected, belittled or sidelined. For example, the Oxford e-Research Centre (where I am currently employed) published an article today about my STEM volunteering and the recent career profiles I’ve been a part of (more on that next). But there’s still a lot of work to be done.

There is a huge drop off in the number of girls studying core STEM subjects at the age of 16. Just 35% of girls choose maths, physics, computing or a technical vocational qualification compared to 94% of boys. This reduces the number going on to do a degree or level 4 qualification in maths, physics, computer science or engineering – 9% of girls compared to 29% of boys. Source: WISE Campaign

As such, I’ve jumped on the chances I’ve been given recently to make a positive difference. The North Yorkshire Business and Education Partnership’s ‘Pen Portraits’ have been designed to give female students a glimpse into the variety of STEM based careers available to them. Through my work as a STEM ambassador, I was asked to provide one of these portraits – if you follow the above link, you’ll find me in there along with a number of other great women in STEM.

People Like Me front cover

As a direct result of NYBEP’s work, I became more involved with (and become a member of) WISE and attended a workshop discussing women and girls in STEM. Part of what WISE does is the People Like Me campaign, which creates a series of packs that STEM Ambassadors and schools can use to help girls identify the parts of their personalities that align with STEM careers. If you take a look at the “Careers in North Yorkshire and East Riding” People Like Me pack, you find me there too! The Science Museum are also doing a series of tweets about STEM Ambassadors, which I highly encourage you to peruse (FYI, you may find me amongst them).

It may seem like I’m tooting my own horn (which I am, to a certain extent, after all – this is my blog!), but the main thing that interests me is getting kids engaged in STEM, and I’m hoping that all the volunteering and STEM education skills I’m learning now together with the increased visibility of these issues will ultimately help kids get interested in STEM, stay interested in STEM, and have equal opportunities in STEM careers.


In The News Science Online

Social filtering of scientific information – a view beyond Twitter

It’s not information overload, it’s filter failure. (Clay Shirky)

Bonetta (2009) gave an excellent introduction to the micro-blogging service Twitter and its uses and limitations for scientific communication. We believe that other social networking tools merit a similar introduction, especially those that provide more effective filtering of scientifically relevant information than Twitter. We find that FriendFeed (already mentioned in the first online comment on the article, by Jo Badge) shares all of the features of Twitter but few of its limitations and provides many additional features valuable for scientists. Bonetta quotes Jonathan Weissman, a Howard Hughes Medical Institute investigator at the University of California, San Francisco: “I could see something similar to Twitter might be useful as a way for a group of scientists to share information. To ask questions like ‘Does anyone have a good antibody?’ ‘How much does everyone pay for oligos?’ ‘Does anyone have experience with this technique?'” It is precisely for such and many more purposes that scientists use FriendFeed, which allows the collection of many kinds of contributions, not just short text messages.

Also in contrast to Twitter, comments to each contribution are archived in that context (and without a time limit), providing a solid base for fruitful, threaded discussions. In your user profile, you can choose to aggregate any number of individual RSS or Atom feeds‘, including scientific publications you bookmark in your online reference manager (e.g. CiteULike or Connotea), your blog entries, social bookmarks (Google Reader,, etc.), and Tweets; and any other items you wish to post directly to your feed. You then look for other users whose profile is relevant to your work and subscribe to them. Every individual item posted in your subscriptions will then appear on your personalized FriendFeed homepage, plus optionally a configurable subset of the feeds you subscribed to. You can choose to bookmark (‘like‘) any of these items (Facebook copied this ‘like’ functionality just before it bought FriendFeed), comment on them, and share discussion threads in various ways.

At first, this aggregation of information and threaded discussions might seem daunting. However, the stream of information can be channeled by organizing it into separate sub-channels (‘lists’; similar to but more versatile than ‘folders’ in email), according to your personal preferences (e.g. one for search alerts). In addition to individual users, you can also subscribe to rooms that revolve around particular topics. For example, the “The Life Scientistsroom currently has 1,267 members and imports one feed.

The feature that makes FriendFeed truly useful is its social filtering system. Active discussions move to the top of your FriendFeed homepage with each new addition, which automatically brings them to the attention of you and everyone else who reads those feeds. In a sense, the most current and the most popular entries compete for attention at the top, making notifications unnecessary. This means that your choice of both rooms and subscriptions affects and filters the content you see. In that way, for instance, you could set your preferences such that you would only see papers with a certain minimum number of ‘likes’ among your colleagues. Alternatively, you can opt to hide items with zero likes or comments, ensuring that only those that someone found interesting will reach you. Thanks to a very fine-grained search functionality, threads also remain easily retrievable.

Some of the synergistic effects of the many scientists interacting on FriendFeed are already apparent at this early stage of adoption. FriendFeed provides a convenient way to microblog from conferences by means of dedicated threads or discussion rooms created for the event, thus allowing to share comments within and across sessions, or even with people not physically present at the meeting. Such conference coverage has even received direct (e.g. ISMB09 , BioSysBio09 ) or indirect (e.g. ISMB08 ) support from the conference organizers.

Above and beyond conference coverage, scientists use FriendFeed to share papers, experiences on laboratory equipment, resources for teaching, or anything else commonly asked at mailing lists. A number of real-world scientific collaborations have already been sparked from such interactions. Collaborative grant proposals have been initiated, submitted and some of them approved after the idea was passed around and discussed on FriendFeed. Several bioinformatics problems have been solved by code-sharing and advice. Articles in scientific journals have been published by FriendFeed users after meeting and discussing on the platform [1-5].

Of course, since FriendFeed was not designed for scientists, there is room for improvement in terms of usability for scientific purposes. For instance, files can only be uploaded upon starting a thread, not while commenting on it, and there is currently no functionality which infers a measure of reputation to a user from his/her contributions (though the wide-spread use of real names somewhat allows that to be imported). As with all online contributions, citability and long-term archiving are unresolved issues, as is the permanence of services whose source code is not public. Fortunately, the development of social networks tailored to the needs of scientists is actively being pursued from various angles. The Polymath projects , in which researchers collaborate online to solve mathematical problems, provide a number of examples. The recent award of two NIH grants of over $US10M each for exactly such purposes is another. Ultimately, the continued enthusiastic adoption of the sophisticated variants of social filtering tools by a broad community of researchers interested in sharing their science will only increase the usefulness for and thus the capabilities of the online scientific community.


Bonetta, L. (2009). Should You Be Tweeting? Cell, 139 (3), 452-453 DOI: 10.1016/j.cell.2009.10.017

1 Lister, A., Charoensawan, V., De, S., James, K., Janga, S. C. C., Huppert, J.,   2009. Interfacing systems biology and synthetic biology. Genome biology. 10 (6), 309+.
2 Saunders N, Beltr‹o P, Jensen L, Jurczak D, Krause R, et al. (2009) Microblogging the ISMB: A New Approach to Conference Reporting. PLoS Comput Biol 5(1): e1000263.
3 Neylon C, Wu S (2009) Article-Level Metrics and the Evolution of Scientific Impact. PLoS Biol 7(11): e1000242.
4 Daub J, Gardner PP, Tate J, Ramskšld D, Manske M, Scott WG, Weinberg Z, Griffiths-Jones S, Bateman A. (2008): The RNA WikiProject: community annotation of RNA families. RNA. 14(12):2462-4
5. Huss & al. The Gene Wiki: community intelligence applied to human gene annotation.

Acknowledgment: This comment has received input from a number of FriendFeed users, as detailed in this thread, and was jointly blogged today by Björn Brembs (FriendFeed; blog post), Allyson Lister (FriendFeed; this blog post) and Daniel Mietchen (FriendFeed; blog post).

In The News Outreach

Inspiring Science Autumn Newsletter

I’ve been meaning to link to this Autumn’s Inspiring Science newsletter, put out by Claire Willis and others at the Science Learning Centre North-East. Not only does it have interesting articles on the science outreach they’ve been involved with recently and what’s coming up in the near future, but it also has a short article on me and my partnered teacher, Louise, as part of the Teacher Scientist Network. Find more about the programme on the Inspiring Science website. Enjoy!

In The News Semantics and Ontologies Standards

Science Commons provide a list of considerations for researchers looking to license their ontology

Back in March, I wrote a blog post about my experiences trying to find out a) if ontologies should be licensed, b) if ontologies could be licensed, and c) what sort of license would be appropriate. After all, it isn’t clear what sort of thing an ontology is: is it software, or is it a document, or is it something else completely? In this post, I included a response I had received from the nice folks over at Science Commons, giving their perspective on the situation.

Today, I came across a Science Commons blog post by Kaitlin Thaney announcing OWL 2. In it, she also mentions that Science Commons now have a Reading Room article on Ontology Copyright Licensing Considerations which is well worth a read. It updates the information contained in my March post, and provides some useful thoughts on how we should go about licensing ontologies. The section below was the part that particularly caught my eye:

For sharing ontologies in a community or publicly, it would be prudent to think about copyright and licensing. For example, the ontology creator could say that “to the extent I may have copyright in my ontology, I license it in the following way.” In that way, she can reassure the community that even in the event copyright is later found to exist, they may rely upon her offer of a license. This provides an important “safety net” for the community of users, given the uncertainty about whether a given ontology may be copyrightable.

The above section seems to be the biggest new point compared with their earlier statement. While they primarily recommend CC0, they do acknowledge that many researchers may wish to choose an attribution-based licences such as the CC Attribution license.

If you create ontologies, then you should read this article: it’s short, easy to understand, and gives you the information you need to make your own decisions.

Housekeeping & Self References In The News Outreach

Inspiring Science Autumn Newsletter

I recently attended an open day at the Science Learning Centre North-East (SLCNE) in my role as half of a Teacher Scientist Network (TSN) partnership. There Louise, my partnered teacher, and I gave a short presentation on how the TSN works, and more specifically about our efforts last year. I enjoyed talking about what a positive experience it was, and also enjoyed seeing the other initiatives (such as Science in the Spotlight and Scientists@Work) that the SLCNE manages.

As an extra bonus, the newsletter for this Centre for Autumn had an article on my TSN partnership with Louise (hence the categorization of this post into the “Self Reference” section). Not only can you read the interview with me and Louise, but you can also read about:

  • ‘Liquid Science’ in March 2010 at Newcastle’s Liquid and Diva Nightclub
  • How you can get funding from the Royal Society (up to £3000!) for “teachers and scientists or engineers to work together on creative investigations involving 5–16 year olds”. The funding goes straight to the school, and the closing date is November 6th. More information:
  • Details on the 2009 SLCNE Christmas Lecture from Dr. Laura Grant. She’ll be giving a ‘Cool Science’ presentation “which looks at some of the strange things that happen at low temperatures. The lectures will be performed at four venues across the North East during the first week of December and are suitable for Year 6/7 pupils.” More information:

I strongly encourage you all to join in with your local SLC or branch of TSN, and to have a look at this season’s newsletter!

Data Integration In The News

Two Journal Special Issues: Big Data, and Semantic Mashups for Bioinformatics

Both of these special issues are worth a look, as some of the papers look pretty interesting. I'll spend a little time in a later post on any articles I find particularly relevant.

  • Semantic Mashup of Biomedical Data Special Issue of the Journal of Biomedical Informatics. This includes a review article by Carole Goble and Robert Stevens: State of the nation in data integration for bioinformatics
  • Nature's Big Data Sepcial Issue. The article entitled "How do your data grow?" was one of the many articles in this issue that I enjoyed. It's interesting to note that these problems in management and curation of big data are only now getting special attention in Nature. When I worked at the EBI, it was common knowledge among the database curators that 1) it would be very difficult for them to find other work as curators if they left the EBI, and 2) the time and high skill level it takes to annotate and curate biological database entries means that it is very difficult to get high coverage in such databases. It's nice to finally see some recognition of all the work the biocurators do by a journal such as Nature. Finally, there are high-profile articles stating that curation begins at home, with the researcher, and that curation needs much more support from researcher-level all the way up to the level of the database curators.

Read and post comments |
Send to a friend


In The News Software and Tools

Google Notebook and the Google Cloud in Bioinformatics

Ever since my friend Frank posted an article about how he has one foot in the internet "cloud", I have been a little worried about how much of my cloud is composed of Google apps. I currently have a Google cloud composed of Google Mail, Calendar, Reader, Docs, Groups, and most recently Notebook. It's convenient, the apps are useful, and it means I can access much of the information relevant to my work in bioinformatics wherever I am. I'm by no means a Google-only person (I use Vox rather than Blogger, for instance), and my set of a dozen or so "home" pages that I like to load whenever I open Firefox doesn't even have Google in a majority of tabs. But I can't help feeling slightly guilty and a little be worried about how dependent I've become on Google products. Then on top of that, I can't help feeling silly to be feeling guilty. It's my choice, after all!

So, moving past all that: today I've added Google Notebook to my cloud. I wanted to have somewhere where I could stash all those interesting papers that are sent to me each week via my personal tame searches on PubMed (aka MyNCBI: perform a regular search, click on "Save Search" and then ask email updates to be sent to you). I tried Connotea but never went and actually looked at it. I tried reading them when the emails arrived, but the time I have to read papers is never regular. So I deliberately had a look at the Google App list, and found Notebook. You can make notes on anything, but there is also a Firefox plugin that allows you to just highlight any portion of a web page you want and then place it directly in its own section of one of your notebooks. Further, Google Notebook is private unless you make it public. Here I can put all of my thoughts on these papers, whenever I have a minute to do it. Each clipping can be labelled, and comments can be added. The plugin is nice, but I've already made it break on one of the PubMed result pages. I suppose I'm talented?

Perhaps I'll finally get both the time to read all those interesting-sounding papers as well as have a single location that I'll actually visit. Only time will tell.

Read and post comments
Send to a friend


In The News

More Neanderthal News: FOXP2 and language

I've got a lot of curiosity when it comes to history and the history of the human species, so stories that pop up about Neanderthals always arouse my interest. Previously I have posted about the great likelihood that humans and Neanderthals lived side by side for a couple thousand years in Europe, and just last week there was some new information about Neanderthals. As The Register reports, it turns out that their version of the FOXP2 (info, more info) gene matches the human version. The Reg had picked up on this information via an interview with Dr. Paabo in the New York Times on 18 October. Some take-home points:

  • This pushes the date back for the mutation of FOXP2 into its human form from up to 200,000 years ago (with 50,000 years ago being the best guess by many) to at least 350,000 years ago, or the time of the evolutionary split between humans and Neanderthals. This is Big News.
  • It could have been contamination with human DNA. Both articles go into this problem in some detail. There were a number of checks put into the work by Paabo et al, and hopefully these measures were enough to ensure that there was no contamination.

It could have also been – through some freak chance – convergent evolution, and the same mutation developed separately in the two species, however even though it is my own idea, I don't think there's much chance of this being right 🙂 Of course FOXP2 cannot be the only contributory factor to our facility for speech. However, it is interesting to see more information is coming out about it, and about our relationship to Neanderthals.

Read and post comments |
Send to a friend