I’m very proud to announce that I’ve been elected one of 19 Fellows of the Software Sustainability Institute (SSI). The SSI is a virtual institute funded by the EPSRC with the mission of supporting scientific software. Fellows from all over the UK, both from academia and industry, advocate best practices in software development, open source, open data and education.
I’m aiming to bring my experience from working with data integration methods (e.g. ontologies) and machine learning that we have successfully used in the context of biological research to the people who work on developing the Internet of Things. I recently attended the Thingmonk conference and already presented a talk “What the IoT should learn from the life sciences” with my SSI hat on.
I’m following big footsteps of two former Cambridge SSI Fellows, Laurent Gatto and Stephen Eglen.
Our manuscript “Estimating binding properties of transcription factors from genome-wide binding profiles” has now been accepted in Nucleic Acids Research. For more scientific background, see the blog post about the pre-published in arXiv version.
It was a long, painful uphill battle to get this piece published. But it was also one of the (unfortunately rare!) examples how a manuscript can really be improved through various iterations of peer review. We like to thank the anonymous reviewers for their continuous support throughout the entire process, for spotting a few conceptual problems and mis-calculations (that eventually even uncovered a problematic piece of code). They were very critical, but always fair!
Our friend and colleague Edward Kinrade suddenly died last month. He had been instrumental for the growth of the Adryan group over six years, taking care of all lab management duties and the training of students at the wet bench.
I wrote a small series of blog posts how we dealt with the situation: 1/ How to communicate bad news to the group. 2/ Disaster management in the lab. 3/ When cloud services bite you in the arse. Maybe someone will find them useful one day.
After about four years in the making, we’ve finally published our paper “A combination of computational and experimental approaches identifies DNA sequence constraints associated with target site binding specificity of the transcription factor CSL” (the title, I know!!!). Please find the PDF here.
This has been an exciting journey. About four years ago I stared at a sequence motif that I got out of motif inference from a CSL/Su(H) ChIP dataset I analysed in collaboration with Sarah Bray’s group. There were a few good reasons to believe it was legit, but Sarah wasn’t convinced at all — not in the light of a crystal structure that Rhett Kovall had published in 2004. The paper discussed why the canonical motif was such a good fit for CSL, but it didn’t offer any entry point for why other sequences might be bound as well. A couple of hours with my favourite crystallographer Ilka Müller convinced me that CSL might prefer it’s canonical motif CGTGGGAA, but she also suggested to me that the space of possible sequences that are “tolerated” by the protein might be significantly larger. I presented these thoughts at a conference that was also attended by Bobby Glen, and we soon decided to team up to tackle the question more systematically using the tools of computational chemistry. Bobby and I proposed this to Apple for an ARTS Award, who presented us with some $30k of hardware, and the rest is history…
In the meantime, Sarah Bray took over the experimental validation of our computational results and because in vivo behaviour often doesn’t tell you much about what the protein really does, the very Rhett Kovall joined forces and provided additional biophysical measurements about CSL/Su(H) binding to DNA.
A truly international and interdisciplinary team. I’m proud of having been a member of it. And well done, Eddie and Rob from my CSBC team.
The Game-of-Life is an example of cellular automata, a conceptual framework that is inspired by the interaction of living cells in a confined space over time. Computer science students learn about Conway’s classic game because it teaches them about Turing completeness, biology students may learn about it as an introduction into discrete stochastic simulations.
We’ve previously used a Game-of-Life implementation on KL25Z microcontrollers (another donation from the ARM University Programme) to provide a more haptic way for students to explore the properties of the game world. Each controller represents a cell whose survival is dependent on the neighbouring devices. However, for technical reasons the implementation relies on a serial ring bus to enable communication between the boards, which is very distinct from biological systems where all cells are in immediate contact with their neighbours.
We discussed this with the ARM University Programme, who suggested the use of wireless radio technology to communicate between our “cells”. Bluetooth Low Energy (BLE) is such an approach, and the Nordic nRF51822 mbed Kit is a low-power Cortex M0-controller with BLE on board. After a few emails with ARM, we’re now in a position to explore the properties of concurrent dynamic systems and compare them to the behaviour of living cells: A great learning experience for technical people and life scientists alike!
Better late than never, one might think. Once upon a time when I spent time in Tien Hsu’s lab, back in 2001 at the Medical University of South Carolina, I conducted a 2-hybrid experiment with the fly von Hippel-Lindau (VHL) tumor suppressor. Amongst a few other candidates, we found the nm23 tumour suppressor as interaction partner for VHL. We published a very nice paper on nm23 function in tracheal development, but never quite revealed where we had found the inspiration to chose exactly that gene to work on. Now in this semi-review article we’re finally coming clean and show about 13 years later that VHL and nm23 do stuff together. Unfortunately, pay per view in Naunyn Schmiedeberg’s Archives of Pharmacology: Interaction between Nm23 and the tumor suppressor VHL.
We are very happy to announce the submission of another article to arXiv prior to publication in a peer-reviewed open access journal:
arXiv:1404.5544: Estimating transcription factor abundance and specificity from genome-wide binding profiles
This is a rather curious story, as we had actually been working on something else! If you’re interested in determining the number of bound TF molecules or how to translate PWM scores into actual binding profiles, please have a look, send us comments and suggestions, tweet it, write about it on your blog (but let us know so we can engage in a fruitful scientific discussion!).
In brief, being challenged by the sometimes lengthy compute times of our GRiP framework, Radu Zabet derived an analytical solution for the prediction of genomic occupancy on the basis of TF binding site preference (in the form of a PWM), DNA accessibility (from genome-wide foot printing data), TF copy number and a scaling factor that translates the PWM score into a specificity. However, when benchmarking the method to actual ChIP-seq data, we realised that the same analytical model also allows the inference of TF copy number and specificity if the other two genome-wide measurements (occupancy and DNA accessibility) are given.
This is now an alternative entry point for studies into TF copy numbers, something that our group are also interested in addressing experimentally using quantitative mass-spec.
It’s not always big news that excites us. In their recent newsletter, the Prime-XS Consortium features an article Pushing the limits of biology. It’s an introduction into the cutting-edge proteomics that the Functional Genomics Centre Zurich does for their Prime-XS partners. Under the heading Next-generation proteomics they report about their work with us on the quantification of Drosophila transcription factors. While they obviously deserve all the credit for the proteomics, we are proud of having asked the question for the next generation of MS!
A lot of the freedom that I have in my work (pursuing research in fly developmental biology, computational studies on bacterial gene regulation, and even my application of microelectronics for teaching purposes) comes from being a Royal Society University Research Fellow (URF). These fellowships give scientists between five and, under exceptional circumstances, ten years to build up a research programme, with a lot of flexibility when it comes to the nature of the work. In fact, URFs can even retrieve voluntary training in media communication, innovation and business, or policy making.
Two years ago the Royal Society has started a now biannual gathering of URFs. This year I was co-organiser of the event that saw eight keynote speakers from a wide range of topics from earth sciences to black holes to genome regulation. The echo of the conference is still visible under #urfcon14 on Twitter. Go check it out and contact any of the speakers if you find their work interesting!
We’ve had a successful 2013 and that apparently hasn’t gone unnoticed. This December alone, we’ve received more applications for summer projects than in all of lent. This is particularly true for the Amgen Scholarship, for which we have chosen a candidate, and Erasmus internships. Therefore, please, do not send any more applications until further notice. You will be ignored! For doctoral and postdoctoral scientists, please see our recommendations how to apply.