The DeepCLIP tool takes a single or more RNA oligonucleotides (short RNA sequences) as input and predicts binding probability and calculates a binding profile. The main purpose of DeepCLIP is to identify binding sites of proteins in novel untested sequences using trained models that have extracted binding site information provided by CLIP data, to predict the effect of sequence variants on the binding, and to identify the importance of individual nucleotides for protein binding affinity.
deepclip.compbio.sdu.dk/
DeepCLIP: predicting the effect of mutations on protein–RNA binding with deep learning
We present DeepCLIP, a novel deep learning approach to modeling RNA-binding protein sites using a shallow neural network composed of CNN and LSTM layers to capture context-dependent binding. DeepCLIP generalizes well across a diverse set of sequences in both in vitro and in vivo settings, and produces a profile of the sequence, which indicates sequence elements important for the binding of the RNA binding protein in question.
The core of DeepCLIP is a convolutional BLSTM network implemented in Theano using the Lasagne library and a few customized network layers and functions.
academic.oup.com/nar/article…
Theano 1.0.5
pypi.org/project/Theano/#des…
Theano: a {Python} framework for fast computation of mathematical expressions
arxiv.org/abs/1605.02688
Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as PyTensor. PyTensor is a fork of Aesara,
aesara.readthedocs.io/en/lat… which is a fork of Theano.
PyTensor
PyTensor is a Python library that allows one to define, optimize, and efficiently evaluate mathematical expressions involving multi-dimensional arrays. It provides the computational backend for PyMC.(
pymc.io/projects/docs/learn.…)
github.com/pymc-devs/pytenso…
Lasagne: First release
A lightweight neural network library built on top of Theano.
zenodo.org/records/27878