Blog | BioTech Consulting

top of page

Search

PROSIT

PROSIT

PROSIT

Last year I dedicated three blogposts to mass spectrometry-based proteomics (MS-proteomics) data analysis: PEPTIDE SEARCH ENGINES THE PEPTIDE-SPECTRUM MATCH DATA LOSS AND WAYS TO CONTROL IT To continue with this subject, I now discuss how using deep learning algorithms to predict fragment (MS2) ion mass intensities increases the number of peptide sequences identified by a peptide search engine. From the handful of deep learning-based MS2 ion intensity predictors reported so f

Genaro Pimienta

Feb 177 min read

DEEPVARIANT

DEEPVARIANT

DEEPVARIANT

In this blogpost I explain the architecture of DeepVariant (Google Brain team). Published in 2018, DeepVariant is a neural network-based variant caller that discovers genomic variants with 99.9% accuracy. A universal SNP and small-indel variant caller using deep neural networks —2018 Unlike Bayesian-based variant callers (e.g., HaplotypeCaller), DeepVariant discovers SNPs and INDELs regardless of the sequencing technology used: Illumina short reads PacBio long reads Oxford

Genaro Pimienta

Feb 74 min read

TRANSFORMERS

TRANSFORMERS

TRANSFORMERS

The deep learning architecture named Transformer first appeared in the literature 2017. You can check out the publication by clicking on this hyperlink: Attention Is All You Need — 2017 Conceived to overcome the limitations of the natural language processing models of the day, the Transformer architecture went from making text translation and next-word prediction more accurate, to inspiring the development of chatGPT (OpenAI). (GPT stands for Generative Pre-trained Transfo

Genaro Pimienta

Jan 226 min read

ENCODER-DECODER NEURAL NETWORKS

ENCODER-DECODER NEURAL NETWORKS

ENCODER-DECODER NEURAL NETWORKS

In my previous blogpost, I explained what recurrent neural networks (RNNs) are. To read more about this topic, click on the following hyperlink: RECURRENT NEURAL NETWORKS RNNs are a specialized form of the classic feed forward network, which I explained in WHAT IS AN ARTIFICIAL NEURAL NETWORK? . Used to translate words or summarize text, RNNs ingest sequential inputs, such as the words in a sentence or paragraph ( Figure 1 ). Figure 1. RNNs are feedforward networks with a hi

Genaro Pimienta

Jan 45 min read

CONVOLUTIONAL NEURAL NETWORKS

CONVOLUTIONAL NEURAL NETWORKS

CONVOLUTIONAL NEURAL NETWORKS

In this blogpost I talk to you about convolutional neural networks ( CNNs ), a specialized deep learning architecture used in computer vision. When processing images, CNNs outperform feedforward networks (FNNs) because they overcome the curse of dimensionality . FNNs assign an artificial neuron to each pixel in an image, and, because FNNs are densely interconnected, the number of neurons required to process a large image increases disproportionally (multidimensionally). This

Genaro Pimienta

Dec 27, 20255 min read

THE SMALLPOX VACCINE

THE SMALLPOX VACCINE

THE SMALLPOX VACCINE

Humanity was for centuries under the spell of smallpox, a deadly disease caused by a virus named variola . Smallpox was eradicated in 1977, but for this to happen, vaccine technology had to be invented. And it was a worldwide vaccination campaign that drove smallpox to extinction . Nothing else would have eliminated smallpox. A human-specific Orthopoxvirus, variola was encoded by a DNA genome. To propagate continuously, variola established endemicity in highly populated hum

Genaro Pimienta

May 14, 202514 min read

COCOLIZTLI: THE LOST EPIDEMIC

COCOLIZTLI: THE LOST EPIDEMIC

COCOLIZTLI: THE LOST EPIDEMIC

THE GREAT SICKNESS Centuries ago, a deadly epidemic swept through Mesoamerica. Those who lived through it, called this malady cocoliztli , which means sickness or pestilence in the Nahuatl language. It was a hemorrhagic fever . One which in the spring of 1545, appeared out of nowhere in central Mexico's highlands. Cocoliztli spread with the wind, stopping only because the roads ended. Sonora to the north. Guatemala in the far south. Estimates from population censuses put th

Genaro Pimienta

Jan 7, 20256 min read

WHAT IS AN ARTIFICIAL NEURAL NETWORK?

WHAT IS AN ARTIFICIAL NEURAL NETWORK?

WHAT IS AN ARTIFICIAL NEURAL NETWORK?

ARTIFICIAL NEURAL NETWORKS Artificial neural networks , also known as deep learning networks, are a type of machine learning algorithm, which can extract exceedingly complex features from input data. Examples of input data features are the words in a sentence or the patterns in an image. In analogy to the huma n brain’s cognitive processes, deep learning networks acquire the ability to identify feature patterns ( learn ) by modulating the activation state of the artificial n

Genaro Pimienta

Jan 1, 20256 min read

RECURRENT NEURAL NETWORKS

RECURRENT NEURAL NETWORKS

RECURRENT NEURAL NETWORKS

I provide in this post a high-level introduction to artificial neural networks in the context of MS-proteomics.

Genaro Pimienta

Oct 14, 20244 min read

THE INTRODUCTION IN A SCIENTIFIC PAPER

THE INTRODUCTION IN A SCIENTIFIC PAPER

THE INTRODUCTION IN A SCIENTIFIC PAPER

The grammar and sentence construction rules in scientific writing are well-defined and accept little or no variation. It is therefore...

Genaro Pimienta

Sep 6, 20246 min read

THE TITLE AND ABSTRACT IN A SCIENTIFIC PAPER

THE TITLE AND ABSTRACT IN A SCIENTIFIC PAPER

THE TITLE AND ABSTRACT IN A SCIENTIFIC PAPER

This blogpost is about the first two components in a scientific research article ( paper ) — the title and abstract . Below I define...

Genaro Pimienta

Aug 27, 20246 min read

DATA LOSS AND WAYS TO CONTROL IT

DATA LOSS AND WAYS TO CONTROL IT

DATA LOSS AND WAYS TO CONTROL IT

This blogpost is a postscript to the previous one — “THE PEPTIDE-SPECTRUM MATCH” — , in which I wrote about the peptide-spectrum match...

Genaro Pimienta

Aug 1, 20248 min read

THE PEPTIDE-SPECTRUM MATCH

THE PEPTIDE-SPECTRUM MATCH

THE PEPTIDE-SPECTRUM MATCH

In my previous blogpost “PROTEOMICS SEARCH ENGINES” I wrote about peptide search engines in the context of the data-dependent acquisition (DDA) approach used in shotgun proteomics. In this blogpost I discuss the strategies used by peptide search engines to mitigate the false discovery rate when analyzing DDA data. For those not familiar with proteomics jargon, the abbreviations used throughout the text are: Data-dependent acquisition ( DDA ) False discovery rate ( FDR ) Fra

Genaro Pimienta

Jul 2, 20246 min read

PEPTIDE SEARCH ENGINES

PEPTIDE SEARCH ENGINES

PEPTIDE SEARCH ENGINES

In this blogpost I talk about peptide search engines—the algorithms used in MS-proteomics to assign peptide sequences to raw mass spectra. For those not familiar with proteomics jargon, the abbreviations used throughout the text are: Data-dependent acquisition ( DDA ) Data-independent acquisition ( DIA ) Mass spectrometry-based proteomics ( MS-proteomics ), AKA shotgun proteomics Peptide fragmentation spectra (MS2) Precursor ion spectra (MS1) Peptide-spectrum match ( PSM )

Genaro Pimienta

Jun 11, 20248 min read

THE PRE-PROTEOMICS ERA

THE PRE-PROTEOMICS ERA

THE PRE-PROTEOMICS ERA

But mass spectrometry-based proteomics (MS-proteomics), as we know it, was not born in 1995, which is when the term proteome first...

Genaro Pimienta

Jun 2, 20243 min read

bottom of page