This document has last been compiled on 2021-12-14 22:00:22.

We ran the same analysis pipeline from preprocessing to differential gene expression (DE) analysis on data when it was mapped to BTx623 (we will call it Year1) and on data when it was mapped to specific genotype, BTx642 or RTx430, (we will call it BT642Year1). In this report, we will analyze the overlap of DE genes identified between Year1 and BT642Year1, focusing on the results from DE spline analysis.

In this file, we compare the raw fpkm counts of samples from root and Preflowering between Year1 and RemapYear1.

Below is the analysis of overlap from the raw fpkm counts between Year1 and BT642Year1:

Load data

Load fpkm counts of all samples

Year1:

## Reading in metadata from file: results/Year1/data/rnaMetaData.txt
## Reading in data from file: results/Year1/data/fpkm_counts.txt
## Total number of all samples: 396 
## Total number of genes in all samples: 34211 
## There are no samples to remove beyond those used in the main experiment
## After filtering all samples: 396 samples, 34211 genes

BT642Year1:

## Reading in metadata from file: results/BT642Year1/data/rnaMetaData.txt
## Reading in data from file: results/BT642Year1/data/fpkm_counts.txt
## Total number of all samples: 199 
## Total number of genes in all samples: 35115 
## There are no samples to remove beyond those used in the main experiment
## After filtering all samples: 199 samples, 35115 genes

Filter samples not in root_preflowering

Year1:

## After filtering: 75 samples, 34211 genes

BT642Year1:

## After filtering: 37 samples 35115 genes

Match BTx623 to OrthologGroup

To compare between Year1 and BT642Year1, we need to match the gene naming. In Year1, genes are named with BT623; and in BT642Year1, genes are names with Ortholog Group. So we implemented mapping the BTx623 gene names to the ortholog groups according to the annotation file.

Mapping BTx623 to OrthologGroup:

## Out of 34211 genes with BT623 names, 
##      28538 genes are matched to remap gene names.
## Since 0 out of 34056 remap gene names are mapped to multiple BT623 gene names, 
##   we collapsed on BT623 with average values of counts so that each remap gene has a unique row.
## After mapping the BTx623 gene names to the remap gene names,
##    Year1      root Preflowering : 75 samples, 34056 genes
##    BT642Year1 root Preflowering : 37 samples, 35115 genes
## 

Compare gene expressions in Year1 and BT642Year1

In this section, we will compare the log fpkm counts of genes in Year1 and BT642Year1. Particularly, since there are many samples for each gene, we will use the 3rd max log fpkm counts, the average of log fpkm counts in Day063, and the log log fpkm count from a particular sample of each gene to compare.

There are genes that their counts are results from collapsing and we provide comparison of Year1 and BT642Year1 of these genes. Some genes do not have one-to-one relationship of RT430-to-BT642, so their counts were collapsed by summation when we were merging data from two genotypes. They are flaged as Collasped On Ortholog. Some genes do not have one-to-one relationship of BT623-to-OrthologGroup. so their counts were collapsed by averaging when we were mapping BT623 gene names to ortholog groups. They are flaged as Collapsed on BT623. We also provide analysis that focusing on these genes with collapsed values.

3rd max log expression values

Compare the 3rd max log fpkm counts of genes in Year1 and BT642Year1

Scatter plots

Here are some genes that are only expressed in one year.
10 genes that were expressed in BT642Year1 but not in Year1 (sort by expression vales in BT642Year1):

##                    GeneID collapsed value_remap value_623 value_diff
## 14751 SbiBTX642.04G091700       1-1    6.996389         0   6.996389
## 13918 SbiBTX642.04G003700       1-1    6.967053         0   6.967053
## 15990 SbiBTX642.04G233300    not1-1    6.674687         0   6.674687
## 15991 SbiBTX642.04G233300    not1-1    6.674687         0   6.674687
## 28764 SbiBTX642.09G172500       1-1    6.585563         0   6.585563
## 18732 SbiBTX642.05G111600       1-1    4.801676         0   4.801676
## 16742 SbiBTX642.04G316200       1-1    4.611763         0   4.611763
## 7769  SbiBTX642.02G264200       1-1    4.569856         0   4.569856
## 15657 SbiBTX642.04G197700       1-1    4.448240         0   4.448240
## 31950 SbiBTX642.10G242600       1-1    3.744161         0   3.744161

10 genes that were expressed in Year1 but not in BT642Year1 (sort by expression vales in Year1):

## 10 genes that were expressed in BT642Year1 but not in Year1 :
##                    GeneID collapsed value_remap value_623 value_diff
## 13480 SbiBTX642.03G436300       1-1           0  7.011898  -7.011898
## 8174  SbiBTX642.02G304600    not1-1           0  6.761684  -6.761684
## 21025 SbiBTX642.06G103500       1-1           0  6.314515  -6.314515
## 1943  SbiBTX642.01G204000       1-1           0  5.954894  -5.954894
## 25363 SbiBTX642.08G007300    not1-1           0  5.499208  -5.499208
## 5447  SbiBTX642.02G010300       1-1           0  5.448571  -5.448571
## 11939 SbiBTX642.03G266300       1-1           0  5.297925  -5.297925
## 10553 SbiBTX642.03G104200       1-1           0  4.221104  -4.221104
## 30135 SbiBTX642.10G041000       1-1           0  4.004501  -4.004501
## 26561 SbiBTX642.08G143100       1-1           0  3.821710  -3.821710

Density Plots

Histogram of differences

Mean log expression values in Day63

Compare the average of log fpkm counts of genes in DAY63 in Year1 and BT642Year1

Scatter plots

Here are some genes that are only expressed in one year.
10 genes that were expressed in BT642Year1 but not in Year1 (sort by expression vales in BT642Year1):

##                    GeneID collapsed value_remap value_623 value_diff
## 13918 SbiBTX642.04G003700       1-1    6.672140         0   6.672140
## 14751 SbiBTX642.04G091700       1-1    6.411169         0   6.411169
## 28764 SbiBTX642.09G172500       1-1    6.087188         0   6.087188
## 18732 SbiBTX642.05G111600       1-1    4.170646         0   4.170646
## 15657 SbiBTX642.04G197700       1-1    4.115077         0   4.115077
## 16742 SbiBTX642.04G316200       1-1    4.062025         0   4.062025
## 7769  SbiBTX642.02G264200       1-1    3.896621         0   3.896621
## 15990 SbiBTX642.04G233300    not1-1    3.093783         0   3.093783
## 15991 SbiBTX642.04G233300    not1-1    3.093783         0   3.093783
## 860   SbiBTX642.01G090700       1-1    2.718687         0   2.718687

10 genes that were expressed in Year1 but not in BT642Year1 (sort by expression vales in Year1):

## 10 genes that were expressed in BT642Year1 but not in Year1 :
##                    GeneID collapsed value_remap value_623 value_diff
## 8174  SbiBTX642.02G304600    not1-1           0  5.791184  -5.791184
## 13480 SbiBTX642.03G436300       1-1           0  5.529867  -5.529867
## 11939 SbiBTX642.03G266300       1-1           0  5.002889  -5.002889
## 21025 SbiBTX642.06G103500       1-1           0  4.995369  -4.995369
## 5447  SbiBTX642.02G010300       1-1           0  4.979439  -4.979439
## 1943  SbiBTX642.01G204000       1-1           0  4.927686  -4.927686
## 25363 SbiBTX642.08G007300    not1-1           0  3.856403  -3.856403
## 18299 SbiBTX642.05G076400    not1-1           0  3.612673  -3.612673
## 18300 SbiBTX642.05G076400    not1-1           0  3.612673  -3.612673
## 22121 SbiBTX642.06G215400       1-1           0  3.402607  -3.402607

Density Plots

Histogram of differences

Log expression values of a single sample

Compare the log fpkm counts of genes from a single sample in Year1 and BT642Year1

## The randomly selected sample: 0824164R06 from BT642:Day084:Preflowering:Roots

Scatter plots

Here are some genes that are only expressed in one year.
10 genes that were expressed in BT642Year1 but not in Year1 (sort by expression vales in BT642Year1):

##                    GeneID collapsed value_remap value_623 value_diff
## 14751 SbiBTX642.04G091700       1-1    6.993788         0   6.993788
## 13918 SbiBTX642.04G003700       1-1    6.988912         0   6.988912
## 28764 SbiBTX642.09G172500       1-1    5.784242         0   5.784242
## 15990 SbiBTX642.04G233300    not1-1    5.484783         0   5.484783
## 15991 SbiBTX642.04G233300    not1-1    5.484783         0   5.484783
## 18732 SbiBTX642.05G111600       1-1    4.800641         0   4.800641
## 7769  SbiBTX642.02G264200       1-1    4.253989         0   4.253989
## 15657 SbiBTX642.04G197700       1-1    3.998196         0   3.998196
## 16742 SbiBTX642.04G316200       1-1    3.571677         0   3.571677
## 19408 SbiBTX642.05G181900    not1-1    3.241840         0   3.241840

10 genes that were expressed in Year1 but not in BT642Year1 (sort by expression vales in Year1):

## 10 genes that were expressed in BT642Year1 but not in Year1 :
##                    GeneID collapsed value_remap value_623 value_diff
## 13480 SbiBTX642.03G436300       1-1           0  5.526695  -5.526695
## 21025 SbiBTX642.06G103500       1-1           0  5.237258  -5.237258
## 5447  SbiBTX642.02G010300       1-1           0  5.142822  -5.142822
## 8174  SbiBTX642.02G304600    not1-1           0  5.080231  -5.080231
## 11939 SbiBTX642.03G266300       1-1           0  5.014802  -5.014802
## 1943  SbiBTX642.01G204000       1-1           0  4.456149  -4.456149
## 10405 SbiBTX642.03G088000    not1-1           0  3.891653  -3.891653
## 10406 SbiBTX642.03G088000    not1-1           0  3.891653  -3.891653
## 10407 SbiBTX642.03G088000    not1-1           0  3.891653  -3.891653
## 10408 SbiBTX642.03G088000    not1-1           0  3.891653  -3.891653

Density Plots

Histogram of differences