This document has last been compiled on 2021-12-14 23:38:48.

We ran the same analysis pipeline from preprocessing to differential gene expression (DE) analysis on data when it was mapped to BTx623 (we will call it Year2) and on data when it was mapped to specific genotype, BTx642 or RTx430, (we will call it BT642Year2). In this report, we will analyze the overlap of DE genes identified between Year2 and BT642Year2, focusing on the results from DE spline analysis.

In this file, we compare the raw fpkm counts of samples from leaf and Postflowering between Year2 and RemapYear2.

Below is the analysis of overlap from the raw fpkm counts between Year2 and BT642Year2:

Load data

Load fpkm counts of all samples

Year2:

## Reading in metadata from file: results/Year2/data/rnaMetaData.txt
## Reading in data from file: results/Year2/data/fpkm_counts.txt
## Total number of all samples: 578 
## Total number of genes in all samples: 34211 
## Removing the 50  samples identified as not part of the main experiment, from the all samples
## After filtering all samples: 528 samples, 34211 genes

BT642Year2:

## Reading in metadata from file: results/BT642Year2/data/rnaMetaData.txt
## Reading in data from file: results/BT642Year2/data/fpkm_counts.txt
## Total number of all samples: 290 
## Total number of genes in all samples: 35115 
## Removing the 26  samples identified as not part of the main experiment, from the all samples
## After filtering all samples: 264 samples, 35115 genes

Filter samples not in leaf_postflowering

Year2:

## After filtering: 51 samples, 34211 genes

BT642Year2:

## After filtering: 26 samples 35115 genes

Match BTx623 to OrthologGroup

To compare between Year2 and BT642Year2, we need to match the gene naming. In Year2, genes are named with BT623; and in BT642Year2, genes are names with Ortholog Group. So we implemented mapping the BTx623 gene names to the ortholog groups according to the annotation file.

Mapping BTx623 to OrthologGroup:

## Out of 34211 genes with BT623 names, 
##      28538 genes are matched to remap gene names.
## Since 0 out of 34056 remap gene names are mapped to multiple BT623 gene names, 
##   we collapsed on BT623 with average values of counts so that each remap gene has a unique row.
## After mapping the BTx623 gene names to the remap gene names,
##    Year2      leaf Postflowering : 51 samples, 34056 genes
##    BT642Year2 leaf Postflowering : 26 samples, 35115 genes
## 

Compare gene expressions in Year2 and BT642Year2

In this section, we will compare the log fpkm counts of genes in Year2 and BT642Year2. Particularly, since there are many samples for each gene, we will use the 3rd max log fpkm counts, the average of log fpkm counts in Day063, and the log log fpkm count from a particular sample of each gene to compare.

There are genes that their counts are results from collapsing and we provide comparison of Year2 and BT642Year2 of these genes. Some genes do not have one-to-one relationship of RT430-to-BT642, so their counts were collapsed by summation when we were merging data from two genotypes. They are flaged as Collasped On Ortholog. Some genes do not have one-to-one relationship of BT623-to-OrthologGroup. so their counts were collapsed by averaging when we were mapping BT623 gene names to ortholog groups. They are flaged as Collapsed on BT623. We also provide analysis that focusing on these genes with collapsed values.

3rd max log expression values

Compare the 3rd max log fpkm counts of genes in Year2 and BT642Year2

Scatter plots

Here are some genes that are only expressed in one year.
10 genes that were expressed in BT642Year2 but not in Year2 (sort by expression vales in BT642Year2):

##                    GeneID collapsed value_remap value_623 value_diff
## 15990 SbiBTX642.04G233300    not1-1    8.784471         0   8.784471
## 15991 SbiBTX642.04G233300    not1-1    8.784471         0   8.784471
## 14751 SbiBTX642.04G091700       1-1    6.959654         0   6.959654
## 13918 SbiBTX642.04G003700       1-1    6.855367         0   6.855367
## 23732 SbiBTX642.07G071800       1-1    5.142822         0   5.142822
## 7769  SbiBTX642.02G264200       1-1    4.624686         0   4.624686
## 19957 SbiBTX642.05G227500    not1-1    4.405992         0   4.405992
## 19958 SbiBTX642.05G227500    not1-1    4.405992         0   4.405992
## 28764 SbiBTX642.09G172500       1-1    4.256256         0   4.256256
## 3210  SbiBTX642.01G354500       1-1    4.027685         0   4.027685

10 genes that were expressed in Year2 but not in BT642Year2 (sort by expression vales in Year2):

## 10 genes that were expressed in BT642Year2 but not in Year2 :
##                    GeneID collapsed value_remap value_623 value_diff
## 21025 SbiBTX642.06G103500       1-1           0 10.564254 -10.564254
## 13480 SbiBTX642.03G436300       1-1           0  5.788425  -5.788425
## 1943  SbiBTX642.01G204000       1-1           0  5.585864  -5.585864
## 8174  SbiBTX642.02G304600    not1-1           0  5.436628  -5.436628
## 11939 SbiBTX642.03G266300       1-1           0  5.061344  -5.061344
## 23056 SbiBTX642.07G007500       1-1           0  4.912171  -4.912171
## 32461 SbiBTX642.10G291100       1-1           0  4.637494  -4.637494
## 7179  SbiBTX642.02G199300       1-1           0  4.504620  -4.504620
## 10553 SbiBTX642.03G104200       1-1           0  4.452200  -4.452200
## 2383  SbiBTX642.01G252900       1-1           0  4.276497  -4.276497

Density Plots

Histogram of differences

Mean log expression values in Day63

Compare the average of log fpkm counts of genes in DAY63 in Year2 and BT642Year2

Scatter plots

Here are some genes that are only expressed in one year.
10 genes that were expressed in BT642Year2 but not in Year2 (sort by expression vales in BT642Year2):

##                    GeneID collapsed value_remap value_623 value_diff
## 15990 SbiBTX642.04G233300    not1-1    8.711649         0   8.711649
## 15991 SbiBTX642.04G233300    not1-1    8.711649         0   8.711649
## 13918 SbiBTX642.04G003700       1-1    6.468219         0   6.468219
## 14751 SbiBTX642.04G091700       1-1    6.298524         0   6.298524
## 19957 SbiBTX642.05G227500    not1-1    4.111258         0   4.111258
## 19958 SbiBTX642.05G227500    not1-1    4.111258         0   4.111258
## 7769  SbiBTX642.02G264200       1-1    4.100590         0   4.100590
## 23732 SbiBTX642.07G071800       1-1    3.730946         0   3.730946
## 7603  SbiBTX642.02G247100       1-1    3.624333         0   3.624333
## 16017 SbiBTX642.04G236000    not1-1    3.597104         0   3.597104

10 genes that were expressed in Year2 but not in BT642Year2 (sort by expression vales in Year2):

## 10 genes that were expressed in BT642Year2 but not in Year2 :
##                    GeneID collapsed value_remap value_623 value_diff
## 21025 SbiBTX642.06G103500       1-1           0  9.946977  -9.946977
## 1943  SbiBTX642.01G204000       1-1           0  5.214626  -5.214626
## 8174  SbiBTX642.02G304600    not1-1           0  5.043900  -5.043900
## 11939 SbiBTX642.03G266300       1-1           0  4.746360  -4.746360
## 13480 SbiBTX642.03G436300       1-1           0  4.577632  -4.577632
## 1909  SbiBTX642.01G199500    not1-1           0  3.788697  -3.788697
## 1910  SbiBTX642.01G199500    not1-1           0  3.788697  -3.788697
## 23056 SbiBTX642.07G007500       1-1           0  3.636933  -3.636933
## 10513 SbiBTX642.03G099500       1-1           0  3.633959  -3.633959
## 2383  SbiBTX642.01G252900       1-1           0  3.411900  -3.411900

Density Plots

Histogram of differences

Log expression values of a single sample

Compare the log fpkm counts of genes from a single sample in Year2 and BT642Year2

## The randomly selected sample: 0906179L15 from BT642:Day084:Postflowering:Leaves

Scatter plots

Here are some genes that are only expressed in one year.
10 genes that were expressed in BT642Year2 but not in Year2 (sort by expression vales in BT642Year2):

##                    GeneID collapsed value_remap value_623 value_diff
## 14751 SbiBTX642.04G091700       1-1    6.412782         0   6.412782
## 15990 SbiBTX642.04G233300    not1-1    5.908813         0   5.908813
## 15991 SbiBTX642.04G233300    not1-1    5.908813         0   5.908813
## 13918 SbiBTX642.04G003700       1-1    5.839204         0   5.839204
## 23732 SbiBTX642.07G071800       1-1    4.445594         0   4.445594
## 28764 SbiBTX642.09G172500       1-1    4.440288         0   4.440288
## 2592  SbiBTX642.01G288400       1-1    4.334854         0   4.334854
## 19957 SbiBTX642.05G227500    not1-1    4.064366         0   4.064366
## 19958 SbiBTX642.05G227500    not1-1    4.064366         0   4.064366
## 7603  SbiBTX642.02G247100       1-1    3.598127         0   3.598127

10 genes that were expressed in Year2 but not in BT642Year2 (sort by expression vales in Year2):

## 10 genes that were expressed in BT642Year2 but not in Year2 :
##                    GeneID collapsed value_remap value_623 value_diff
## 21025 SbiBTX642.06G103500       1-1           0  9.665816  -9.665816
## 13480 SbiBTX642.03G436300       1-1           0  5.987548  -5.987548
## 5447  SbiBTX642.02G010300       1-1           0  5.896272  -5.896272
## 1943  SbiBTX642.01G204000       1-1           0  5.585864  -5.585864
## 8174  SbiBTX642.02G304600    not1-1           0  5.193378  -5.193378
## 11939 SbiBTX642.03G266300       1-1           0  4.629939  -4.629939
## 23056 SbiBTX642.07G007500       1-1           0  4.421560  -4.421560
## 14619 SbiBTX642.04G077000    not1-1           0  4.329124  -4.329124
## 1909  SbiBTX642.01G199500    not1-1           0  3.993368  -3.993368
## 1910  SbiBTX642.01G199500    not1-1           0  3.993368  -3.993368

Density Plots

Histogram of differences