Introduction:

This tool inputs a peak table file and performs correlation analysis on it.

Input files:

Peak table file in Tab-delimited txt format, first column is the compound identifier, the others are samples.

For example:

HU_011

HU_014

HU_015

HU_017

HU_018

HU_019

(2-methoxyethoxy)propanoic acid isomer

3.019766

3.814339

3.519691

2.562183

3.781922

4.161074

(gamma)Glu-Leu/Ile

3.888479

4.277149

4.195649

4.32376

4.629329

4.412266

1-Methyluric acid

3.869006

3.837704

4.102254

4.53852

4.178829

4.516805

1-Methylxanthine

3.717259

3.776851

4.291665

4.432216

4.11736

4.562052

1,3-Dimethyluric acid

3.535461

3.932581

3.955376

4.228491

4.005545

4.320582

1,7-Dimethyluric acid

3.325199

4.025125

3.972904

4.109927

4.024092

4.326856

2-acetamido-4-methylphenyl acetate

4.204754

5.181858

3.88568

4.237915

1.852994

4.080681

2-Aminoadipic acid

4.080204

4.359246

4.249111

4.231404

4.323679

4.244485

Parameter:

Correlation methods:

1.     Kendall rank correlation:

The Kendall rank correlation coefficient, commonly referred to as Kendall's tau coefficient (after the Greek letter τ), is a statistic used to measure the ordinal association between two measured quantities. A tau test is a non-parametric hypothesis test for statistical dependence based on the tau coefficient.

2.     Pearson correlation:

the Pearson correlation coefficient, also referred to as Pearson's r is a measure of the linear correlation between two variables X and Y. It has a value between +1 and −1, where 1 is total positive linear correlation, 0 is no linear correlation, and −1 is total negative linear correlation.

3.     Spearman correlation:

Spearman's rank correlation coefficient or Spearman's rho is a nonparametric measure of rank correlation. It assesses how well the relationship between two variables can be described using a monotonic function.

 

Cluster methods:

1.     ward: Ward's minimum variance method aims at finding compact, spherical clusters.

2.     complete: The complete linkage method finds similar clusters.

3.     single: The single linkage method (which is closely related to the minimal spanning tree) adopts a ‘friends of friends’ clustering strategy.

The other methods can be regarded as aiming for clusters with characteristics somewhere between the single and complete link methods.

4.     centroid: Method "centroid" is typically meant to be used with squared Euclidean distances.

5.     average: The average distance method measures the average distance between each pair of observations

6.     mcquitty: It finds similar cluster.

7.     median: Median distance method.

Output files:

1.     ' sample_cor_plot.pdf', correlation heatmap plot result.

 

2.     ' sample_cor.txt', correlation matrix result.