Skip to content

Commit

Permalink
limiting to contigs over 1000bp
Browse files Browse the repository at this point in the history
  • Loading branch information
linsalrob committed Nov 12, 2024
1 parent db15af9 commit cc009f6
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions Workshops/COMBINE_WA_2024.md
Original file line number Diff line number Diff line change
Expand Up @@ -642,6 +642,16 @@ And now we plot _all_ the raw data.
sns.lineplot(data=melted_df, x='Sample', y='Depth', hue='contig', legend=False)
```

## Filter for longer contigs

We filter our data frame so that we only consider contigs > 1000 bp. You can change this limit if you wish.

```
minlength = 1000
longcontigs = seqlengths[seqlengths['length'] > minlength].index
dfs = dfs[dfs.index.isin(longcontigs)]
```

## Calculate the corrrelations

Now that we have the contigs and their average depth across the samples, we calculate a pairwise correlation between all contigs and all other contigs.
Expand Down

0 comments on commit cc009f6

Please sign in to comment.