Failed to read fasta index (error 2) #319

ramnageena11 · 2024-12-20T22:53:10Z

Pls look into this error and suggest how to resolve it?
Here:

I have indexed .bam (from dorado) and then converted to fasta (reads).
reference: assembled (genome: NCBI) or assembly from same long reads (using for modification analysis).

modkit pileup -t 4 --cpg --ref /home/Documents/......../genome_assemblies/bc01/ /home/Documents/demux-pod5/EXP-NBD104_barcode01.bam bc01.bed

calculated chunk size: 6, interval size 100000, processing 600000 positions concurrently
filtering to only CpG motifs
Error! Failed to read fasta index from "/home/Documents/genome_assemblies/bc01/.fai"
caused by No such file or directory (os error 2)

ArtRand · 2024-12-21T00:05:47Z

Hello @ramnageena11,

Could you try passing a path to the FASTA-formatted file containing the reference sequence? Something like /home/Documents/genome_assemblies/bc01/ref.fa, there needs to be an index as well such as /home/Documents/genome_assemblies/bc01/ref.fa.fai.

ramnageena11 · 2025-01-02T19:54:05Z

Hi ArtRand,
Thanks for the suggestion. you mean i need to create an index file for reference genome (NCBI) or Assembled genome (using same reads of epigenomic analysis)?

Thanks
Ram

ArtRand · 2025-01-03T20:32:04Z

Hello @ramnageena11,

You need to create an index for the reference FASTA you aligned the reads to. So if this is the assembly, you should use the same reference sequence.

ramnageena11 · 2025-01-03T22:03:25Z

Hi ArtRand,
I did the index of ref.fa to ref.fa.fai but pileup command came another error 101. said no such files.
Pls see the errors:
modkit pileup /home/dnasequencer/Documents/payal_epi/dorado_aling_bam/aligned_bc01.bam /home/dnasequencer/Documents/payal_epi/demux-pod5/pileup/pileup_01.bed --ref /home/dnasequencer/Documents/payal_epi/OA-G20_genome/ref.fa --preset traditional

Error! unable to open SAM/BAM/CRAM index for /home/dnasequencer/Documents/payal_epi/dorado_aling_bam/aligned_bc01.bam; please create an index

modkit pileup /home/dnasequencer/Documents/payal_epi/dorado_aling_bam/aligned_bc01.bam /home/dnasequencer/Documents/payal_epi/demux-pod5/pileup/pileup_01.bed --ref /home/dnasequencer/Documents/payal_epi/OA-G20_genome/ref.fa.fai --preset traditional

Error! unable to open SAM/BAM/CRAM index for /home/dnasequencer/Documents/payal_epi/dorado_aling_bam/aligned_bc01.bam; please create an index

Let me tell what I have done:
Experiment design: 10 barcode files (5 samples with 2 replicates)

QC using "dorado" with SUP (nanopore)
demux the reads as per barcodes (.fastq). and demuxed .bam files.
I have assembled all the sequenced reads (barcodes) in a separate assembly (.fa).
Downloaded the genome reference from NCBI. THIS WILL Be Reference? or Assemblies will be reference?

I was using pileup command got error.

Now, I used dorado aligner to align reads (demuxed .fatsq) with reference (.fa) and created .bam. again it is giving error:
modkit pileup /home/dnasequencer/Documents/payal_epi/dorado_aling_bam/aligned_bc01.bam /home/dnasequencer/Documents/payal_epi/demux-pod5/pileup/pileup_01.bed --ref /home/dnasequencer/Documents/payal_epi/OA-G20_genome/ref.fa.fai --preset traditional

Error! unable to open SAM/BAM/CRAM index for /home/dnasequencer/Documents/payal_epi/dorado_aling_bam/aligned_bc01.bam; please create an index

What "index" creation it is asking?

Can you pls tell me all the steps for Modkit (consider me a beginner)? Starting from Raw sequence data to visualization of results.
I would be highly grateful.

Thanks
rgds
Ram

ArtRand · 2025-01-03T22:26:31Z

Hello @ramnageena11,

A few things to check.

To run pileup you need 2 indices one for the aligned, sorted modBAM file (.bai usually) and one for the FASTA reference. You create the former with samtools index ${bam} and the latter with samtools faidx ${ref}.
(optional) Make sure that you haven't lost the modified base information in your reads, run modkit summary ${modbam} --threads {threads}. If you used dorado aligner to align your sequencing reads you should be fine.
Run pileup using the commands you have posted.

Downloaded the genome reference from NCBI. THIS WILL Be Reference? or Assemblies will be reference?

Use the reference sequence you aligned the reads to, sounds like this is either the assembly or the NCBI reference.

ArtRand added the troubleshooting workflow and data preparation questions label Dec 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failed to read fasta index (error 2) #319

Failed to read fasta index (error 2) #319

ramnageena11 commented Dec 20, 2024

ArtRand commented Dec 21, 2024

ramnageena11 commented Jan 2, 2025

ArtRand commented Jan 3, 2025

ramnageena11 commented Jan 3, 2025

ArtRand commented Jan 3, 2025

Failed to read fasta index (error 2) #319

Failed to read fasta index (error 2) #319

Comments

ramnageena11 commented Dec 20, 2024

ArtRand commented Dec 21, 2024

ramnageena11 commented Jan 2, 2025

ArtRand commented Jan 3, 2025

ramnageena11 commented Jan 3, 2025

I was using pileup command got error.

ArtRand commented Jan 3, 2025