read duplication percent
I would like to know if someone could help me to understand a bit more the read duplication. It seems that the percentage of read duplication is pretty high in my dataset and I was wondering why that could be? Does anyone has an answer or maybe a link to a post that could help me here?
Thanks a lot,
- 46.06 % ==> fraction of reads that are duplicates
- 34.12 % ==> nonduplicate and phased reads; ideal 45-50