Skip to content

Unclear output from ESPRESSO_Q summary.txt #78

@pclavell

Description

@pclavell

Hello,
I would like a clarification on what is exactly shown in each row in the summary.txt of ESPRESSO_Q:
For example, in this output:

number of FSM splice junction chains: 25922
number of ISM splice junction chains: 49296
number of NIC splice junction chains: 14954
number of validated NIC chains: 3894
number of NNC splice junction chains: 13508
number of validated NNC chains: 5656
number of splice junction chains with a failed junction: 37451
total FSM abundance: 7204779.9
total novel ISM abundance: 22660.36
total NIC abundance: 65848.54
total NNC abundance: 110658.77
total single exon abundance: 145342.99
number of detected FSM isoforms: 26322
number of detected novel ISM isoforms: 1417
number of detected NIC isoforms: 3728
number of detected NNC isoforms: 5413
number of detected single exon isoforms: 2271
number of internal exon boundary check failures: 85497
number of terminal exon boundary check failures: 2208080
  1. Difference between SJ chains and validated SJ chains?
  2. Difference between validated chain and detected isoform? Is it only about TSS and TTS for the same splice junction?
  3. How are the structural categories (FSM, ISM, NIC, NNC) obtained? Is it running SQANTI on the background and then only displaying these 4 categories?

Thanks a lot

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions