; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001578 (gene) of Snake gourd v1 genome

Gene IDTan0001578
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCaspase-6 protein
Genome locationLG01:6311640..6316247
RNA-Seq ExpressionTan0001578
SyntenyTan0001578
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR043459 - NFD6/NOXY2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014957.1 hypothetical protein SDJN02_22588, partial [Cucurbita argyrosperma subsp. argyrosperma]8.6e-3989.9Show/hide
Query:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHGRNGKDG
        MAWRSSGSLSR+LIS+VR+SSLRS+PSLPRLR PP+  RPRLQSRRLSFSTSRNLGELGCTQSLLPM +IMGATCLTSHLCVSARAFCELSHGRNGKDG
Subjt:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHGRNGKDG

XP_022149349.1 uncharacterized protein LOC111017781 isoform X2 [Momordica charantia]1.9e-3895.74Show/hide
Query:  SGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMN-IMGATCLTSHLCVSARAFCELSHGRNGKDG
        SGSLSRSLISTVRASSLRSAPSLPRLRPPPL  RPRLQSRRLSFSTSRNLGELGCTQSLLPM+ IMGATCLTSHLCVSARAFCELSHGRNGKDG
Subjt:  SGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMN-IMGATCLTSHLCVSARAFCELSHGRNGKDG

XP_022941570.1 uncharacterized protein LOC111446884 isoform X3 [Cucurbita moschata]3.0e-3990.82Show/hide
Query:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHGRNGKDG
        MAWR SGSLSRSLISTVRASSLRSAPSLPRL PPPL  RPRLQSRRLSFSTSRNLGELGCTQSLL M+I+GATCLTSHLCVS RAFCELSHG NGKDG
Subjt:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHGRNGKDG

XP_022983440.1 uncharacterized protein LOC111482036 isoform X2 [Cucurbita maxima]7.8e-4091.84Show/hide
Query:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHGRNGKDG
        MAWR SGSLSRSLISTVRASSLRSAPSLPRLRPPPL  RPRLQSRRLSFSTSRNLGELGCTQSLL M+I+GATCLTSHLCVS RAFCELSHG NGKDG
Subjt:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHGRNGKDG

XP_038883079.1 uncharacterized protein LOC120074027 isoform X2 [Benincasa hispida]2.7e-4093.94Show/hide
Query:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHGRNGKDG
        MAWRSSGSLSRSLIS+VRASSLRSAPSLPRLRPPPL  RPR QSRRLSFSTSRNLGELGCTQSLLPM +IMGATCLTSHLCVSARAFCELSHGRNGKDG
Subjt:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHGRNGKDG

TrEMBL top hitse value%identityAlignment
A0A6J1D6T0 uncharacterized protein LOC111017781 isoform X29.3e-3995.74Show/hide
Query:  SGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMN-IMGATCLTSHLCVSARAFCELSHGRNGKDG
        SGSLSRSLISTVRASSLRSAPSLPRLRPPPL  RPRLQSRRLSFSTSRNLGELGCTQSLLPM+ IMGATCLTSHLCVSARAFCELSHGRNGKDG
Subjt:  SGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMN-IMGATCLTSHLCVSARAFCELSHGRNGKDG

A0A6J1FU19 uncharacterized protein LOC111446884 isoform X31.4e-3990.82Show/hide
Query:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHGRNGKDG
        MAWR SGSLSRSLISTVRASSLRSAPSLPRL PPPL  RPRLQSRRLSFSTSRNLGELGCTQSLL M+I+GATCLTSHLCVS RAFCELSHG NGKDG
Subjt:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHGRNGKDG

A0A6J1J1R9 uncharacterized protein LOC111482625 isoform X23.5e-3888.89Show/hide
Query:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHGRNGKDG
        MAWRSSGSLSR+LIS+VR SSLRS+PSLPRLR PPL  RPRLQSRRL+FSTSRNLGELGCTQSLLPM +IMGATCLTSHLCV ARAFCELSHGRNGKDG
Subjt:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHGRNGKDG

A0A6J1J270 uncharacterized protein LOC111482036 isoform X16.6e-3792.39Show/hide
Query:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHG
        MAWR SGSLSRSLISTVRASSLRSAPSLPRLRPPPL  RPRLQSRRLSFSTSRNLGELGCTQSLL M+I+GATCLTSHLCVS RAFCELSHG
Subjt:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHG

A0A6J1J5V3 uncharacterized protein LOC111482036 isoform X23.8e-4091.84Show/hide
Query:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHGRNGKDG
        MAWR SGSLSRSLISTVRASSLRSAPSLPRLRPPPL  RPRLQSRRLSFSTSRNLGELGCTQSLL M+I+GATCLTSHLCVS RAFCELSHG NGKDG
Subjt:  MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHGRNGKDG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15000.1 unknown protein2.0e-1757.29Show/hide
Query:  MAWRSSGSLSRSLIS-TVRASSLRS-APSLPRLRPPPLDPRPRLQSRRLSFST-SRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHG
        MAWR++GS +RS +S T R+ SLRS   +LPRLRP    P+  L SRR +FS+ SRNLG LGCTQS LP+ +++  + LTSHL V+ RAFCELS+G
Subjt:  MAWRSSGSLSRSLIS-TVRASSLRS-APSLPRLRPPPLDPRPRLQSRRLSFST-SRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHG

AT2G15000.2 unknown protein2.3e-1857.84Show/hide
Query:  MAWRSSGSLSRSLIS-TVRASSLRS-APSLPRLRPPPLDPRPRLQSRRLSFST-SRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHGRNGK
        MAWR++GS +RS +S T R+ SLRS   +LPRLRP    P+  L SRR +FS+ SRNLG LGCTQS LP+ +++  + LTSHL V+ RAFCELS+G  GK
Subjt:  MAWRSSGSLSRSLIS-TVRASSLRS-APSLPRLRPPPLDPRPRLQSRRLSFST-SRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHGRNGK

Query:  DG
        DG
Subjt:  DG

AT2G15000.3 unknown protein2.3e-1857.84Show/hide
Query:  MAWRSSGSLSRSLIS-TVRASSLRS-APSLPRLRPPPLDPRPRLQSRRLSFST-SRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHGRNGK
        MAWR++GS +RS +S T R+ SLRS   +LPRLRP    P+  L SRR +FS+ SRNLG LGCTQS LP+ +++  + LTSHL V+ RAFCELS+G  GK
Subjt:  MAWRSSGSLSRSLIS-TVRASSLRS-APSLPRLRPPPLDPRPRLQSRRLSFST-SRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHGRNGK

Query:  DG
        DG
Subjt:  DG

AT2G15000.4 unknown protein2.0e-1757.29Show/hide
Query:  MAWRSSGSLSRSLIS-TVRASSLRS-APSLPRLRPPPLDPRPRLQSRRLSFST-SRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHG
        MAWR++GS +RS +S T R+ SLRS   +LPRLRP    P+  L SRR +FS+ SRNLG LGCTQS LP+ +++  + LTSHL V+ RAFCELS+G
Subjt:  MAWRSSGSLSRSLIS-TVRASSLRS-APSLPRLRPPPLDPRPRLQSRRLSFST-SRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHG

AT2G15000.5 unknown protein2.0e-1757.29Show/hide
Query:  MAWRSSGSLSRSLIS-TVRASSLRS-APSLPRLRPPPLDPRPRLQSRRLSFST-SRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHG
        MAWR++GS +RS +S T R+ SLRS   +LPRLRP    P+  L SRR +FS+ SRNLG LGCTQS LP+ +++  + LTSHL V+ RAFCELS+G
Subjt:  MAWRSSGSLSRSLIS-TVRASSLRS-APSLPRLRPPPLDPRPRLQSRRLSFST-SRNLGELGCTQSLLPM-NIMGATCLTSHLCVSARAFCELSHG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTGGCGCTCTTCTGGTTCTCTATCTCGCTCCTTGATTTCCACTGTTAGGGCTTCTTCTCTCCGATCGGCTCCTTCTCTTCCTCGGCTACGCCCGCCACCTCTCGA
TCCTCGACCTCGTCTTCAATCTCGTCGGCTTTCATTTTCCACTTCCAGGAATTTGGGAGAACTGGGATGCACGCAGTCTCTTCTGCCCATGAACATCATGGGTGCTACCT
GCCTGACTTCACACCTATGTGTTAGCGCACGAGCTTTTTGCGAGCTGTCTCATGGAAGGAATGGAAAAGATGGGTGA
mRNA sequenceShow/hide mRNA sequence
CACCATAGCATTTCCTCGTAAGGATCTGTACCCTTGTAAAACCCTCGCAGTGCTGCACTTCGACTTCGTCTTGTCGAACAAAGAGCTTTCACTACTAAAATGGCTTGGCG
CTCTTCTGGTTCTCTATCTCGCTCCTTGATTTCCACTGTTAGGGCTTCTTCTCTCCGATCGGCTCCTTCTCTTCCTCGGCTACGCCCGCCACCTCTCGATCCTCGACCTC
GTCTTCAATCTCGTCGGCTTTCATTTTCCACTTCCAGGAATTTGGGAGAACTGGGATGCACGCAGTCTCTTCTGCCCATGAACATCATGGGTGCTACCTGCCTGACTTCA
CACCTATGTGTTAGCGCACGAGCTTTTTGCGAGCTGTCTCATGGAAGGAATGGAAAAGATGGGTGATGCACGAGGACATTCTATCATTCGAAAGCAGGAGACTGTCGCAA
CATTTGTACTTGAGTGATCTCCAGTAGAAGCACACATGTGAGCAATTCTATGGGAGTGAAGTCCAGTATAGAGAGAGAGAGAGTTTCTGGTAACTTTATTAGGTTGGTAG
ATGTGATAAAAAATTTCCTAGCTGTTCAGGGAATTCATCCTATTTTTCCACACTACTCGGTTTGACTATGGATTCCATAAAATTATCAATTGTTGATTATCTGACCTGCT
TTGTTTCTTCTGGGCCTGAATTTTGTAATTATTTACCTTCAATTACTTTCATACCAGTTACTGGATTTAGAATCACATTATGATATTCTTGTTTCTATTAGCTTTGTTAG
TACCTTAAACCTAAGAATTATCGATTTGGACACAGATCAACAGAGAGATAAATAATAATAGTCCATCTTTAAGTTTCCAACATGAAAGTGAC
Protein sequenceShow/hide protein sequence
MAWRSSGSLSRSLISTVRASSLRSAPSLPRLRPPPLDPRPRLQSRRLSFSTSRNLGELGCTQSLLPMNIMGATCLTSHLCVSARAFCELSHGRNGKDG