; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015631 (gene) of Snake gourd v1 genome

Gene IDTan0015631
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG01:25738662..25739293
RNA-Seq ExpressionTan0015631
SyntenyTan0015631
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448684.1 PREDICTED: uncharacterized protein LOC103490783 [Cucumis melo]2.1e-2868.07Show/hide
Query:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGG-FRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRD
        MSLNCLSCQLLQR+DS+R RD Q    Y SD     +RSWSGNLS     RQNRGG FR MA+ KVAP+ HRR     AV+FG  GKEPRL+RSSGMRRD
Subjt:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGG-FRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRD

Query:  WSFEDLRAIREEKGPSANS
        WSFEDLR IREEK PS NS
Subjt:  WSFEDLRAIREEKGPSANS

XP_022952265.1 uncharacterized protein LOC111454968 [Cucurbita moschata]4.7e-4480.17Show/hide
Query:  MSLNCLSCQL-LQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSF
        MSLNCLSCQL LQRSDSD+D +  +YF+D + SP+RSWSGNLSFRPPTRQN  GFR   E KVAPMGHRRLHSTGAVAFGGPGKEPRLIRS+GMRRDWSF
Subjt:  MSLNCLSCQL-LQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSF

Query:  EDLRAIREEKGPSANS
        EDLRAIRE+K  S NS
Subjt:  EDLRAIREEKGPSANS

XP_023539506.1 uncharacterized protein LOC111800149 [Cucurbita pepo subsp. pepo]1.0e-4380.17Show/hide
Query:  MSLNCLSCQL-LQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSF
        MSLNCLSCQL LQRSDSD+D +  +YF+D + SP+RSWSGNLSFRPPTRQN  GFR   E KVAP GHRRLHSTGAVAFGGPGKEPRLIRS+GMRRDWSF
Subjt:  MSLNCLSCQL-LQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSF

Query:  EDLRAIREEKGPSANS
        EDLRAIREEK  S NS
Subjt:  EDLRAIREEKGPSANS

XP_031737872.1 uncharacterized protein LOC116402546 [Cucumis sativus]2.5e-2964.62Show/hide
Query:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRG------------GFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEP
        MSLNCLSCQ+LQR+DS+R RD Q    Y SD  +S ERSWSGNL  RP    NRG            GFR MA+ KVAP+GHRR     AV+FG  GKEP
Subjt:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRG------------GFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEP

Query:  RLIRSSGMRRDWSFEDLRAIREEKGPSANS
        RLIRSSGMRRDWSFEDLRAIREEK PS NS
Subjt:  RLIRSSGMRRDWSFEDLRAIREEKGPSANS

XP_038905810.1 uncharacterized protein LOC120091761 [Benincasa hispida]4.1e-3269.83Show/hide
Query:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDW
        MSLNCLSCQ+LQR+DS+R RD Q    Y SD   S ERSWSGNLSFRP  R NRGGFR + E KVAP+ HRR     AV+FG  GKEPRL+RSSGMRRDW
Subjt:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDW

Query:  SFEDLRAIREEKGPSA
        SFEDLR IREE+ PSA
Subjt:  SFEDLRAIREEKGPSA

TrEMBL top hitse value%identityAlignment
A0A0A0L198 Uncharacterized protein4.1e-3066.67Show/hide
Query:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRG--------GFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIR
        MSLNCLSCQ+LQR+DS+R RD Q    Y SD  +S ERSWSGNL  RP    NRG        GFR MA+ KVAP+GHRR     AV+FG  GKEPRLIR
Subjt:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRG--------GFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIR

Query:  SSGMRRDWSFEDLRAIREEKGPSANS
        SSGMRRDWSFEDLRAIREEK PS NS
Subjt:  SSGMRRDWSFEDLRAIREEKGPSANS

A0A1S3BJN6 uncharacterized protein LOC1034907831.0e-2868.07Show/hide
Query:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGG-FRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRD
        MSLNCLSCQLLQR+DS+R RD Q    Y SD     +RSWSGNLS     RQNRGG FR MA+ KVAP+ HRR     AV+FG  GKEPRL+RSSGMRRD
Subjt:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGG-FRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRD

Query:  WSFEDLRAIREEKGPSANS
        WSFEDLR IREEK PS NS
Subjt:  WSFEDLRAIREEKGPSANS

A0A5A7UAS3 Uncharacterized protein1.0e-2868.07Show/hide
Query:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGG-FRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRD
        MSLNCLSCQLLQR+DS+R RD Q    Y SD     +RSWSGNLS     RQNRGG FR MA+ KVAP+ HRR     AV+FG  GKEPRL+RSSGMRRD
Subjt:  MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGG-FRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRD

Query:  WSFEDLRAIREEKGPSANS
        WSFEDLR IREEK PS NS
Subjt:  WSFEDLRAIREEKGPSANS

A0A6J1GL99 uncharacterized protein LOC1114549682.3e-4480.17Show/hide
Query:  MSLNCLSCQL-LQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSF
        MSLNCLSCQL LQRSDSD+D +  +YF+D + SP+RSWSGNLSFRPPTRQN  GFR   E KVAPMGHRRLHSTGAVAFGGPGKEPRLIRS+GMRRDWSF
Subjt:  MSLNCLSCQL-LQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSF

Query:  EDLRAIREEKGPSANS
        EDLRAIRE+K  S NS
Subjt:  EDLRAIREEKGPSANS

A0A6J1L834 uncharacterized protein LOC1115001868.6e-2862.16Show/hide
Query:  MSLNCLSCQLLQRSDSDRDRDH--QDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWS
        M+LNCLSCQLLQR+DS+RD D   Q+Y+S       RSWSGNLSFRPP R  +   RA+ E +  P+  RRLHS+G ++ G   KEP+L+RSSGMRRDWS
Subjt:  MSLNCLSCQLLQRSDSDRDRDH--QDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWS

Query:  FEDLRAIREEK
        FEDLRAIREEK
Subjt:  FEDLRAIREEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G35215.1 unknown protein6.6e-1239.64Show/hide
Query:  MSLNCLSCQLLQRSDSDRD---RDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDW
        MSLNCL+C +LQR+DSDRD   R    +  + + S       N S  P  R+                GHRRL+S   + + G   EP+L+RSSG+RRDW
Subjt:  MSLNCLSCQLLQRSDSDRD---RDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDW

Query:  SFEDLRAIREE
        SFEDL+  +++
Subjt:  SFEDLRAIREE

AT5G46770.1 unknown protein1.4e-0939.39Show/hide
Query:  MSLNCLSCQLLQRSDSDRDRDHQDYFSDP-----------------SHSPERSWSGNLSFRPPTRQNR-GGFRAMAEKKVAPMGHRRLHS-TGAVAFGGP
        MSLNCLSCQ L R+DS++D D     S P                 +    R+WSGNLS R   +  R G   A   KKV  + H RL    G+     P
Subjt:  MSLNCLSCQLLQRSDSDRDRDHQDYFSDP-----------------SHSPERSWSGNLSFRPPTRQNR-GGFRAMAEKKVAPMGHRRLHS-TGAVAFGGP

Query:  GK--EPRLIRSSGMRRDWSFEDLR--AIREEK
         +  +P+L+RS+G+RR+WSFE+LR   + EEK
Subjt:  GK--EPRLIRSSGMRRDWSFEDLR--AIREEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCTGAATTGCCTCTCATGTCAACTCTTACAGAGATCGGACTCCGACAGAGACCGCGACCACCAAGATTATTTCTCTGATCCCTCTCACTCGCCGGAGAGAAGCTG
GTCCGGCAACCTCTCGTTCCGGCCTCCCACTCGCCAAAACAGAGGAGGGTTTCGGGCCATGGCGGAGAAGAAGGTGGCGCCGATGGGCCACCGCCGTCTTCACAGTACCG
GCGCCGTCGCTTTCGGCGGCCCCGGTAAGGAGCCCAGGCTGATTAGAAGCTCGGGGATGAGGAGGGATTGGAGCTTTGAGGATCTCAGAGCCATTCGAGAGGAAAAGGGG
CCATCTGCCAATTCCTAA
mRNA sequenceShow/hide mRNA sequence
CATTAACCCAAAAATTAGAGGGTGGTTAAGCTATAATTCCGGATTACTGAAATTCCCACAACCAAATCCCCAATTTAAAACCCTAAAATTCGTCTCCATCAATCCCCAAT
TCTCCCCGTAAAATTCCCCAAATCCAAAACCCTCTCTCCGTCGTCTTCCTCCTCTGTAACCATGAGTCTGAATTGCCTCTCATGTCAACTCTTACAGAGATCGGACTCCG
ACAGAGACCGCGACCACCAAGATTATTTCTCTGATCCCTCTCACTCGCCGGAGAGAAGCTGGTCCGGCAACCTCTCGTTCCGGCCTCCCACTCGCCAAAACAGAGGAGGG
TTTCGGGCCATGGCGGAGAAGAAGGTGGCGCCGATGGGCCACCGCCGTCTTCACAGTACCGGCGCCGTCGCTTTCGGCGGCCCCGGTAAGGAGCCCAGGCTGATTAGAAG
CTCGGGGATGAGGAGGGATTGGAGCTTTGAGGATCTCAGAGCCATTCGAGAGGAAAAGGGGCCATCTGCCAATTCCTAACAACTTACTAACTTCTTTTTTTTTTCCTTGT
TCTTTTCTATTTTTTTATTATATATATATATTTGAAATTTGTTTGGACATGAATATATAAAAAAGTAGAAGGAATGGGGAGA
Protein sequenceShow/hide protein sequence
MSLNCLSCQLLQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEKG
PSANS