; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021451 (gene) of Snake gourd v1 genome

Gene IDTan0021451
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG11:47665550..47666317
RNA-Seq ExpressionTan0021451
SyntenyTan0021451
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067273.1 retrotransposon protein [Cucumis melo var. makuwa]5.3e-4446.77Show/hide
Query:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS
        M+GPACS FGWNEE+KC+EA+K +FD WV  HP+A+GL N+ FP F DL ++FG+DRATG   ++P +M    A D  EDD++ +  D  + +P  ++  
Subjt:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS

Query:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS
         GED S+TPTS   +VGSS+P+++RR S   ++ D      + T+++I KIAAW  EK  IE +    LYAEL+ IPGM   DC++V ++LL D +ML++
Subjt:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS

Query:  F
        F
Subjt:  F

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]3.9e-4749.25Show/hide
Query:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS
        M+GPACSGFGWNE +KC+E EK +FD WV  HP+A+GL N+PFP F DL V+FG+DRATG   ++P +M+S  A D  EDD++ +  D  + +P  ++  
Subjt:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS

Query:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS
         GED  +TPTS T + GSSRP+++RR  SG ++ D      + T+++IGKIA W  EK  IE +    LYAEL+ IPGM   DC++VA++LL D  ML++
Subjt:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS

Query:  F
        F
Subjt:  F

TYK10886.1 retrotransposon protein [Cucumis melo var. makuwa]9.0e-4446.77Show/hide
Query:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS
        M+GPACS FGWNEE+KC+EA+K +FD WV  HP+A+GL N+ FP F DL ++FG+DRATG   ++P +M    A D  EDD++ +  D  + +P  ++  
Subjt:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS

Query:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS
         GED S+TPTS   +VGSSRP+++RR S   ++ D      + T+++I KI AW  EK  IE +    LYAEL+ IPGM   DC++V ++LL D +ML++
Subjt:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS

Query:  F
        F
Subjt:  F

XP_008460440.1 PREDICTED: uncharacterized protein LOC103499248 [Cucumis melo]2.6e-4346.5Show/hide
Query:  LGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVSM
        +GPACS FGWNEE+KC+EA+K +FD WV  HP+A+GL N+ FP F DL ++FG+DRATG   ++P +M    A D  EDD++ +  D  + +P  ++   
Subjt:  LGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVSM

Query:  GEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNSF
        GED S+TPTS   +VGSS+P+++RR S   ++ D      + T+++I KIAAW  EK  IE +    LYAEL+ IPGM   DC++V ++LL D +ML++F
Subjt:  GEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNSF

XP_022158565.1 uncharacterized protein LOC111025018 [Momordica charantia]1.1e-3338.14Show/hide
Query:  GFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDLEDDLNPDFPDSFLRDPP---PVDVSMGEDT
        GFGWN++ KC+EAEK++FD WV SHP+AKGLRN+P P ++DL V FGKDRATGA    P DMAS AA  + +D + +  D ++ DPP     + ++ ED 
Subjt:  GFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDLEDDLNPDFPDSFLRDPP---PVDVSMGEDT

Query:  SATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNSFFAWT
          TPTS+     SS  ++R+R    S M D++    ++    + K+A W  +K+  + AR   ++ +LK IP +   D + +   L+++     SF    
Subjt:  SATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNSFFAWT

Query:  PDWKYDYCVEVLGKA
         ++K  +C+++LGK+
Subjt:  PDWKYDYCVEVLGKA

TrEMBL top hitse value%identityAlignment
A0A1S3CC17 uncharacterized protein LOC1034992481.3e-4346.5Show/hide
Query:  LGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVSM
        +GPACS FGWNEE+KC+EA+K +FD WV  HP+A+GL N+ FP F DL ++FG+DRATG   ++P +M    A D  EDD++ +  D  + +P  ++   
Subjt:  LGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVSM

Query:  GEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNSF
        GED S+TPTS   +VGSS+P+++RR S   ++ D      + T+++I KIAAW  EK  IE +    LYAEL+ IPGM   DC++V ++LL D +ML++F
Subjt:  GEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNSF

A0A5A7VGQ0 Retrotransposon protein2.6e-4446.77Show/hide
Query:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS
        M+GPACS FGWNEE+KC+EA+K +FD WV  HP+A+GL N+ FP F DL ++FG+DRATG   ++P +M    A D  EDD++ +  D  + +P  ++  
Subjt:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS

Query:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS
         GED S+TPTS   +VGSS+P+++RR S   ++ D      + T+++I KIAAW  EK  IE +    LYAEL+ IPGM   DC++V ++LL D +ML++
Subjt:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS

Query:  F
        F
Subjt:  F

A0A5D3C7T4 Uncharacterized protein1.9e-4749.25Show/hide
Query:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS
        M+GPACSGFGWNE +KC+E EK +FD WV  HP+A+GL N+PFP F DL V+FG+DRATG   ++P +M+S  A D  EDD++ +  D  + +P  ++  
Subjt:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS

Query:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS
         GED  +TPTS T + GSSRP+++RR  SG ++ D      + T+++IGKIA W  EK  IE +    LYAEL+ IPGM   DC++VA++LL D  ML++
Subjt:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS

Query:  F
        F
Subjt:  F

A0A5D3CKC1 Retrotransposon protein4.4e-4446.77Show/hide
Query:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS
        M+GPACS FGWNEE+KC+EA+K +FD WV  HP+A+GL N+ FP F DL ++FG+DRATG   ++P +M    A D  EDD++ +  D  + +P  ++  
Subjt:  MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDL-EDDLNPDFPDSFLRDPPPVDVS

Query:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS
         GED S+TPTS   +VGSSRP+++RR S   ++ D      + T+++I KI AW  EK  IE +    LYAEL+ IPGM   DC++V ++LL D +ML++
Subjt:  MGEDTSATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNS

Query:  F
        F
Subjt:  F

A0A6J1DW73 uncharacterized protein LOC1110250185.4e-3438.14Show/hide
Query:  GFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDLEDDLNPDFPDSFLRDPP---PVDVSMGEDT
        GFGWN++ KC+EAEK++FD WV SHP+AKGLRN+P P ++DL V FGKDRATGA    P DMAS AA  + +D + +  D ++ DPP     + ++ ED 
Subjt:  GFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDLEDDLNPDFPDSFLRDPP---PVDVSMGEDT

Query:  SATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNSFFAWT
          TPTS+     SS  ++R+R    S M D++    ++    + K+A W  +K+  + AR   ++ +LK IP +   D + +   L+++     SF    
Subjt:  SATPTSRTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNSFFAWT

Query:  PDWKYDYCVEVLGKA
         ++K  +C+++LGK+
Subjt:  PDWKYDYCVEVLGKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24960.2 unknown protein3.2e-0737.04Show/hide
Query:  SGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATG
        +GF W+  R  V A+ DI++T++ +HP A+  R +  P + +L  IFGK+ + G
Subjt:  SGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGACCTGCATGTAGTGGATTCGGATGGAACGAGGAGCGTAAATGCGTGGAGGCCGAGAAAGACATATTCGATACATGGGTGATGTCTCATCCACATGCAAAAGG
TCTTCGAAATAGGCCATTCCCATTGTTCAACGATCTAGCAGTCATATTTGGAAAAGATAGAGCAACAGGTGCCGGTGCACAATCTCCATCCGACATGGCATCAGATGCTG
CAACTGACTTAGAGGATGATCTAAACCCTGATTTTCCCGATTCATTCCTCCGTGATCCACCGCCAGTGGATGTATCGATGGGCGAGGACACATCAGCCACACCTACGAGT
CGAACACAAGAAGTCGGATCATCCAGACCGACTAGGAGGAGACGTCTGTCCTCGGGGTCAAATATGGCAGACATCATGAGTCAGGGTTTTCAATTGACAGCCGAACAGAT
TGGAAAGATTGCAGCCTGGCTGTCGGAGAAGGATAGAATTGAAAGAGCTCGAGTGAACGAGCTATATGCAGAGTTGAAAGCAATCCCAGGGATGACAAGACAAGATTGCA
TGATGGTTGCAAAGACGCTTCTCTCGGATAGTATGATGTTGAACTCTTTCTTTGCTTGGACCCCAGATTGGAAGTATGATTATTGTGTGGAAGTTCTGGGGAAAGCACCG
GGAACCTGA
mRNA sequenceShow/hide mRNA sequence
CGCGATTGCGGAGATGTTAGGACCTGCATGTAGTGGATTCGGATGGAACGAGGAGCGTAAATGCGTGGAGGCCGAGAAAGACATATTCGATACATGGGTGATGTCTCATC
CACATGCAAAAGGTCTTCGAAATAGGCCATTCCCATTGTTCAACGATCTAGCAGTCATATTTGGAAAAGATAGAGCAACAGGTGCCGGTGCACAATCTCCATCCGACATG
GCATCAGATGCTGCAACTGACTTAGAGGATGATCTAAACCCTGATTTTCCCGATTCATTCCTCCGTGATCCACCGCCAGTGGATGTATCGATGGGCGAGGACACATCAGC
CACACCTACGAGTCGAACACAAGAAGTCGGATCATCCAGACCGACTAGGAGGAGACGTCTGTCCTCGGGGTCAAATATGGCAGACATCATGAGTCAGGGTTTTCAATTGA
CAGCCGAACAGATTGGAAAGATTGCAGCCTGGCTGTCGGAGAAGGATAGAATTGAAAGAGCTCGAGTGAACGAGCTATATGCAGAGTTGAAAGCAATCCCAGGGATGACA
AGACAAGATTGCATGATGGTTGCAAAGACGCTTCTCTCGGATAGTATGATGTTGAACTCTTTCTTTGCTTGGACCCCAGATTGGAAGTATGATTATTGTGTGGAAGTTCT
GGGGAAAGCACCGGGAACCTGA
Protein sequenceShow/hide protein sequence
MLGPACSGFGWNEERKCVEAEKDIFDTWVMSHPHAKGLRNRPFPLFNDLAVIFGKDRATGAGAQSPSDMASDAATDLEDDLNPDFPDSFLRDPPPVDVSMGEDTSATPTS
RTQEVGSSRPTRRRRLSSGSNMADIMSQGFQLTAEQIGKIAAWLSEKDRIERARVNELYAELKAIPGMTRQDCMMVAKTLLSDSMMLNSFFAWTPDWKYDYCVEVLGKAP
GT