; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003145 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003145
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationscaffold4:33287992..33291953
RNA-Seq ExpressionSpg003145
SyntenySpg003145
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022897442.1 uncharacterized protein LOC111411108 [Olea europaea var. sylvestris]2.0e-0937.24Show/hide
Query:  EKDIRKAVSTVTEGSTQKG-----QSSDNVEDLNEKEEGE-------GELIPSKMGGSQQKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAE
        E  IR+    V EG+   G     +   +V  L+E+E+ E       GE +       +++ +K WEDLKA   + FRPSQEG LCA+FL+++Q  TV E
Subjt:  EKDIRKAVSTVTEGSTQKG-----QSSDNVEDLNEKEEGE-------GELIPSKMGGSQQKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAE

Query:  YIKRLEVY---LALLPETILENAFLNDLKRAVKAAVVSRRPNGLE
        Y +R E     L  + E ++E +FLN L+  ++A V   RP GLE
Subjt:  YIKRLEVY---LALLPETILENAFLNDLKRAVKAAVVSRRPNGLE

XP_024017591.1 uncharacterized protein LOC112090471 [Morus notabilis]3.5e-0940.82Show/hide
Query:  EGELIPSKMGGSQQKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALL---PETILENAFLNDLKRAVKAAVVSRRPNGL
        EGE +       ++  ++ W +LK +  + F  +QEG LC +FL++ QE TV EY ++ E+  ALL   PE +LE+AF+N LK  V+A V   +PNGL
Subjt:  EGELIPSKMGGSQQKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALL---PETILENAFLNDLKRAVKAAVVSRRPNGL

XP_031745472.1 uncharacterized protein K02A2.6-like [Cucumis sativus]4.6e-0932.53Show/hide
Query:  MAQKQMEERVDAMEKDIRKAVSTVTEGSTQKGQSSDNVEDLN--EKEEGEGELIPSK------------------MGGSQQKRIKLWEDLKARKFKHFRP
        MAQ+Q+EERV+  EK+I      + E      + +D + + +  +K+E  G    S                      +++KR++ WEDLK R F  F  
Subjt:  MAQKQMEERVDAMEKDIRKAVSTVTEGSTQKGQSSDNVEDLN--EKEEGEGELIPSK------------------MGGSQQKRIKLWEDLKARKFKHFRP

Query:  SQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE
        + +  L AR + IKQE + +EY+K+   Y A LP   E++L +AFL  L+ +++A V+SR P  LE
Subjt:  SQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE

XP_038891356.1 uncharacterized protein LOC120080793 isoform X1 [Benincasa hispida]4.1e-1044.58Show/hide
Query:  IKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE
        I  WE+LK R F+++RP+ EG L AR L IKQ+ + ++Y+K+   Y   LP   E +L++AF+N L+  ++A V+SR+PN LE
Subjt:  IKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE

XP_038893564.1 uncharacterized protein LOC120082453 [Benincasa hispida]5.9e-0943.02Show/hide
Query:  QKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE
        ++ I +WE+LK R F+++RP+ EG L AR L IKQE + A+Y+K+   Y A LP   E +L++AF+N L+  ++  V+SR+   LE
Subjt:  QKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE

TrEMBL top hitse value%identityAlignment
A0A5A7SSY7 Ty3/gypsy retrotransposon protein1.9e-0843.02Show/hide
Query:  QKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE
        +KR + W++LK R + HFR  + G  C RFLAIKQE +V EY++R E   A LP   E +L   F N L   ++  V + R  GLE
Subjt:  QKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE

A0A5A7T3B8 Retrotransposon protein4.9e-0943.18Show/hide
Query:  SQQKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE
        + +K+I  WEDLK R F+HF+ S EG L AR + I+Q+   A+Y+K+   Y A LP   E++L +AF+  L+  ++A V+SR P  LE
Subjt:  SQQKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE

A0A5A7U1I8 Gypsy/ty3 element polyprotein3.8e-0944.94Show/hide
Query:  GSQQKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE
        G  +KR +LW++LK R +  FR  + G  CARFLAIKQE +V EY++R E   A LP   E +L  AF N L   ++  V + R  GLE
Subjt:  GSQQKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE

A0A5D3BL18 Ty3/gypsy retrotransposon protein1.9e-0843.02Show/hide
Query:  QKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE
        +KR + W++LK R + HFR  + G  C RFLAIKQE +V EY++R E   A LP   E +L   F N L   ++  V + R  GLE
Subjt:  QKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE

A0A6J1D5H9 uncharacterized protein LOC1110174761.4e-0847.5Show/hide
Query:  WEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE
        W++L+ R  K FR S+EG  CAR LAIKQE +VAEY +  E   A LP   + +LE  FLN L   V++ V++  P GLE
Subjt:  WEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLP---ETILENAFLNDLKRAVKAAVVSRRPNGLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTCGGGATCCAAAATTACACTTTGGCTTCTGTATATCCACGACTCTACAGGCTTTCCATACGTACTCACATCAATAGTAAATATCTGGAATGTTGATAACGGAGT
TTGGGACTCAGGCTTCCGGAGAAATCTTAACGATTCTGAAATTACTGAATGGGCCTCGCTATCTCACTACCTTATATCTTTTGCTTTAACAAATAATAATGACTCCTGGA
TTTGGAATTATGGAACCTCCAGAATCTTTTCAGTCCATTCTATGATGAAGTTTTTAACATTGCAGAGGGATTATTCTATTGATCCTTTATATTCGATTATATGGAAAGCC
TTTGGATGGAATATAGTTCTGCCTGATGTAAAGGTTTTGGACCAAGATTTTGATGAAAAGCCATACTTATTCCAGATTCCTGTTAGACCTTCAATTATTCATACATATAA
GAGGAAGGGCAAAGAGATACCTAAACAATTGGTATTAGAGCACCTCAACATGGGGAGAACACGAAAAATGGCACAGAAACAGATGGAAGAAAGAGTTGATGCGATGGAGA
AGGACATCAGAAAGGCAGTGTCGACTGTAACTGAAGGGTCTACTCAGAAGGGCCAATCGTCAGACAATGTGGAAGACTTGAACGAGAAAGAGGAAGGCGAAGGGGAACTA
ATCCCTTCGAAGATGGGGGGATCGCAACAAAAGAGGATCAAGTTATGGGAAGATTTGAAAGCTAGGAAGTTCAAGCATTTTCGTCCTTCACAAGAAGGCATACTGTGCGC
ACGTTTCTTAGCGATCAAGCAAGAAGATACAGTGGCAGAGTATATTAAGAGGCTCGAAGTATACTTGGCTCTGTTGCCGGAGACGATTTTGGAAAACGCCTTCTTAAACG
ACCTCAAACGTGCTGTTAAGGCAGCCGTTGTTAGCCGAAGACCTAATGGGCTTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGATGTCGGGATCCAAAATTACACTTTGGCTTCTGTATATCCACGACTCTACAGGCTTTCCATACGTACTCACATCAATAGTAAATATCTGGAATGTTGATAACGGAGT
TTGGGACTCAGGCTTCCGGAGAAATCTTAACGATTCTGAAATTACTGAATGGGCCTCGCTATCTCACTACCTTATATCTTTTGCTTTAACAAATAATAATGACTCCTGGA
TTTGGAATTATGGAACCTCCAGAATCTTTTCAGTCCATTCTATGATGAAGTTTTTAACATTGCAGAGGGATTATTCTATTGATCCTTTATATTCGATTATATGGAAAGCC
TTTGGATGGAATATAGTTCTGCCTGATGTAAAGGTTTTGGACCAAGATTTTGATGAAAAGCCATACTTATTCCAGATTCCTGTTAGACCTTCAATTATTCATACATATAA
GAGGAAGGGCAAAGAGATACCTAAACAATTGGTATTAGAGCACCTCAACATGGGGAGAACACGAAAAATGGCACAGAAACAGATGGAAGAAAGAGTTGATGCGATGGAGA
AGGACATCAGAAAGGCAGTGTCGACTGTAACTGAAGGGTCTACTCAGAAGGGCCAATCGTCAGACAATGTGGAAGACTTGAACGAGAAAGAGGAAGGCGAAGGGGAACTA
ATCCCTTCGAAGATGGGGGGATCGCAACAAAAGAGGATCAAGTTATGGGAAGATTTGAAAGCTAGGAAGTTCAAGCATTTTCGTCCTTCACAAGAAGGCATACTGTGCGC
ACGTTTCTTAGCGATCAAGCAAGAAGATACAGTGGCAGAGTATATTAAGAGGCTCGAAGTATACTTGGCTCTGTTGCCGGAGACGATTTTGGAAAACGCCTTCTTAAACG
ACCTCAAACGTGCTGTTAAGGCAGCCGTTGTTAGCCGAAGACCTAATGGGCTTGAATAG
Protein sequenceShow/hide protein sequence
MMSGSKITLWLLYIHDSTGFPYVLTSIVNIWNVDNGVWDSGFRRNLNDSEITEWASLSHYLISFALTNNNDSWIWNYGTSRIFSVHSMMKFLTLQRDYSIDPLYSIIWKA
FGWNIVLPDVKVLDQDFDEKPYLFQIPVRPSIIHTYKRKGKEIPKQLVLEHLNMGRTRKMAQKQMEERVDAMEKDIRKAVSTVTEGSTQKGQSSDNVEDLNEKEEGEGEL
IPSKMGGSQQKRIKLWEDLKARKFKHFRPSQEGILCARFLAIKQEDTVAEYIKRLEVYLALLPETILENAFLNDLKRAVKAAVVSRRPNGLE