; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035659 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035659
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr3:26491339..26498043
RNA-Seq ExpressionLag0035659
SyntenyLag0035659
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]2.2e-12856.76Show/hide
Query:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT-----------------------------QVLQDMITNCIR
        P  ++++VM+T+  T E+RM E+++ +N LMKA+EE+D +I  LK+ IE++  AESS T                             Q LQ+MI N I+
Subjt:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT-----------------------------QVLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD---------------------
         QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD                     
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD---------------------

Query:  ----CTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
             TRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +
Subjt:  ----CTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IEESM-------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC
        RKE +  + T        +E+M                               ERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDEET-------IEESM-------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC

Query:  KYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATI
        KYHRVI HPVE+CFVLK+LILKLA+++KIEL+LD+VAQ+N A +
Subjt:  KYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATI

XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]1.3e-12851.92Show/hide
Query:  SFKTSSMVAVEIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQ
        S K +S  +    +Y G         +   +D+ S IA  ++ L+    + K   ++K NPL+            E   DV++VMM +   +E  M EM+
Subjt:  SFKTSSMVAVEIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQ

Query:  EHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDN
          IN LMK ++E+D +I  LK Q++ +  AESSQT                              Q LQDMITN IRAQYGGP+Q S +YSKPYTKRIDN
Subjt:  EHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDN

Query:  LRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKG
        LR P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRRTVSM ELTNTKQRKG
Subjt:  LRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKG

Query:  ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESM-
        E V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         + T +ESM 
Subjt:  ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESM-

Query:  -------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILK
                                       ERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+
Subjt:  -------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILK

Query:  LAMERKIELDLDEVAQSNLATI
        LA E++IELDL+EVAQ+N A +
Subjt:  LAMERKIELDLDEVAQSNLATI

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]3.7e-12851.72Show/hide
Query:  SFKTSSMVAVEIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQ
        S K +S  +    +Y G         +   +D+ S IA  ++ L+    + K   ++K NPL+            E   DV++VMM +   +E  M EM+
Subjt:  SFKTSSMVAVEIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQ

Query:  EHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDN
          IN LMK ++E+D +I  LK Q++ +  AESSQT                              Q LQDMIT+ IRAQYGGP+Q S +YSKPYTKRIDN
Subjt:  EHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDN

Query:  LRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKG
        LR P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRRTVSM ELTNTKQRKG
Subjt:  LRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKG

Query:  ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESM-
        E V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         + T +ESM 
Subjt:  ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESM-

Query:  -------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILK
                                       ERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+
Subjt:  -------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILK

Query:  LAMERKIELDLDEVAQSNLATI
        LA E++IELDL+EVAQ+N A +
Subjt:  LAMERKIELDLDEVAQSNLATI

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]1.7e-12851.72Show/hide
Query:  SFKTSSMVAVEIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQ
        S K +S  +    +Y G         +   +D+ S IA  ++ L+    + K   ++K NPL+            E   DV++VMM +   +E  M EM+
Subjt:  SFKTSSMVAVEIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQ

Query:  EHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDN
          IN LMK ++E+D +I  LK Q++ +  AESSQT                              Q LQDMIT+ IRAQYGGP+Q S +YSKPYTKRIDN
Subjt:  EHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDN

Query:  LRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKG
        LR P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRRTVSM ELTNTKQRKG
Subjt:  LRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKG

Query:  ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESM-
        E V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         + T++ESM 
Subjt:  ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESM-

Query:  -------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILK
                                       ERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+
Subjt:  -------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILK

Query:  LAMERKIELDLDEVAQSNLATI
        LA E++IELDL+EVAQ+N A +
Subjt:  LAMERKIELDLDEVAQSNLATI

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]1.3e-12851.92Show/hide
Query:  SFKTSSMVAVEIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQ
        S K +S  +    +Y G         +   +D+ S IA  ++ L+    + K   ++K NPL+            E   DV++VMM +   +E  M EM+
Subjt:  SFKTSSMVAVEIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQ

Query:  EHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDN
          IN LMK ++E+D +I  LK Q++ +  AESSQT                              Q LQDMITN IRAQYGGP+Q S +YSKPYTKRIDN
Subjt:  EHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDN

Query:  LRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKG
        LR P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRRTVSM ELTNTKQRKG
Subjt:  LRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKG

Query:  ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESM-
        E V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         + T +ESM 
Subjt:  ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESM-

Query:  -------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILK
                                       ERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+
Subjt:  -------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILK

Query:  LAMERKIELDLDEVAQSNLATI
        LA E++IELDL+EVAQ+N A +
Subjt:  LAMERKIELDLDEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H1.0e-12856.76Show/hide
Query:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT-----------------------------QVLQDMITNCIR
        P  ++++VM+T+  T E+RM E+++ +N LMKA+EE+D +I  LK+ IE++  AESS T                             Q LQ+MI N I+
Subjt:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQT-----------------------------QVLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD---------------------
         QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD                     
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD---------------------

Query:  ----CTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
             TRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +
Subjt:  ----CTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IEESM-------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC
        RKE +  + T        +E+M                               ERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDEET-------IEESM-------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC

Query:  KYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATI
        KYHRVI HPVE+CFVLK+LILKLA+++KIEL+LD+VAQ+N A +
Subjt:  KYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATI

A0A5A7URH1 Ty3-gypsy retrotransposon protein5.7e-12745.15Show/hide
Query:  KASIIASEETTLQGAYTNDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESS-----------
        K  I+  E   +   Y++ K      P  E  P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS           
Subjt:  KASIIASEETTLQGAYTNDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESS-----------

Query:  ------------------QTQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ
                            Q LQ+MI + I+ QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQ
Subjt:  ------------------QTQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ

Query:  FVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
        FVRTLKGNA D                          TR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  FVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESM---------------------------------ERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T       I+ESM                                 ERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESM---------------------------------ERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--
        + DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R   
Subjt:  IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--

Query:  ------RSKKFSQPQQLVML-------------NKSFSKTFHKKKKENLATS-----------------YCINVEEVDN------------SRRVN----
              RS     P++++ +             N   SK  +   + N  TS                   +  +E +N            S+R++    
Subjt:  ------RSKKFSQPQQLVML-------------NKSFSKTFHKKKKENLATS-----------------YCINVEEVDN------------SRRVN----

Query:  -KGLPSLIASSLQLLVLR---------------------DKKLHISIPSRMKRKFSVLINTEGSLKVKSNLIILTNPANE
         K  PS   SS   L +                      D K+H  +PSRMKRK  V INTEGSL VK   II TNP NE
Subjt:  -KGLPSLIASSLQLLVLR---------------------DKKLHISIPSRMKRKFSVLINTEGSLKVKSNLIILTNPANE

A0A5A7UXF0 Ty3-gypsy retrotransposon protein2.6e-12754.15Show/hide
Query:  KASIIASEETTLQGAYTNDKFLVKYNPLFEPDS--DVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESS-----------
        K  I+  E   +   Y++ K      P  E  S  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS           
Subjt:  KASIIASEETTLQGAYTNDKFLVKYNPLFEPDS--DVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESS-----------

Query:  ------------------QTQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ
                            Q LQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQ
Subjt:  ------------------QTQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ

Query:  FVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
        FVRTLKGNAFD                          TRR VSM ELTNT+Q+KGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  FVRTLKGNAFD-------------------------CTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESM---------------------------------ERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T       I+ESM                                 ERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESM---------------------------------ERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKS
        + DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N   I+  S
Subjt:  IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKS

A0A5A7VCV9 Retrotransposon gag protein8.8e-12859.13Show/hide
Query:  KYNPLFE------PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVLQ-------DMITNCIRAQYGGPTQD
        K+N L E      P  ++++VM+T+    E+RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS    ++       +MI N I+ QYGG  Q 
Subjt:  KYNPLFE------PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVLQ-------DMITNCIRAQYGGPTQD

Query:  SLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDCTRRTVSMFELTNTKQRKGELVVNYINRWR
          LYSKPYTKRIDNLR P GYQP KFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDCTRR VSM ELTNT+Q+KGELV++Y+NRWR
Subjt:  SLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDCTRRTVSMFELTNTKQRKGELVVNYINRWR

Query:  AMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESM------------
        A+SLDCK+RLTELS+VEMC QGMHW LLYIL+GIKP TFEELATRAHDMELSI +R  +D L+   R +    ++T       I+ESM            
Subjt:  AMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESM------------

Query:  ---------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELD
                             ERQ+K+YPFPD+D+ D LEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD
Subjt:  ---------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELD

Query:  LDEVAQSNLATIKEKS
        +DEVAQ+N   I+  S
Subjt:  LDEVAQSNLATIKEKS

A0A5D3D4X3 Ty3-gypsy retrotransposon protein3.7e-12655.08Show/hide
Query:  KYNPLFE------PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESS-----------------------------Q
        K+N L E      P  ++++VM+T+    E+RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS                              
Subjt:  KYNPLFE------PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESS-----------------------------Q

Query:  TQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD--------
         Q LQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD        
Subjt:  TQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD--------

Query:  -----------------CTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI
                          TRR VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI
Subjt:  -----------------CTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI

Query:  ASRENQDLLLPNMRKEGRNDEET-------IEESM---------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKC
        A+R  +D L+P  R +    ++T       I+ESM                                 ERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+C
Subjt:  ASRENQDLLLPNMRKEGRNDEET-------IEESM---------------------------------ERQKKIYPFPDADIPDMLEQLLEAQLIKLPKC

Query:  KRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKS
        KRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N   I+  S
Subjt:  KRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTCAAGACTTCTTCGATGGTCGCTGTCGAGATCAAGTCTTACATGGGTTCAACTGCCCATTGTTGCTTCAATGAACTGAGGTTGCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCAGGGGGCATATACCAATGACAAGTTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAAGAACACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTA
CTCCAAACCTTATACTAAGAGGATTGATAATTTGAGAACGCCAATCGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAGGGCAATCCTAAACAACATATTGCCC
ACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTGCTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGCACTAGACGAACCGTTAGC
ATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTC
TTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGC
TAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCTATGGAAAGACAGAAGAAAATC
TATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGA
TCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACC
TTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGAAAGAGAAGTAAAAAGTTC
TCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCGACTTCTTACTGCATCAACGTAGAAGAAGTTGA
CAATTCAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGTGACAAGAAGCTTCATATTAGCATCCCGTCACGTATGAAGA
GGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGGTGAAGTCAAATCTCATTATCTTGACCAATCCTGCAAATGAAGATCTGATCAAGACCATGACAAAGATA
AGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCTAGCCCACACGAGCTTAAAAGTTCCTTCTCCAAGTTCGAAGGTTCTCACGTTGCT
TCGCTGCAGTTCTTTCCTCCAAGTTCAAAGGGGTTCTCATGCAACACCTTCCTCCAAGTTCGAAGGATCTCACGCATTTCGTTGCAGTTCCTTCTCTCAGAGTTCAAAGT
TCCTTCTCTCCAAGTTCGAAGGTTCTCACGTTGCTTTGTTGCAGTTTCCTTCCTCCAAGTCCGAAGGCTCCCCCAAGTCGAGTCGAAGACTCACACGTTGCTTCGCTGTA
GTTTCCTTCCTCCAAGTCCGAAGGCTCTCCCAAGTCAAGTCGAAGGCTCACACGTTGCTTCGCTGTAGTTTCCTTCCTCCAAGTTTGAAGGCAAATCTGGCGACTACTCC
TGCAGGAAACTACAGTCATCAAAGTGACTGGTCTGGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATC
AAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAG
GTGGTGAAATCACTGCAAGTGAAAGTTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAATTCGGAGATTCAGATCAGAGAACTCAGAGTCGAGA
GAATTCTGTCAGAGTCCAGAGTCATCAGAAGTCAGAGAGTCTAGAGAATTCAGAAGATCCTAGATTCAGAATTCAACCAACTCAAGACTCAGAAGATCATCAAGCCAATC
GATCGATCAAGAAGATCAACAAGTCAGCAGACCGATCATCCAAGAAGATCAACAAGTCAGCAGACCGATCATCAAACCGATCATCCGAGAAGATCAACAAGTCAGTAGAC
CGATCATCCAAAAAGATCAACAAGTCACAACAGGCCGATCCAAGAGATCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTCAAGACTTCTTCGATGGTCGCTGTCGAGATCAAGTCTTACATGGGTTCAACTGCCCATTGTTGCTTCAATGAACTGAGGTTGCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCAGGGGGCATATACCAATGACAAGTTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAAGAACACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTA
CTCCAAACCTTATACTAAGAGGATTGATAATTTGAGAACGCCAATCGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAGGGCAATCCTAAACAACATATTGCCC
ACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTGCTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGCACTAGACGAACCGTTAGC
ATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTC
TTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGC
TAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCTATGGAAAGACAGAAGAAAATC
TATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGA
TCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACC
TTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGAAAGAGAAGTAAAAAGTTC
TCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCGACTTCTTACTGCATCAACGTAGAAGAAGTTGA
CAATTCAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGTGACAAGAAGCTTCATATTAGCATCCCGTCACGTATGAAGA
GGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGGTGAAGTCAAATCTCATTATCTTGACCAATCCTGCAAATGAAGATCTGATCAAGACCATGACAAAGATA
AGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCTAGCCCACACGAGCTTAAAAGTTCCTTCTCCAAGTTCGAAGGTTCTCACGTTGCT
TCGCTGCAGTTCTTTCCTCCAAGTTCAAAGGGGTTCTCATGCAACACCTTCCTCCAAGTTCGAAGGATCTCACGCATTTCGTTGCAGTTCCTTCTCTCAGAGTTCAAAGT
TCCTTCTCTCCAAGTTCGAAGGTTCTCACGTTGCTTTGTTGCAGTTTCCTTCCTCCAAGTCCGAAGGCTCCCCCAAGTCGAGTCGAAGACTCACACGTTGCTTCGCTGTA
GTTTCCTTCCTCCAAGTCCGAAGGCTCTCCCAAGTCAAGTCGAAGGCTCACACGTTGCTTCGCTGTAGTTTCCTTCCTCCAAGTTTGAAGGCAAATCTGGCGACTACTCC
TGCAGGAAACTACAGTCATCAAAGTGACTGGTCTGGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATC
AAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAG
GTGGTGAAATCACTGCAAGTGAAAGTTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAATTCGGAGATTCAGATCAGAGAACTCAGAGTCGAGA
GAATTCTGTCAGAGTCCAGAGTCATCAGAAGTCAGAGAGTCTAGAGAATTCAGAAGATCCTAGATTCAGAATTCAACCAACTCAAGACTCAGAAGATCATCAAGCCAATC
GATCGATCAAGAAGATCAACAAGTCAGCAGACCGATCATCCAAGAAGATCAACAAGTCAGCAGACCGATCATCAAACCGATCATCCGAGAAGATCAACAAGTCAGTAGAC
CGATCATCCAAAAAGATCAACAAGTCACAACAGGCCGATCCAAGAGATCATTAA
Protein sequenceShow/hide protein sequence
MSFKTSSMVAVEIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQ
IENQHIAESSQTQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDCTRRTVS
MFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEETIEESMERQKKI
YPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKF
SQPQQLVMLNKSFSKTFHKKKKENLATSYCINVEEVDNSRRVNKGLPSLIASSLQLLVLRDKKLHISIPSRMKRKFSVLINTEGSLKVKSNLIILTNPANEDLIKTMTKI
RAFKCKSSLSQEPKLHDALAHTSLKVPSPSSKVLTLLRCSSFLQVQRGSHATPSSKFEGSHAFRCSSFSQSSKFLLSKFEGSHVALLQFPSSKSEGSPKSSRRLTRCFAV
VSFLQVRRLSQVKSKAHTLLRCSFLPPSLKANLATTPAGNYSHQSDWSGQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQ
VVKSLQVKVDDDRGDHPCRKLQSSKFGDSDQRTQSRENSVRVQSHQKSESLENSEDPRFRIQPTQDSEDHQANRSIKKINKSADRSSKKINKSADRSSNRSSEKINKSVD
RSSKKINKSQQADPRDH