; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038470 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038470
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr2:18263361..18267988
RNA-Seq ExpressionLag0038470
SyntenyLag0038470
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.4e-16654.53Show/hide
Query:  PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESS-----------------------------QTQVLQDMITNCIR
        P  ++++VM+    T E RM E+++ ++ LMK +EE+D +I  LK+ IE++  AESS                               Q LQ+MI + I+
Subjt:  PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESS-----------------------------QTQVLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL
         QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKD LTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNM

Query:  RKEGRNDKET-------IEESMVVNTTLPKSSSKEKRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +     +T       I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP 
Subjt:  RKEGRNDKET-------IEESMVVNTTLPKSSSKEKRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSKTFHKKT
        YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +   + 
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSKTFHKKT

Query:  KENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLK---DLIKTMTKI
          N  +S     +EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS+ST KK R STS FDRLK   D  +   K 
Subjt:  KENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLK---DLIKTMTKI

Query:  RAFKCKSSLSQEPKLHDAPSPHELKRFFVAVPSLQVRGSL
           K     + + K+H        ++ FV    +   GSL
Subjt:  RAFKCKSSLSQEPKLHDAPSPHELKRFFVAVPSLQVRGSL

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.8e-16148.24Show/hide
Query:  SYMGS--TAHCCFNGLTLQEDKASIVAGQETTLQ-GAYPNDKFLVKYNPLFE-------------PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIE
        SY+G     H C   +   ED       +    Q    P    ++K NP  +             P  ++++VM+    T E RM E+++ ++ LMK +E
Subjt:  SYMGS--TAHCCFNGLTLQEDKASIVAGQETTLQ-GAYPNDKFLVKYNPLFE-------------PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIE

Query:  EKDSQIEQLKSQIENQHIAESS-----------------------------QTQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKF
        E+D +I  LK+ IE++  AESS                               Q LQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR P GYQPPKF
Subjt:  EKDSQIEQLKSQIENQHIAESS-----------------------------QTQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKEGRNDKET-------IEESMVVNTTLPKSSSKE
        +SLDCKD LTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI  R  +D L+P  R +     +T       I+ESMVV+ T  KS SK 
Subjt:  MSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKEGRNDKET-------IEESMVVNTTLPKSSSKE

Query:  KRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDL
        K       H        TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIEL++
Subjt:  KRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDL

Query:  DEVAQSNLATI------------------------------------------------------------------KEKSKHQR------------KKD
        DEVAQ+N   I                                                                  KE   HQ+            +++
Subjt:  DEVAQSNLATI------------------------------------------------------------------KEKSKHQR------------KKD

Query:  PKKLQPKRMRSK--KFSQPQQLVMLNKSFSKTF---HKKTKENLATSYCINVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATE
         K   PK ++ K   F Q ++ + L +   ++F   H +    + T +  ++ EV+N+  S           QRTSVFDRIKP TTR SVFQR+SMA  E
Subjt:  PKKLQPKRMRSK--KFSQPQQLVMLNKSFSKTF---HKKTKENLATSYCINVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATE

Query:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLK
        EENQC     TR S F+RLS+S SKK+R STS FDRLK
Subjt:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLK

XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]7.8e-15758.85Show/hide
Query:  SFKTSSMVVVMNKSYMGSTAHCCFNGLTLQEDKASIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMAETRTMEERMTEMQEH
        S K +S     + +Y G        G+   +D+ S +A          P    ++K NPL+            E   DV++VMMA+   +E  M EM+  
Subjt:  SFKTSSMVVVMNKSYMGSTAHCCFNGLTLQEDKASIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMAETRTMEERMTEMQEH

Query:  IDTLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        I+ LMK ++E+D +I  LK Q++ +  AESSQT                              Q LQDMITN IRAQYGGP+Q S +YSKPYTKRIDNLR
Subjt:  IDTLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
         P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPES++SWE+LE+EFLNRFYSTRRTVSM ELTNTKQRKGE 
Subjt:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKEGRN-------DKETIEESMVVN
        V++YINRWRA+SLDCKD LTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA R  +D L+P ++K+ +         K T +ESMVVN
Subjt:  VVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKEGRN-------DKETIEESMVVN

Query:  TT-LPKSSSKEKRQTK-----GAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA
        TT L  S  KE R  K         LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA
Subjt:  TT-LPKSSSKEKRQTK-----GAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA

Query:  MERKIELDLDEVAQSNLATI
         E++IELDL+EVAQ+N A +
Subjt:  MERKIELDLDEVAQSNLATI

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]1.0e-15658.65Show/hide
Query:  SFKTSSMVVVMNKSYMGSTAHCCFNGLTLQEDKASIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMAETRTMEERMTEMQEH
        S K +S     + +Y G        G+   +D+ S +A          P    ++K NPL+            E   DV++VMMA+   +E  M EM+  
Subjt:  SFKTSSMVVVMNKSYMGSTAHCCFNGLTLQEDKASIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMAETRTMEERMTEMQEH

Query:  IDTLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        I+ LMK ++E+D +I  LK Q++ +  AESSQT                              Q LQDMIT+ IRAQYGGP+Q S +YSKPYTKRIDNLR
Subjt:  IDTLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
         P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPES++SWE+LE+EFLNRFYSTRRTVSM ELTNTKQRKGE 
Subjt:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKEGRN-------DKETIEESMVVN
        V++YINRWRA+SLDCKD LTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA R  +D L+P ++K+ +         K T++ESMVVN
Subjt:  VVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKEGRN-------DKETIEESMVVN

Query:  TT-LPKSSSKEKRQTK-----GAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA
        TT L  S  KE R  K         LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA
Subjt:  TT-LPKSSSKEKRQTK-----GAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA

Query:  MERKIELDLDEVAQSNLATI
         E++IELDL+EVAQ+N A +
Subjt:  MERKIELDLDEVAQSNLATI

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]7.8e-15758.85Show/hide
Query:  SFKTSSMVVVMNKSYMGSTAHCCFNGLTLQEDKASIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMAETRTMEERMTEMQEH
        S K +S     + +Y G        G+   +D+ S +A          P    ++K NPL+            E   DV++VMMA+   +E  M EM+  
Subjt:  SFKTSSMVVVMNKSYMGSTAHCCFNGLTLQEDKASIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMAETRTMEERMTEMQEH

Query:  IDTLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        I+ LMK ++E+D +I  LK Q++ +  AESSQT                              Q LQDMITN IRAQYGGP+Q S +YSKPYTKRIDNLR
Subjt:  IDTLMKAIEEKDSQIEQLKSQIENQHIAESSQT------------------------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
         P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPES++SWE+LE+EFLNRFYSTRRTVSM ELTNTKQRKGE 
Subjt:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKEGRN-------DKETIEESMVVN
        V++YINRWRA+SLDCKD LTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA R  +D L+P ++K+ +         K T +ESMVVN
Subjt:  VVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKEGRN-------DKETIEESMVVN

Query:  TT-LPKSSSKEKRQTK-----GAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA
        TT L  S  KE R  K         LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA
Subjt:  TT-LPKSSSKEKRQTK-----GAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA

Query:  MERKIELDLDEVAQSNLATI
         E++IELDL+EVAQ+N A +
Subjt:  MERKIELDLDEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H2.7e-15561.57Show/hide
Query:  PNDKFLVKYNPLFE-------------PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQT--------------
        P    ++K NP  +             P  ++++VM+ +  T E+RM E+++ ++ LMKA+EE+D +I  LK+ IE++  AESS T              
Subjt:  PNDKFLVKYNPLFE-------------PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQT--------------

Query:  ---------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTL
                       Q LQ+MI N I+ QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTL
Subjt:  ---------------QVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTL

Query:  KGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTF
        KGNAFDWYTDLEPES+DSWE+LER+FLNRFYSTRR VSM ELT TKQRKGE V++YINRWRA+SLDCKD LTELS+VEMC QGMHW LLYIL+GIKPRTF
Subjt:  KGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTF

Query:  EELATRAHDMELSIARRENQDLLLPNMRKEGRNDKET-------IEESMVVNTT----LPKSSSKEKRQTKG-AHHLTLKERQKKIYPFPDADIPDMLEQ
        EELATRAHDMELSIA R N DLL+P +RKE +  K T        +E+MVV+TT    + K    EKRQ +G     TLKERQ+K+YPFPD+D+PDML+Q
Subjt:  EELATRAHDMELSIARRENQDLLLPNMRKEGRNDKET-------IEESMVVNTT----LPKSSSKEKRQTKG-AHHLTLKERQKKIYPFPDADIPDMLEQ

Query:  LLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATI
        LLE QLI+LP+CKRP EM +V+DP YCKYHRVI HPVE+CFVLK+LILKLA+++KIEL+LD+VAQ+N A +
Subjt:  LLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATI

A0A5A7TZU9 Ribonuclease H1.8e-0231.25Show/hide
Query:  KRMRSKKFSQPQQLVMLNKSFSKTFH--KKTKENLATSYCINVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSA
        K+++ + +S P     +    S+      K K  +A +  I VEE  +S++ +    QR+SVFDRI     RPSVFQR+S +  ++ NQ S  +STR SA
Subjt:  KRMRSKKFSQPQQLVMLNKSFSKTFH--KKTKENLATSYCINVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSA

Query:  FQRLSVSTSKKSRSSTSVFDRLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKRFF
        FQRL+ S  K    S +   R     +    +   + K+S+S   K        E++  F
Subjt:  FQRLSVSTSKKSRSSTSVFDRLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKRFF

A0A5A7TZU9 Ribonuclease H2.6e-15063.31Show/hide
Query:  DVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESS-----------------------------QTQVLQDMITNCIRAQY
        ++++VM+    T E RM E+++ ++ LMK +EE+D +I  LK+ IE++  AESS                               Q LQ+MI + I+ QY
Subjt:  DVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESS-----------------------------QTQVLQDMITNCIRAQY

Query:  GGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRF
        GGP Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWY DLEPES+D+WE+LER+FLNRF
Subjt:  GGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRF

Query:  YSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKE
        YSTRR VSM ELTNT+Q+KGELV+NYINRWRA+SLDCKD LTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA R  +D L+P  R +
Subjt:  YSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKE

Query:  GRNDKET-------IEESMVVNTTLPKSSSKEKRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCK
             +T       I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCK
Subjt:  GRNDKET-------IEESMVVNTTLPKSSSKEKRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCK

Query:  YHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKS
        YHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N   I+  S
Subjt:  YHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKS

A0A5A7URH1 Ty3-gypsy retrotransposon protein2.6e-16654.53Show/hide
Query:  PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESS-----------------------------QTQVLQDMITNCIR
        P  ++++VM+    T E RM E+++ ++ LMK +EE+D +I  LK+ IE++  AESS                               Q LQ+MI + I+
Subjt:  PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESS-----------------------------QTQVLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL
         QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKD LTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNM

Query:  RKEGRNDKET-------IEESMVVNTTLPKSSSKEKRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +     +T       I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP 
Subjt:  RKEGRNDKET-------IEESMVVNTTLPKSSSKEKRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSKTFHKKT
        YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +   + 
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRM--------RSKKFSQPQQLVMLNKSFSKTFHKKT

Query:  KENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLK---DLIKTMTKI
          N  +S     +EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS+ST KK R STS FDRLK   D  +   K 
Subjt:  KENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLK---DLIKTMTKI

Query:  RAFKCKSSLSQEPKLHDAPSPHELKRFFVAVPSLQVRGSL
           K     + + K+H        ++ FV    +   GSL
Subjt:  RAFKCKSSLSQEPKLHDAPSPHELKRFFVAVPSLQVRGSL

A0A5D3BX77 Retrotransposon gag protein8.7e-16248.24Show/hide
Query:  SYMGS--TAHCCFNGLTLQEDKASIVAGQETTLQ-GAYPNDKFLVKYNPLFE-------------PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIE
        SY+G     H C   +   ED       +    Q    P    ++K NP  +             P  ++++VM+    T E RM E+++ ++ LMK +E
Subjt:  SYMGS--TAHCCFNGLTLQEDKASIVAGQETTLQ-GAYPNDKFLVKYNPLFE-------------PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIE

Query:  EKDSQIEQLKSQIENQHIAESS-----------------------------QTQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKF
        E+D +I  LK+ IE++  AESS                               Q LQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR P GYQPPKF
Subjt:  EKDSQIEQLKSQIENQHIAESS-----------------------------QTQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKEGRNDKET-------IEESMVVNTTLPKSSSKE
        +SLDCKD LTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI  R  +D L+P  R +     +T       I+ESMVV+ T  KS SK 
Subjt:  MSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLLLPNMRKEGRNDKET-------IEESMVVNTTLPKSSSKE

Query:  KRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDL
        K       H        TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIEL++
Subjt:  KRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDL

Query:  DEVAQSNLATI------------------------------------------------------------------KEKSKHQR------------KKD
        DEVAQ+N   I                                                                  KE   HQ+            +++
Subjt:  DEVAQSNLATI------------------------------------------------------------------KEKSKHQR------------KKD

Query:  PKKLQPKRMRSK--KFSQPQQLVMLNKSFSKTF---HKKTKENLATSYCINVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATE
         K   PK ++ K   F Q ++ + L +   ++F   H +    + T +  ++ EV+N+  S           QRTSVFDRIKP TTR SVFQR+SMA  E
Subjt:  PKKLQPKRMRSK--KFSQPQQLVMLNKSFSKTF---HKKTKENLATSYCINVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATE

Query:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLK
        EENQC     TR S F+RLS+S SKK+R STS FDRLK
Subjt:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLK

A0A5D3D4X3 Ty3-gypsy retrotransposon protein7.6e-15061.56Show/hide
Query:  KYNPLFE------PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESS-----------------------------Q
        K+N L E      P  ++++VM+ +    E+RM E+++ ++ LMK +EE+D +I  LK+ IE++  AESS                              
Subjt:  KYNPLFE------PDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESS-----------------------------Q

Query:  TQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPE
         Q LQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWY DLEPE
Subjt:  TQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPE

Query:  SLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI
        S+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA+SLDCKD LTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI
Subjt:  SLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI

Query:  ARRENQDLLLPNMRKEGRNDKET-------IEESMVVNTTLPKSSSKEKRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKC
        A R  +D L+P  R +     +T       I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+C
Subjt:  ARRENQDLLLPNMRKEGRNDKET-------IEESMVVNTTLPKSSSKEKRQTKGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKC

Query:  KRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKS
        KRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N   I+  S
Subjt:  KRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGTTGTCATGAACAAGTCATACATGGGTTCTACTGCCCATTGTTGCTTCAACGGACTGACGTTGCAAGAAGATAAAGCTTCTAT
CGTTGCAGGTCAAGAAACAACCTTGCAGGGGGCATATCCTAATGACAAATTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGG
CTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAAGAGCACATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTA
CTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAACGCCAATCGGGTATCAGCCACCAAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCC
ACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAGCAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCT
GAGTCATTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACGAAAAGG
TGAACTCGTTGTTAACTATATAAACCGCTGGAGAGCCATGAGTCTAGATTGCAAAGATTGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCATGCACTGGG
AACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGACGAGAAAACCAAGACCTTCTC
CTCCCTAACATGAGAAAAGAAGGAAGGAACGATAAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAA
GGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGC
TTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGAC
TTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGA
TCCTAAGAAACTTCAACCCAAGAGAATGAGAAGTAAAAAGTTCTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGACAAAAG
AGAACCTTGCGACTTCCTACTGCATCAACGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCT
TCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAA
GAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAAC
TGCATGATGCTCCTAGCCCACACGAGCTTAAAAGGTTCTTCGTTGCAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAG
TTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTAT
GTGGTGCGTTGTTGCATTGTTCCCTCTACTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCCTCCGTTGCAGT
TCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTCCAGTTCCTTCTCCCAAATTCGAAGGTTCTCATGTGCTTCGCGCTGCAGTTCCTTCCCTCCAAGTTTGAAG
GTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTT
CCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCT
CACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTC
CTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGC
GCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTCCTT
CCTCCAAGTTCGAAGGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTTAGAGTTTCAAAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGTCTG
AAGACCGAAGACTCTCTGCGATCCACAAGTCCAAGTGTCGAAGACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACTTCTTGAAGACCAAACACTCTTCAAGACT
TCAACACTTCTTGAAGACCAAACACTCCTCAAGACTTCAACACTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAAGACATC
AACACTTCTTGAAGATTGAAGACTCCTTCAAGACTGGAAGACTTCAAGCTCCAAGAATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGTTGTCATGAACAAGTCATACATGGGTTCTACTGCCCATTGTTGCTTCAACGGACTGACGTTGCAAGAAGATAAAGCTTCTAT
CGTTGCAGGTCAAGAAACAACCTTGCAGGGGGCATATCCTAATGACAAATTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGG
CTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAAGAGCACATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTA
CTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAACGCCAATCGGGTATCAGCCACCAAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCC
ACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAGCAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCT
GAGTCATTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACGAAAAGG
TGAACTCGTTGTTAACTATATAAACCGCTGGAGAGCCATGAGTCTAGATTGCAAAGATTGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCATGCACTGGG
AACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGACGAGAAAACCAAGACCTTCTC
CTCCCTAACATGAGAAAAGAAGGAAGGAACGATAAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAA
GGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGC
TTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGAC
TTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGA
TCCTAAGAAACTTCAACCCAAGAGAATGAGAAGTAAAAAGTTCTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGACAAAAG
AGAACCTTGCGACTTCCTACTGCATCAACGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCT
TCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAA
GAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAAC
TGCATGATGCTCCTAGCCCACACGAGCTTAAAAGGTTCTTCGTTGCAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAG
TTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATGATCTTAT
GTGGTGCGTTGTTGCATTGTTCCCTCTACTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCCTCCGTTGCAGT
TCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTCCAGTTCCTTCTCCCAAATTCGAAGGTTCTCATGTGCTTCGCGCTGCAGTTCCTTCCCTCCAAGTTTGAAG
GTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTT
CCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCT
CACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTC
CTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGC
GCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTCCTT
CCTCCAAGTTCGAAGGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTTAGAGTTTCAAAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGTCTG
AAGACCGAAGACTCTCTGCGATCCACAAGTCCAAGTGTCGAAGACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACTTCTTGAAGACCAAACACTCTTCAAGACT
TCAACACTTCTTGAAGACCAAACACTCCTCAAGACTTCAACACTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAAGACATC
AACACTTCTTGAAGATTGAAGACTCCTTCAAGACTGGAAGACTTCAAGCTCCAAGAATCCATTGA
Protein sequenceShow/hide protein sequence
MSFKTSSMVVVMNKSYMGSTAHCCFNGLTLQEDKASIVAGQETTLQGAYPNDKFLVKYNPLFEPDSDVVTVMMAETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQ
IENQHIAESSQTQVLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEP
ESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDCLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIARRENQDLL
LPNMRKEGRNDKETIEESMVVNTTLPKSSSKEKRQTKGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKD
LILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRMRSKKFSQPQQLVMLNKSFSKTFHKKTKENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRP
SVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKRFFVAVPSLQVRGSLHCTLLRCSFSK
FEGSSLYPAALFLLQVLEGSQLYNCYVVPPPSANDLMWCVVALFPLLSSSMVLTQLCWSFFSPSSKVLTPPLQFLLSKVEGSHSLRSSSFSQIRRFSCASRCSSFPPSLK
VLTSLRCDPSSKFEGSHALRHSSFLQVRRFSRASPQFLPPSSKVLTRIATVPSSKFEGSHALRHSSFLQVRRFSRASPQFLPPSSKVLTRFATVPSSKFEGSHAHRHSSF
LQVRRFSRASPQFLPPSSKVLTRFATVPSSKFEGSHALRHSSFLQVRRFSRASPQFLLPPSSKVLTRFATVPPSSKFEGFRNSTLSQRQEFRVSKLSSRTREFRETPPSL
KTEDSLRSTSPSVEDFLKTKHSSRLQHFLKTKHSSRLQHFLKTKHSSRLQHFLKTQHSSRLQHSLKIKDSSRHQHFLKIEDSFKTGRLQAPRIH