; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041105 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041105
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr13:12067182..12073201
RNA-Seq ExpressionLag0041105
SyntenyLag0041105
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.1e-13846.27Show/hide
Query:  KTSSMVVVMNKSYMGSTAHCCLSELTLQEDKASIAEGQETTLQGAYTNDKFIVKYNPLF------------EPDSDVVTVMMTETRTMEERMAEMQEHID
        K +S    ++ SY+G      L     ++++  + + +             +++ NPLF            E   +VV+VMM +  T E  MAEM+  I+
Subjt:  KTSSMVVVMNKSYMGSTAHCCLSELTLQEDKASIAEGQETTLQGAYTNDKFIVKYNPLF------------EPDSDVVTVMMTETRTMEERMAEMQEHID

Query:  TLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTP
         LMKV+EE+D +I  LK Q++    +ESSQT VVK  DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGG  Q S +YSKPYTKRIDNLR P
Subjt:  TLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTP

Query:  IGW--------------------------------EELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEM
        +G+                                ++L R+F+     N F  TRR VSM ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEM
Subjt:  IGW--------------------------------EELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEM

Query:  CIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKEKRQTNGAHH-------L
        C QGMHWELLYIL+GIKPRTFEELATRAHDM+LSI +R  +D L+   R +     +T       + ES++V  T  KS SK K   +  +H        
Subjt:  CIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKEKRQTNGAHH-------L

Query:  TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSN----------
        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP CKRPE++ KVDDP YCKYHRVI H VE+CFVLK LI KLA E KIELD+DEVAQ+N          
Subjt:  TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSN----------

Query:  ----------------------LATIKEK---SKYQRKKDPKKLQPKR------------------------------------KRKEVDNSKKSEQRTS
                              L   ++K   S  Q K++P + + +                                       +E+DNS + +QRTS
Subjt:  ----------------------LATIKEK---SKYQRKKDPKKLQPKR------------------------------------KRKEVDNSKKSEQRTS

Query:  VFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRK
        VFD IKP TTR SVF R+SMA  +EENQC   T  + SAF+RLS+S SKK R ST  FDRLK+TNDQ +R+M  L+ K F E N D K+ S +PSRMKRK
Subjt:  VFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRK

Query:  FSVLINTEDSL
         SV INTE SL
Subjt:  FSVLINTEDSL

KAA0045440.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.9e-14046.41Show/hide
Query:  PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RMAE+++ ++ LMKV+EE+D +I  LK+ IE+   AESS    VKN DKGK ++Q+ QP+   SIASLS+QQLQ+MIT+ I+
Subjt:  PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIG-----------------------------------WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV
         QYGG  Q   LYSKPYTKRIDNLR P G                                   WE+LER+FLNRFYSTRR +SM ELTNT+Q+KGELV+
Subjt:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIG-----------------------------------WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV

Query:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTT
        +YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTF+ELATRAHDMEL++ ++  +D L+P +R +    ++T       I+ES+VV+ T
Subjt:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTT

Query:  LPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAM
          KS SK K     R+ +G      TLKERQ+K+YPFP++D+ DMLEQLLE QLI+L  CKR E+  KVDDP YCKYHRVI HP+E+CFVLK LILKLA 
Subjt:  LPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAM

Query:  ERKIELDLDEVAQSNLATIK---------------------------------------------------------------------EKSKYQRK---
        E+KIELD+DEVAQ+N   IK                                                                     ++S++ +K   
Subjt:  ERKIELDLDEVAQSNLATIK---------------------------------------------------------------------EKSKYQRK---

Query:  -----------------------KDPKKLQPKR---------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRP
                               KD   LQ +R                                         KEV+NS +  QRTSVFDRIKP TTR 
Subjt:  -----------------------KDPKKLQPKR---------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRP

Query:  SVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE
        SVF R+SMA  EE+NQC     TR S F+RLS+STSKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+ S IPSRMKRK  V INTE
Subjt:  SVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.9e-14851.98Show/hide
Query:  PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RMAE+++ ++ LMKV+EE+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIG--------------------------------------------------------------WEELEREFL
         QYGG  Q   LY KPYTKRIDNLR P G                                                              WE+LER+FL
Subjt:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIG--------------------------------------------------------------WEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNM

Query:  RKEGRNDEET-------IEESIVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPK
        R +    ++T       I+ES+VV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP CKRPE+  KVDDP 
Subjt:  RKEGRNDEET-------IEESIVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSNLATIKEKSKYQRKKDPKKLQPKR-----------------------------------
        YCKYHRVI HPVE+CFVLK LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R                                   
Subjt:  YCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSNLATIKEKSKYQRKKDPKKLQPKR-----------------------------------

Query:  ----KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLF
              KEV+NS +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F
Subjt:  ----KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLF

Query:  DEVNSDKKLQSSIPSRMKRKFSVLINTEDSL
         E N D K+ S +PSRMKRK  V INTE SL
Subjt:  DEVNSDKKLQSSIPSRMKRKFSVLINTEDSL

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.4e-13943.24Show/hide
Query:  SYMGS--TAHCCLSELTLQEDKASIAEGQETTLQ-GAYTNDKFIVKYNPLFE-------------PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIE
        SY+G     H C  E+   ED       +    Q         ++K NP  +             P  ++++VM+T   T E RMAE+++ ++ LMKV+E
Subjt:  SYMGS--TAHCCLSELTLQEDKASIAEGQETTLQ-GAYTNDKFIVKYNPLFE-------------PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIE

Query:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIG------
        E+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LYSKPYTKRIDNLR P G      
Subjt:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIG------

Query:  --------------------------------------------------------WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
                                                                WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  --------------------------------------------------------WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKE
        +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     +T       I+ES+VV+ T  KS SK 
Subjt:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKE

Query:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDL
        K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LP+CKRP++ EKVDDP YCKYHRVI HPVE+CFVLK LILKLA E+KIEL++
Subjt:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDL

Query:  DEVAQSN-----------------------------------LATIKEKSKYQRKKD-------------------------------------------
        DEVAQ+N                                   + TI  ++K    KD                                           
Subjt:  DEVAQSN-----------------------------------LATIKEKSKYQRKKD-------------------------------------------

Query:  -----PKKLQPKRK-----------------------------------------------RKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATE
             PK ++ K +                                                KEV+N  +  QRTSVFDRIKP TTR SVF R+SMA  E
Subjt:  -----PKKLQPKRK-----------------------------------------------RKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATE

Query:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE
        EENQC     TR S F+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+ S +PSRMKRK  V INT+
Subjt:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE

TYK23838.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-13045.6Show/hide
Query:  SFKTSSMVVVMNKSYMGSTAHCCLSELTLQEDKASIAEGQETTLQGAYTNDKFIVKYNPLF------------EPDSDVVTVMMTETRTMEERMAEMQEH
        S K +S     +  Y+G      L     ++++  + + +             I++ NPLF            E   +VV+VMM +  T E  MAEM+  
Subjt:  SFKTSSMVVVMNKSYMGSTAHCCLSELTLQEDKASIAEGQETTLQGAYTNDKFIVKYNPLF------------EPDSDVVTVMMTETRTMEERMAEMQEH

Query:  IDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLR
        I+ LMKV+EE+D +I  LK Q++    AESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGG  Q S +YSKPYTKRIDNLR
Subjt:  IDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLR

Query:  TPIG-----WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHD
         P+G       +LE +FLNRFYSTRR VSM ELTNTKQRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHD
Subjt:  TPIG-----WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHD

Query:  MELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLI
        MELSI +R  +D L+   R +     +T       + ES++V  T  KS SK K   +  +H        TL+ERQKK+YPF D+DI DMLEQL+E QLI
Subjt:  MELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLI

Query:  KLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSN----------------------------------------
        +LP CKR E+  +VDDP YCKYH+VI HP+E+CFVLK LILKLA E +IELD+DEVAQ+N                                        
Subjt:  KLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSN----------------------------------------

Query:  ---------------------LATIKEK--------SKYQRK--------------------------KDPKKLQPKR----------------------
                             +A  KEK        S + RK                          KD   LQP++                      
Subjt:  ---------------------LATIKEK--------SKYQRK--------------------------KDPKKLQPKR----------------------

Query:  ----------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQP
                          +E+DNS + +QRTS FDRIKP TTR SVF R+SMA  +EENQC  ST  R  AF+RLS+STSKK R STS FDRLK+TNDQ 
Subjt:  ----------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQP

Query:  KRKM
        +R+M
Subjt:  KRKM

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.0e-13846.27Show/hide
Query:  KTSSMVVVMNKSYMGSTAHCCLSELTLQEDKASIAEGQETTLQGAYTNDKFIVKYNPLF------------EPDSDVVTVMMTETRTMEERMAEMQEHID
        K +S    ++ SY+G      L     ++++  + + +             +++ NPLF            E   +VV+VMM +  T E  MAEM+  I+
Subjt:  KTSSMVVVMNKSYMGSTAHCCLSELTLQEDKASIAEGQETTLQGAYTNDKFIVKYNPLF------------EPDSDVVTVMMTETRTMEERMAEMQEHID

Query:  TLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTP
         LMKV+EE+D +I  LK Q++    +ESSQT VVK  DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGG  Q S +YSKPYTKRIDNLR P
Subjt:  TLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTP

Query:  IGW--------------------------------EELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEM
        +G+                                ++L R+F+     N F  TRR VSM ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEM
Subjt:  IGW--------------------------------EELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEM

Query:  CIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKEKRQTNGAHH-------L
        C QGMHWELLYIL+GIKPRTFEELATRAHDM+LSI +R  +D L+   R +     +T       + ES++V  T  KS SK K   +  +H        
Subjt:  CIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKEKRQTNGAHH-------L

Query:  TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSN----------
        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP CKRPE++ KVDDP YCKYHRVI H VE+CFVLK LI KLA E KIELD+DEVAQ+N          
Subjt:  TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSN----------

Query:  ----------------------LATIKEK---SKYQRKKDPKKLQPKR------------------------------------KRKEVDNSKKSEQRTS
                              L   ++K   S  Q K++P + + +                                       +E+DNS + +QRTS
Subjt:  ----------------------LATIKEK---SKYQRKKDPKKLQPKR------------------------------------KRKEVDNSKKSEQRTS

Query:  VFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRK
        VFD IKP TTR SVF R+SMA  +EENQC   T  + SAF+RLS+S SKK R ST  FDRLK+TNDQ +R+M  L+ K F E N D K+ S +PSRMKRK
Subjt:  VFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRK

Query:  FSVLINTEDSL
         SV INTE SL
Subjt:  FSVLINTEDSL

A0A5A7TQK6 Retrotransposon gag protein1.4e-14046.41Show/hide
Query:  PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RMAE+++ ++ LMKV+EE+D +I  LK+ IE+   AESS    VKN DKGK ++Q+ QP+   SIASLS+QQLQ+MIT+ I+
Subjt:  PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIG-----------------------------------WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV
         QYGG  Q   LYSKPYTKRIDNLR P G                                   WE+LER+FLNRFYSTRR +SM ELTNT+Q+KGELV+
Subjt:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIG-----------------------------------WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV

Query:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTT
        +YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTF+ELATRAHDMEL++ ++  +D L+P +R +    ++T       I+ES+VV+ T
Subjt:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTT

Query:  LPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAM
          KS SK K     R+ +G      TLKERQ+K+YPFP++D+ DMLEQLLE QLI+L  CKR E+  KVDDP YCKYHRVI HP+E+CFVLK LILKLA 
Subjt:  LPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAM

Query:  ERKIELDLDEVAQSNLATIK---------------------------------------------------------------------EKSKYQRK---
        E+KIELD+DEVAQ+N   IK                                                                     ++S++ +K   
Subjt:  ERKIELDLDEVAQSNLATIK---------------------------------------------------------------------EKSKYQRK---

Query:  -----------------------KDPKKLQPKR---------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRP
                               KD   LQ +R                                         KEV+NS +  QRTSVFDRIKP TTR 
Subjt:  -----------------------KDPKKLQPKR---------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRP

Query:  SVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE
        SVF R+SMA  EE+NQC     TR S F+RLS+STSKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+ S IPSRMKRK  V INTE
Subjt:  SVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE

A0A5A7URH1 Ty3-gypsy retrotransposon protein1.4e-14851.98Show/hide
Query:  PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RMAE+++ ++ LMKV+EE+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIG--------------------------------------------------------------WEELEREFL
         QYGG  Q   LY KPYTKRIDNLR P G                                                              WE+LER+FL
Subjt:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIG--------------------------------------------------------------WEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNM

Query:  RKEGRNDEET-------IEESIVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPK
        R +    ++T       I+ES+VV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP CKRPE+  KVDDP 
Subjt:  RKEGRNDEET-------IEESIVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSNLATIKEKSKYQRKKDPKKLQPKR-----------------------------------
        YCKYHRVI HPVE+CFVLK LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R                                   
Subjt:  YCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSNLATIKEKSKYQRKKDPKKLQPKR-----------------------------------

Query:  ----KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLF
              KEV+NS +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F
Subjt:  ----KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLF

Query:  DEVNSDKKLQSSIPSRMKRKFSVLINTEDSL
         E N D K+ S +PSRMKRK  V INTE SL
Subjt:  DEVNSDKKLQSSIPSRMKRKFSVLINTEDSL

A0A5D3BX77 Retrotransposon gag protein6.9e-14043.24Show/hide
Query:  SYMGS--TAHCCLSELTLQEDKASIAEGQETTLQ-GAYTNDKFIVKYNPLFE-------------PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIE
        SY+G     H C  E+   ED       +    Q         ++K NP  +             P  ++++VM+T   T E RMAE+++ ++ LMKV+E
Subjt:  SYMGS--TAHCCLSELTLQEDKASIAEGQETTLQ-GAYTNDKFIVKYNPLFE-------------PDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIE

Query:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIG------
        E+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LYSKPYTKRIDNLR P G      
Subjt:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIG------

Query:  --------------------------------------------------------WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
                                                                WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  --------------------------------------------------------WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKE
        +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     +T       I+ES+VV+ T  KS SK 
Subjt:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKE

Query:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDL
        K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LP+CKRP++ EKVDDP YCKYHRVI HPVE+CFVLK LILKLA E+KIEL++
Subjt:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDL

Query:  DEVAQSN-----------------------------------LATIKEKSKYQRKKD-------------------------------------------
        DEVAQ+N                                   + TI  ++K    KD                                           
Subjt:  DEVAQSN-----------------------------------LATIKEKSKYQRKKD-------------------------------------------

Query:  -----PKKLQPKRK-----------------------------------------------RKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATE
             PK ++ K +                                                KEV+N  +  QRTSVFDRIKP TTR SVF R+SMA  E
Subjt:  -----PKKLQPKRK-----------------------------------------------RKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATE

Query:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE
        EENQC     TR S F+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+ S +PSRMKRK  V INT+
Subjt:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE

A0A5D3DK07 Ty3-gypsy retrotransposon protein5.9e-13145.6Show/hide
Query:  SFKTSSMVVVMNKSYMGSTAHCCLSELTLQEDKASIAEGQETTLQGAYTNDKFIVKYNPLF------------EPDSDVVTVMMTETRTMEERMAEMQEH
        S K +S     +  Y+G      L     ++++  + + +             I++ NPLF            E   +VV+VMM +  T E  MAEM+  
Subjt:  SFKTSSMVVVMNKSYMGSTAHCCLSELTLQEDKASIAEGQETTLQGAYTNDKFIVKYNPLF------------EPDSDVVTVMMTETRTMEERMAEMQEH

Query:  IDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLR
        I+ LMKV+EE+D +I  LK Q++    AESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGG  Q S +YSKPYTKRIDNLR
Subjt:  IDTLMKVIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLR

Query:  TPIG-----WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHD
         P+G       +LE +FLNRFYSTRR VSM ELTNTKQRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHD
Subjt:  TPIG-----WEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHD

Query:  MELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLI
        MELSI +R  +D L+   R +     +T       + ES++V  T  KS SK K   +  +H        TL+ERQKK+YPF D+DI DMLEQL+E QLI
Subjt:  MELSITSRENQDLLLPNMRKEGRNDEET-------IEESIVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLI

Query:  KLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSN----------------------------------------
        +LP CKR E+  +VDDP YCKYH+VI HP+E+CFVLK LILKLA E +IELD+DEVAQ+N                                        
Subjt:  KLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSN----------------------------------------

Query:  ---------------------LATIKEK--------SKYQRK--------------------------KDPKKLQPKR----------------------
                             +A  KEK        S + RK                          KD   LQP++                      
Subjt:  ---------------------LATIKEK--------SKYQRK--------------------------KDPKKLQPKR----------------------

Query:  ----------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQP
                          +E+DNS + +QRTS FDRIKP TTR SVF R+SMA  +EENQC  ST  R  AF+RLS+STSKK R STS FDRLK+TNDQ 
Subjt:  ----------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQP

Query:  KRKM
        +R+M
Subjt:  KRKM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGTTGTCATGAACAAGTCTTACATGGGTTCTACTGCCCATTGTTGCCTCAGTGAACTGACGTTGCAAGAAGATAAAGCTTCTAT
CGCTGAAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAGTTTATTGTTAAGTATAACCCTTTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACTAGAACTATGGAAGAAAGAATGGCTGAGATGCAGGAGCATATCGACACCTTGATGAAGGTGATTGAAGAAAAAGATTCTCAAATCGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGATGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAACTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACTTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAACGCCAATCGGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAA
CAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCTGTTGAAATGTGCATTCAAGG
CATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTACTAGTCGAGAAAACC
AAGACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATAGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAG
CGACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCA
ACTGATAAAGCTTCCTAGGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCG
TCCTAAAGAACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATAAAAGAAAAGAGCAAATATCAA
AGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGAAAGAGAAAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAAC
TACTCGTCCTTCGGTATTCCATAGAATGAGTATGGCTGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCT
CCACATCGAAGAAAAGTCGATCTTCAACATCTTTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAATAACTTGGAGGTGAAACTTTTCGATGAA
GTAAACAGCGACAAGAAGCTTCAAAGTAGCATCCCGTCACGTATGAAGAGGAAGTTCTCTGTCCTCATAAATACAGAAGATTCCTTGAAGTGGGGGCACCACAGCGAATG
GCAGTTGCTTCCTCCAAGTCTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCAAGTTCGAA
GGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTC
TCACACGCTTCGCTGCAGTTTCTTCTCCCTAACTTCGAAGGTTCTCACGCGCTTCGCTGCAATTCCTTCCTCCCTAAGTTTGAAGGTTCTCACGTTGCTTCGCTCCAGTT
CCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGCTGCAGTTCCTTCTCTCCACGTTCAAAGGTTCTCACGCATTCCGCTACAGTTCATTTCCTCCAAGTTCAAAGGTT
CTCAAGTCGCTGCGCTGCAATTCGTTCCTCCTAAATTCGAAGGTTCTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGTCGCTTCGTTGCAG
TTCCTTCCTCCAAGTTTGAGGGTTCTTACGTCGATTCGCTGCAGTTCCTTCCTCCCAAATTCGAAGGTTTTTCTCACGCACTTCGTTGCAGTTCCTTCCTCCAAGTTCAA
AGGTTCTCAAGTCGCTGCGCTGCAATTCTTTCCTCCCAAATTCGAAGGTTCTCACGTCGCTGCTTCGTTGCAGTTACCTTCCTCCAAGTTCGAGGGTTCTCACGCGCTTC
GCTGCAGTTGCCTTCCTCCAAAAGTTCGAAGGTTCTCAAGCGCTTCGCTGCAGTTCCTTCCTCCAAATTTGAAGGTTCTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCA
AGTTCAGAGGTTCTCAAGTCTCTGCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGTCGCTTCGTTGCAATTGCCTTCCTCCAAGTTCGAAGGTTCTCAAGCGC
TTCGCTGCATGCAGTTGCCTTCCTCCAAGTTCGAAGGTTCTCCAAGCGCTTCGCTGCATGCAGTTGCCTTCCTCCAAGTTCGAAGGTTCTCCAAGCGCTTCGCTGCATGC
AGTTGCCTTCCTCCAAGTTCTCGAAGGGTTCTCCAAGCGCTTTGCTGCATGCAGTTGCCTTCCTCCAAGTTCTTGAAGGGTTCTCACGCGCTTCGCTGCAGTTGCCTCTT
CCTACCTCCAAAGCAAAGTTCGAAGGGTTCTCACGCGCTTCGCTGCAGTTGCCTTCCTCCAAAAGTTCGAAGGTTCTCAAGCGCTTCGCTGCAGTTCCTTCCTCCAAATT
TGAAGGTTCTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGTTACCTTCCTCCAAGTTCGAGAGTTCTCATGCGCTTCGCTGCAGTTGCCTTCCTCCAAAA
GTTCGAAGGTTCTCAAGCGCTTCGCTGCAGTTCCTTCCTCCAAATTTGAAGGTTCTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAGAGGTTCTCAAGTCGCT
GCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGTCGCTTCGTTGCAGTTGCCTTCCTCCAAGTTCGAAGGTTCTCAAGCGCTTCGCTGCATGCAGTTGCCTTCC
TCCAAGTTCGAAGGTTCTCCAAGCGCTTCGCTGCATGCAGTTGCCTTCCTCCAAGTTTTCGAAGGGTTCTCCAAGCGCTTCGCTGCATGCAGTTGCCTTCCTCCAAGTTC
TTGAAGGGTTCTCACGCGCTTCGCTGCAGTTGCCTCTTCCTACCTCCAAAGCAAAGTTCGAAGGGTTCTCACGCGCTTCGCTACATTTGCCTTCCTCCAAGTTTGAAGGT
TCTCAAGTCGCTGCGCTGCAATTCGTTCCTCCCAAATTCGAAGGTTCTTTCGTTGCAGTTGCCTTCCTCCAAGCCGATCCAAGAGATCATCAACCTAACAGACCGATCAT
CCAAGAAGGTCAACAAGCCAACAGGTCGATCATAAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATTCGACAGATCATCAA
GCCAAACGACCGATCAAGAAGATCAACAAGTCAGCAGACTGATCATCCAAAAAGATCAACAAGCCAACAGGCTGATCATCCAAGAAGATCAACAAGTAAGCAGACCGATC
ATCAAACCGATCATCCGAGAAGATCAACAAGTCAACAGACCGATCATCCAAGAAGATCAACAAGCCAACAGGTCGATCATCCAAGAAGATCAACAAGCCAATAAGCCGAT
CCAAGAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGTTGTCATGAACAAGTCTTACATGGGTTCTACTGCCCATTGTTGCCTCAGTGAACTGACGTTGCAAGAAGATAAAGCTTCTAT
CGCTGAAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAGTTTATTGTTAAGTATAACCCTTTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACTAGAACTATGGAAGAAAGAATGGCTGAGATGCAGGAGCATATCGACACCTTGATGAAGGTGATTGAAGAAAAAGATTCTCAAATCGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGATGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAACTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACTTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAACGCCAATCGGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAA
CAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCTGTTGAAATGTGCATTCAAGG
CATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTACTAGTCGAGAAAACC
AAGACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATAGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAG
CGACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCA
ACTGATAAAGCTTCCTAGGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCG
TCCTAAAGAACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATAAAAGAAAAGAGCAAATATCAA
AGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGAAAGAGAAAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAAC
TACTCGTCCTTCGGTATTCCATAGAATGAGTATGGCTGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCT
CCACATCGAAGAAAAGTCGATCTTCAACATCTTTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAATAACTTGGAGGTGAAACTTTTCGATGAA
GTAAACAGCGACAAGAAGCTTCAAAGTAGCATCCCGTCACGTATGAAGAGGAAGTTCTCTGTCCTCATAAATACAGAAGATTCCTTGAAGTGGGGGCACCACAGCGAATG
GCAGTTGCTTCCTCCAAGTCTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCAAGTTCGAA
GGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTC
TCACACGCTTCGCTGCAGTTTCTTCTCCCTAACTTCGAAGGTTCTCACGCGCTTCGCTGCAATTCCTTCCTCCCTAAGTTTGAAGGTTCTCACGTTGCTTCGCTCCAGTT
CCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGCTGCAGTTCCTTCTCTCCACGTTCAAAGGTTCTCACGCATTCCGCTACAGTTCATTTCCTCCAAGTTCAAAGGTT
CTCAAGTCGCTGCGCTGCAATTCGTTCCTCCTAAATTCGAAGGTTCTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGTCGCTTCGTTGCAG
TTCCTTCCTCCAAGTTTGAGGGTTCTTACGTCGATTCGCTGCAGTTCCTTCCTCCCAAATTCGAAGGTTTTTCTCACGCACTTCGTTGCAGTTCCTTCCTCCAAGTTCAA
AGGTTCTCAAGTCGCTGCGCTGCAATTCTTTCCTCCCAAATTCGAAGGTTCTCACGTCGCTGCTTCGTTGCAGTTACCTTCCTCCAAGTTCGAGGGTTCTCACGCGCTTC
GCTGCAGTTGCCTTCCTCCAAAAGTTCGAAGGTTCTCAAGCGCTTCGCTGCAGTTCCTTCCTCCAAATTTGAAGGTTCTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCA
AGTTCAGAGGTTCTCAAGTCTCTGCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGTCGCTTCGTTGCAATTGCCTTCCTCCAAGTTCGAAGGTTCTCAAGCGC
TTCGCTGCATGCAGTTGCCTTCCTCCAAGTTCGAAGGTTCTCCAAGCGCTTCGCTGCATGCAGTTGCCTTCCTCCAAGTTCGAAGGTTCTCCAAGCGCTTCGCTGCATGC
AGTTGCCTTCCTCCAAGTTCTCGAAGGGTTCTCCAAGCGCTTTGCTGCATGCAGTTGCCTTCCTCCAAGTTCTTGAAGGGTTCTCACGCGCTTCGCTGCAGTTGCCTCTT
CCTACCTCCAAAGCAAAGTTCGAAGGGTTCTCACGCGCTTCGCTGCAGTTGCCTTCCTCCAAAAGTTCGAAGGTTCTCAAGCGCTTCGCTGCAGTTCCTTCCTCCAAATT
TGAAGGTTCTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGTTACCTTCCTCCAAGTTCGAGAGTTCTCATGCGCTTCGCTGCAGTTGCCTTCCTCCAAAA
GTTCGAAGGTTCTCAAGCGCTTCGCTGCAGTTCCTTCCTCCAAATTTGAAGGTTCTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAGAGGTTCTCAAGTCGCT
GCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGTCGCTTCGTTGCAGTTGCCTTCCTCCAAGTTCGAAGGTTCTCAAGCGCTTCGCTGCATGCAGTTGCCTTCC
TCCAAGTTCGAAGGTTCTCCAAGCGCTTCGCTGCATGCAGTTGCCTTCCTCCAAGTTTTCGAAGGGTTCTCCAAGCGCTTCGCTGCATGCAGTTGCCTTCCTCCAAGTTC
TTGAAGGGTTCTCACGCGCTTCGCTGCAGTTGCCTCTTCCTACCTCCAAAGCAAAGTTCGAAGGGTTCTCACGCGCTTCGCTACATTTGCCTTCCTCCAAGTTTGAAGGT
TCTCAAGTCGCTGCGCTGCAATTCGTTCCTCCCAAATTCGAAGGTTCTTTCGTTGCAGTTGCCTTCCTCCAAGCCGATCCAAGAGATCATCAACCTAACAGACCGATCAT
CCAAGAAGGTCAACAAGCCAACAGGTCGATCATAAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATTCGACAGATCATCAA
GCCAAACGACCGATCAAGAAGATCAACAAGTCAGCAGACTGATCATCCAAAAAGATCAACAAGCCAACAGGCTGATCATCCAAGAAGATCAACAAGTAAGCAGACCGATC
ATCAAACCGATCATCCGAGAAGATCAACAAGTCAACAGACCGATCATCCAAGAAGATCAACAAGCCAACAGGTCGATCATCCAAGAAGATCAACAAGCCAATAAGCCGAT
CCAAGAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAG
Protein sequenceShow/hide protein sequence
MSFKTSSMVVVMNKSYMGSTAHCCLSELTLQEDKASIAEGQETTLQGAYTNDKFIVKYNPLFEPDSDVVTVMMTETRTMEERMAEMQEHIDTLMKVIEEKDSQIEQLKSQ
IENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGWEELEREFLNRFYSTRRTVSMFELTNTK
QRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSITSRENQDLLLPNMRKEGRNDEETIEESIVVNTTLPKSSSKEK
RQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPRCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKNLILKLAMERKIELDLDEVAQSNLATIKEKSKYQ
RKKDPKKLQPKRKRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSFFDRLKVTNDQPKRKMNNLEVKLFDE
VNSDKKLQSSIPSRMKRKFSVLINTEDSLKWGHHSEWQLLPPSLKVLTRFAAVPSSQFEGSHALRCSSFPQVRRFSRRFAAVPSSKFEGSHIASLRAALRCSSFLQVRRF
SHASLQFLLPNFEGSHALRCNSFLPKFEGSHVASLQFLPPSSKVFMRFAAVPSLHVQRFSRIPLQFISSKFKGSQVAALQFVPPKFEGSLTCFAAVPSSKFEGSHVASLQ
FLPPSLRVLTSIRCSSFLPNSKVFLTHFVAVPSSKFKGSQVAALQFFPPKFEGSHVAASLQLPSSKFEGSHALRCSCLPPKVRRFSSASLQFLPPNLKVLSRASLQFLPP
SSEVLKSLRCSSFLQVQRFSRRFVAIAFLQVRRFSSASLHAVAFLQVRRFSKRFAACSCLPPSSKVLQALRCMQLPSSKFSKGSPSALLHAVAFLQVLEGFSRASLQLPL
PTSKAKFEGFSRASLQLPSSKSSKVLKRFAAVPSSKFEGSLTRFAAVPSSKFKVTFLQVREFSCASLQLPSSKSSKVLKRFAAVPSSKFEGSLTRFAAVPSSKFRGSQVA
ALQFLPPSSKVLTSLRCSCLPPSSKVLKRFAACSCLPPSSKVLQALRCMQLPSSKFSKGSPSASLHAVAFLQVLEGFSRASLQLPLPTSKAKFEGFSRASLHLPSSKFEG
SQVAALQFVPPKFEGSFVAVAFLQADPRDHQPNRPIIQEGQQANRSIIKKINKSAGRSSKRINKLTSRFDRSSSQTTDQEDQQVSRLIIQKDQQANRLIIQEDQQVSRPI
IKPIIREDQQVNRPIIQEDQQANRSIIQEDQQANKPIQEIIKPTGRSKRSST