; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022073 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022073
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr7:17326511..17337190
RNA-Seq ExpressionLag0022073
SyntenyLag0022073
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-15850.28Show/hide
Query:  KTSSMVAVMNKSYMGSTVHCCFNGLTLQEDKASVVADQETTLQGAYTNDKFFVKYNPLF------------ELDSDVVTVMMTGTRTMKERMVEMQEHID
        K +S  + ++ SY+G            ++++  V+  +              ++ NPLF            E   +VV+VMM    T +  M EM+  I+
Subjt:  KTSSMVAVMNKSYMGSTVHCCFNGLTLQEDKASVVADQETTLQGAYTNDKFFVKYNPLF------------ELDSDVVTVMMTGTRTMKERMVEMQEHID

Query:  TLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP
         LMK +EE+D +I  LK Q++    +ESSQT VVK   KG  +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSKPYTKRIDNLR P
Subjt:  TLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP

Query:  IEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELS
        + YQP KFQQFDGKGNPKQHI HFVETCENA        ++L R+F+     N F  TRR VSM ELTNT QRKGE V++YIN WRA+SLDCKD+LTELS
Subjt:  IEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELS

Query:  SVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH----
        +VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA++  +D L+   R +     +T       + ESM+V  T  KS SK K   +  +H    
Subjt:  SVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH----

Query:  ---LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKE
            TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA E KIELD+DEVAQ+N   +  
Subjt:  ---LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKE

Query:  KSK--------HQRKK-------DPKKLQPMRKRSKKFSQPQQ----------LVMLNKSFSKNFHKKKKENFA--TSYCIDV-------EEVDNSEMGE
         S          QRK        +P  ++  +K     SQ ++          +  L +SF ++  ++  E  A  T+  ++V       EE+DNS   +
Subjt:  KSK--------HQRKK-------DPKKLQPMRKRSKKFSQPQQ----------LVMLNKSFSKNFHKKKKENFA--TSYCIDV-------EEVDNSEMGE

Query:  QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSR
        QRTSVFD IKP TTR S FQR+SMA  +EENQC   T  + SAF+RLS+S SKK RPST  FDRLK+T+ Q +R+M  L+ K F E N D K+ S +PSR
Subjt:  QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSR

Query:  MKRKFSVLINTE
        MKRK SV INTE
Subjt:  MKRKFSVLINTE

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.3e-15045.41Show/hide
Query:  KTSSMVAVMNKSYMGSTVHCCFNGLTLQEDKASVVADQETTLQGAYTNDK--FFVKYNPLF------------ELDSDVVTVMMTGTRTMKERMVEMQEH
        K +S  +  + SY+G            ++++  V+  ++ +L+    + K    ++ NPLF            E   +VV+VMM    T +  M EM+  
Subjt:  KTSSMVAVMNKSYMGSTVHCCFNGLTLQEDKASVVADQETTLQGAYTNDK--FFVKYNPLF------------ELDSDVVTVMMTGTRTMKERMVEMQEH

Query:  IDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        I+ LMK  EE+D +I  LK Q++     ESSQT VVK   KG  +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR
Subjt:  IDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  TPIEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVE
         P+ YQPPKFQQFDGKGNPKQHIAHFVETCENA        ++L R+F+          + FE     + +GE V++YIN WRA+SLDCKD+LTELS+VE
Subjt:  TPIEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVE

Query:  MCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH------
        MC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  T  KS SK K   +  ++      
Subjt:  MCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH------

Query:  -LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN--------
          TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E KIELD+DEVAQ+N        
Subjt:  -LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN--------

Query:  ----------------------------------------------------------------------------LATIKEKSKHQRKKDPKKLQPMRK
                                                                                       I  K K +R K   K +P++ 
Subjt:  ----------------------------------------------------------------------------LATIKEKSKHQRKKDPKKLQPMRK

Query:  RSKKFSQPQQLVMLNKSFSKNFHKKKKENFA------TSYCIDV-------EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTST
        + K F QP++ + L +   ++F +   E         T+  ++V       EEVDNS   +QRT VF RIKP T R S FQR+SMA  EEENQC  ST  
Subjt:  RSKKFSQPQQLVMLNKSFSKNFHKKKKENFA------TSYCIDV-------EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTST

Query:  RPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE
        R SAF+RLS+ST KK RPSTS FDRLK+ + Q +R+M +L+ K F E N D K+ S +PSRMKRK SV INTE
Subjt:  RPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE

KAA0045440.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.1e-15246.49Show/hide
Query:  KASVVADQETTLQGAYTNDKFFVKYNPLFELDSDVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNT
        K  +V  +   +   Y++ K   +  P      ++++VM+T   T + RM E+++ ++ LMK +EE+D +I  LK+ IE+   AESS    VKN  KG  
Subjt:  KASVVADQETTLQGAYTNDKFFVKYNPLFELDSDVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNT

Query:  IVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELE
        ++Q+ QP+   SIASLS+QQLQ+MIT+ I+ QYGG  Q   LYSKPYTKRIDNLR P  YQPPKFQQ+  K      +    +  +   PES+D+WE+LE
Subjt:  IVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELE

Query:  REFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLL
        R+FLNRFYSTRR +SM ELTNT+Q+KGELV++YIN WRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTF+ELATRAHDMEL++A++  +D L
Subjt:  REFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLL

Query:  LPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKV
        +P +R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFP++D+ DMLEQLLE QLI+L +CKR E+  KV
Subjt:  LPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKV

Query:  DDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN---------------------------------------LATIKEKSK------
        DDP YCKYHRVI HP+E+CFVLK+LILKLA E+KIELD+DEVAQ+N                                       + TI  ++K      
Subjt:  DDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN---------------------------------------LATIKEKSK------

Query:  ---------HQRKKDPKKLQ------------------------------PMRKRSKKFSQPQQLVMLNKSFSKNFHKKKKENFA------TSYCIDV--
                 HQ+ + P  +Q                              P++ + + F Q ++ + L +   ++F +   E          +  ++V  
Subjt:  ---------HQRKKDPKKLQ------------------------------PMRKRSKKFSQPQQLVMLNKSFSKNFHKKKKENFA------TSYCIDV--

Query:  -----EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFD
             +EV+NS    QRTSVFDRIKP TTR S FQR+SMA  EE+NQC     TR S F+RLS+STSKK RPSTS FDRLK+T+ Q +R+M +L+ K F 
Subjt:  -----EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFD

Query:  EVNSDKKLQSSIPSRMKRKFSVLINTE
        E N D K+ S IPSRMKRK  V INTE
Subjt:  EVNSDKKLQSSIPSRMKRKFSVLINTE

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.3e-17153.92Show/hide
Query:  KASVVADQETTLQGAYTNDKFFVKYNPLFELDSDVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNT
        K  +V  +   +   Y++ K   +  P      ++++VM+T   T + RM E+++ ++ LMK +EE+D +I  LK+ IE++  AESS    VKN  KG  
Subjt:  KASVVADQETTLQGAYTNDKFFVKYNPLFELDSDVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNT

Query:  IVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENA------------
        ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGGP Q   LY KPYTKRIDNLR P  YQPPKFQQFDGKGNPKQH+AHF++TCE A            
Subjt:  IVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENA------------

Query:  --------------EPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKP
                      EPES+D+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YIN WRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP
Subjt:  --------------EPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKP

Query:  RTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIP
        RTFEELATRAHDMELSIA++  +D L+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ 
Subjt:  RTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIP

Query:  DMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPMRK----
        DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ  R     
Subjt:  DMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPMRK----

Query:  ----RSKKFSQPQQLVMLNKSFSKNFHKKKKENFATSYCIDVEEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLS
            RS     P++++ +    + +   +   N+ +S     +EV+NS    QRTSVFDRIKP TTR S FQR+S+A  EEENQC     TR S  +RLS
Subjt:  ----RSKKFSQPQQLVMLNKSFSKNFHKKKKENFATSYCIDVEEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLS

Query:  VSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE
        +ST KK RPSTS FDRLK+T+ Q +R+M + + K F E N D K+ S +PSRMKRK  V INTE
Subjt:  VSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.7e-16749.58Show/hide
Query:  DVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQY
        ++++VM+T   T + RM E+++ ++ LMK +EE+D +I  LK+ IE++  AESS    VKN  KG  ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QY
Subjt:  DVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQY

Query:  GGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESLDSWEELEREFLNRF
        GGP Q   LYSKPYTKRIDNLR P  YQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPES+D+WE+LER+FLNRF
Subjt:  GGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESLDSWEELEREFLNRF

Query:  YSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKE
        YSTRR VSM ELTNT+Q+KGELV++YIN WRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI ++  +D L+P  R +
Subjt:  YSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKE

Query:  GRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCK
             +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCK
Subjt:  GRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCK

Query:  YHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD-----------
        YHRVI HPVE+CFVLK+LILKLA E+KIEL++DEVAQ+N                                   + TI  ++K    KD           
Subjt:  YHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD-----------

Query:  ----PKKLQ------------------------------PMRKRSKKFSQPQQLVMLNKSFSKNF---HKKKKENFATSYCIDVEEVDN--------SEM
            P  +Q                              P++ + + F Q ++ + L +   ++F   H ++     T +   + EV+N        + +
Subjt:  ----PKKLQ------------------------------PMRKRSKKFSQPQQLVMLNKSFSKNF---HKKKKENFATSYCIDVEEVDN--------SEM

Query:  GE--QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSS
         E  QRTSVFDRIKP TTR S FQR+SMA  EEENQC     TR S F+RLS+S SKK RPSTS FDRLK+T+ Q +R+M +L+ K F E N D K+ S 
Subjt:  GE--QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSS

Query:  IPSRMKRKFSVLINTEEPKL
        +PSRMKRK  V INT+  ++
Subjt:  IPSRMKRKFSVLINTEEPKL

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein6.5e-15950.28Show/hide
Query:  KTSSMVAVMNKSYMGSTVHCCFNGLTLQEDKASVVADQETTLQGAYTNDKFFVKYNPLF------------ELDSDVVTVMMTGTRTMKERMVEMQEHID
        K +S  + ++ SY+G            ++++  V+  +              ++ NPLF            E   +VV+VMM    T +  M EM+  I+
Subjt:  KTSSMVAVMNKSYMGSTVHCCFNGLTLQEDKASVVADQETTLQGAYTNDKFFVKYNPLF------------ELDSDVVTVMMTGTRTMKERMVEMQEHID

Query:  TLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP
         LMK +EE+D +I  LK Q++    +ESSQT VVK   KG  +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSKPYTKRIDNLR P
Subjt:  TLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP

Query:  IEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELS
        + YQP KFQQFDGKGNPKQHI HFVETCENA        ++L R+F+     N F  TRR VSM ELTNT QRKGE V++YIN WRA+SLDCKD+LTELS
Subjt:  IEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELS

Query:  SVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH----
        +VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA++  +D L+   R +     +T       + ESM+V  T  KS SK K   +  +H    
Subjt:  SVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH----

Query:  ---LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKE
            TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA E KIELD+DEVAQ+N   +  
Subjt:  ---LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKE

Query:  KSK--------HQRKK-------DPKKLQPMRKRSKKFSQPQQ----------LVMLNKSFSKNFHKKKKENFA--TSYCIDV-------EEVDNSEMGE
         S          QRK        +P  ++  +K     SQ ++          +  L +SF ++  ++  E  A  T+  ++V       EE+DNS   +
Subjt:  KSK--------HQRKK-------DPKKLQPMRKRSKKFSQPQQ----------LVMLNKSFSKNFHKKKKENFA--TSYCIDV-------EEVDNSEMGE

Query:  QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSR
        QRTSVFD IKP TTR S FQR+SMA  +EENQC   T  + SAF+RLS+S SKK RPST  FDRLK+T+ Q +R+M  L+ K F E N D K+ S +PSR
Subjt:  QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSR

Query:  MKRKFSVLINTE
        MKRK SV INTE
Subjt:  MKRKFSVLINTE

A0A5A7SUW1 Retrotransposon gag protein1.1e-15045.41Show/hide
Query:  KTSSMVAVMNKSYMGSTVHCCFNGLTLQEDKASVVADQETTLQGAYTNDK--FFVKYNPLF------------ELDSDVVTVMMTGTRTMKERMVEMQEH
        K +S  +  + SY+G            ++++  V+  ++ +L+    + K    ++ NPLF            E   +VV+VMM    T +  M EM+  
Subjt:  KTSSMVAVMNKSYMGSTVHCCFNGLTLQEDKASVVADQETTLQGAYTNDK--FFVKYNPLF------------ELDSDVVTVMMTGTRTMKERMVEMQEH

Query:  IDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        I+ LMK  EE+D +I  LK Q++     ESSQT VVK   KG  +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR
Subjt:  IDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  TPIEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVE
         P+ YQPPKFQQFDGKGNPKQHIAHFVETCENA        ++L R+F+          + FE     + +GE V++YIN WRA+SLDCKD+LTELS+VE
Subjt:  TPIEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVE

Query:  MCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH------
        MC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  T  KS SK K   +  ++      
Subjt:  MCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH------

Query:  -LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN--------
          TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E KIELD+DEVAQ+N        
Subjt:  -LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN--------

Query:  ----------------------------------------------------------------------------LATIKEKSKHQRKKDPKKLQPMRK
                                                                                       I  K K +R K   K +P++ 
Subjt:  ----------------------------------------------------------------------------LATIKEKSKHQRKKDPKKLQPMRK

Query:  RSKKFSQPQQLVMLNKSFSKNFHKKKKENFA------TSYCIDV-------EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTST
        + K F QP++ + L +   ++F +   E         T+  ++V       EEVDNS   +QRT VF RIKP T R S FQR+SMA  EEENQC  ST  
Subjt:  RSKKFSQPQQLVMLNKSFSKNFHKKKKENFA------TSYCIDV-------EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTST

Query:  RPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE
        R SAF+RLS+ST KK RPSTS FDRLK+ + Q +R+M +L+ K F E N D K+ S +PSRMKRK SV INTE
Subjt:  RPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE

A0A5A7TQK6 Retrotransposon gag protein5.3e-15346.49Show/hide
Query:  KASVVADQETTLQGAYTNDKFFVKYNPLFELDSDVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNT
        K  +V  +   +   Y++ K   +  P      ++++VM+T   T + RM E+++ ++ LMK +EE+D +I  LK+ IE+   AESS    VKN  KG  
Subjt:  KASVVADQETTLQGAYTNDKFFVKYNPLFELDSDVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNT

Query:  IVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELE
        ++Q+ QP+   SIASLS+QQLQ+MIT+ I+ QYGG  Q   LYSKPYTKRIDNLR P  YQPPKFQQ+  K      +    +  +   PES+D+WE+LE
Subjt:  IVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESLDSWEELE

Query:  REFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLL
        R+FLNRFYSTRR +SM ELTNT+Q+KGELV++YIN WRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTF+ELATRAHDMEL++A++  +D L
Subjt:  REFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLL

Query:  LPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKV
        +P +R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFP++D+ DMLEQLLE QLI+L +CKR E+  KV
Subjt:  LPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKV

Query:  DDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN---------------------------------------LATIKEKSK------
        DDP YCKYHRVI HP+E+CFVLK+LILKLA E+KIELD+DEVAQ+N                                       + TI  ++K      
Subjt:  DDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN---------------------------------------LATIKEKSK------

Query:  ---------HQRKKDPKKLQ------------------------------PMRKRSKKFSQPQQLVMLNKSFSKNFHKKKKENFA------TSYCIDV--
                 HQ+ + P  +Q                              P++ + + F Q ++ + L +   ++F +   E          +  ++V  
Subjt:  ---------HQRKKDPKKLQ------------------------------PMRKRSKKFSQPQQLVMLNKSFSKNFHKKKKENFA------TSYCIDV--

Query:  -----EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFD
             +EV+NS    QRTSVFDRIKP TTR S FQR+SMA  EE+NQC     TR S F+RLS+STSKK RPSTS FDRLK+T+ Q +R+M +L+ K F 
Subjt:  -----EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFD

Query:  EVNSDKKLQSSIPSRMKRKFSVLINTE
        E N D K+ S IPSRMKRK  V INTE
Subjt:  EVNSDKKLQSSIPSRMKRKFSVLINTE

A0A5A7URH1 Ty3-gypsy retrotransposon protein1.1e-17153.92Show/hide
Query:  KASVVADQETTLQGAYTNDKFFVKYNPLFELDSDVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNT
        K  +V  +   +   Y++ K   +  P      ++++VM+T   T + RM E+++ ++ LMK +EE+D +I  LK+ IE++  AESS    VKN  KG  
Subjt:  KASVVADQETTLQGAYTNDKFFVKYNPLFELDSDVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNT

Query:  IVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENA------------
        ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGGP Q   LY KPYTKRIDNLR P  YQPPKFQQFDGKGNPKQH+AHF++TCE A            
Subjt:  IVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENA------------

Query:  --------------EPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKP
                      EPES+D+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YIN WRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP
Subjt:  --------------EPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKP

Query:  RTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIP
        RTFEELATRAHDMELSIA++  +D L+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ 
Subjt:  RTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIP

Query:  DMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPMRK----
        DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ  R     
Subjt:  DMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPMRK----

Query:  ----RSKKFSQPQQLVMLNKSFSKNFHKKKKENFATSYCIDVEEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLS
            RS     P++++ +    + +   +   N+ +S     +EV+NS    QRTSVFDRIKP TTR S FQR+S+A  EEENQC     TR S  +RLS
Subjt:  ----RSKKFSQPQQLVMLNKSFSKNFHKKKKENFATSYCIDVEEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLS

Query:  VSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE
        +ST KK RPSTS FDRLK+T+ Q +R+M + + K F E N D K+ S +PSRMKRK  V INTE
Subjt:  VSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTE

A0A5D3BX77 Retrotransposon gag protein1.3e-16749.58Show/hide
Query:  DVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQY
        ++++VM+T   T + RM E+++ ++ LMK +EE+D +I  LK+ IE++  AESS    VKN  KG  ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QY
Subjt:  DVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHGKGNTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQY

Query:  GGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESLDSWEELEREFLNRF
        GGP Q   LYSKPYTKRIDNLR P  YQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPES+D+WE+LER+FLNRF
Subjt:  GGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESLDSWEELEREFLNRF

Query:  YSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKE
        YSTRR VSM ELTNT+Q+KGELV++YIN WRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI ++  +D L+P  R +
Subjt:  YSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKE

Query:  GRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCK
             +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCK
Subjt:  GRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCK

Query:  YHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD-----------
        YHRVI HPVE+CFVLK+LILKLA E+KIEL++DEVAQ+N                                   + TI  ++K    KD           
Subjt:  YHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD-----------

Query:  ----PKKLQ------------------------------PMRKRSKKFSQPQQLVMLNKSFSKNF---HKKKKENFATSYCIDVEEVDN--------SEM
            P  +Q                              P++ + + F Q ++ + L +   ++F   H ++     T +   + EV+N        + +
Subjt:  ----PKKLQ------------------------------PMRKRSKKFSQPQQLVMLNKSFSKNF---HKKKKENFATSYCIDVEEVDN--------SEM

Query:  GE--QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSS
         E  QRTSVFDRIKP TTR S FQR+SMA  EEENQC     TR S F+RLS+S SKK RPSTS FDRLK+T+ Q +R+M +L+ K F E N D K+ S 
Subjt:  GE--QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSS

Query:  IPSRMKRKFSVLINTEEPKL
        +PSRMKRK  V INT+  ++
Subjt:  IPSRMKRKFSVLINTEEPKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGCTGTCATGAACAAGTCATACATGGGTTCTACTGTCCATTGTTGCTTCAACGGACTGACGTTGCAAGAAGATAAAGCTTCTGT
CGTTGCAGACCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAGTTTTTTGTCAAGTATAACCCTTTGTTTGAACTTGATTCTGACGTAGTGACTGTCATGATGA
CTGGGACTAGAACTATGAAAGAAAGAATGGTTGAGATGCAGGAGCACATCGACACCTTGATGAAGACGATTGAAGAAAAAGATTCTCAAATCGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGGCAAAGGAAATACTATAGTGCAAGATGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATTAGAGCCCAGTACGGTGGACCTACTCAAGATTCTCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAACGCCAATCGAGTATCAGCCACCAAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAAC
GCTGAACCTGAGTCATTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACA
ACGAAAAGGTGAACTCGTTGTTAACTATATAAATCACTGGAGAGCCATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCA
TGCACTGGGAACTCCTCTATATCCTTAAAGGTATAAAACCTCGCACCTTTGAAGAACTAGCAACTCGCGCCCACGATATGGAACTAAGTATTGCTAGCCAAGAAAACCAA
GACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCG
ACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAAAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAAC
TGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTC
CTAAAGGACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAG
AAAGAAGGATCCTAAGAAACTTCAACCCATGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCTTTCTCCAAAAATTTCCACAAAA
AGAAAAAAGAGAACTTTGCGACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCTGAGATGGGTGAACAAAGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACT
ACTCGTCCTTCGGCATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTC
CACATCGAAGAAAACTCGACCTTCAACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGGTCAATCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTTGATGAAG
TAAACAGCGATAAAAAGCTTCAAAGTAGCATCCCGTCACGTATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGAGCCTAAACTGCATGATGCTCCTAGCCCACAC
GAGCTTAAAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTC
CTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCG
AAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCA
GTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCCAAGTTTGAA
GGTTCTCACGCGCTACGCTGCAATTCCTTCCTCCAAGTTCAAAGAAACTGGGGAAGACGTGACCTGCAAGACCAGCAAAAACCGCTGCACACCGACGTGGTGTAGGGCAC
ACCGCCTCCGATGCTTAAGTCAGAAAACGGGAGGAGAGAAAGAAAATCCGGGGCGGCTAGAGGCTACAGAGACCAAGCAGGGCCCGACGGGCTCGGCCCGCGCAGGGGGT
CGGGCCAAAATTCCGACCCCTTCGGTCTTGGCCCGTCCCGCTCGCCGGTTTCGCCTCTTGGGTCCATCGTCCGGTCTGATTTTTGCTCGGTTGTCCTCGTCAGCTCCTTG
TGCACCGGGGTGGTCCAAAATTTCCTATAACATTAAGCCCCCACTCTTGAATTGGGATTCAGCTAGGATTCTCGGCCTTGGCATGAGGCTGAGCATTTCTTCTGCTTTTC
TTGCTAATTTGCTCTGTTGTATCCATCCCGAGGGTGCGTACACTCAGTCGGGGAAAACTTGGGGAGGTCGAGGAAATACTAGAGCCTCTCTTCAAGTATTGTATTCATCC
CAGGGTCGAGCTTCACTGCAAGCCCTCACAGTCGTCCTGAGGGCTAGAATGCTCGGCCTCGGCCTTGGCGTGAGGCCGGACATTTTTCCTACATTGCTTGTTATTTTGCT
CCTCATGCTGCTGGGTGCGTACACTCTTCTGAGGAAATGCTTGGGGAGGCCCAGGAATTTCCCAAGCCTCTCTTCAAGCTACATAGTCATCCCAGGGGTGCGTACACTCT
TCTGGGGAAAAGCTTTGGGAGGCCTAGGAAGTTCCCGAACCTCTCTTCAAGCTACATGGTCATCCCAGGGGTGCGTACACTCTTCTGAGGAAATGCTTGGGGAGGCCCAG
GAATTTCCCAAGCCTCTCTTCAAGCTACATAGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTTGGGAGGCCTAGGAAGTTCCCAAACCTCTCTTCAAGTTG
CATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCCGAGCCTCTCTTCAAGCTGCATGGTCATCCCAGGGGTGCGTACACT
CTTCTGGGGAAATGCTTGGGGAGGCCCAGGAATTTTCCAAGCCTCTCTTCAAGCTACATAGTCATCCCAAGGGTGGTCGGCTCTGGCCTTGGCATGAGGCCGACCAACTC
CTCTGTGTCATCTGCGTGCTCAGTCCGTGCTCGACTTATGTGCAGAGCATCTCCCTTTCGGCTCGAATCGCTTCAGCTACGATAAAAGGTCAAGTGTTGTGGTCTTCCCA
GGGGTGCGTACACTCTTCTGGGGAAAAACTTGAGGAGGCTTGGGATTTTCCCGGGCCCCTCTTCAAGCTGCTGTGTTCATCCCAGGGATCAACAAGCCAAGAAGCCGATC
CAAGAGATCATCACGCCAACAGGCCGATCATCCAAGAAGATCAACAAGCCAATAGGTCGATCCAAGAGATCATCAACCTAGCAGACCGATCATCCAAGAAGCTCAACAAA
CTGGCCCAAGAAGATCCTTCAAGACTGGAAGACTTCAACAATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGCTGTCATGAACAAGTCATACATGGGTTCTACTGTCCATTGTTGCTTCAACGGACTGACGTTGCAAGAAGATAAAGCTTCTGT
CGTTGCAGACCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAGTTTTTTGTCAAGTATAACCCTTTGTTTGAACTTGATTCTGACGTAGTGACTGTCATGATGA
CTGGGACTAGAACTATGAAAGAAAGAATGGTTGAGATGCAGGAGCACATCGACACCTTGATGAAGACGATTGAAGAAAAAGATTCTCAAATCGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGGCAAAGGAAATACTATAGTGCAAGATGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATTAGAGCCCAGTACGGTGGACCTACTCAAGATTCTCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAACGCCAATCGAGTATCAGCCACCAAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAAC
GCTGAACCTGAGTCATTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACA
ACGAAAAGGTGAACTCGTTGTTAACTATATAAATCACTGGAGAGCCATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCA
TGCACTGGGAACTCCTCTATATCCTTAAAGGTATAAAACCTCGCACCTTTGAAGAACTAGCAACTCGCGCCCACGATATGGAACTAAGTATTGCTAGCCAAGAAAACCAA
GACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCG
ACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAAAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAAC
TGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTC
CTAAAGGACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAG
AAAGAAGGATCCTAAGAAACTTCAACCCATGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCTTTCTCCAAAAATTTCCACAAAA
AGAAAAAAGAGAACTTTGCGACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCTGAGATGGGTGAACAAAGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACT
ACTCGTCCTTCGGCATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTC
CACATCGAAGAAAACTCGACCTTCAACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGGTCAATCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTTGATGAAG
TAAACAGCGATAAAAAGCTTCAAAGTAGCATCCCGTCACGTATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGAGCCTAAACTGCATGATGCTCCTAGCCCACAC
GAGCTTAAAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTC
CTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCG
AAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCA
GTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCCAAGTTTGAA
GGTTCTCACGCGCTACGCTGCAATTCCTTCCTCCAAGTTCAAAGAAACTGGGGAAGACGTGACCTGCAAGACCAGCAAAAACCGCTGCACACCGACGTGGTGTAGGGCAC
ACCGCCTCCGATGCTTAAGTCAGAAAACGGGAGGAGAGAAAGAAAATCCGGGGCGGCTAGAGGCTACAGAGACCAAGCAGGGCCCGACGGGCTCGGCCCGCGCAGGGGGT
CGGGCCAAAATTCCGACCCCTTCGGTCTTGGCCCGTCCCGCTCGCCGGTTTCGCCTCTTGGGTCCATCGTCCGGTCTGATTTTTGCTCGGTTGTCCTCGTCAGCTCCTTG
TGCACCGGGGTGGTCCAAAATTTCCTATAACATTAAGCCCCCACTCTTGAATTGGGATTCAGCTAGGATTCTCGGCCTTGGCATGAGGCTGAGCATTTCTTCTGCTTTTC
TTGCTAATTTGCTCTGTTGTATCCATCCCGAGGGTGCGTACACTCAGTCGGGGAAAACTTGGGGAGGTCGAGGAAATACTAGAGCCTCTCTTCAAGTATTGTATTCATCC
CAGGGTCGAGCTTCACTGCAAGCCCTCACAGTCGTCCTGAGGGCTAGAATGCTCGGCCTCGGCCTTGGCGTGAGGCCGGACATTTTTCCTACATTGCTTGTTATTTTGCT
CCTCATGCTGCTGGGTGCGTACACTCTTCTGAGGAAATGCTTGGGGAGGCCCAGGAATTTCCCAAGCCTCTCTTCAAGCTACATAGTCATCCCAGGGGTGCGTACACTCT
TCTGGGGAAAAGCTTTGGGAGGCCTAGGAAGTTCCCGAACCTCTCTTCAAGCTACATGGTCATCCCAGGGGTGCGTACACTCTTCTGAGGAAATGCTTGGGGAGGCCCAG
GAATTTCCCAAGCCTCTCTTCAAGCTACATAGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTTGGGAGGCCTAGGAAGTTCCCAAACCTCTCTTCAAGTTG
CATGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCCGAGCCTCTCTTCAAGCTGCATGGTCATCCCAGGGGTGCGTACACT
CTTCTGGGGAAATGCTTGGGGAGGCCCAGGAATTTTCCAAGCCTCTCTTCAAGCTACATAGTCATCCCAAGGGTGGTCGGCTCTGGCCTTGGCATGAGGCCGACCAACTC
CTCTGTGTCATCTGCGTGCTCAGTCCGTGCTCGACTTATGTGCAGAGCATCTCCCTTTCGGCTCGAATCGCTTCAGCTACGATAAAAGGTCAAGTGTTGTGGTCTTCCCA
GGGGTGCGTACACTCTTCTGGGGAAAAACTTGAGGAGGCTTGGGATTTTCCCGGGCCCCTCTTCAAGCTGCTGTGTTCATCCCAGGGATCAACAAGCCAAGAAGCCGATC
CAAGAGATCATCACGCCAACAGGCCGATCATCCAAGAAGATCAACAAGCCAATAGGTCGATCCAAGAGATCATCAACCTAGCAGACCGATCATCCAAGAAGCTCAACAAA
CTGGCCCAAGAAGATCCTTCAAGACTGGAAGACTTCAACAATCCTTGA
Protein sequenceShow/hide protein sequence
MSFKTSSMVAVMNKSYMGSTVHCCFNGLTLQEDKASVVADQETTLQGAYTNDKFFVKYNPLFELDSDVVTVMMTGTRTMKERMVEMQEHIDTLMKTIEEKDSQIEQLKSQ
IENQHIAESSQTQVVKNHGKGNTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIEYQPPKFQQFDGKGNPKQHIAHFVETCEN
AEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQ
DLLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
LKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPMRKRSKKFSQPQQLVMLNKSFSKNFHKKKKENFATSYCIDVEEVDNSEMGEQRTSVFDRIKPPT
TRPSAFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEEPKLHDAPSPH
ELKSSFSPSSRVLTLYAIALFLLQVRGFSVVRLLRCSSSKCEGSYVVRCCIVPSSLKFDGSHAALLEFLLPKFEGSHALRCSSFFSRSKVLTRCVAVLSPQVRRFTHFAA
VPSPKFEGSHALRSAIPSPKFEGSHALRCSSFPPSLKVLTRYAAIPSSKFKETGEDVTCKTSKNRCTPTWCRAHRLRCLSQKTGGEKENPGRLEATETKQGPTGSARAGG
RAKIPTPSVLARPARRFRLLGPSSGLIFARLSSSAPCAPGWSKISYNIKPPLLNWDSARILGLGMRLSISSAFLANLLCCIHPEGAYTQSGKTWGGRGNTRASLQVLYSS
QGRASLQALTVVLRARMLGLGLGVRPDIFPTLLVILLLMLLGAYTLLRKCLGRPRNFPSLSSSYIVIPGVRTLFWGKALGGLGSSRTSLQATWSSQGCVHSSEEMLGEAQ
EFPKPLFKLHSHPRGAYTLLGKSFGRPRKFPNLSSSCMVIPGVRTLFWGKAWGGLGSSRASLQAAWSSQGCVHSSGEMLGEAQEFSKPLFKLHSHPKGGRLWPWHEADQL
LCVICVLSPCSTYVQSISLSARIASATIKGQVLWSSQGCVHSSGEKLEEAWDFPGPLFKLLCSSQGSTSQEADPRDHHANRPIIQEDQQANRSIQEIINLADRSSKKLNK
LAQEDPSRLEDFNNP