; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007553 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007553
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:956004..961832
RNA-Seq ExpressionLag0007553
SyntenyLag0007553
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR000589 - Ribosomal protein S15
IPR001584 - Integrase, catalytic core
IPR009068 - S15/NS1, RNA-binding
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU17915.1 hypothetical protein TSUD_330400, partial [Trifolium subterraneum]6.5e-9535.35Show/hide
Query:  ILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGT
        ++ P      +DGY+LG K  P EF+ +   + K     NP+FE+W   DQ L GWL  SMT  +    +  +TS ++W+    + GA ++++I  L+  
Subjt:  ILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGT

Query:  LQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTM------------LIISKETS------
          +T+KG  KM DYL  MK  ++ L+LAG P+S SDLI     GLD+EY P++  + ++    W +L A L T             L ++   +      
Subjt:  LQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTM------------LIISKETS------

Query:  ------QDNNKIRGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFN-NPHGTSNKGNGENLAYLATSEIVCDPNWLADSGA
                NN  RG   +  + G  R  S +T            +   +     +   RFD+ ++ + H ++N   G +  +LA+   V D +W  DSGA
Subjt:  ------QDNNKIRGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFN-NPHGTSNKGNGENLAYLATSEIVCDPNWLADSGA

Query:  TSHVTRGDHRSVGVGFRLKPTPTTDMSVSGGRFLSSVLAVFGEPTDRQVGFTLRKTDSDRNYKKTDRPTSV---------RRAKRP-----LELIHCDLW
        ++HVT   H++       +      + V  G  L  V       T R +   LR T  D  Y+ +++ +S          R+   P     LEL+H D+W
Subjt:  TSHVTRGDHRSVGVGFRLKPTPTTDMSVSGGRFLSSVLAVFGEPTDRQVGFTLRKTDSDRNYKKTDRPTSV---------RRAKRP-----LELIHCDLW

Query:  GPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRH
        GP+PI S++ ++YY+  +D+F+RFT ++PLK K +  +AFIQ+K +VE + + +IKT+Q D GGE+++      + GI+FR  C +TSQQN   ERKHRH
Subjt:  GPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRH

Query:  IVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG
        I E GLTLLAQ+ MPL +WW+AF++AV++INRLP+PV +N SP+  L K  PDY  L+ FG AC+PCL+PY  HK QFH+T    L   +  KG
Subjt:  IVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG

GAU21262.1 hypothetical protein TSUD_286720 [Trifolium subterraneum]4.8e-9034.95Show/hide
Query:  ILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGT
        ++ P      +DGY+LGTK  P +F+ +     K+    NP++EEW   DQAL GWL  SM   +    +  +TSKE+W   + + GA +++RI  L+  
Subjt:  ILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGT

Query:  LQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTMLIISKETSQDNNKIRGKTIKEIKTGD
          NT+KG  KM  YL  MK  ++ L++AG+P+S SDL+     GLDAE+ P++  + ++   SW +L A    +L       Q NN I            
Subjt:  LQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTMLIISKETSQDNNKIRGKTIKEIKTGD

Query:  IRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGENLAYLATSEIVCDPNWLADSGATSHVTRGDHRSVGVGFRLKPTPTTD
               ++S  T     +       +  T    R       P+  +    G + A++ +     D  W  DSGA++HVT  + +   +           
Subjt:  IRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGENLAYLATSEIVCDPNWLADSGATSHVTRGDHRSVGVGFRLKPTPTTD

Query:  MSVSGGRFLSSVLAVFGEPTDRQVGFTLRKTDSDRNYKKTDRPTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNA
        M ++  +F    L  F                           +S   AK PL+LIH D+WGP+PI S++  +YY+  +D+FSRFT +FPLK K E ++A
Subjt:  MSVSGGRFLSSVLAVFGEPTDRQVGFTLRKTDSDRNYKKTDRPTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNA

Query:  FIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLN
        F Q+K +VE + + KI+ +Q D GGE++       + GI+FR  C +TSQQN   ERKHRH+ E+G+TLLAQ+ MPL +WW+AF+++V++INRLP+ V  
Subjt:  FIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLN

Query:  NSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG
        N SP+  LF+  PDY  L+ FG AC+PCL+PY  HK QFH+T    L   +  KG
Subjt:  NSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]1.6e-8532.73Show/hide
Query:  VDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGSTK
        +DGY+LGT   P +F+ S  ++ K+    NP F +W   DQAL GWL  SM   +    +  +TSK++W   + + GA +K+RI  L+    NT+KG  K
Subjt:  VDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGSTK

Query:  MLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTMLIISKETSQDN--------------NKIRGKTIKEI
        M +YL  MK  S+ L+LAG+P+S SDL+     GLDAEY P++  + ++   SW ++ A    +L       Q N              NK   +  K  
Subjt:  MLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTMLIISKETSQDN--------------NKIRGKTIKEI

Query:  KTGDIRTISPQTI----SPQTMVLEIATILEVEGEEDTEIKERFDEDF-NNPHGTSNKGNGENLAYLATSEIVCDPNWLADSGATSHVTR--------GD
          G+ R  + + +        M      +    G    +   RFD  +    + T     G + A++A+     D  W  DSGA +HVT          +
Subjt:  KTGDIRTISPQTI----SPQTMVLEIATILEVEGEEDTEIKERFDEDF-NNPHGTSNKGNGENLAYLATSEIVCDPNWLADSGATSHVTR--------GD

Query:  HRS-----VGVGFRLK--------------------PTPTTD-MSVSGGRFLSSVLAVFG----EPTDRQVGFTLRK-------------------TDSD
        H       VG G +LK                    P  T + +SVS     +++L  F        D+  G TL K                   +  +
Subjt:  HRS-----VGVGFRLK--------------------PTPTTD-MSVSGGRFLSSVLAVFG----EPTDRQVGFTLRK-------------------TDSD

Query:  RNYKKTDRPT---------------------------------------SVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKG
          ++K   P                                        S    + PL LIH D+WGP+PI S + ++YY+  +D+FSRFT +FPLK K 
Subjt:  RNYKKTDRPT---------------------------------------SVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKG

Query:  EALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLP
        + ++AFIQ+K L E + + KIK +Q D GGE+++      + GI+FR  C +TSQQN   ERKHRH+ E+GLTLLAQ+ MPLR+WW+AF++AV++INRLP
Subjt:  EALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLP

Query:  TPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG
        + V  N SP+  +FK  PDY  L+ FG AC+PCL+PY  HK QFH+T    +   +  KG
Subjt:  TPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]1.0e-8731.92Show/hide
Query:  ILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGT
        ++ P      +DGY+LG K  P EF+ +   + K     NP+FE+W   DQ L GWL  SMT  +    +  +TS ++W   + + GA ++++I  L+  
Subjt:  ILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGT

Query:  LQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTM------------------LIISKETS
          +T+KG  KM DYL  MK  ++ L+LAG P+S SDLI     GLD+EY P++  + ++   SW +L A L T                     ++K++ 
Subjt:  LQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTM------------------LIISKETS

Query:  Q-------------DNNKIRGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFN-NPHGTSNKGNGENLAYLATSEIVCDPN
                       NN  RG   +  + G  R  S +T            +  ++     +   RFD+ ++ + H  +N   G + A+LA+   + D +
Subjt:  Q-------------DNNKIRGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFN-NPHGTSNKGNGENLAYLATSEIVCDPN

Query:  WLADSGATSHVTR--------GDHRS-----VGVGFRLKPTPTTD---------------------MSVSGGRFLSSVLAVFGE----PTDRQVG-----
        W  DSGA++HVT          +H       VG G +L+   T                       +SVS     +++L  F E      D+  G     
Subjt:  WLADSGATSHVTR--------GDHRS-----VGVGFRLKPTPTTD---------------------MSVSGGRFLSSVLAVFGE----PTDRQVG-----

Query:  -------FTLRKTDS-------DRNYKKTDRP---------------------------------------TSVRRAKRPLELIHCDLWGPSPIPSTAVY
               + L + DS       +  ++K   P                                       TS   AK  LEL+H D+WGP+PI S++ +
Subjt:  -------FTLRKTDS-------DRNYKKTDRP---------------------------------------TSVRRAKRPLELIHCDLWGPSPIPSTAVY

Query:  RYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQ
        +YY+  +D+F+RFT ++PLK K +  +AFIQ+K +VE +   KIKT+Q D GGE++       + GI+FR  C +TSQQN   ERKHRHI E GLTLLAQ
Subjt:  RYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQ

Query:  SSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG
        + MPL +WW+AF++AV++INRLP+ V +N SP+  L K  PDY  L+ FG AC+P L+PY  HK QFH+T    L   +  KG
Subjt:  SSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]4.2e-8633.09Show/hide
Query:  DGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGSTKM
        DGY+LGTK  P +F+ S     K+    NP +++W   DQAL GWL  SMT  +    +  +TSK++W   + + GA +++RI  L+    NT K   KM
Subjt:  DGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGSTKM

Query:  LDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHA------------------ILQTMLIISKETSQDNNKI-----
          YLA MK  ++ L+LAG+P+S SDL+     GLD+EY P++  + ++   SW +  A                   L      + +     NK      
Subjt:  LDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHA------------------ILQTMLIISKETSQDNNKI-----

Query:  -RGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGENLAYLATSEIVCDPNWLADSGATSHVTRGDHRS-
         RG   + ++ G  R    +   P      I  I    G    +   RFD+ +   +  + +G G + A++A+     D  W  DSGA++HVT   H+S 
Subjt:  -RGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGENLAYLATSEIVCDPNWLADSGATSHVTRGDHRS-

Query:  ---------------VGVGFRLK--------------------PTPTTD-MSVSGGRFLSSVLAVFGE--------------------------------
                       VG G +LK                    P  T + +SVS     ++ L  F E                                
Subjt:  ---------------VGVGFRLK--------------------PTPTTD-MSVSGGRFLSSVLAVFGE--------------------------------

Query:  PTD--------------RQVGF----TLRKTDSDRNYK--KTDR-----------------PTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDN
        PT+              R++G      L K   D N K   +D+                  TS   AK PL+LIH D+WGP+PI S + ++YY+  +D+
Subjt:  PTD--------------RQVGF----TLRKTDSDRNYK--KTDR-----------------PTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDN

Query:  FSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWW
        FSRFT +FPLK K E ++AF Q+K LVE + + KIK ++ D GGE++         GI+F+  C +TSQQN   ERKHRH+ E+GLTLLAQ+ MPL +WW
Subjt:  FSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWW

Query:  DAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG
        +AF++AV++INRLP+ V  N SP+  +FK  PDYT L+ FG AC+PCL+PY  HK QFH+T    L   +  KG
Subjt:  DAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)4.9e-8831.92Show/hide
Query:  ILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGT
        ++ P      +DGY+LG K  P EF+ +   + K     NP+FE+W   DQ L GWL  SMT  +    +  +TS ++W   + + GA ++++I  L+  
Subjt:  ILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGT

Query:  LQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTM------------------LIISKETS
          +T+KG  KM DYL  MK  ++ L+LAG P+S SDLI     GLD+EY P++  + ++   SW +L A L T                     ++K++ 
Subjt:  LQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTM------------------LIISKETS

Query:  Q-------------DNNKIRGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFN-NPHGTSNKGNGENLAYLATSEIVCDPN
                       NN  RG   +  + G  R  S +T            +  ++     +   RFD+ ++ + H  +N   G + A+LA+   + D +
Subjt:  Q-------------DNNKIRGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFN-NPHGTSNKGNGENLAYLATSEIVCDPN

Query:  WLADSGATSHVTR--------GDHRS-----VGVGFRLKPTPTTD---------------------MSVSGGRFLSSVLAVFGE----PTDRQVG-----
        W  DSGA++HVT          +H       VG G +L+   T                       +SVS     +++L  F E      D+  G     
Subjt:  WLADSGATSHVTR--------GDHRS-----VGVGFRLKPTPTTD---------------------MSVSGGRFLSSVLAVFGE----PTDRQVG-----

Query:  -------FTLRKTDS-------DRNYKKTDRP---------------------------------------TSVRRAKRPLELIHCDLWGPSPIPSTAVY
               + L + DS       +  ++K   P                                       TS   AK  LEL+H D+WGP+PI S++ +
Subjt:  -------FTLRKTDS-------DRNYKKTDRP---------------------------------------TSVRRAKRPLELIHCDLWGPSPIPSTAVY

Query:  RYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQ
        +YY+  +D+F+RFT ++PLK K +  +AFIQ+K +VE +   KIKT+Q D GGE++       + GI+FR  C +TSQQN   ERKHRHI E GLTLLAQ
Subjt:  RYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQ

Query:  SSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG
        + MPL +WW+AF++AV++INRLP+ V +N SP+  L K  PDY  L+ FG AC+P L+PY  HK QFH+T    L   +  KG
Subjt:  SSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)2.1e-8633.09Show/hide
Query:  DGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGSTKM
        DGY+LGTK  P +F+ S     K+    NP +++W   DQAL GWL  SMT  +    +  +TSK++W   + + GA +++RI  L+    NT K   KM
Subjt:  DGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGSTKM

Query:  LDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHA------------------ILQTMLIISKETSQDNNKI-----
          YLA MK  ++ L+LAG+P+S SDL+     GLD+EY P++  + ++   SW +  A                   L      + +     NK      
Subjt:  LDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHA------------------ILQTMLIISKETSQDNNKI-----

Query:  -RGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGENLAYLATSEIVCDPNWLADSGATSHVTRGDHRS-
         RG   + ++ G  R    +   P      I  I    G    +   RFD+ +   +  + +G G + A++A+     D  W  DSGA++HVT   H+S 
Subjt:  -RGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGENLAYLATSEIVCDPNWLADSGATSHVTRGDHRS-

Query:  ---------------VGVGFRLK--------------------PTPTTD-MSVSGGRFLSSVLAVFGE--------------------------------
                       VG G +LK                    P  T + +SVS     ++ L  F E                                
Subjt:  ---------------VGVGFRLK--------------------PTPTTD-MSVSGGRFLSSVLAVFGE--------------------------------

Query:  PTD--------------RQVGF----TLRKTDSDRNYK--KTDR-----------------PTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDN
        PT+              R++G      L K   D N K   +D+                  TS   AK PL+LIH D+WGP+PI S + ++YY+  +D+
Subjt:  PTD--------------RQVGF----TLRKTDSDRNYK--KTDR-----------------PTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDN

Query:  FSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWW
        FSRFT +FPLK K E ++AF Q+K LVE + + KIK ++ D GGE++         GI+F+  C +TSQQN   ERKHRH+ E+GLTLLAQ+ MPL +WW
Subjt:  FSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWW

Query:  DAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG
        +AF++AV++INRLP+ V  N SP+  +FK  PDYT L+ FG AC+PCL+PY  HK QFH+T    L   +  KG
Subjt:  DAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG

A0A2Z6M732 Integrase catalytic domain-containing protein2.3e-9034.95Show/hide
Query:  ILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGT
        ++ P      +DGY+LGTK  P +F+ +     K+    NP++EEW   DQAL GWL  SM   +    +  +TSKE+W   + + GA +++RI  L+  
Subjt:  ILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGT

Query:  LQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTMLIISKETSQDNNKIRGKTIKEIKTGD
          NT+KG  KM  YL  MK  ++ L++AG+P+S SDL+     GLDAE+ P++  + ++   SW +L A    +L       Q NN I            
Subjt:  LQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTMLIISKETSQDNNKIRGKTIKEIKTGD

Query:  IRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGENLAYLATSEIVCDPNWLADSGATSHVTRGDHRSVGVGFRLKPTPTTD
               ++S  T     +       +  T    R       P+  +    G + A++ +     D  W  DSGA++HVT  + +   +           
Subjt:  IRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGENLAYLATSEIVCDPNWLADSGATSHVTRGDHRSVGVGFRLKPTPTTD

Query:  MSVSGGRFLSSVLAVFGEPTDRQVGFTLRKTDSDRNYKKTDRPTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNA
        M ++  +F    L  F                           +S   AK PL+LIH D+WGP+PI S++  +YY+  +D+FSRFT +FPLK K E ++A
Subjt:  MSVSGGRFLSSVLAVFGEPTDRQVGFTLRKTDSDRNYKKTDRPTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNA

Query:  FIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLN
        F Q+K +VE + + KI+ +Q D GGE++       + GI+FR  C +TSQQN   ERKHRH+ E+G+TLLAQ+ MPL +WW+AF+++V++INRLP+ V  
Subjt:  FIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLN

Query:  NSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG
        N SP+  LF+  PDY  L+ FG AC+PCL+PY  HK QFH+T    L   +  KG
Subjt:  NSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG

A0A803PM38 Uncharacterized protein6.6e-9334.72Show/hide
Query:  VDGYVLGTKSRPSEFLESPGEAGKLQLT--PNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGS
        +DGY+ GT  +P EFL S    G +      NP FE+W   DQ L GWL+GSMT  +  + +   +S  +W ALE+++GA SKA++++ R  +Q  +KG+
Subjt:  VDGYVLGTKSRPSEFLESPGEAGKLQLT--PNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGS

Query:  TKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTM-------------------------LIISK----
          M DYL   +Q ++ L LAG P   + L+S V  GLD EY+P++  I+ +  ++WQ+L  +L ++                          + +K    
Subjt:  TKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTM-------------------------LIISK----

Query:  ---ETSQDNNKIRGKTIKEIKTGDIRTISPQTISPQTMVLE--------------------IATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGE---
             + +NN   G +         R    +T  P+                          + I ++  +E+   KE+      N     + G G    
Subjt:  ---ETSQDNNKIRGKTIKEIKTGDIRTISPQTISPQTMVLE--------------------IATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGE---

Query:  -NLAYLATSEIVCDP----NWLADSGATS-------------HVTRGDHRSVGVGFRLK--------PTPTTDMSVS---------GGRFLSSVLA----
         + + L   EI+  P    N L+ S  TS              V   +   V +  +LK        PT TT MS +          G  +S+V +    
Subjt:  -NLAYLATSEIVCDP----NWLADSGATS-------------HVTRGDHRSVGVGFRLK--------PTPTTDMSVS---------GGRFLSSVLA----

Query:  ----------------VFGEPTDRQVGFTLRK------------TDSDRNYKKTDRPTSV--RRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSR
                          G P+ R +   L K             D+ +  K    P  V  +RA  PLEL+H D+WGPSPI S   +RYYI  +D+FSR
Subjt:  ----------------VFGEPTDRQVGFTLRK------------TDSDRNYKKTDRPTSV--RRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSR

Query:  FTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAF
        +T ++PLK K EAL AF+Q+K LVE + + ++K +Q+DWGGE++ F  F   HGI F+HPC HTS QN   ERKHRHIVEMGLTLLAQ+ +P ++WWDAF
Subjt:  FTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAF

Query:  TSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG
         +AV++INRLPTPVL   +P+  LFK  PDY FL+VFG +CFPCLR YQ+HKFQFHST   +L      KG
Subjt:  TSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG

A0A803QCY3 Uncharacterized protein1.7e-9635.57Show/hide
Query:  VDGYVLGTKSRPSEFL--ESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGS
        +DG++ GT   PSE++   S  +  K   T NP+FE W   DQ L GWL+ SMT  +  + +   ++  +W ALEQ+YGA SK++++  R  +Q TKKG 
Subjt:  VDGYVLGTKSRPSEFL--ESPGEAGKLQLTPNPKFEEWTTMDQALSGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGS

Query:  TKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTMLIISKETSQDNNKIRGKTIKEIKTGDIRTISPQT
        T M++YL   K  +++L LAG P   + L + V   LD  Y+ ++  I+ +  +SWQEL  +L +     +   + NN  RG+       G  R      
Subjt:  TKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISSWQELHAILQTMLIISKETSQDNNKIRGKTIKEIKTGDIRTISPQT

Query:  ISPQTMVLEIATILEVEGEED---TEIKERFDEDF--NNPHGTSNKGNGEN----LAYLATSEIVCDPNWLADSGATSHVTR-----------GDHRSVG
         S  T         +V G+ D         FD+ +  ++PH ++    G+N     A++AT E +    W ADSGA++++T            G    V 
Subjt:  ISPQTMVLEIATILEVEGEED---TEIKERFDEDF--NNPHGTSNKGNGEN----LAYLATSEIVCDPNWLADSGATSHVTR-----------GDHRSVG

Query:  VG-------------------------------------FRLKPTPTTDMSV-----SGGRFLSSVL------------AVFGEPTDRQVGFTLRKTD--
        VG                                     F      TTD  V     S   F+  +              ++   T R     LR +   
Subjt:  VG-------------------------------------FRLKPTPTTDMSV-----SGGRFLSSVL------------AVFGEPTDRQVGFTLRKTD--

Query:  ---SDRNYKKTDR---------------PTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDL
           SD N    D                  S  +A + L+L+H DLWGPSPI S   ++YY+  VD+ +RFT ++PLK K EA +AF+ +K+L E + + 
Subjt:  ---SDRNYKKTDR---------------PTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDL

Query:  KIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPD
        KIK L++D GGE++  S F+  HGI F H C HTS QN   ERKHRHIVEMGLTLLAQS MPL++WWDAF++AV++INRLPTP+L++ +P+  L K  PD
Subjt:  KIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPD

Query:  YTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG
        Y FL+ FG ACFPCLRPYQ+HKFQFHS    +L      KG
Subjt:  YTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKG

SwissProt top hitse value%identityAlignment
P59223 40S ribosomal protein S13-13.3e-2567.71Show/hide
Query:  ESMMPKGKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDSKFRLILVESRIHCLTR
        ES+    K+    +Q   +     G P   QVKSVTGSKILRILKAHGL PEI EDLYHLIKK V+I+KHLERNRKDKDSKFRLILVESRIH L R
Subjt:  ESMMPKGKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDSKFRLILVESRIHCLTR

P59224 40S ribosomal protein S13-23.3e-2567.71Show/hide
Query:  ESMMPKGKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDSKFRLILVESRIHCLTR
        ES+    K+    +Q   +     G P   QVKSVTGSKILRILKAHGL PEI EDLYHLIKK V+I+KHLERNRKDKDSKFRLILVESRIH L R
Subjt:  ESMMPKGKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDSKFRLILVESRIHCLTR

Q05761 40S ribosomal protein S134.3e-2562.07Show/hide
Query:  SLPYTFTLP--LLALLQNWESMMPK-GKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDS
        +LPY  T P  L     + E M+ K  K+ +  +Q   L     G PL   VKSVTGSKILRILKAHGL PEI EDLY LIKK V+I+KHLERNRKDKDS
Subjt:  SLPYTFTLP--LLALLQNWESMMPK-GKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDS

Query:  KFRLILVESRIHCLTR
        KFRLILVESRIH L R
Subjt:  KFRLILVESRIHCLTR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-4648.96Show/hide
Query:  SVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHP
        S   + RPLE I+ D+W  SPI S   YRYY+  VD+F+R+T ++PLK K +    FI +K L+E +   +I T  SD GGEF +   +  QHGI     
Subjt:  SVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHP

Query:  CHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHK
          HT + N + ERKHRHIVE GLTLL+ +S+P  +W  AF  AV++INRLPTP+L   SP+  LF + P+Y  LRVFG AC+P LRPY  HK
Subjt:  CHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.1e-4850Show/hide
Query:  AKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHT
        + +PLE I+ D+W  SPI S   YRYY+  VD+F+R+T ++PLK K +  + FI +K+LVE +   +I TL SD GGEF     +L QHGI       HT
Subjt:  AKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFSRFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHT

Query:  SQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQ
         + N + ERKHRHIVEMGLTLL+ +S+P  +W  AF+ AV++INRLPTP+L   SP+  LF   P+Y  L+VFG AC+P LRPY  HK +
Subjt:  SQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQ

Arabidopsis top hitse value%identityAlignment
AT3G60770.1 Ribosomal protein S13/S152.4e-2667.71Show/hide
Query:  ESMMPKGKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDSKFRLILVESRIHCLTR
        ES+    K+    +Q   +     G P   QVKSVTGSKILRILKAHGL PEI EDLYHLIKK V+I+KHLERNRKDKDSKFRLILVESRIH L R
Subjt:  ESMMPKGKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDSKFRLILVESRIHCLTR

AT4G00100.1 ribosomal protein S13A2.4e-2667.71Show/hide
Query:  ESMMPKGKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDSKFRLILVESRIHCLTR
        ES+    K+    +Q   +     G P   QVKSVTGSKILRILKAHGL PEI EDLYHLIKK V+I+KHLERNRKDKDSKFRLILVESRIH L R
Subjt:  ESMMPKGKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDSKFRLILVESRIHCLTR

AT4G29090.1 Ribonuclease H-like superfamily protein4.3e-0425.25Show/hide
Query:  IMWRIWDHCNKAEIQNQIPSAEKLFQSIDSNLKEWE-ESYLKNQPSKRQRNLVSHAHRENSKPNCW-RLISDVAWNEKMNRKGMGWVVLDSEGSLVYFG
        ++WR+W + N+   + +  +A+++ +  + +L+EW   +  ++  +K Q N  S   R    P+ W +  +D  WN    R G+GWV+ + +G + + G
Subjt:  IMWRIWDHCNKAEIQNQIPSAEKLFQSIDSNLKEWE-ESYLKNQPSKRQRNLVSHAHRENSKPNCW-RLISDVAWNEKMNRKGMGWVVLDSEGSLVYFG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.9e-0535.29Show/hide
Query:  HRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACF
        +R I+E   ++L +  +P  F  DA  +AV IIN+ P+  +N   P    F+S P Y++LR FG   +
Subjt:  HRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINRLPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGAATTGTAGTATCCCTGCCTTATACATTCACTTTGCCTCTCTTAGCACTACTACAAAATTGGGAATCCATGATGCCAAAAGGAAAGGAACCAAGGGGTGTCAA
TCAAACACCAACTTTGAAACACCAACTTGATGGCTTTCCTTTGACCATTCAGGTGAAAAGTGTAACTGGGAGCAAGATCTTGCGTATATTGAAGGCTCATGGGCTAACTC
CTGAAATCTCAGAGGATCTTTACCATCTCATCAAGAAAGTTGTTTCAATCAAAAAACATTTGGAGAGAAACAGGAAAGACAAGGATTCCAAGTTTAGGTTGATTTTGGTG
GAAAGCAGGATCCATTGCCTGACCCGGAAAGGTTGGTTTGCTAAGGACTATTGGCTCTGGATGGAGGAAAATTTAAGCAAGGAAGAGCTATCTAAAGGAGTTTTGATTAT
GTGGAGGATTTGGGATCATTGTAACAAAGCAGAAATTCAGAATCAAATCCCATCAGCAGAGAAATTATTCCAGTCTATTGATTCTAACCTAAAAGAATGGGAAGAATCTT
ACCTCAAGAACCAGCCCTCGAAGAGACAGAGAAACCTTGTGAGTCATGCCCATCGAGAAAATTCCAAGCCGAACTGTTGGAGATTGATCTCAGATGTCGCCTGGAACGAG
AAGATGAACCGCAAAGGGATGGGTTGGGTCGTGCTTGACTCGGAAGGATCCTTGGTCTACTTCGGAATGCATCTAGTTATGGCAGACGAGGTCTTAAACACGCCTGAGAG
TAGCATTGCCGCCCGCCACTGTAGCTGTCACGACTCCTGTTACGGGCATCATCAGCTCTTCATTCTGACACCCCCTCAGCACAGTACTCACGTTGACGGATACGTCTTGG
GAACAAAGTCTCGGCCGTCGGAATTTCTTGAATCGCCTGGTGAAGCTGGTAAATTGCAGCTTACCCCAAACCCAAAATTTGAAGAATGGACAACGATGGACCAAGCCCTT
TCAGGATGGCTCTTTGGGTCAATGACACCAGCCGTAACAGCCGATGCAGTAAGCTTCAAAACCTCAAAAGAGGTGTGGAAAGCGTTGGAACAGATGTATGGCGCAACCAG
CAAAGCAAGGATTAATCAGCTTCGAGGTACCTTGCAAAATACTAAGAAAGGGTCTACTAAAATGTTAGATTATTTGGCTACAATGAAACAGGCATCGGAAAACCTTCAGC
TGGCTGGGGCTCCTGTTTCTCTATCGGATTTGATTTCATATGTTTTTGGTGGCTTGGATGCTGAGTATATCCCAATCATCTGCACTATTCAAGAGAAAGAAATCTCTTCT
TGGCAAGAACTACATGCAATTTTGCAAACTATGCTTATAATAAGCAAGGAAACTTCTCAGGACAACAACAAAATTCGGGGCAAAACAATCAAAGAAATCAAAACAGGGGA
TATCAGAACTATCAGCCCTCAAACTATTAGCCCTCAAACTATGGTTCTCGAAATAGCAACAATACTAGAGGTAGAGGGCGAGGAAGATACAGAAATCAAAGAGCGTTTTG
ATGAAGATTTCAACAACCCACATGGAACGAGCAACAAAGGCAATGGTGAGAACTTAGCCTATTTAGCAACCTCTGAGATTGTTTGTGATCCAAATTGGCTGGCCGATAGT
GGTGCTACAAGCCATGTCACTAGGGGTGATCATCGATCGGTCGGCGTCGGTTTTAGGCTCAAACCGACGCCGACCACCGACATGTCGGTTTCCGGCGGTCGGTTCTTGTC
GTCGGTTCTCGCTGTTTTTGGCGAACCGACCGACCGACAAGTCGGTTTTACTTTACGTAAAACCGATTCCGACCGAAATTATAAGAAAACCGACCGACCGACGTCGGTTC
GTCGTGCTAAGAGACCACTTGAACTTATTCATTGTGACCTTTGGGGGCCCTCACCCATTCCTTCGACTGCTGTTTATAGATATTACATTAGCCTAGTTGACAATTTCAGT
CGCTTTACACATGTTTTTCCTCTAAAAACAAAAGGAGAAGCTTTAAATGCCTTCATCCAATACAAAGCTTTAGTTGAGATTAAACTTGACCTTAAGATTAAAACTCTCCA
AAGTGATTGGGGTGGTGAGTTTCGTAGTTTTTCCCCTTTTCTCAAACAACACGGTATAGAATTTAGACACCCTTGCCACCACACTAGCCAACAAAACGAAATTGTTGAAC
GAAAGCATCGTCACATAGTTGAAATGGGCCTCACACTTCTTGCTCAATCATCCATGCCACTTAGGTTTTGGTGGGATGCCTTTACCTCAGCTGTGTTCATTATAAACCGC
CTCCCAACCCCTGTCCTTAACAACTCTTCCCCTTGGCACACCCTTTTTAAATCCCACCCTGACTATACCTTCCTTCGTGTTTTTGGTTCAGCCTGCTTCCCTTGCTTAAG
ACCATATCAATCCCATAAGTTCCAATTCCATTCCACAAATGTGTTTTCCTTGGCTATAGTGAGTATCACAAAAGGTATCGATGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGAATTGTAGTATCCCTGCCTTATACATTCACTTTGCCTCTCTTAGCACTACTACAAAATTGGGAATCCATGATGCCAAAAGGAAAGGAACCAAGGGGTGTCAA
TCAAACACCAACTTTGAAACACCAACTTGATGGCTTTCCTTTGACCATTCAGGTGAAAAGTGTAACTGGGAGCAAGATCTTGCGTATATTGAAGGCTCATGGGCTAACTC
CTGAAATCTCAGAGGATCTTTACCATCTCATCAAGAAAGTTGTTTCAATCAAAAAACATTTGGAGAGAAACAGGAAAGACAAGGATTCCAAGTTTAGGTTGATTTTGGTG
GAAAGCAGGATCCATTGCCTGACCCGGAAAGGTTGGTTTGCTAAGGACTATTGGCTCTGGATGGAGGAAAATTTAAGCAAGGAAGAGCTATCTAAAGGAGTTTTGATTAT
GTGGAGGATTTGGGATCATTGTAACAAAGCAGAAATTCAGAATCAAATCCCATCAGCAGAGAAATTATTCCAGTCTATTGATTCTAACCTAAAAGAATGGGAAGAATCTT
ACCTCAAGAACCAGCCCTCGAAGAGACAGAGAAACCTTGTGAGTCATGCCCATCGAGAAAATTCCAAGCCGAACTGTTGGAGATTGATCTCAGATGTCGCCTGGAACGAG
AAGATGAACCGCAAAGGGATGGGTTGGGTCGTGCTTGACTCGGAAGGATCCTTGGTCTACTTCGGAATGCATCTAGTTATGGCAGACGAGGTCTTAAACACGCCTGAGAG
TAGCATTGCCGCCCGCCACTGTAGCTGTCACGACTCCTGTTACGGGCATCATCAGCTCTTCATTCTGACACCCCCTCAGCACAGTACTCACGTTGACGGATACGTCTTGG
GAACAAAGTCTCGGCCGTCGGAATTTCTTGAATCGCCTGGTGAAGCTGGTAAATTGCAGCTTACCCCAAACCCAAAATTTGAAGAATGGACAACGATGGACCAAGCCCTT
TCAGGATGGCTCTTTGGGTCAATGACACCAGCCGTAACAGCCGATGCAGTAAGCTTCAAAACCTCAAAAGAGGTGTGGAAAGCGTTGGAACAGATGTATGGCGCAACCAG
CAAAGCAAGGATTAATCAGCTTCGAGGTACCTTGCAAAATACTAAGAAAGGGTCTACTAAAATGTTAGATTATTTGGCTACAATGAAACAGGCATCGGAAAACCTTCAGC
TGGCTGGGGCTCCTGTTTCTCTATCGGATTTGATTTCATATGTTTTTGGTGGCTTGGATGCTGAGTATATCCCAATCATCTGCACTATTCAAGAGAAAGAAATCTCTTCT
TGGCAAGAACTACATGCAATTTTGCAAACTATGCTTATAATAAGCAAGGAAACTTCTCAGGACAACAACAAAATTCGGGGCAAAACAATCAAAGAAATCAAAACAGGGGA
TATCAGAACTATCAGCCCTCAAACTATTAGCCCTCAAACTATGGTTCTCGAAATAGCAACAATACTAGAGGTAGAGGGCGAGGAAGATACAGAAATCAAAGAGCGTTTTG
ATGAAGATTTCAACAACCCACATGGAACGAGCAACAAAGGCAATGGTGAGAACTTAGCCTATTTAGCAACCTCTGAGATTGTTTGTGATCCAAATTGGCTGGCCGATAGT
GGTGCTACAAGCCATGTCACTAGGGGTGATCATCGATCGGTCGGCGTCGGTTTTAGGCTCAAACCGACGCCGACCACCGACATGTCGGTTTCCGGCGGTCGGTTCTTGTC
GTCGGTTCTCGCTGTTTTTGGCGAACCGACCGACCGACAAGTCGGTTTTACTTTACGTAAAACCGATTCCGACCGAAATTATAAGAAAACCGACCGACCGACGTCGGTTC
GTCGTGCTAAGAGACCACTTGAACTTATTCATTGTGACCTTTGGGGGCCCTCACCCATTCCTTCGACTGCTGTTTATAGATATTACATTAGCCTAGTTGACAATTTCAGT
CGCTTTACACATGTTTTTCCTCTAAAAACAAAAGGAGAAGCTTTAAATGCCTTCATCCAATACAAAGCTTTAGTTGAGATTAAACTTGACCTTAAGATTAAAACTCTCCA
AAGTGATTGGGGTGGTGAGTTTCGTAGTTTTTCCCCTTTTCTCAAACAACACGGTATAGAATTTAGACACCCTTGCCACCACACTAGCCAACAAAACGAAATTGTTGAAC
GAAAGCATCGTCACATAGTTGAAATGGGCCTCACACTTCTTGCTCAATCATCCATGCCACTTAGGTTTTGGTGGGATGCCTTTACCTCAGCTGTGTTCATTATAAACCGC
CTCCCAACCCCTGTCCTTAACAACTCTTCCCCTTGGCACACCCTTTTTAAATCCCACCCTGACTATACCTTCCTTCGTGTTTTTGGTTCAGCCTGCTTCCCTTGCTTAAG
ACCATATCAATCCCATAAGTTCCAATTCCATTCCACAAATGTGTTTTCCTTGGCTATAGTGAGTATCACAAAAGGTATCGATGTTTAA
Protein sequenceShow/hide protein sequence
MLRIVVSLPYTFTLPLLALLQNWESMMPKGKEPRGVNQTPTLKHQLDGFPLTIQVKSVTGSKILRILKAHGLTPEISEDLYHLIKKVVSIKKHLERNRKDKDSKFRLILV
ESRIHCLTRKGWFAKDYWLWMEENLSKEELSKGVLIMWRIWDHCNKAEIQNQIPSAEKLFQSIDSNLKEWEESYLKNQPSKRQRNLVSHAHRENSKPNCWRLISDVAWNE
KMNRKGMGWVVLDSEGSLVYFGMHLVMADEVLNTPESSIAARHCSCHDSCYGHHQLFILTPPQHSTHVDGYVLGTKSRPSEFLESPGEAGKLQLTPNPKFEEWTTMDQAL
SGWLFGSMTPAVTADAVSFKTSKEVWKALEQMYGATSKARINQLRGTLQNTKKGSTKMLDYLATMKQASENLQLAGAPVSLSDLISYVFGGLDAEYIPIICTIQEKEISS
WQELHAILQTMLIISKETSQDNNKIRGKTIKEIKTGDIRTISPQTISPQTMVLEIATILEVEGEEDTEIKERFDEDFNNPHGTSNKGNGENLAYLATSEIVCDPNWLADS
GATSHVTRGDHRSVGVGFRLKPTPTTDMSVSGGRFLSSVLAVFGEPTDRQVGFTLRKTDSDRNYKKTDRPTSVRRAKRPLELIHCDLWGPSPIPSTAVYRYYISLVDNFS
RFTHVFPLKTKGEALNAFIQYKALVEIKLDLKIKTLQSDWGGEFRSFSPFLKQHGIEFRHPCHHTSQQNEIVERKHRHIVEMGLTLLAQSSMPLRFWWDAFTSAVFIINR
LPTPVLNNSSPWHTLFKSHPDYTFLRVFGSACFPCLRPYQSHKFQFHSTNVFSLAIVSITKGIDV