; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021678 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021678
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:10609218..10611654
RNA-Seq ExpressionLag0021678
SyntenyLag0021678
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030477911.1 uncharacterized protein LOC115694948 [Cannabis sativa]7.7e-8032.85Show/hide
Query:  KGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDI
        +GR G++ALKLDMSKA+DRVEW +LE  M  +GF   +V LIM+C+ + ++SF  NGD  G++ PSRGLRQGDPLSPYLFLIC+EGLSR+L+++E++ ++
Subjt:  KGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDI

Query:  KGLRISRWSPSINHLFFADDCFLFFRANVDEA----------HE----------------------------------IADC------------------
        +GL I+R +PSI+HL FADD  LF +A+   A          H+                                  I DC                  
Subjt:  KGLRISRWSPSINHLFFADDCFLFFRANVDEA----------HE----------------------------------IADC------------------

Query:  -----LRLYALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKESKGSDSFVWKSLMWGQDLLRMRVRWRVGNDDVGYVLGIPRPRIDKSDARMW
              +++ LL  W  K FS+GGKEVL+KAV+Q IPTY+MSCFKL K+       +  +  WG +                          +  D  +W
Subjt:  -----LRLYALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKESKGSDSFVWKSLMWGQDLLRMRVRWRVGNDDVGYVLGIPRPRIDKSDARMW

Query:  HYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKI
        H+   G Y V+SG+ LA+ L      S  +  R WW ++W+ ++P K + F W++F ++LPT   L +R +     C +C S  E   HA++ C+ A+ +
Subjt:  HYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKI

Query:  WRTSPFSVEGWLHNVSSAADVLFRGMELLVV------EDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLC--RSVGVQG----
        W+ S F ++       + A  +F+G  L  +      EDFE F  L W +   RN+     +  +     A  + T +   F       + V V G    
Subjt:  WRTSPFSVEGWLHNVSSAADVLFRGMELLVV------EDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLC--RSVGVQG----

Query:  ------EVSEGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK
               V    + W PP     KLN DAA     ++ G+GA++RD  G ++  L K
Subjt:  ------EVSEGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]5.9e-8029.67Show/hide
Query:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD
        K+G +G+AA+KLDMSKA+DRVEW ++ + M  +GF    VNLI+ C+ SV+YSF  NG   G++ PSRG+RQGDPLSPYLFLICAEGLSR+L+ KE  G 
Subjt:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD

Query:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLY---------------------------------------------------------
        + GLR+SR +PS++HLFFADD  LF RAN   A  I   L +Y                                                         
Subjt:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLY---------------------------------------------------------

Query:  ----------ALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPK---------------------------------------------------
                   LL+ WK + FS GGKEVL+KAV+Q IPTY+MSCF+LP                                                    
Subjt:  ----------ALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPK---------------------------------------------------

Query:  -----------ESKGS------------------------DSFVWKSLMWGQDLLRMRVRWRVGND----------------------------------
                   ES  S                         S  W+S++WG++LL   +RWRVG                                    
Subjt:  -----------ESKGS------------------------DSFVWKSLMWGQDLLRMRVRWRVGND----------------------------------

Query:  -------------------DVGYVLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDML
                           D+  +L IP     K+D  +W+    G Y V+SGY  A+ L              WW+ +W  ++PSK + F W++FH++L
Subjt:  -------------------DVGYVLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDML

Query:  PTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNV
        P    L RR +     CP+CK + E   HA++ C  A+++W+ S  ++   L   SSA + L          +FE+F  LCW I  +RN E F  +P   
Subjt:  PTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNV

Query:  EQTDAWEWITNYLTQFQGFLCRS------------VGVQGEVSEGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK
          T   ++   Y+ ++Q    +S                   +     W+ P     KLNTDAA  +  +  G+GAV+RD  G I     K
Subjt:  EQTDAWEWITNYLTQFQGFLCRS------------VGVQGEVSEGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]1.2e-8030Show/hide
Query:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD
        K+G +G+AA+KLDMSKA+DRVEW F+ + M  +GF    V LI+ C+ SV+YSF  NG  +G + PSRG+RQGDPLSPYLFLICAEGLSR+L+++E  G 
Subjt:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD

Query:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCL------------------------------------------------------------
        ++GL+ISR +PS++HLFFADD  LF RAN   A  I  CL                                                            
Subjt:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCL------------------------------------------------------------

Query:  -------RLYALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKE--------------------------------------------------
               +++ LL+ WK   FS GGKEVL+KAV+Q IPTY+MSCF+LP                                                    
Subjt:  -------RLYALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKE--------------------------------------------------

Query:  ---------------------------SKG---------SDSFVWKSLMWGQDLLRMRVRWRVGND----------------------------------
                                   S G         + S  W+SL+WG++LL   +RWRVG+                                   
Subjt:  ---------------------------SKG---------SDSFVWKSLMWGQDLLRMRVRWRVGND----------------------------------

Query:  -------------------DVGYVLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDML
                           D+  VL IP       D  +W+    G Y V+SGY  A  L      +    +  WW+ +W  ++P K + F W++FH  L
Subjt:  -------------------DVGYVLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDML

Query:  PTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNV
        P    L RR +     C +C S  E   HA++ C  A+ +W  S FS++      SS AD L      L   + E F +LCW I ++RN  ++    +  
Subjt:  PTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNV

Query:  EQTDAWEWITNYLTQFQGFLCRS---VGVQGEVSEGHLG--------WQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK
            A  +  +YLT+FQ    ++   V   G  +             W  P    LKLNTDAAI +     G+GAV+R+  G I+  L K
Subjt:  EQTDAWEWITNYLTQFQGFLCRS---VGVQGEVSEGHLG--------WQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK

XP_030508858.1 uncharacterized protein LOC115723499 [Cannabis sativa]1.1e-8631.19Show/hide
Query:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD
        K+G +G+AA+KLDMSKA+DRVEW ++ + M  +GF    V+LI+ C+ SV+YSF  NG   G++ P+RG+RQGDPLSPYLFLICAEGLSR+L+ KE  G 
Subjt:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD

Query:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLY---------------------------------------------------------
        + GL++SR +PS++HLFFADD  LF RAN   A  I   L +Y                                                         
Subjt:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLY---------------------------------------------------------

Query:  ----------ALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLP-------------------------KESKGS---------------------
                   LL+ WK + FS+GGKEVL+KAV+Q IPTY+MSCF+LP                          ES  S                     
Subjt:  ----------ALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLP-------------------------KESKGS---------------------

Query:  ---DSFVWKSLMWGQDLLRMRVRWRVGND-----------------------------------------------------DVGYVLGIPRPRIDKSDA
            S  W+S++WG++LL   +RWRVG+                                                      ++  +L IP       DA
Subjt:  ---DSFVWKSLMWGQDLLRMRVRWRVGND-----------------------------------------------------DVGYVLGIPRPRIDKSDA

Query:  RMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWA
         +W+    G Y V+SGY  A+ L              WW+ +W  ++PSK + F W++FH+ +P    L R+ +     CP+CK + E   HA++ C  A
Subjt:  RMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWA

Query:  RKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLCRSV------------G
        +++WR S   +   L   SSA + LF   +     DFE+F  +CW I  +RN E   K P   +      + T+YL ++Q    +SV             
Subjt:  RKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLCRSV------------G

Query:  VQGEVSEGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK
         + +V+     W  P+    KLNTDAA  +  +  G+GAV+RD  G I     K
Subjt:  VQGEVSEGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK

XP_030509188.1 uncharacterized protein LOC115723863 [Cannabis sativa]3.5e-8030.51Show/hide
Query:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD
        K+G +G+AA+KLDMSKA+DRVEW F+ + M  +GF    V+LI+ C+ +VTYSF  NG  +G + PSRG+RQGDPLSPYLFLICAEGLSR+L+ +E+ G 
Subjt:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD

Query:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLYA--------------------------------------------------------
        ++GL+ISR +PS++HLFFADD  LF RAN   A  I  CL+ Y+                                                        
Subjt:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLYA--------------------------------------------------------

Query:  -----------LLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLP----------------KESKGSDSFVWK------------------------
                   LL+ WK   FS GGKE+L+KAV+Q IPTY+MSCF+LP                  S    S  WK                        
Subjt:  -----------LLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLP----------------KESKGSDSFVWK------------------------

Query:  -----------------------------------------SLMWGQ----------DLLRMRVRW-------RVGNDDVGYVLGIPRPRIDKSDARMWH
                                                 SL W            DL+  + +W            DV  +L IP       DA +W 
Subjt:  -----------------------------------------SLMWGQ----------DLLRMRVRW-------RVGNDDVGYVLGIPRPRIDKSDARMWH

Query:  YEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIW
        +   G Y V+SGY LA         +    M  WW+ +W  ++P K + F W++FH  LP    L RR +     C +C S  E   HA+++C  A+ +W
Subjt:  YEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIW

Query:  RTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLCR-------SVGVQGEVSEGH
          S   ++      S++AD+L      L   +FE F +LCW   ++RN  ++    +   Q  A  +  +YL +FQ    +       S+          
Subjt:  RTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLCR-------SVGVQGEVSEGH

Query:  L----GWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK
              W  P    LKLNTDAAI +   + G+GA +R+  G I+  + K
Subjt:  L----GWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK

TrEMBL top hitse value%identityAlignment
A0A803PVI9 Uncharacterized protein1.8e-8730.43Show/hide
Query:  QEIKKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEA
        + +K+G+ G+AA+KLDMSKA+DRVEW F++  M +LGF+ + VNLI  C+ SV++SF  NG  +G + P+RG+RQGDPLSPYLF++CAEGLSR+L+ +E 
Subjt:  QEIKKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEA

Query:  RGDIKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLYALLTG-------------------------------------------------
        RG+++GL+++R +PS++HLFFADD  L  RAN   AH I + L LY   +G                                                 
Subjt:  RGDIKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLYALLTG-------------------------------------------------

Query:  ------------WKHKS------FSMGGKEVLIKAVLQVIPTYSMSCFKLPK------------------------------------------------
                    WKH S      FS+GGKEVL+KAV Q IPTY+MSCF+L K                                                
Subjt:  ------------WKHKS------FSMGGKEVLIKAVLQVIPTYSMSCFKLPK------------------------------------------------

Query:  -------------------------------------ESKGS-DSFVWKSLMWGQDLLRMRVRWRVGN--------------------------------
                                              +KGS  S  W+ ++WG++LL   +RW+VG+                                
Subjt:  -------------------------------------ESKGS-DSFVWKSLMWGQDLLRMRVRWRVGN--------------------------------

Query:  ---------------------DDVGYVLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFH
                              DV  VL IP      SD  +W++E  G Y V+SGY LA+ L      + G + + WWN +W+  +PSK + F WR  +
Subjt:  ---------------------DDVGYVLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFH

Query:  DMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEV--FSK
        D LPT   L  R +     C +C++  E   HA++ CK  RKIWR S F++   +    +  +++ +  +L      E+F  + W I N+RN+E      
Subjt:  DMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEV--FSK

Query:  RPLNVEQTDAWEWITNYLTQFQGFLCRSVGVQGEVSEGHL------GWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK
        +P N+ +  A     +YL  +Q    +        S  +L       W  P    LKLNTDAAI Q  Q +G GA++RD  GEI+    K
Subjt:  RPLNVEQTDAWEWITNYLTQFQGFLCRSVGVQGEVSEGHL------GWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK

A0A803Q2K8 Uncharacterized protein3.7e-8030.13Show/hide
Query:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD
        K+G +G+AA+KLDMSKA+DRVEW F+ + M  +GF    V+LI+ C+ +VTYSF  NG  +G + PSRG+RQGDPLSPYLFLICAEGLSR+L+ +E+ G 
Subjt:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD

Query:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLYA--------------------------------------------------------
        ++GL+ISR +PS++HLFFADD  LF RAN   A  I  CL+ Y+                                                        
Subjt:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLYA--------------------------------------------------------

Query:  -----------LLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLP----------------KESKGSDSFVWK------------------------
                   LL+ WK   FS GGKE+L+KAV+Q IPTY+MSCF+LP                  S    S  WK                        
Subjt:  -----------LLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLP----------------KESKGSDSFVWK------------------------

Query:  ---------------SLM--------WGQDLLRMRVRWRVGN-----------------------------------------------------DDVGY
                       SL+        +  +LL   +RWRVG+                                                      DV  
Subjt:  ---------------SLM--------WGQDLLRMRVRWRVGN-----------------------------------------------------DDVGY

Query:  VLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSK
        +L IP       DA +W +   G Y V+SGY LA         +    M  WW+ +W  ++P K + F W++FH  LP    L RR +     C +C S 
Subjt:  VLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSK

Query:  VEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLCR-
         E   HA+++C  A+ +W  S   ++      S++AD+L      L   +FE F +LCW   ++RN  ++    +   Q  A  +  +YL +FQ    + 
Subjt:  VEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLCR-

Query:  ------SVGVQGEVSEGHL----GWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK
              S+                W  P    LKLNTDAAI +   + G+GA +R+  G I+  + K
Subjt:  ------SVGVQGEVSEGHL----GWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK

A0A803Q6Z2 Uncharacterized protein6.4e-8029.35Show/hide
Query:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD
        K+GR+G+AA+KLDMSKA+DRVEWFFLE+ M  LGF    V LI+ C+ SV+YSF  NG  +G+I P RG+RQGDPLSPYLFLIC+EG SR+L+++E+ G 
Subjt:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGD

Query:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLYA--------------------------------------------------------
        ++GL++SR +P I HL FADD  LF RA+   A  I  CL LY+                                                        
Subjt:  IKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLYA--------------------------------------------------------

Query:  -----------LLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLP-----------------KESKGS-----------------------------
                   L+  W+ + FS+GGKEVL+KAV+Q IPTY+MSCF+LP                 K + G+                             
Subjt:  -----------LLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLP-----------------KESKGS-----------------------------

Query:  ----------------------------------------DSFVWKSLMWGQDLLRMRVRWRVGN-----------------------------------
                                                 S  WKS++WG++LL   +RWR+G+                                   
Subjt:  ----------------------------------------DSFVWKSLMWGQDLLRMRVRWRVGN-----------------------------------

Query:  ------------------DDVGYVLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDML
                           DV  +L IP       D  +WHY   G Y V+SGY LAS +  S   S       WW ++W  ++PSK + F WR +H+ L
Subjt:  ------------------DDVGYVLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDML

Query:  PTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNV
        PT   L+ R +     CP+C+  +E   HA + C  A+++W+    S+   L    S +D L      L  E  E F    W I   RN E  SK P + 
Subjt:  PTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNV

Query:  EQTDAWEWITNYLTQFQGFLCR--SVGVQGEVSE--------------GHL----------------GWQPPMYP--------FLKLNTDAAIRQNLQRS
         Q   +++ ++YL +F+    +  S G + +VS                HL                G  PP  P         LK+NTDAA+       
Subjt:  EQTDAWEWITNYLTQFQGFLCR--SVGVQGEVSE--------------GHL----------------GWQPPMYP--------FLKLNTDAAIRQNLQRS

Query:  GVGAVVRDEKGEIMGFLEK
        G+ A++R+  G+I+  + K
Subjt:  GVGAVVRDEKGEIMGFLEK

A0A803QB90 Uncharacterized protein1.0e-8233.21Show/hide
Query:  KGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDI
        +GR G++ALKLDMSKA+DRVEW +LE  M  +GF   +V LIM+C+ + ++SF  NGD  G++ PSRGLRQGDPLSPYLFLIC+EGLSR+L+++E++ ++
Subjt:  KGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDI

Query:  KGLRISRWSPSINHLFFADDCFLFFRANVDEA----------HE----------------------------------IADC------------------
        +GL I+R +PSI+HL FADD  LF +A+   A          H+                                  I DC                  
Subjt:  KGLRISRWSPSINHLFFADDCFLFFRANVDEA----------HE----------------------------------IADC------------------

Query:  -----LRLYALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKESKGSDSFVWKSLMWGQDLLRMRVRWRVGNDDVGYVLGIPRPRIDKSDARMW
              +++ LL  W  K FS+GGKEVL+KAV+Q IPTY+MSCFKL K+       +  +  WG +    ++ W+  N                 D  +W
Subjt:  -----LRLYALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKESKGSDSFVWKSLMWGQDLLRMRVRWRVGNDDVGYVLGIPRPRIDKSDARMW

Query:  HYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKI
        H+   G Y V+SG+ LA+ L      S  +  R WW ++W+ ++P K + F W++F ++LPT   L +R +     C +C S  E   HA++ C+ A+ +
Subjt:  HYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKI

Query:  WRTSPFSVEGWLHNVSSAADVLFRGMELLVV------EDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLC--RSVGVQG----
        W+ S F ++       + A  +F+G  L  +      EDFE F  L W +   RN+     +  +     A  + T +   F       + V V G    
Subjt:  WRTSPFSVEGWLHNVSSAADVLFRGMELLVV------EDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLC--RSVGVQG----

Query:  ------EVSEGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK
               V    + W PP     KLN DAA     ++ G+GA++RD  G ++  L K
Subjt:  ------EVSEGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK

A0A803QCG6 Uncharacterized protein1.2e-8130.66Show/hide
Query:  KGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDI
        +G+ G+AALKLDMSKA+DRVEW +LE  M  +GF   +V LIM+C+ + ++SF  NG+  G++ P RGLRQGDPLSPYLFLIC+EGLSR+L ++E  G++
Subjt:  KGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDI

Query:  KGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLY----------------------------------------------------------
         GLR++R SP+++HL FADD  LF RAN   A  I   L +Y                                                          
Subjt:  KGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLY----------------------------------------------------------

Query:  ---------ALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKE-----------------SKG---------------------------SDSF
                  LL  W  + FS GGKEVL+KAV+Q IPTY+MSCFKL K+                   G                           S S+
Subjt:  ---------ALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKE-----------------SKG---------------------------SDSF

Query:  VWKSLMWGQDLLRMRVRWRVGND----------------------------------------------------DVGYVLGIPRPRIDKSDARMWHYEK
         W+S+ WG++LL   +R++VGN                                                     DV  +L IP      +D  +WH+  
Subjt:  VWKSLMWGQDLLRMRVRWRVGND----------------------------------------------------DVGYVLGIPRPRIDKSDARMWHYEK

Query:  RGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIWRTS
         G Y V+SG+ LAS L      S  +    WW F+W+  +P K + F W++ H++LPT   L +R +     C +C S  E   HA+++CK ARKIW+ S
Subjt:  RGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECKWARKIWRTS

Query:  PFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLCRSVGVQGEVS-------------
         F  +  +       D L     +   EDFE    + W I   RN      +  +  Q      I  Y T F    CR+    G  +             
Subjt:  PFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLCRSVGVQGEVS-------------

Query:  -EGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK
         +  + W  P     KLN DAA     +  G+GA++RD  G ++  L K
Subjt:  -EGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog1.2e-1131.29Show/hide
Query:  LKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDIKGLRISRW
        L +D  KA+D ++  F+ + +K +G E  F+ LI       T +   NG +  +     G RQG PLSP LF I  E L+  +  ++A   IKG+ I   
Subjt:  LKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDIKGLRISRW

Query:  SPSINHLFFADDCFLFFRANVDEAHEIADCLRLYALLTGWK---HKS
        S  I    FADD  ++     D   ++ + ++ Y+ ++G+K   HKS
Subjt:  SPSINHLFFADDCFLFFRANVDEAHEIADCLRLYALLTGWK---HKS

P0C2F6 Putative ribonuclease H protein At1g657501.7e-1324.71Show/hide
Query:  DARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECK
        D   W + + G++ VRS Y +      +        M  ++N  W  R+P + K F W + +  + TE+   RR +     C +CK  VE  LH + +C 
Subjt:  DARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIWECK

Query:  WARKIW-------RTSPF---SVEGWLH----NVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTD-AWEWITNYLTQFQGFL
            IW       R   F   S+  WL+    + S   D+ +  +          F ++ WW    R   +F +     ++     EW         G +
Subjt:  WARKIW-------RTSPF---SVEGWLH----NVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTD-AWEWITNYLTQFQGFL

Query:  CRSVGVQGEVSEGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMG
           VG+     E  +GW  P   ++K+NTD A R N   +  G V+RD  G   G
Subjt:  CRSVGVQGEVSEGHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMG

P0C2F6 Putative ribonuclease H protein At1g657502.5e-0430.59Show/hide
Query:  RANVDEAHEIADCLRLYALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKESKGSDSFVWKSLMWGQDLLRMR---VRW
        R N D   EI +  R+ + ++GW+ K+ S  G+  L KAVL  +P +SMS   LP+        + ++ +WG    + +   V+W
Subjt:  RANVDEAHEIADCLRLYALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKESKGSDSFVWKSLMWGQDLLRMR---VRW

P11369 LINE-1 retrotransposable element ORF2 protein9.3e-1228.47Show/hide
Query:  LKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDIKGLRISRW
        + LD  KA+D+++  F+ K ++  G +  ++N+I         + + NG++   I    G RQG PLSPYLF I  E L+R +  ++   +IKG++I + 
Subjt:  LKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDIKGLRISRW

Query:  SPSINHLFFADDCFLFFRANVDEAHEIADCLRLYALLTGWKHKS
           I+ L  ADD  ++     +   E+ + +  +  + G+K  S
Subjt:  SPSINHLFFADDCFLFFRANVDEAHEIADCLRLYALLTGWKHKS

P92555 Uncharacterized mitochondrial protein AtMg012501.8e-1555.88Show/hide
Query:  FRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDIKGLRISRWSPSINHLFFADD
        F  NG  +G + PSRGLRQGDPLSPYLF++C E LS +    + +G + G+R+S  SP INHL FADD
Subjt:  FRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDIKGLRISRWSPSINHLFFADD

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)2.4e-0730.4Show/hide
Query:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFN-GDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARG
        ++ R+ +  + LD+ KA+D V    + + ++ LG ++   N I   +   T + R   G +   IC  RG++QGDPLSP+LF    + L   L+     G
Subjt:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFN-GDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARG

Query:  DIKGLRISRWSPSINHLFFADDCFL
           G         I  L FADD  L
Subjt:  DIKGLRISRWSPSINHLFFADDCFL

Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.2e-0725.58Show/hide
Query:  QRGCPMCKSKVEMPLHAIWECKWARKIWRTSPFSV--EG-WLHNV-SSAADVLFRGMELLVVEDFEKFF-MLCWWIRNKRNQEVFSKRPLNVEQTDAWEW
        +  C  C    E   H +++C +AR +W  SP     EG W  ++ ++   VL   +E+  +         L W +   RN+ +F  +     + DA E 
Subjt:  QRGCPMCKSKVEMPLHAIWECKWARKIWRTSPFSV--EG-WLHNV-SSAADVLFRGMELLVVEDFEKFF-MLCWWIRNKRNQEVFSKRPLNVEQTDAWEW

Query:  ITNYLTQFQGFLCRSVGVQGEVS----EGHLG--WQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIM
        +   +  F+ +  R   ++G+ S    E +L   W+ P Y ++K NTDA  +    R G+G ++R+E G ++
Subjt:  ITNYLTQFQGFLCRSVGVQGEVS----EGHLG--WQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIM

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.0e-0551.35Show/hide
Query:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFED
        KKG +GW  LKLD+ KAYDR+ W +LE  + + GF +
Subjt:  KKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFED

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-1926.05Show/hide
Query:  DARMWHYEKRGRYKVRSGYLLASGL---RHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIW
        D+  W Y   G Y V+SGY + + +   R S        +   +   W  +   K +HF W+   + LP    L  R +  +  C  C S  E   H ++
Subjt:  DARMWHYEKRGRYKVRSGYLLASGL---RHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKSKVEMPLHAIW

Query:  ECKWARKIWRTS--PFSVEG-----------WLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQT------DAWEWITNYL
        +C +AR  W  S  P  + G           W+ N+ +      +  +L+          L W +   RN+ VF  R  N ++       D  EW     
Subjt:  ECKWARKIWRTS--PFSVEG-----------WLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQT------DAWEWITNYL

Query:  TQFQGFLCRSVGVQGEVSEGHLG-WQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEI
        T+       S G + +V+    G W+PP + ++K NTDA   ++ +R G+G V+R+EKGE+
Subjt:  TQFQGFLCRSVGVQGEVSEGHLG-WQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.3e-1655.88Show/hide
Query:  FRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDIKGLRISRWSPSINHLFFADD
        F  NG  +G + PSRGLRQGDPLSPYLF++C E LS +    + +G + G+R+S  SP INHL FADD
Subjt:  FRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKEARGDIKGLRISRWSPSINHLFFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATCCAAAGACATAATTAACAAGCCAGACCAAGAGATAAAGAAGGGGAGAAGGGGGTGGGCAGCCCTTAAGCTCGATATGAGCAAAGCATATGATAGGGTGGAATG
GTTTTTCCTTGAGAAATGTATGAAGGCGCTGGGTTTTGAGGATAACTTTGTTAACCTAATTATGGATTGTGTGATGTCAGTGACATACTCTTTCAGGTTCAATGGAGATC
GAAGGGGAAATATTTGTCCATCGAGAGGGTTACGTCAAGGAGACCCGTTGTCCCCCTATCTCTTCCTAATATGTGCAGAGGGACTGTCAAGGATGCTAGAATGGAAAGAG
GCGAGAGGAGATATAAAGGGGTTAAGGATTTCGAGGTGGAGTCCTTCGATCAATCACCTTTTCTTTGCAGATGACTGTTTTCTCTTTTTTAGAGCTAATGTGGATGAGGC
TCATGAAATTGCAGACTGTTTGAGATTGTATGCTCTACTAACAGGATGGAAGCATAAGAGTTTTTCTATGGGTGGTAAAGAGGTGCTTATAAAGGCAGTCCTACAAGTGA
TCCCAACCTACTCGATGTCATGTTTTAAGCTACCGAAGGAGAGTAAGGGTAGTGATTCTTTTGTGTGGAAAAGTTTGATGTGGGGGCAAGACTTACTGCGGATGAGGGTG
AGATGGAGAGTAGGGAATGATGATGTGGGGTATGTTCTTGGCATTCCTAGACCAAGAATAGATAAATCGGATGCTCGAATGTGGCATTATGAGAAGAGGGGTCGATACAA
AGTGAGAAGTGGGTATCTTTTAGCTTCTGGCTTGAGGCACAGTTCTGGGGGTTCTGATGGGGAGAAAATGCGATGTTGGTGGAATTTTTGGTGGAATAGAAGAATTCCGA
GTAAGGCTAAGCACTTTGGGTGGAGGCTGTTCCATGATATGCTTCCCACTGAAGACAATTTAAGGAGGAGGGGAGTGGATTTGCAGAGAGGATGCCCGATGTGTAAGTCG
AAAGTTGAAATGCCTTTACATGCCATTTGGGAGTGTAAGTGGGCAAGGAAAATTTGGAGGACCTCTCCATTCAGTGTTGAGGGTTGGCTCCATAATGTATCTAGTGCAGC
TGATGTGTTGTTTCGAGGAATGGAGCTGCTGGTAGTTGAGGATTTTGAGAAGTTTTTTATGCTTTGCTGGTGGATCAGGAATAAACGGAATCAGGAGGTGTTTTCTAAGC
GCCCTCTAAATGTTGAGCAAACAGATGCTTGGGAGTGGATTACGAATTATCTTACACAATTTCAAGGCTTTCTGTGTAGAAGTGTAGGGGTGCAAGGGGAAGTTTCAGAG
GGTCATTTGGGTTGGCAACCACCGATGTACCCTTTCCTCAAATTAAACACAGATGCAGCAATCAGACAAAATCTCCAACGAAGTGGTGTGGGTGCTGTAGTAAGAGATGA
AAAGGGAGAGATTATGGGTTTTTTGGAAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATATCCAAAGACATAATTAACAAGCCAGACCAAGAGATAAAGAAGGGGAGAAGGGGGTGGGCAGCCCTTAAGCTCGATATGAGCAAAGCATATGATAGGGTGGAATG
GTTTTTCCTTGAGAAATGTATGAAGGCGCTGGGTTTTGAGGATAACTTTGTTAACCTAATTATGGATTGTGTGATGTCAGTGACATACTCTTTCAGGTTCAATGGAGATC
GAAGGGGAAATATTTGTCCATCGAGAGGGTTACGTCAAGGAGACCCGTTGTCCCCCTATCTCTTCCTAATATGTGCAGAGGGACTGTCAAGGATGCTAGAATGGAAAGAG
GCGAGAGGAGATATAAAGGGGTTAAGGATTTCGAGGTGGAGTCCTTCGATCAATCACCTTTTCTTTGCAGATGACTGTTTTCTCTTTTTTAGAGCTAATGTGGATGAGGC
TCATGAAATTGCAGACTGTTTGAGATTGTATGCTCTACTAACAGGATGGAAGCATAAGAGTTTTTCTATGGGTGGTAAAGAGGTGCTTATAAAGGCAGTCCTACAAGTGA
TCCCAACCTACTCGATGTCATGTTTTAAGCTACCGAAGGAGAGTAAGGGTAGTGATTCTTTTGTGTGGAAAAGTTTGATGTGGGGGCAAGACTTACTGCGGATGAGGGTG
AGATGGAGAGTAGGGAATGATGATGTGGGGTATGTTCTTGGCATTCCTAGACCAAGAATAGATAAATCGGATGCTCGAATGTGGCATTATGAGAAGAGGGGTCGATACAA
AGTGAGAAGTGGGTATCTTTTAGCTTCTGGCTTGAGGCACAGTTCTGGGGGTTCTGATGGGGAGAAAATGCGATGTTGGTGGAATTTTTGGTGGAATAGAAGAATTCCGA
GTAAGGCTAAGCACTTTGGGTGGAGGCTGTTCCATGATATGCTTCCCACTGAAGACAATTTAAGGAGGAGGGGAGTGGATTTGCAGAGAGGATGCCCGATGTGTAAGTCG
AAAGTTGAAATGCCTTTACATGCCATTTGGGAGTGTAAGTGGGCAAGGAAAATTTGGAGGACCTCTCCATTCAGTGTTGAGGGTTGGCTCCATAATGTATCTAGTGCAGC
TGATGTGTTGTTTCGAGGAATGGAGCTGCTGGTAGTTGAGGATTTTGAGAAGTTTTTTATGCTTTGCTGGTGGATCAGGAATAAACGGAATCAGGAGGTGTTTTCTAAGC
GCCCTCTAAATGTTGAGCAAACAGATGCTTGGGAGTGGATTACGAATTATCTTACACAATTTCAAGGCTTTCTGTGTAGAAGTGTAGGGGTGCAAGGGGAAGTTTCAGAG
GGTCATTTGGGTTGGCAACCACCGATGTACCCTTTCCTCAAATTAAACACAGATGCAGCAATCAGACAAAATCTCCAACGAAGTGGTGTGGGTGCTGTAGTAAGAGATGA
AAAGGGAGAGATTATGGGTTTTTTGGAAAAGTAG
Protein sequenceShow/hide protein sequence
MISKDIINKPDQEIKKGRRGWAALKLDMSKAYDRVEWFFLEKCMKALGFEDNFVNLIMDCVMSVTYSFRFNGDRRGNICPSRGLRQGDPLSPYLFLICAEGLSRMLEWKE
ARGDIKGLRISRWSPSINHLFFADDCFLFFRANVDEAHEIADCLRLYALLTGWKHKSFSMGGKEVLIKAVLQVIPTYSMSCFKLPKESKGSDSFVWKSLMWGQDLLRMRV
RWRVGNDDVGYVLGIPRPRIDKSDARMWHYEKRGRYKVRSGYLLASGLRHSSGGSDGEKMRCWWNFWWNRRIPSKAKHFGWRLFHDMLPTEDNLRRRGVDLQRGCPMCKS
KVEMPLHAIWECKWARKIWRTSPFSVEGWLHNVSSAADVLFRGMELLVVEDFEKFFMLCWWIRNKRNQEVFSKRPLNVEQTDAWEWITNYLTQFQGFLCRSVGVQGEVSE
GHLGWQPPMYPFLKLNTDAAIRQNLQRSGVGAVVRDEKGEIMGFLEK