; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039763 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039763
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:49499515..49502170
RNA-Seq ExpressionLag0039763
SyntenyLag0039763
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]8.0e-15336.75Show/hide
Query:  LSEKDDLAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHEMATKMVNFKA---IEKKEE
        L+ +  L WL  GDFNEI++ +EK GGA ++  QMD FR+ +N C     GY    +TW   +   N++  RLDR     + + K V  K    ++   +
Subjt:  LSEKDDLAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHEMATKMVNFKA---IEKKEE

Query:  EILKLSTEN------------------EEQNFEQILEA--------------------------------------------------------------
            L T+N                  + ++ + I+EA                                                              
Subjt:  EILKLSTEN------------------EEQNFEQILEA--------------------------------------------------------------

Query:  ----KVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTK
            + E++ LL++EE YW  R++  WLK GD+NT++FH+++S+R+++N I GI+   G W ++EE I   A  YF  ++ SS+P  +   +V   +  K
Subjt:  ----KVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTK

Query:  INDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYK
        + ++    L + +T+ E+   +K + PNKAPG DG  A F+Q YW  VG N     L VLN ++ I  LNKT I+LI K  + K M +FRPISLCNV YK
Subjt:  INDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYK

Query:  IIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSF
        +I+K +ANR K  L  IIS  Q+AF   RLI+DNV++ FE +H + +K  GKEG +AIKLDM KA+DRV+WGFI +++++MGF NRW   +M+ + +VS+
Subjt:  IIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSF

Query:  SVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAIN
        S+LING +     PSRG+RQGDPLSP LFL+CAEG S L+ +   N+ + G  IN  CP +THLFFADDS+LFC++  EEC  L+ I   YE+ASGQ IN
Subjt:  SVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAIN

Query:  LEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKV--------SKILQVGDKK-------------------------
         +KS    S N  +E   EI  ILG  Q      YLG+PS   RSK+++F  +K KV         K+L +G K+                         
Subjt:  LEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKV--------SKILQVGDKK-------------------------

Query:  -------------------KLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGR
                           K+ W    ++C SK+SGG+GFR L  FN AMLAKQ+WRIL +P SL+ +VL  R F   D L A +GS+   +W+SI    
Subjt:  -------------------KLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGR

Query:  DLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIW-VLEELRGRRVKEIIREDGK-WDEDHIKRLFLPMDVEGILSIPL
        ++ + G RWR+GNG+  +I +D WL        I   +       V  +I  D K W  + ++ +FLP +VE IL IPL
Subjt:  DLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIW-VLEELRGRRVKEIIREDGK-WDEDHIKRLFLPMDVEGILSIPL

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]3.5e-14840.88Show/hide
Query:  KAIEKKEEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYF
        K  EKKE     + ++       +I   + E+++LL+ EE  W+ RS+  WL  GD+NT++FH+K+S R++RN I GI    G W++  E I  +A  YF
Subjt:  KAIEKKEEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYF

Query:  RKLFQSSNPDLMNNRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITL
        + ++ SS P  ++  +V++ + T + ++    L Q +TR EIE  +  + P KAPG DG  A F+Q YW+ VG + V   L VLN ++ +  +NKT ITL
Subjt:  RKLFQSSNPDLMNNRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITL

Query:  ILKVKDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKE
        + K+K+   M +FRPISLCNV YK+I+K +ANR K  L  IIS  Q+AF+ GRLI+DNV++ FE +H + +KK+GKEG  AIKLDM KAYDRV+WGFIK+
Subjt:  ILKVKDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKE

Query:  MLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRS
        ++++MGF  +WIK +M  + +VS+S+L+NG +     P+RG+RQGDP+SPY+FL+CA+GFS LL        + G  I   CP ITHLFFADDSLLFC++
Subjt:  MLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRS

Query:  KREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKVSKIL---------------
          +ECQTL +I  LYE ASGQ IN++KS    S N   E+  E+ ++LG  Q      YLG+PS   +SK ++F  +K +V + L               
Subjt:  KREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKVSKIL---------------

Query:  -------------------------------------QVGDKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFR
                                             Q G + K+ W    KLC++K +GGMGFR L  FN AMLAKQ WR++ +P SL+ ++   R + 
Subjt:  -------------------------------------QVGDKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFR

Query:  NEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWL---IRKGNSSPIWVLEELRGRRVKEII-REDGKWDEDHIKRLFLPMDVEGI
        + D  +A +G++   TW+SI  G ++ + G RWR+GNGE   I +D WL   I     SP    ++    RV  +I RE  +W +D ++ LFLP +   I
Subjt:  NEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWL---IRKGNSSPIWVLEELRGRRVKEII-REDGKWDEDHIKRLFLPMDVEGI

Query:  LSIPLGN
        LSIPL +
Subjt:  LSIPLGN

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]1.4e-14436.41Show/hide
Query:  WLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLN---------------------HEMATKMV
        WL+ GD NEI +   K  G  ++   M  FR  +++C L A     + FTW ++R K   + +R+D  F+N                     H + +  +
Subjt:  WLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLN---------------------HEMATKMV

Query:  NF-------------------KAIEKKEEEILKLST--ENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKG
        +F                   K I K + E+ +L++     + +   +  A+  L++LL  EE YW+ RS+  WLK+GD+NT++FHSK+S R   N+IK 
Subjt:  NF-------------------KAIEKKEEEILKLST--ENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKG

Query:  IYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTV
        +    G     ++ I ++  +YF K+F +++ D      V++ + T I+      L Q +TR+++ A +KS+  +K+PG DG  A FYQ YW  VG+   
Subjt:  IYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTV

Query:  QTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKE
        Q  L VLN     +  NKT++TLI K+K  K M++FRPISLCNV YKII+K +A R K+ L+S+IS TQ+AF+  RLI+DN+++ FE IH+++++K+G +
Subjt:  QTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKE

Query:  GQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFR
        G  A+K DM KA+DRV+W FI  ++ +MGF  RWI  IM  + T  FS  ING       P RG+RQGDPLSPYLFL+C+EG S LL  E +   LKG  
Subjt:  GQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFR

Query:  INNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENI
        ++   PSI+HLFFADDSLLFC++    C  +K   ++Y +ASGQ +N +KS+   S N          +ILG+   +    YLG+P+ S R K+++F NI
Subjt:  INNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENI

Query:  KAKV--------SKILQVG--------------------------------------------DKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAK
        K K+         KI  +G                                            D KK+HW K   LC+SK  GGMGFR    FNQA+LAK
Subjt:  KAKV--------SKILQVG--------------------------------------------DKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAK

Query:  QSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGRRVKEIIREDG
        Q+WRI + P SLL +VL G  F   DF+ A  G  +  TW+ IVWGR+L  +G R ++G G       D W+       P +         V + I    
Subjt:  QSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGRRVKEIIREDG

Query:  KWDEDHIKRLFLPMDVEGILSIPL
        +W+ + ++  F   DV+ IL IPL
Subjt:  KWDEDHIKRLFLPMDVEGILSIPL

XP_030931246.1 uncharacterized protein LOC115957168 [Quercus lobata]7.3e-14637.48Show/hide
Query:  WLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHEMATKM----------------------
        W+  GDFN I+   EK+        QMD F+E + +C L   G+    FTW   R       +RLDR         K                       
Subjt:  WLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHEMATKM----------------------

Query:  ------------------VNFKAIEKKEEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIY
                             K ++K+ E++ K   E+ + +  + L    ELD LL ++E YW   S+ +WLK+GDKNT++FHSK+S R++RN I+GI 
Subjt:  ------------------VNFKAIEKKEEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIY

Query:  SGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQT
        +    W ED   IG +AT YF ++F +   D M   + +N VQ KI +D +  L + Y+  EI+A +  + P KAPG DG +A FYQ +W+ VG++ V  
Subjt:  SGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQT

Query:  CLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQ
         L  LN       +N T I LI K+K  + M ++RPISLCNV YKII+K +AN+ K+ L  IIS TQ+AFVP RLI+DN+++ +EC+HA+  +K+GK+G 
Subjt:  CLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQ

Query:  IAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRIN
        IA+KLD+ KAYDRV+W F+K ++++MGF   WI  +M  V T SFSV ING       PSRGIRQGDPLSPYLFL+CAEGF+ LL +      + G  I 
Subjt:  IAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRIN

Query:  NFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKA
           P I++L FADDSL+FC++ R E Q L EI  LY  ASGQ INLEKS    S N +    +EI  +LGV++     +YLG+P+   RSK + F  +K 
Subjt:  NFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKA

Query:  KVSKIL----------------------------------------------------QVGDKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQS
        ++ K L                                                    QVGD++K+HW   + + + K  GGMGFR++  FN AMLAKQ 
Subjt:  KVSKIL----------------------------------------------------QVGDKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQS

Query:  WRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGR-RVKEIIRED-G
        WR+++   SL+Y     R F    FL+A   SN+   WKSI+  +++ K G  WR+G G    +  + W+    ++  I   +E+    RV E+I  + G
Subjt:  WRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGR-RVKEIIRED-G

Query:  KWDEDHIKRLFLPMDVEGILSIPLGNK
         W+++ I+  F   D + IL IPL  +
Subjt:  KWDEDHIKRLFLPMDVEGILSIPLGNK

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]3.6e-14536.19Show/hide
Query:  MEKLSEKDDLAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHE----------------
        ++ L     + WL  GDFNEI   +EK+GG  +  RQM+ F + IN C  R   +   K+TW   R     + +RLDR   N E                
Subjt:  MEKLSEKDDLAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHE----------------

Query:  ------------------MATKMVNFKAIEKKE---EEILKLSTE-------------------------NEEQ---------NFEQILE----------
                             K   F+++  K+   EEI+K + E                         N+E+           +Q LE          
Subjt:  ------------------MATKMVNFKAIEKKE---EEILKLSTE-------------------------NEEQ---------NFEQILE----------

Query:  -------AKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNC
                +V L+K LE+E++ WR RS+  W + GD+NT +FH+K+S R Q+N I GI    G W+EDE KI  +A  YF KLF SS P+  ++  +++ 
Subjt:  -------AKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNC

Query:  VQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCN
        VQ K+  D    L + YT  E+   +K + P KAPG DG    F+Q +W+T GE    T L  LN  +     N+T I LI K+ + KH+ ++RPISLCN
Subjt:  VQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCN

Query:  VSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVE
        V+YKI +K IANR KK L SIIS TQ+AFV GRLI+DNV++ FE +H I  KK GK G++AIKLDM KAYDRV+W F++++++++GF+      IM+ + 
Subjt:  VSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVE

Query:  TVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASG
        TVS+++ ING  +    PSRGIRQGDPLSPYLFL+CAEG S L+   V N +++G  I    P ++HLFFADDSL+FC++   EC  L+ +  +YE+ASG
Subjt:  TVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASG

Query:  QAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKVSKIL-----------------------------------
        Q +N  K+    S N  KE  +EI    G +  K+   YLG+PS   ++K   F +IK K+ K L                                   
Subjt:  QAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKVSKIL-----------------------------------

Query:  -----------------QVGDKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSI
                         QV ++ ++ W   +K+C SKS+GGMGF+ L  FN A+LAKQ WR+    +SL+Y+VL  + F   +F+ AS+G+N   +W+SI
Subjt:  -----------------QVGDKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSI

Query:  VWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRG-RRVKEII-REDGKWDEDHIKRLFLPMDVEGILSIPLGNKV
        +  + L KEG +WR+GNG    + +D WL    +   I     L    RV +++  E G+W  + I  +FLP + + I SIP+  ++
Subjt:  VWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRG-RRVKEII-REDGKWDEDHIKRLFLPMDVEGILSIPLGNKV

TrEMBL top hitse value%identityAlignment
A0A7N2L6Z9 Reverse transcriptase domain-containing protein4.6e-15438.06Show/hide
Query:  LAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDR----------------------------YFL
        L W   GDFNEIV+  EK GGA +   QMD FR  IN C  +  GYS   +TW   +    ++Y RLDR                             F+
Subjt:  LAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDR----------------------------YFL

Query:  NHEMATK-----MVNFKAIEKKEEE----------------------------------------------------ILKLSTENEEQNF--EQILEAKV
        ++    K       +F+A   K+E+                                                    +L   TE +   F   +I   + 
Subjt:  NHEMATK-----MVNFKAIEKKEEE----------------------------------------------------ILKLSTENEEQNF--EQILEAKV

Query:  ELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKINDDWR
        EL+ LL++EE +W  RS+  WLK GD+NT++FH+++S+R+++N I G++   G W ED + I N A  YF  ++ +SNP +++  +V   + T I ++  
Subjt:  ELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKINDDWR

Query:  RMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKIIAKTI
          L + +TR EI   +K + P K+PG DG  A F+Q YWD VG N     L VLN  + + ++NKT I LI K  + K M +FRPISLCNV YK+I+KT+
Subjt:  RMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKIIAKTI

Query:  ANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLING
        ANR K  L  II+  Q+AF   RLI+DNV+I +E +H +++KK GK+  +A KLDM KA+DRV+WGFI+ ++++MGF   WI  IM+ + +VS+SV+ING
Subjt:  ANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLING

Query:  SSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSMF
               P+RG+RQGDPLSPYLFL+CAEG S LL     NQ L G  +   CP ITHLFFADDSLLFC++ REEC+ LKEI   YE ASGQ +N +KS  
Subjt:  SSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSMF

Query:  MASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKV--------SKILQVGDKK-------------------------------
          S N   E  + I  ILG  Q      YLG+PS   RSK  +F  IK +V         K+L  G K+                               
Subjt:  MASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKV--------SKILQVGDKK-------------------------------

Query:  -------------KLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEG
                     K+ W    K+C+ KS GG+GFR L+ FN A+LAKQ+WRIL +P SL  ++L  + F   D L AS+GSN   TW+SI    ++ K+G
Subjt:  -------------KLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEG

Query:  YRWRIGNGEYTYIDQDPWLIRKGN---SSPIWVLEELRGRRVKEIIREDGKWDE-DHIKRLFLPMDVEGILSIPL
         RWR+GNG   +I  D WL         +P  + E+     V  +I  D +W + D I+ LFLP+D E IL IPL
Subjt:  YRWRIGNGEYTYIDQDPWLIRKGN---SSPIWVLEELRGRRVKEIIREDGKWDE-DHIKRLFLPMDVEGILSIPL

A0A803P996 Uncharacterized protein4.3e-15235.61Show/hide
Query:  LAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHE-------------------------
        L WL  GDFNEI+++++K GG+ +    M+ F+ ++++C L+   Y+ + FTW  +R   + + +RLD  F+N++                         
Subjt:  LAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHE-------------------------

Query:  -----------------------------------------------------MATKMVNF------------KAIEKKEEEILKLSTEN--EEQNFEQI
                                                             ++T   N             K I   + ++  L+  N     +F+++
Subjt:  -----------------------------------------------------MATKMVNF------------KAIEKKEEEILKLSTEN--EEQNFEQI

Query:  LEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKI
          AK  L+ LLE+EE YW+  S+  WL  GD+NT++FH+K+S RK  N+IK +++ +G     +E I  +   ++  LF S++ D       ++C+ T +
Subjt:  LEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKI

Query:  NDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKI
        + +    L +P+T  E+   + S+ P+K+PG DG  A FYQ  W TVG+   +  L +LN++ D + LNKT+ITLI KVK  +H++E+RPISLCNV  K+
Subjt:  NDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKI

Query:  IAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFS
        + K + +RFK  L  +IS  Q+AF+P RLI+DNV++ FE +HAI+NK  G+ G  + KLDM KA+DRV+W FI+E++++MGF  RWI  IM  + T +FS
Subjt:  IAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFS

Query:  VLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINL
         +ING    +  PSRG++QG PLSPYLFL+C+EGFS LL  E ++ NL GF++    P ITHLFFADDSLLFC++    C  +K + + Y KASGQ +NL
Subjt:  VLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINL

Query:  EKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKVSKILQVGDKK---------------------------KLHWTKR
        +KS+   S N          + L +   +    YLG+PS S R K +MF NIK ++ K++   ++K                           ++HW   
Subjt:  EKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKVSKILQVGDKK---------------------------KLHWTKR

Query:  NKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWLI
        N LC+SK  GGMGFR    FNQA+LAKQ+WRI + P SLL ++L  R F N +FL+AS+G +   TW+ I W R+L  +G RW++G+G +     DPW+ 
Subjt:  NKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWLI

Query:  RKGNSSPIWVLEELRGRRVKEIIREDGKWDEDHIKRLFLPMDVEGILSIPL
              P        G  V  +I ++ +WD   +++ F  +DVE ILS+PL
Subjt:  RKGNSSPIWVLEELRGRRVKEIIREDGKWDEDHIKRLFLPMDVEGILSIPL

A0A803PI64 Uncharacterized protein2.4e-15036.53Show/hide
Query:  WLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHE---------------------------
        WL  GDFNEIV+ +EK GG  ++   M+ FRE I+ C+L  +  +++  TW    ++ N V +RLDR   N E                           
Subjt:  WLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHE---------------------------

Query:  -----------------------------------------------------------------MATKMVNFKAIEKKEEEILKLSTENEEQNFEQILE
                                                                            K V  + I+K +  +++LS+ ++   + ++ +
Subjt:  -----------------------------------------------------------------MATKMVNFKAIEKKEEEILKLSTENEEQNFEQILE

Query:  AKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKIND
         + +L+ +LE++E YWR RS+  WLK GD NT++FH K+S R+++N IKG+   IG+W  +   +  +   YF  +F SS+       +V+N +  K+ D
Subjt:  AKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKIND

Query:  DWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKIIA
        D   ML + +T  EI   VK ++P KAPG DG  A FY  +W  + ++ +  CLKVLN   ++  LN+T+I LI KV+  K + EFRPISLCNV YKI++
Subjt:  DWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKIIA

Query:  KTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVL
        K +  R    ++ +IS TQ+AF+  RLI DN IIG+E +H +R  +    G++A+KLDM KAYDRV+W F+  ++ R+GF   W+  IM+ V + SFS L
Subjt:  KTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVL

Query:  INGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEK
        ING ++ +  P RG+RQGDPLSP+LFL CAE  S L+ +E     L+G R N    S++HLFFADDSL+F  +  + C   ++I   Y  ASGQ +N  K
Subjt:  INGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEK

Query:  SMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKVSKILQVGDKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQS
        S      NV  E   ++  ++GVR+    G YLG+PS   R+K +  + IK K         +KK+HW K   LCR K  GG+GFR+L  FNQA+LAKQ 
Subjt:  SMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKVSKILQVGDKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQS

Query:  WRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGRRVKEIIREDGKW
        WR ++HP+ L  +VL    F ++ FL+A  G+NA   W+S+VWG+ L  +GYRWR+GNGE   + +DPWL R                 V ++ R DG+W
Subjt:  WRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGRRVKEIIREDGKW

Query:  DEDHIKRLFLPMDVEGILSIPLGN
        DE  I+ +F  +D E IL+IP  +
Subjt:  DEDHIKRLFLPMDVEGILSIPLGN

A0A803PJK4 Uncharacterized protein9.0e-15036.91Show/hide
Query:  LAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHE----MATKMV---------------
        L WL+ GDFNEI++  +K GGA +N  Q+D FRET++ C L    +   +FTW  + ++   V +RLD  F+N +    MAT ++               
Subjt:  LAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHE----MATKMV---------------

Query:  -----------------NFKAIEKKEEEIL---------------------------------------------KLSTENEE----------QNFEQIL
                          F+ I  KE++ L                                             KLS E  E           +F Q+L
Subjt:  -----------------NFKAIEKKEEEIL---------------------------------------------KLSTENEE----------QNFEQIL

Query:  EAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKIN
        E++  LD LL +EEDYW  RS+ +WLK+GD NT++FH K+S RK  N+I  +    G+    +  I NI  EYF  +F +        + V+  V   + 
Subjt:  EAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKIN

Query:  DDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKII
         + +  L  P T  E+   +KS++ + +PG DG    FY  YW  VGE+  +T LKVLNE  D S  N T+ITLI KVK    M   RPISLCNV YK++
Subjt:  DDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKII

Query:  AKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSV
        +KTI  R K  LNSIIS +Q AF+P RLI+DN+++ FE IH++++ K+GK+G  AIKLDM KA+DR++W FIK M+  MGF +  +  I++ + TVS+S 
Subjt:  AKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSV

Query:  LINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLE
         ING+ Q    P RG+RQGDPLSPYLF++CAEGFS LL  E  N +L GF+++   P+++HLFFADDSLL CR+     ++++     Y +ASGQ +N E
Subjt:  LINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLE

Query:  KSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKVSKIL--------QVGDK----------------------KKL---
        KS+   S N ++       ++L ++       YLG+PS S R K+ +F  IK K+ K+L         +G K                      KKL   
Subjt:  KSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKVSKIL--------QVGDK----------------------KKL---

Query:  -------------------HWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDL
                           HW   N LC+SK  GGMGF+    +NQA+LAKQ+WRIL +P SL  ++L  R F++  FL A +GS    TW+ IVWG++L
Subjt:  -------------------HWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDL

Query:  FKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGRRVKEIIREDGKWDEDHIKRLFLPMDVEGILSIPL
         + G RW++GNG       DPWL    +  P+          V E+I  D +W    +K  FL +D++ ILSIPL
Subjt:  FKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGRRVKEIIREDGKWDEDHIKRLFLPMDVEGILSIPL

A0A803QAN3 Uncharacterized protein1.2e-14938.11Show/hide
Query:  LAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLN-------HEMATKMVNFKAIEKK----
        L W++ GDFNEI+ +  KKGG R+   QMD FR  ++ C L    ++ + FTW + R K + +++RLD  F N         + T  +++ A + +    
Subjt:  LAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLN-------HEMATKMVNFKAIEKK----

Query:  EEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQS
        +  +L  +     +N E I      LD LL +EE+YW  RS+  WL+ GD+NT++FH+ ++ RK +N IK + +  G+    ++ +  +   ++  LF +
Subjt:  EEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQS

Query:  SNPDLMNNRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKD
              + + V+N +   I  D    L  P+T  E+ A +KS+SP+K+PGSDG  A FYQ YWD VG    Q  L VLN+  D++ LNK++ITLI KV +
Subjt:  SNPDLMNNRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKD

Query:  SKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMG
           M ++RPISLCNV YK+I+K I  RF+K L  +IS TQ+AF+  RLI+DN+++ FE IH +R+K QG++G  A+KLDM KA+DRV+W +++ ++ +MG
Subjt:  SKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMG

Query:  FENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQ
        F ++W+  IM  + T SFS  +NG      +PSRG+RQGDPLSPYLFL+C+EG S LL  E    NL+G R+    PS++HL FADDSLLFCR+  +   
Subjt:  FENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQ

Query:  TLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKV--------SKILQVGDKK-------
         ++   + Y +ASGQ +N  KS+   S N   +        L +  T+    YLG+PS S R K ++F +IK +V         KI  +G K+       
Subjt:  TLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKMFENIKAKV--------SKILQVGDKK-------

Query:  -------------------------------------KLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLK
                                             K+HW + N LC+SK  GGMGFR    FNQA+LAKQ+WRI + P SLL ++L  R F    F  
Subjt:  -------------------------------------KLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLK

Query:  ASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGRRVKEIIREDGKWDEDHIKRLFLPMDVEGILSIPL
        A +G +   TW+SI WGRDL  +G R++IG G       DPW+    N  P+          V   I +  +W+ D +   F P+DV+ ILSIPL
Subjt:  ASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGRRVKEIIREDGKWDEDHIKRLFLPMDVEGILSIPL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.4e-3625.77Show/hide
Query:  KRNQVYKRLDRYFLNHEMATKMVNFKAIEKKEEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQI
        KR Q   ++D         T     K +EK+E+      T ++    ++I + + EL ++  ++     N S+  + +  +K  R       +++++NQI
Subjt:  KRNQVYKRLDRYFLNHEMATKMVNFKAIEKKEEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQI

Query:  KGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNC-VQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGE
          I +  G    D  +I     EY++ L+ +   +L      ++     ++N +    L +P T SEI A + SL   K+PG DG  A FYQ Y + +  
Subjt:  KGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNC-VQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGE

Query:  NTVQTCLKVLNEDVDISLLNKTVITLILKV-KDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKK
          ++    +  E +  +   +  I LI K  +D+     FRPISL N+  KI+ K +ANR ++ +  +I   Q  F+PG     N+      I  I   K
Subjt:  NTVQTCLKVLNEDVDISLLNKTVITLILKV-KDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKK

Query:  QGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNL
           +  + I +D  KA+D+++  F+ + L ++G +  ++K I    +  + ++++NG   + F    G RQG PLSP LF +  E  +  + +E E   +
Subjt:  QGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNL

Query:  KGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKM
        KG ++      +    FADD +++  +     Q L ++ + + K SG  IN++KS      N ++ E+Q +G++     +K +  YLG+  Q  R    +
Subjt:  KGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGTYLGMPSQSHRSKTKM

Query:  F-ENIKAKVSKILQVGDKKK
        F EN K  + +I +  +K K
Subjt:  F-ENIKAKVSKILQVGDKKK

P08548 LINE-1 reverse transcriptase homolog1.9e-3527.22Show/hide
Query:  EMATKMVNFKAIEKKEEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEK
        E+   M + K +EK+E    K S   E      I + + EL+++  +      N+S+  + +  +K  +   + + +++ ++ I  I +G      D  +
Subjt:  EMATKMVNFKAIEKKEEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEK

Query:  IGNIATEYFRKLFQSSNPDLMN-NRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDI
        I  I  EY++KL+     +L   ++ +  C   +++     ML +P + SEI +T+++L   K+PG DG  + FYQ + + +    +     +  E +  
Subjt:  IGNIATEYFRKLFQSSNPDLMN-NRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDI

Query:  SLLNKTVITLILKV-KDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKA
        +   +  ITLI K  KD      +RPISL N+  KI+ K + NR ++ +  II   Q  F+PG     N+      I  I NK + K+  I + +D  KA
Subjt:  SLLNKTVITLILKV-KDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKA

Query:  YDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLF
        +D ++  F+   L+++G E  ++K I       + ++++NG   K F    G RQG PLSP LF +  E    L     E + +KG  I +    I    
Subjt:  YDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLF

Query:  FADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSM-FMASKNVKKEEA--QEIGKILGVRQTKELGTYL
        FADD +++  + R+    L E+   Y   SG  IN  KS+ F+ + N + E+     I   +  ++ K LG YL
Subjt:  FADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSM-FMASKNVKKEEA--QEIGKILGVRQTKELGTYL

P11369 LINE-1 retrotransposable element ORF2 protein3.8e-3629.83Show/hide
Query:  NFKAIEKKEEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATE
        + KA+EKKE    K S   E      I++ + E++++         N+++  + +  +K  +     +   + +  I  I +  G    D E+I N    
Subjt:  NFKAIEKKEEEILKLSTENEEQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATE

Query:  YFRKLFQSSNPDLMNNRKVVNCVQT-KINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNE-DVDISLLN--
        ++++L+ +   +L    K ++  Q  K+N D    L  P +  EIEA + SL   K+PG DG  A FYQ    T  E+ +    K+ ++ +V+ +L N  
Subjt:  YFRKLFQSSNPDLMNNRKVVNCVQT-KINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNE-DVDISLLN--

Query:  -KTVITLILK-VKDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDR
         +  ITLI K  KD   +  FRPISL N+  KI+ K +ANR ++ + +II P Q  F+PG     N+      IH I NK + K   I I LD  KA+D+
Subjt:  -KTVITLILK-VKDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDR

Query:  VKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQ-NLKGFRINNFCPSITHLFFA
        ++  F+ ++L+R G +  ++  I         ++ +NG   +      G RQG PLSPYLF +  E    +L R +  Q  +KG +I      I+ L  A
Subjt:  VKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQ-NLKGFRINNFCPSITHLFFA

Query:  DDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSM-FMASKNVKKE-EAQEIG---------KILGVRQTKEL
        DD +++    +   + L  + N + +  G  IN  KSM F+ +KN + E E +E           K LGV  TKE+
Subjt:  DDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSM-FMASKNVKKE-EAQEIG---------KILGVRQTKEL

P14381 Transposon TX1 uncharacterized 149 kDa protein2.4e-3027.25Show/hide
Query:  IEKKEEEILKLS---TENEEQNFE-QILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATE
        IE    E+L L    + +E+Q  + + LE K  L  + + +      RS+   L + D+ +R+F++   ++  R QI  +++  G   ED E I + A  
Subjt:  IEKKEEEILKLS---TENEEQNFE-QILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATE

Query:  YFRKLFQSSNPDLMNNRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVI
        +++ LF S +P   +  + +      +++  +  LE P T  E+   ++ +  NK+PG DG    F+Q +WDT+G +  +   +   +        + V+
Subjt:  YFRKLFQSSNPDLMNNRKVVNCVQTKINDDWRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVI

Query:  TLILKVKDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFI
        +L+ K  D + ++ +RP+SL +  YKI+AK I+ R K  L  +I P Q+  VPGR I DNV +  + +H  R           + LD  KA+DRV   ++
Subjt:  TLILKVKDSKHMREFRPISLCNVSYKIIAKTIANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFI

Query:  KEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFC
           LQ   F  +++  +     +    V IN S        RG+RQG PLS  L+ +  E F  LL      + L G  +      +    +ADD +L  
Subjt:  KEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFC

Query:  R-----SKREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVK
        +      + +ECQ       +Y  AS   IN  KS  +   ++K
Subjt:  R-----SKREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVK

P93295 Uncharacterized mitochondrial protein AtMg003102.2e-2040.34Show/hide
Query:  DKKKLHWTKRNKLCRSK-SSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGE
        +K+K+ W    KLC+SK   GG+GFR+L +FNQA+LAKQS+RI+  P +LL ++L  R F +   ++ SVG+     W+SI+ GR+L   G    IG+G 
Subjt:  DKKKLHWTKRNKLCRSK-SSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGE

Query:  YTYIDQDPWLIRKGNSSPI
        +T +  D W++ +    P+
Subjt:  YTYIDQDPWLIRKGNSSPI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.9e-1530.37Show/hide
Query:  EDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSN----PDLMNNRKVVNCVQTKINDDWRRMLEQ
        E ++R +S+  WL++GD NTR+FH      + +N IK +     +  E+  ++  +   Y+  L  S +    PD +   K ++    + ND     L  
Subjt:  EDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSN----PDLMNNRKVVNCVQTKINDDWRRMLEQ

Query:  PYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKII
          +  EI A V ++  NKAPG D   A F+   W  V ++T+    +       +   N T ITLI KV     +  FRP+S C V YKII
Subjt:  PYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.9e-1437.5Show/hide
Query:  IANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMK
        +  R K  + ++I P QA+F+PGR+ +DN++   E +H++R KK G +G + +KLD+ KAYDR++W ++++ L   GF   W+  I +
Subjt:  IANRFKKELNSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMK

AT4G29090.1 Ribonuclease H-like superfamily protein5.0e-2034.09Show/hide
Query:  DKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEY
        + K +HW   + L   K+ GG+GF+++  FN A+L KQ WR+L  PESL+ KV   R F   D L A +GS     WKSI   +++ ++G R  +GNGE 
Subjt:  DKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGEY

Query:  TYIDQDPWLIRKGNSSPIWVL----EELRG----RRVKEIIREDGK-WDEDHIKRLFLPMDVEGILSI-PLGNKVL
          I +  WL  K  S+ + +     +E        +V ++I E G+ W +D I+ LF  ++ + I  + P G ++L
Subjt:  TYIDQDPWLIRKGNSSPIWVL----EELRG----RRVKEIIREDGK-WDEDHIKRLFLPMDVEGILSI-PLGNKVL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-2140.34Show/hide
Query:  DKKKLHWTKRNKLCRSK-SSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGE
        +K+K+ W    KLC+SK   GG+GFR+L +FNQA+LAKQS+RI+  P +LL ++L  R F +   ++ SVG+     W+SI+ GR+L   G    IG+G 
Subjt:  DKKKLHWTKRNKLCRSK-SSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSIVWGRDLFKEGYRWRIGNGE

Query:  YTYIDQDPWLIRKGNSSPI
        +T +  D W++ +    P+
Subjt:  YTYIDQDPWLIRKGNSSPI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)5.8e-1655.88Show/hide
Query:  LINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDS
        +ING+ Q    PSRG+RQGDPLSPYLF++C E  SGL  R  E   L G R++N  P I HL FADD+
Subjt:  LINGSSQKEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGCTGAGCGAGAAGGACGACTTGGCGTGGTTAATAGGCGGTGACTTTAATGAGATTGTAGCGGACTCTGAGAAAAAAGGTGGAGCTAGAAAGAACCCGAGGCA
AATGGACCTTTTCAGAGAAACAATAAATCGTTGTAAACTTAGGGCTTGGGGCTACTCTAGAAATAAATTCACATGGAGAAGGAGCAGAAACAAAAGGAACCAAGTTTATA
AAAGACTTGATAGATATTTTCTAAACCATGAGATGGCAACGAAAATGGTAAACTTCAAGGCCATTGAGAAGAAAGAGGAGGAGATTCTAAAGCTCTCTACTGAGAATGAG
GAGCAGAATTTCGAACAGATTCTTGAAGCAAAAGTAGAACTGGACAAACTTTTGGAGGAAGAAGAAGATTACTGGAGGAACCGATCTCAGGAAACCTGGCTGAAAAATGG
GGACAAAAACACTAGATGGTTTCATTCCAAATCATCCCAAAGAAAGCAAAGGAACCAAATCAAAGGGATATACTCAGGAATTGGCTTGTGGGAGGAGGATGAAGAGAAAA
TAGGAAATATAGCTACTGAATATTTTCGAAAGCTTTTTCAATCCTCCAACCCAGACTTAATGAACAATAGAAAGGTGGTGAATTGTGTCCAAACCAAAATTAATGATGAC
TGGAGGAGGATGCTAGAACAACCTTACACACGAAGTGAGATTGAAGCCACAGTGAAAAGCCTAAGTCCTAACAAAGCTCCCGGAAGTGACGGAGCCCACGCCACCTTCTA
TCAAGGATACTGGGATACGGTTGGGGAAAATACAGTCCAAACATGCCTCAAAGTCCTAAACGAGGATGTTGATATAAGTTTGTTGAACAAGACCGTTATAACTCTAATCC
TGAAAGTAAAAGACTCCAAACACATGAGAGAGTTCAGACCAATCAGTCTATGCAACGTTAGCTATAAAATAATAGCTAAAACAATTGCAAACAGGTTCAAGAAAGAGTTG
AATTCAATAATATCACCCACCCAAGCTGCTTTTGTCCCAGGAAGACTTATATCTGATAATGTCATAATAGGCTTTGAATGTATCCATGCTATTCGAAATAAGAAGCAAGG
TAAGGAAGGGCAGATCGCCATCAAGCTGGACATGTGCAAAGCCTATGACAGGGTGAAATGGGGCTTTATAAAAGAAATGCTTCAAAGAATGGGGTTCGAAAACAGATGGA
TCAAAAACATTATGAAATACGTGGAAACGGTATCATTCTCTGTCCTTATCAATGGCTCGTCGCAGAAAGAATTCAAACCATCCAGAGGAATCAGACAAGGAGACCCGCTA
TCCCCATACCTATTCTTGGTATGTGCTGAAGGTTTTTCGGGTCTGCTATTCAGGGAAGTTGAAAACCAAAACCTCAAAGGCTTTCGTATTAATAATTTTTGTCCAAGTAT
CACCCACTTATTCTTCGCTGATGATAGTCTTTTGTTTTGCAGGTCTAAAAGGGAGGAGTGCCAGACATTGAAAGAGATTTTCAACCTATATGAGAAGGCCTCGGGACAAG
CCATAAACCTAGAGAAATCTATGTTTATGGCGAGCAAAAATGTGAAAAAGGAGGAAGCCCAAGAGATTGGCAAAATCCTAGGTGTGAGACAAACAAAGGAGCTAGGAACA
TACCTAGGAATGCCTTCTCAAAGCCACAGAAGCAAAACCAAAATGTTTGAGAACATTAAAGCAAAAGTTTCCAAAATCCTCCAAGTGGGGGATAAAAAGAAGCTACATTG
GACTAAACGGAACAAGTTATGTCGAAGCAAATCCTCGGGTGGCATGGGTTTTAGGGAACTAAACTTTTTTAACCAAGCGATGTTGGCCAAGCAAAGTTGGAGAATCCTTA
AACACCCTGAAAGTCTTCTCTACAAAGTTCTGTGTGGCCGTTGCTTTAGGAATGAGGATTTTCTTAAAGCCTCGGTAGGATCCAACGCACCCCAAACATGGAAAAGCATT
GTGTGGGGGAGAGACCTCTTTAAGGAGGGGTATAGATGGAGAATTGGCAACGGAGAATATACGTACATAGACCAAGACCCGTGGTTAATCAGAAAAGGCAATAGTTCCCC
TATATGGGTCCTAGAGGAGCTAAGAGGTAGAAGAGTAAAGGAAATCATTAGGGAGGATGGGAAATGGGATGAAGATCATATCAAAAGATTGTTTCTTCCCATGGATGTCG
AAGGCATATTATCAATTCCCTTGGGCAACAAAGTGTTGGGATTTGTGCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGCTGAGCGAGAAGGACGACTTGGCGTGGTTAATAGGCGGTGACTTTAATGAGATTGTAGCGGACTCTGAGAAAAAAGGTGGAGCTAGAAAGAACCCGAGGCA
AATGGACCTTTTCAGAGAAACAATAAATCGTTGTAAACTTAGGGCTTGGGGCTACTCTAGAAATAAATTCACATGGAGAAGGAGCAGAAACAAAAGGAACCAAGTTTATA
AAAGACTTGATAGATATTTTCTAAACCATGAGATGGCAACGAAAATGGTAAACTTCAAGGCCATTGAGAAGAAAGAGGAGGAGATTCTAAAGCTCTCTACTGAGAATGAG
GAGCAGAATTTCGAACAGATTCTTGAAGCAAAAGTAGAACTGGACAAACTTTTGGAGGAAGAAGAAGATTACTGGAGGAACCGATCTCAGGAAACCTGGCTGAAAAATGG
GGACAAAAACACTAGATGGTTTCATTCCAAATCATCCCAAAGAAAGCAAAGGAACCAAATCAAAGGGATATACTCAGGAATTGGCTTGTGGGAGGAGGATGAAGAGAAAA
TAGGAAATATAGCTACTGAATATTTTCGAAAGCTTTTTCAATCCTCCAACCCAGACTTAATGAACAATAGAAAGGTGGTGAATTGTGTCCAAACCAAAATTAATGATGAC
TGGAGGAGGATGCTAGAACAACCTTACACACGAAGTGAGATTGAAGCCACAGTGAAAAGCCTAAGTCCTAACAAAGCTCCCGGAAGTGACGGAGCCCACGCCACCTTCTA
TCAAGGATACTGGGATACGGTTGGGGAAAATACAGTCCAAACATGCCTCAAAGTCCTAAACGAGGATGTTGATATAAGTTTGTTGAACAAGACCGTTATAACTCTAATCC
TGAAAGTAAAAGACTCCAAACACATGAGAGAGTTCAGACCAATCAGTCTATGCAACGTTAGCTATAAAATAATAGCTAAAACAATTGCAAACAGGTTCAAGAAAGAGTTG
AATTCAATAATATCACCCACCCAAGCTGCTTTTGTCCCAGGAAGACTTATATCTGATAATGTCATAATAGGCTTTGAATGTATCCATGCTATTCGAAATAAGAAGCAAGG
TAAGGAAGGGCAGATCGCCATCAAGCTGGACATGTGCAAAGCCTATGACAGGGTGAAATGGGGCTTTATAAAAGAAATGCTTCAAAGAATGGGGTTCGAAAACAGATGGA
TCAAAAACATTATGAAATACGTGGAAACGGTATCATTCTCTGTCCTTATCAATGGCTCGTCGCAGAAAGAATTCAAACCATCCAGAGGAATCAGACAAGGAGACCCGCTA
TCCCCATACCTATTCTTGGTATGTGCTGAAGGTTTTTCGGGTCTGCTATTCAGGGAAGTTGAAAACCAAAACCTCAAAGGCTTTCGTATTAATAATTTTTGTCCAAGTAT
CACCCACTTATTCTTCGCTGATGATAGTCTTTTGTTTTGCAGGTCTAAAAGGGAGGAGTGCCAGACATTGAAAGAGATTTTCAACCTATATGAGAAGGCCTCGGGACAAG
CCATAAACCTAGAGAAATCTATGTTTATGGCGAGCAAAAATGTGAAAAAGGAGGAAGCCCAAGAGATTGGCAAAATCCTAGGTGTGAGACAAACAAAGGAGCTAGGAACA
TACCTAGGAATGCCTTCTCAAAGCCACAGAAGCAAAACCAAAATGTTTGAGAACATTAAAGCAAAAGTTTCCAAAATCCTCCAAGTGGGGGATAAAAAGAAGCTACATTG
GACTAAACGGAACAAGTTATGTCGAAGCAAATCCTCGGGTGGCATGGGTTTTAGGGAACTAAACTTTTTTAACCAAGCGATGTTGGCCAAGCAAAGTTGGAGAATCCTTA
AACACCCTGAAAGTCTTCTCTACAAAGTTCTGTGTGGCCGTTGCTTTAGGAATGAGGATTTTCTTAAAGCCTCGGTAGGATCCAACGCACCCCAAACATGGAAAAGCATT
GTGTGGGGGAGAGACCTCTTTAAGGAGGGGTATAGATGGAGAATTGGCAACGGAGAATATACGTACATAGACCAAGACCCGTGGTTAATCAGAAAAGGCAATAGTTCCCC
TATATGGGTCCTAGAGGAGCTAAGAGGTAGAAGAGTAAAGGAAATCATTAGGGAGGATGGGAAATGGGATGAAGATCATATCAAAAGATTGTTTCTTCCCATGGATGTCG
AAGGCATATTATCAATTCCCTTGGGCAACAAAGTGTTGGGATTTGTGCCCTAA
Protein sequenceShow/hide protein sequence
MEKLSEKDDLAWLIGGDFNEIVADSEKKGGARKNPRQMDLFRETINRCKLRAWGYSRNKFTWRRSRNKRNQVYKRLDRYFLNHEMATKMVNFKAIEKKEEEILKLSTENE
EQNFEQILEAKVELDKLLEEEEDYWRNRSQETWLKNGDKNTRWFHSKSSQRKQRNQIKGIYSGIGLWEEDEEKIGNIATEYFRKLFQSSNPDLMNNRKVVNCVQTKINDD
WRRMLEQPYTRSEIEATVKSLSPNKAPGSDGAHATFYQGYWDTVGENTVQTCLKVLNEDVDISLLNKTVITLILKVKDSKHMREFRPISLCNVSYKIIAKTIANRFKKEL
NSIISPTQAAFVPGRLISDNVIIGFECIHAIRNKKQGKEGQIAIKLDMCKAYDRVKWGFIKEMLQRMGFENRWIKNIMKYVETVSFSVLINGSSQKEFKPSRGIRQGDPL
SPYLFLVCAEGFSGLLFREVENQNLKGFRINNFCPSITHLFFADDSLLFCRSKREECQTLKEIFNLYEKASGQAINLEKSMFMASKNVKKEEAQEIGKILGVRQTKELGT
YLGMPSQSHRSKTKMFENIKAKVSKILQVGDKKKLHWTKRNKLCRSKSSGGMGFRELNFFNQAMLAKQSWRILKHPESLLYKVLCGRCFRNEDFLKASVGSNAPQTWKSI
VWGRDLFKEGYRWRIGNGEYTYIDQDPWLIRKGNSSPIWVLEELRGRRVKEIIREDGKWDEDHIKRLFLPMDVEGILSIPLGNKVLGFVP