; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011141 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011141
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:15567387..15570560
RNA-Seq ExpressionLag0011141
SyntenyLag0011141
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.7e-15234.75Show/hide
Query:  KVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNE-WWSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEK
        +++LG +    V   GL G L L W+    +++ S+S GHI   +   N+  +  TGFYG P+ ++R+ SWE+++R       +W++ GDFNE+L   +K
Subjt:  KVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNE-WWSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEK

Query:  KGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLDRFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPN-NRKNPKKL
        +GG  + Q  M+ FK  L  C L    + G  +TW RR   G +++ERLDR VAN     +   L  SHL    SDH PIL +E  +  P  + K   + 
Subjt:  KGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLDRFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPN-NRKNPKKL

Query:  KFEERWVQFEECKNIVKGNWMSGRKENAGDINCKVESILHKLADWNKVRLGGSIQGAVERKYKELQELNKGLGQSNEVAIAKAEQELSCLLEEE-----K
         FEE W +  +   +++  W     +    ++  +     +L  WN +    +++  ++  YKEL  L   L     V  AK E+ +S LLE++     +
Subjt:  KFEERWVQFEECKNIVKGNWMSGRKENAGDINCKVESILHKLADWNKVRLGGSIQGAVERKYKELQELNKGLGQSNEVAIAKAEQELSCLLEEE-----K

Query:  HNGSWVEEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVC
         +  W  EE+ IG +   YF+TLF+S+    + +++ +  + P ++    + L + F++ E+E  L  +  TKAPG DG  ALF+Q YW IVG+ + K C
Subjt:  HNGSWVEEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVC

Query:  LNTLNGEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFL
        L  LNGE S+   N T IA+IPK   P  +  F+PISLC TVYK+I+K +ANR+K ++  +I+++QS F+P R+I DNV+A F+ +H+I   K G++  +
Subjt:  LNTLNGEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFL

Query:  AAKLDMSKVYDRVEWEFIRQTMLKL---------------------------------------------------------------------------
        A KLDM+K YDRVEW F+R+ MLKL                                                                           
Subjt:  AAKLDMSKVYDRVEWEFIRQTMLKL---------------------------------------------------------------------------

Query:  ------------DSGAYMR----DYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDK
                    DS  +M+        L+ +   YE VS Q +N  KS+  +S +  R    ++  +L + +      YLG+P+  G+ + ++F  +KDK
Subjt:  ------------DSGAYMR----DYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDK

Query:  VWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSW
        +WK +  WKEK  S  GKE+L+KAV QAIP Y+MSCF++P  +C+E+N + A+F W    +K   HW+ W+ LC SK  GGLGFR+L  FNQA+LAK+ W
Subjt:  VWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSW

Query:  RIVKNPNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI
        RI++ P  LV+RI R +Y     FLEA +G NPS  WRS+ WG++L   G+RWRVGNG  I++  D W+
Subjt:  RIVKNPNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]2.4e-15435.99Show/hide
Query:  RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVK--NEWWSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLD
        R+K ++G+ +   VP  G SG + LLW +E+ ++V S+++ HIDA +     +  W  TGFYG PE  KR +SW L+   ++   + WL  GDFNE+L  
Subjt:  RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVK--NEWWSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLD

Query:  NEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLDRFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPK
        NEK GG  + Q  MD F+D++N C   DLGY G  YTW       + I  RLDR +A      K   +++ HL     DH  +L  +N   H   R   K
Subjt:  NEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLDRFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPK

Query:  KLKFEERWVQFEECKNIVKGNWMSGRKENAGD-INCKVESILHKLADWNKVRLGGSIQGAVERKYKELQEL-NKGLGQSNEVAIAKAEQELSCLLEEEK-
        +  FE +W + E+CK I++ +W  G   +  + I+  +     +L+ W+   + G I   ++ K   L  L  + L +   + I +  +E++ LL++E+ 
Subjt:  KLKFEERWVQFEECKNIVKGNWMSGRKENAGD-INCKVESILHKLADWNKVRLGGSIQGAVERKYKELQEL-NKGLGQSNEVAIAKAEQELSCLLEEEK-

Query:  ---------------------------------------HNGSWVEEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIE
                                                 G W + EE I   A  YF  +++SS+P+   I++  + I  +V+++  E L R F+K E
Subjt:  ---------------------------------------HNGSWVEEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIE

Query:  IEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSI
        +   LK +   KAPGPDG  A+F+Q YW IVG ++T + LN LN    +  LN T I++IPK+ +PK M  F+PISLCN VYK+ISK LANR+K ++  I
Subjt:  IEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSI

Query:  ISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRVEWEFIRQTMLKLDSGAYMRD----------------------------
        IS++QS F   RLI+DNV+  F+ +H ++ K AGKEGF+A KLDMSK +DRVEW FI + M ++      RD                            
Subjt:  ISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRVEWEFIRQTMLKLDSGAYMRD----------------------------

Query:  ------------------------------------------------------------YE---VLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQA
                                                                    YE   +L+ IL +YE  S Q +N +KSS   S +  +   
Subjt:  ------------------------------------------------------------YE---VLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQA

Query:  ALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGN
          + NILG    +    YLG+PS  GR+K++VF  +K+KV   L  WK K  S GGKE+LIKAVAQAIP YTMSCF LP  +C++M R+   F WG    
Subjt:  ALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGN

Query:  KNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYI
        + K  W+SWKR+C SK +GGLGFR L  FN AMLAK++WRI+ NPN LV R+L+ +YF     L A LG +PS +WRSI    ++ + G RWRVGNG+ I
Subjt:  KNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYI

Query:  KIGEDPWIIGLNGYKPV
         I ED W+   + YK +
Subjt:  KIGEDPWIIGLNGYKPV

XP_028090832.1 uncharacterized protein LOC114291041 [Camellia sinensis]1.4e-15134.84Show/hide
Query:  MKTLCWNIWEVGNPRTVRALR-------------------------LKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHID--ANLLVKNEW
        M  LCWN   +GNPR+VR+L+                         ++ KLG+   F V   GL+G L LLW   + V V S S GHID   +  +    
Subjt:  MKTLCWNIWEVGNPRTVRALR-------------------------LKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHID--ANLLVKNEW

Query:  WSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLDRF
        W F GFYG PE  +R  SW L++R      + WL  GDFNE+L   EK G   +  S M+ F++V+    L DLG+ G  +TW        +++ERLDR 
Subjt:  WSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLDRF

Query:  VANTSLIDKAYKLEISHLNYH-----QSDHRPI-LDIENYSPHPNNRKNPKKL-KFEERWVQFEECKNIVKGNWMSGRKENAGDINCKVESILHKLADWN
         AN      A++L   H   +      S+H PI +DI        +  +  ++ +FE  W++   C+ +V  NW+     N   +   +  +   L  W+
Subjt:  VANTSLIDKAYKLEISHLNYH-----QSDHRPI-LDIENYSPHPNNRKNPKKL-KFEERWVQFEECKNIVKGNWMSGRKENAGDINCKVESILHKLADWN

Query:  KVRLGGSIQGAVERKYKE-LQELNKGLGQSNEVAIAKAEQ-ELSC----LLEEE-----------------------------KH-----------NGSW
        +     ++ G V+R+ KE   +L + L  S+      AE  +L C    LLE+E                             +H           NG W
Subjt:  KVRLGGSIQGAVERKYKE-LQELNKGLGQSNEVAIAKAEQ-ELSC----LLEEE-----------------------------KH-----------NGSW

Query:  VEEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLN
         E++  +      YF+ LF++      ++   +Q I+ RVSD    +L RPFS++E+   L  +  TKAPGPDG  ALF+Q +W +VG  ++ V L  LN
Subjt:  VEEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLN

Query:  GEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLD
        G D +  +N T+I +IPK  +PK M  F+PISLCN VYKIISK LANR+K ++ S+IS +QS FIPGRLI+DN +  F+  H + +K+AGK+G L  KLD
Subjt:  GEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLD

Query:  MSKVYDRVEWEFIRQTMLKLD----------------------SGAYMR---------------------------------------------------
        MSK YDRVEW FI Q ML L                       +G + R                                                   
Subjt:  MSKVYDRVEWEFIRQTMLKLD----------------------SGAYMR---------------------------------------------------

Query:  ------------------DYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVL
                          +  VL++IL +YE  SRQ +N +KS+   SK+ D     ++++ILG+    S G YLG+P   G +K  V   VK++VW  L
Subjt:  ------------------DYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVL

Query:  QRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKN
          WKEK  S  G+EVLIK+VAQAIP Y MSCF+LP  +C E+  +   F WG      K H  SWK+L  SK  GG+GFR    FN A+LAK+ WR+V+ 
Subjt:  QRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKN

Query:  PNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWIIGLNGYKPV
        P  LV+R+ + +YF ++SFLEA +G NPS TWRSI+  R L  +G+RWRVGNG  I +  D W+   + +K +
Subjt:  PNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWIIGLNGYKPV

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]4.0e-15434.72Show/hide
Query:  MKTLCWNIWEVGNPRTVRAL-------------------------RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNEW--
        M  L WN   +GN RTV+AL                         ++K +    H   V S+G  G L LLWK+ +TVK+N++++ HIDA   ++  W  
Subjt:  MKTLCWNIWEVGNPRTVRAL-------------------------RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNEW--

Query:  --WSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLD
          W FTGFYG+P+  +R ESW  +K      ++ WL  GDFNE+    EK+GG  + +  M+ F D +N C   ++ + G KYTW      G  I+ERLD
Subjt:  --WSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLD

Query:  RFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKENA-GDINCKVESILHKLADWNKVRL
        R +AN   +D     ++ HL+   SDH P+    +  P    +K  K  +FE  W++   C+ IVK  W  G    A G +   +E   H L  WNK   
Subjt:  RFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKENA-GDINCKVESILHKLADWNKVRL

Query:  GGSIQGAVERKYKELQE------------------------LNKGLGQSNEV---------------------AIAKAEQELSCLLEEEKHNGSWVEEEE
             G V RK  ELQ+                        LNK L + +E+                     A A A  + + +       G W E+E 
Subjt:  GGSIQGAVERKYKELQE------------------------LNKGLGQSNEV---------------------AIAKAEQELSCLLEEEKHNGSWVEEEE

Query:  EIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSL
        +I  VA  YFE LF SS   PE+    +  ++P+V+     EL R ++  E+   LK +   KAPGPDG   LF+Q +W+  GE +T   L+ LN   S 
Subjt:  EIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSL

Query:  GPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVY
           N T I +IPK   PKH+  ++PISLCN  YKI SKA+ANR+K  + SIIS +QS F+ GRLI+DNV+  F+++H I+ KK GK G +A KLDMSK Y
Subjt:  GPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVY

Query:  DRVEWEFIRQTMLKLDSGAYMR------------------------------------------------------------------------------
        DRVEW F+ + M KL     +R                                                                              
Subjt:  DRVEWEFIRQTMLKLDSGAYMR------------------------------------------------------------------------------

Query:  -------------DYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKE
                     + + L+ +L  YE  S Q +N  K+S   S +  +     +    G ++      YLG+PS  G+NK   F  +K+K+ K L  WKE
Subjt:  -------------DYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKE

Query:  KFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILV
        K  S  GKE+LIKAVA A+P YTMSCFKLP ++C+E+  +  KF WG V N+N+  W+SW ++C SK NGG+GF+ L LFN A+LAK+ WR+    + LV
Subjt:  KFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILV

Query:  SRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI
         R+L+ KYF +  F+ A LG NPS +WRSI+  + L K G++WRVGNG  I++ ED W+
Subjt:  SRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI

XP_030939658.1 uncharacterized protein LOC115964500 [Quercus lobata]2.5e-15135.22Show/hide
Query:  MKTLCWNIWEVGNPRTVRAL-------------------------RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNEW--
        M  L WN    GN RTV+AL                         ++K +    H   VPS+G  G L LLWK+ +TVK+N++++ HIDA   ++  W  
Subjt:  MKTLCWNIWEVGNPRTVRAL-------------------------RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNEW--

Query:  --WSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLD
          W FTGFYG+P+  +R +SW  +K      ++ WL  GDFNE+    EK+GG  + +  M+ F D +N C   ++ + G KYTW      G  I+ERLD
Subjt:  --WSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLD

Query:  RFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKENA-GDINCKVESILHKLADWNKVRL
        R +AN   +D     ++ HL+   SDH P+    +  P    +K  K  +FE  W++   C+ IVK  W  G    A G +   +E   H L  WNK   
Subjt:  RFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKENA-GDINCKVESILHKLADWNKVRL

Query:  GGSIQGAVERKYKELQE------------------------LNKGLGQSNEV---------------------AIAKAEQELSCLLEEEKHNGSWVEEEE
             G V RK  ELQ+                        LNK L + +E+                     A A A  + + +       G W E+E 
Subjt:  GGSIQGAVERKYKELQE------------------------LNKGLGQSNEV---------------------AIAKAEQELSCLLEEEKHNGSWVEEEE

Query:  EIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSL
        +I  VA  YFE LF SS   PE+  + +  ++P+V+     EL R ++  E+   LK +   KAPGPDG   LF+Q +W+  GE +T   L+ LN   S 
Subjt:  EIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSL

Query:  GPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVY
           N T I +IPK   PKH+  ++PISLCN  YKI SKA+ANR+K  + SIIS +QS F+ GRLI+DNV+  F+++H I+ KK  K G +A KLDMSK Y
Subjt:  GPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVY

Query:  DRVEWEFIRQTMLKLDSGAYMRDYEVLKEILVEY----------EVVSRQNVN-------------LEKSSCLVSKSVDRGQ---------AALLSNI--
        DRVEW F+ + M KL     +R   +     + Y           ++  + +               E  S L+  SV  G             LS++  
Subjt:  DRVEWEFIRQTMLKLDSGAYMRDYEVLKEILVEY----------EVVSRQNVN-------------LEKSSCLVSKSVDRGQ---------AALLSNI--

Query:  -----------------------------------LGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNY
                                            G ++      YLG+PS  G+ K   F  +K+K+ K L  WKEK  S  GKE+LIKAVA A+P Y
Subjt:  -----------------------------------LGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNY

Query:  TMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLEAPLGPN
        TMSCFKLP ++C+E+  +  KF WG V N+N+  W+SW ++C SK NGG+GF+ L LFN A+LAK+ WR+    + LV R+L+ KYF +  F+ A LG N
Subjt:  TMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLEAPLGPN

Query:  PSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI
        PS +WRSI+  + L K G++WRVGNG  I++ ED W+
Subjt:  PSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI

TrEMBL top hitse value%identityAlignment
A0A2N9G258 Uncharacterized protein7.2e-15734.56Show/hide
Query:  PNAMKTLCWNIWEVGNPRTVRAL-------------------------RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANL-LVKN
        P AM  L WN   +GNPRTV+ L                         RL+ +L +++ F   S    G L LLWKK + ++V+SF   HIDA +    +
Subjt:  PNAMKTLCWNIWEVGNPRTVRAL-------------------------RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANL-LVKN

Query:  EWWSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLD
          W FTGFYG+PE  KR ESW L++R ++   + W   GDFNE++   EK+G + + +  M  F+DVL+ C  VDLG+ G K+TW   +R GD+  ERLD
Subjt:  EWWSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLD

Query:  RFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKENAG----DINCKVESILHKLADWNK
        R VA    + +     + HL+   SDH+P+      S  P  R + K  +FEE W   + C+ ++   W   +K  +G     +  K+ +   +L  W+K
Subjt:  RFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKENAG----DINCKVESILHKLADWNK

Query:  VRLGG--SIQGAVERKYKELQELNKGLGQSNEVAIAKAEQELSCLLEEEKH----------------------------------------NGSWVEEEE
           G   +    VE + ++ + ++   G+ + V + K  +EL  LL +E+                                         NG W     
Subjt:  VRLGG--SIQGAVERKYKELQELNKGLGQSNEVAIAKAEQELSCLLEEEKH----------------------------------------NGSWVEEEE

Query:  EIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSL
        ++ A+   Y+ +LF + NP+   I++ ++HI P V++   E+L R F   E+   LK +   KAPGPDG   LFY  YW ++G+++TK  L  LN    L
Subjt:  EIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSL

Query:  GPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVY
           N T+I +IPK  +P+ +  F+PISLCN +YK+ISK LANR+K ++ +I+S+SQS F+PGRLI+DN++  F+++H +  ++  + G +A KLDMSK Y
Subjt:  GPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVY

Query:  DRVEWEFIRQTMLKL------------------------------------------------------DSGAYMR----DYEVLKEILVEYEVVSRQNV
        DRVEW+++++ M ++                                                      DS  + +    D   ++ IL +YE  S Q +
Subjt:  DRVEWEFIRQTMLKL------------------------------------------------------DSGAYMR----DYEVLKEILVEYEVVSRQNV

Query:  NLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISI
        N +K++   SKS      + + N+LG+        YLG+PS  GR K   F ++K++VW  L+ WKEK  S  GKE+LIK+VAQAIP Y MSCF+LP  +
Subjt:  NLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISI

Query:  CEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWG
         +E+  L  +F WG  G+K K HW+ W  LC SK NGG+GFREL  FN+A+LAK+ WR++ N + L  ++ + KYF + S LEA L    S  W+SI+  
Subjt:  CEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWG

Query:  RDLFKLGMRWRVGNGQYIKIGEDPWI
        RDL K G  WRVG+   I+I  D W+
Subjt:  RDLFKLGMRWRVGNGQYIKIGEDPWI

A0A2N9GLL3 Reverse transcriptase domain-containing protein1.8e-15534.13Show/hide
Query:  PNAMKTLCWNIWEVGNPRTVRALR-------------------------LKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNE
        P  M+ + WN   +GN RT+R L                          +K+ LG+ +   VPS G SG + L+WK+E+ + + ++S  HIDA +     
Subjt:  PNAMKTLCWNIWEVGNPRTVRALR-------------------------LKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNE

Query:  W-WSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLD
          W  TGFYG PE   + ESW L++   +L N+ WL  GDFNE+L  +EK+G + +    M  F++ LN C L+D+GY+G  +TW       + ++E LD
Subjt:  W-WSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLD

Query:  RFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKE--NAGDINCKVESI---LHKLADWN
        R VA+ S +       + H+   +SDH  +L +E  +    N +  +  KFEE+W   EEC+ +++  W S  ++      + C VE I     KLA+W 
Subjt:  RFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKE--NAGDINCKVESI---LHKLADWN

Query:  KVRLGGSIQGAVERKYKELQELNKGLGQSNEV-----AIAKAEQEL-SCLLEEEKH---------------------------------------NGSWV
        K+       GA   K +E  +  + L +SNE      AI   + E+ +CLL EE +                                       +G W 
Subjt:  KVRLGGSIQGAVERKYKELQELNKGLGQSNEV-----AIAKAEQEL-SCLLEEEKH---------------------------------------NGSWV

Query:  EEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNG
        E + E+  +A KYF  LF  S  +P  I   I  IEP VS      L +PF+ IE++  L  +  +KAPGPDG    FYQ YW IVG ++T   L+ LN 
Subjt:  EEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNG

Query:  EDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDM
           L  +N T + +IPK  SP+++  F+PISLCN +YK+ISK LANR++ ++  +IS +QS F+P RLI+DN++  ++ +H++  K++GK G++A KLDM
Subjt:  EDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDM

Query:  SKVYDRVEWEFIRQTMLKLD--------------------------------------------------------------------------------
        SK YDRVEWEF+ QTMLKL                                                                                 
Subjt:  SKVYDRVEWEFIRQTMLKLD--------------------------------------------------------------------------------

Query:  -----------SGAYMRDYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGD-YLGMPSQTGRNKNKVFWKVKDKVWKVL
                   + A  ++   +  IL  YE  S Q VN  K+S   S +        + ++LG+  ++   D YLG+P   G++K + F  +K++V K +
Subjt:  -----------SGAYMRDYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGD-YLGMPSQTGRNKNKVFWKVKDKVWKVL

Query:  QRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKN
          WKEK  S  GKEVLIKAVAQ+IP YTMSCFKLP + C ++    AKF WG    ++K HW+SW ++C  K +GGLGFR+L  FN A+LAK+ WR++ N
Subjt:  QRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKN

Query:  PNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI
        P+ L  R+ + KYF +SSFL+A LG +PS  W S +  + L K G+ W+VGNG  I I  D W+
Subjt:  PNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI

A0A2N9HLP3 Uncharacterized protein2.5e-15735.89Show/hide
Query:  PTLPNAMKTLCWNIWEVGNPRTVRAL-------------------------RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLV
        P  P AM  L WN   +GNPRTV+ L                         RL+ KL +D+ F   S    G L LLWK  + +++NSFS  HIDA +  
Subjt:  PTLPNAMKTLCWNIWEVGNPRTVRAL-------------------------RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLV

Query:  KN-EWWSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKE
             W FTGFYG+PE  KR ESW L++R ++   + W   GDFNE++   EK+G + + +  M  F+DVL+ C  VDLG+ G K+TW   +R GD+  E
Subjt:  KN-EWWSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKE

Query:  RLDRFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNW-----------------------MSGRK
        RLDR VA    + +     + HL+   SDH+P+      S +P   +  K  +FEE W   + C+  V   W                        S  K
Subjt:  RLDRFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNW-----------------------MSGRK

Query:  ENAGDINCKVESILHKLADWNKVRLGG--------------SIQGAVERKYKELQ--ELNKGLGQSNEVAIAKA--EQELSCLLEEEKHNGSWVEEEEEI
         N G+I  K++ + H+L     V + G              S+    ER +++    E  K   ++      +A   Q  + + + +   G W     ++
Subjt:  ENAGDINCKVESILHKLADWNKVRLGG--------------SIQGAVERKYKELQ--ELNKGLGQSNEVAIAKA--EQELSCLLEEEKHNGSWVEEEEEI

Query:  GAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGP
         A+   ++ +LF +    P+ I +  +HI P V+++   +L R F+  E+   LK +   KAPGPDG   LF+Q YW  +G+++T+V L  LN    L  
Subjt:  GAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGP

Query:  LNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDR
        +N T+I +IPK  +P+ +  F+PI LCN +YKIISK LANR+K ++  IIS+SQS F+PGRLI+DN++  F+++H +  +K G+ G +A KLDM   +  
Subjt:  LNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDR

Query:  VEWEFIRQTM--LKLDSG------------------AYMRDYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMP
        ++ E I   +  + +  G                  A + D E ++ IL +YE  S Q +N +K++   SKS  +   A + N+LG+        YLG+P
Subjt:  VEWEFIRQTM--LKLDSG------------------AYMRDYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMP

Query:  SQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLG
        S  GR K   F ++K++VW  L+ WKEK  S  G+E+LIK+VAQAIP Y MSCF+LP  + +E+  L  +F WG  G+K K HW+SW  LC SK +GG+G
Subjt:  SQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLG

Query:  FRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI
        FREL  FN+A+LAK+ WR++ NP+ L  ++ + KYF + S LEA      S  W+SI+  RDL K G  WRVGNG  I+I  D W+
Subjt:  FRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI

A0A2N9I946 Uncharacterized protein3.9e-15534.62Show/hide
Query:  PNAMKTLCWNIWEVGNPRTVRALR-------------------------LKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNE
        P  MKT   N   +GNP TVR L                          L+V+LG   CF V  NG  G L L+WK  + V + SFS  HIDA++++ + 
Subjt:  PNAMKTLCWNIWEVGNPRTVRALR-------------------------LKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNE

Query:  W-WSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLD
          W  TGFYG PER  RS SW L+++ +++ N+ WL+ GDFNEVL   E+ G   +  S M AF+  L+ CSL DLGY G  ++W  R   G +++ RLD
Subjt:  W-WSFTGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLD

Query:  RFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNW---MSGRKENAGDINCKVESILHKLADWNKV
        R VAN   +      ++ H+ +  SDH  ++ I N  P P +    K  +FE  WV+   C++ +K  W   +SG       +  K+++   +L  WN+ 
Subjt:  RFVANTSLIDKAYKLEISHLNYHQSDHRPILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNW---MSGRKENAGDINCKVESILHKLADWNKV

Query:  RLGGSIQGAVERKYKELQELNKGLGQSNEVAIAKAEQELSCLLEEEK----------------------------------------HNGSWVEEEEEIG
        ++  + +   ++K +  Q  +  L + +   +    +E++ L+E+E+                                          G W  E   I 
Subjt:  RLGGSIQGAVERKYKELQELNKGLGQSNEVAIAKAEQELSCLLEEEK----------------------------------------HNGSWVEEEEEIG

Query:  AVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPL
         +A +YF  LF SS  NP+ I + +  ++  VS    + L R FS  EI++ L  +  +KAPGPDG  ALF+Q YW IVGED++   L+  +    LG +
Subjt:  AVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPL

Query:  NTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRV
        N T I +IPK  +P+ M  F+PISLCN +YKI SK L NRMK ++ +IIS SQS F+PGRLISDN+I  F+++H + + +AG    +A KLDMSK YDRV
Subjt:  NTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRV

Query:  EWEFIRQTMLKL---------------------------------------------------------------------------------------D
        EW F++  +LKL                                                                                       D
Subjt:  EWEFIRQTMLKL---------------------------------------------------------------------------------------D

Query:  SGAYMR----DYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFF
        S  + R    D   L  IL  YE  S Q +N EK++   SK+      A + ++ G   ++    YLG+P   GR+K + F ++KD++WK LQ WKEK  
Subjt:  SGAYMR----DYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFF

Query:  SAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRI
        S  G+E+LIKAV QAIP Y MSCFKLP  +C+E+  L  +F WG    + + HW +  +L   K+ GG+GFR+L LFN+A+LA++ WR+++ P+ L+ RI
Subjt:  SAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRI

Query:  LRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI
        L+ KYF  +SFLEA +  N S  WRSI   R + + G+RWRVGNG  IKI +D W+
Subjt:  LRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI

A0A2N9IPS8 Reverse transcriptase domain-containing protein2.9e-16634.83Show/hide
Query:  PNAMKTLCWNIWEVGNPRTVRAL-------------------------RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNE
        P  M+ L WN   +GN  TVR L                         RL+V + +D  F VP  G  G L +LW  +L VK+ ++S+ HIDA ++ K +
Subjt:  PNAMKTLCWNIWEVGNPRTVRAL-------------------------RLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNE

Query:  WWSF--TGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERL
           F  TGFYG+PE  KR ESW L+K    L +  WL  GDFNE+L +NE+ G   + +  +  F++ +  C L DLGY G  YTW+R+     ++  RL
Subjt:  WWSF--TGFYGSPEREKRSESWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERL

Query:  DRFVANTSLIDKAYKLEISHLNYHQSDHRPI-LDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKENAGDINC--KVESILHKLADWNK
        DR +A+ S +       +SHL    SDH PI LDI +    P  ++  K  +FE  W++ E+C+ ++   W  G  E +       K++     L  W++
Subjt:  DRFVANTSLIDKAYKLEISHLNYHQSDHRPI-LDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKENAGDINC--KVESILHKLADWNK

Query:  VRLGGSIQGAVERKYKELQELNKGLGQSNEVAIAKAEQELSCLLEEEK----------------------------------------HNGSWVEEEEEI
         R  GS+  +++RK ++LQ L           I + + +L+ LLE+E+                                         +G W  E+ +I
Subjt:  VRLGGSIQGAVERKYKELQELNKGLGQSNEVAIAKAEQELSCLLEEEK----------------------------------------HNGSWVEEEEEI

Query:  GAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGP
          +A  YF+ +F SSNP+ E I   +Q +E  V++   ++L+  F+K E+   LK +  TKAPGPDG  A+FYQ+YWDIVG ++T+  L+ L+    L  
Subjt:  GAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGP

Query:  LNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDR
        +N T IA+IPK  +P+++  F+PISLCN +YKI+SK LANR+K ++  +IS++QS F+PGRLI+DNV+  F+ +HS++ K+ GK+G +A KLDMSK YDR
Subjt:  LNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDR

Query:  VEW--------------EFIRQTMLKLDSGAY--------------------------------------------------------------------
        VEW              E+IR  M+ L S +Y                                                                    
Subjt:  VEW--------------EFIRQTMLKLDSGAY--------------------------------------------------------------------

Query:  ---------MRDYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKF
                 + + E +  IL +YE  S Q +N  K+S   +KS   G    + +   +    S   YLG+PS  GR+K+  F ++K +VW+ +  WKEKF
Subjt:  ---------MRDYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKF

Query:  FSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSR
         S  G+EVLIKAVAQ+IP Y+MSCFKLP S+C ++N +++ F WG      KAHW+ W +LC SK +GGLGFR+L  FN A+LAK+ WR +++ N LV R
Subjt:  FSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSR

Query:  ILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWIIGLNGYKPVWTEDNI-KKKYVSLVI
        + + KYF +  F+ A LG  PS  WRSI   R + +LG++W +G+G  +KI EDPW+   + +K V  +  +  K+ VS++I
Subjt:  ILRGKYFKKSSFLEAPLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWIIGLNGYKPVWTEDNI-KKKYVSLVI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-2023.9Show/hide
Query:  LMGGDFNEVL--LDNEKKGGNPKKQSDMDAFKDVLNLCSLVD----LGYKGEKYTWKRRDRKGDIIKERLDRFVANTSLIDKAYKLEISHLNYHQSDHRP
        L+ GDFN  L  LD   +    K   D       L+   L+D    L  K  +YT+            ++D  V + +L+ K  + EI  +  + SDH  
Subjt:  LMGGDFNEVL--LDNEKKGGNPKKQSDMDAFKDVLNLCSLVD----LGYKGEKYTWKRRDRKGDIIKERLDRFVANTSLIDKAYKLEISHLNYHQSDHRP

Query:  I---LDIENYSPHPNNRKNPKKLKFEERWVQ------------------------FEECKNIVKGNWMSGRKENAGDINCKVESILHKLADWNK---VRL
        I   L I+N +   +       L   + WV                         ++  K + +G +++           K++++  +L +  K      
Subjt:  I---LDIENYSPHPNNRKNPKKLKFEERWVQ------------------------FEECKNIVKGNWMSGRKENAGDINCKVESILHKLADWNK---VRL

Query:  GGSIQGAVERKYKELQEL--NKGLGQSNE------VAIAKAEQELSCLLEEEKH----------NGSWVEEEEEIGAVANKYFETLFASSNPNPEDIQKA
          S +  + +   EL+E+   K L + NE        I K ++ L+ L+++++            G    +  EI     +Y++ L+A+   N E++   
Subjt:  GGSIQGAVERKYKELQEL--NKGLGQSNE------VAIAKAEQELSCLLEEEKH----------NGSWVEEEEEIGAVANKYFETLFASSNPNPEDIQKA

Query:  IQ-HIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPLNTTW----IAIIPK-SASPKHME
        +  +  PR++ ++ E L RP +  EI  ++  L   K+PGPDG  A FYQ Y     E++    L      +  G L  ++    I +IPK        E
Subjt:  IQ-HIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPLNTTW----IAIIPK-SASPKHME

Query:  GFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRVEWEFIRQTMLKLD-SGAY
         F+PISL N   KI++K LANR++  +  +I   Q  FIPG     N+      I  IN  +A  +  +   +D  K +D+++  F+ +T+ KL   G Y
Subjt:  GFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRVEWEFIRQTMLKLD-SGAY

Query:  MR
        ++
Subjt:  MR

P0C2F6 Putative ribonuclease H protein At1g657507.9e-2835.75Show/hide
Query:  MPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGG
        MP    R     F ++ ++V   +  W+EK  S  G+  L KAV  ++P ++MS   LP SI   +++L   FLWGS   K K H + W ++C  K+ GG
Subjt:  MPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGG

Query:  LGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKY----FKKSSFLEAPLGPNPSLTWRSILWG-RDLFKLGMRWRVGNGQYIKIGEDPWIIGLNGYK
        LG R     N+A+++K  WR+++  N L + +L+ KY     + S +L  P G + S TWRSI  G RD+   G+ W  G+GQ I+   D W+ G    K
Subjt:  LGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKY----FKKSSFLEAPLGPNPSLTWRSILWG-RDLFKLGMRWRVGNGQYIKIGEDPWIIGLNGYK

Query:  PVWTEDN
        P+   DN
Subjt:  PVWTEDN

P11369 LINE-1 retrotransposable element ORF2 protein4.7e-2025.69Show/hide
Query:  RKENAGDINCKVESILHKLADWNKVRLGGSIQGAVERK---YKELQELNKGLGQSNEVAIAKAEQELSCLLEEEKHNGSWVEEEEEIGAVANKYFETLFA
        +KE       + + I+    + N+V    +IQ   + +   ++++ +++K L +     + K  ++   + +     G    + EEI      +++ L++
Subjt:  RKENAGDINCKVESILHKLADWNKVRLGGSIQGAVERK---YKELQELNKGLGQSNEVAIAKAEQELSCLLEEEKHNGSWVEEEEEIGAVANKYFETLFA

Query:  SSNPNPEDIQKAIQHIE-PRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPLNTTW----IAI
        +   N +++ K +   + P+++  Q + L  P S  EIE V+  L   K+PGPDG  A FYQ++     ED+  +     +  +  G L  ++    I +
Subjt:  SSNPNPEDIQKAIQHIE-PRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPLNTTW----IAI

Query:  IPK-SASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRVEWEFIR
        IPK    P  +E F+PISL N   KI++K LANR++  + +II   Q  FIPG     N+      IH IN  K   +  +   LD  K +D+++  F+ 
Subjt:  IPK-SASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRVEWEFIR

Query:  QTMLKLDSGAYMRDYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKL
        + + +  SG       ++K I    + V+   VN EK   +  KS  R    L   +  I L
Subjt:  QTMLKLDSGAYMRDYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKL

P14381 Transposon TX1 uncharacterized 149 kDa protein8.8e-2724.51Show/hide
Query:  LMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYT---WKRRDRKGDIIKERLDRFVANTSLIDKAYKLEISHLNYHQSDHRPILD
        ++GGDFN   LD   +    K+ S     ++++   SLVD+  +    T      R R G + + R+DR   ++ L+ +A    I    +  SDH  +  
Subjt:  LMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYT---WKRRDRKGDIIKERLDRFVANTSLIDKAYKLEISHLNYHQSDHRPILD

Query:  IENYSP--------HPNN-----RKNPKKLKFEER-WVQFEECKNIVKGNWMSGR-----------KENAGDINCKVESILHKLADWNKVRLGGSIQGAV
          + +P        H NN         K ++   R W  F++    +   W  G+           K  +G  N ++E++  ++ D  + RL GS   A+
Subjt:  IENYSP--------HPNN-----RKNPKKLKFEER-WVQFEECKNIVKGNWMSGR-----------KENAGDINCKVESILHKLADWNKVRLGGSIQGAV

Query:  ERKYKELQELNKGLGQSN-EVAIAKAEQELSC----------LLEEEKHN-----------GSWVEEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHI
        + +Y E +E  + + Q     A  ++  +L C           LE++K N           G+ +E+ E I   A  +++ LF+    +P+  ++    +
Subjt:  ERKYKELQELNKGLGQSN-EVAIAKAEQELSC----------LLEEEKHN-----------GSWVEEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHI

Query:  EPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNT
         P VS++++E L  P +  E+ + L+ +   K+PG DG    F+Q +WD +G D  +V        +         ++++PK    + ++ ++P+SL +T
Subjt:  EPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNT

Query:  VYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKE-GFLAAKLDMSKVYDRVEWEFIRQTMLKLDSGAYMRDYEVLKEI
         YKI++KA++ R+K ++  +I   QS  +PGR I DNV      +H   +++ G    FL+  LD  K +DRV+ +++  T+     G     Y  LK +
Subjt:  VYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKE-GFLAAKLDMSKVYDRVEWEFIRQTMLKLDSGAYMRDYEVLKEI

Query:  LVEYEVVSRQNVNL
            E + + N +L
Subjt:  LVEYEVVSRQNVNL

P93295 Uncharacterized mitochondrial protein AtMg003108.8e-3546.53Show/hide
Query:  AIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKE-NGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLE
        A+P Y MSCF+L   +C+++     +F W S  NK K  W++W++LC SKE +GGLGFR+L  FNQA+LAK+S+RI+  P+ L+SR+LR +YF  SS +E
Subjt:  AIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKE-NGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLE

Query:  APLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWII
          +G  PS  WRSI+ GR+L   G+   +G+G + K+  D WI+
Subjt:  APLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWII

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.0e-0924.02Show/hide
Query:  MDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLDRFVAN----TSLIDKAYKLEISHLNYHQSDHRP-ILDIEN--------------YSPHPN
        ++ F++ L    LVD+  +G  YTW        II+ +LDR +AN    +S        E+S +    SDH P I+ +EN               S HP 
Subjt:  MDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLDRFVAN----TSLIDKAYKLEISHLNYHQSDHRP-ILDIEN--------------YSPHPN

Query:  ---------NRKNP---KKLKFEERWVQFEECKNIVK----GNWMSGRKE--------------NAGDINCKVESILHKLADWNKVRLGGSIQGAVERKY
                   + P         E     ++C  ++     GN     KE              N  D   +VE +  K   WN            + + 
Subjt:  ---------NRKNP---KKLKFEERWVQFEECKNIVK----GNWMSGRKE--------------NAGDINCKVESILHKLADWNKVRLGGSIQGAVERKY

Query:  KELQELNKGLGQSNEVAIAKAEQELSCLLEEEKHNGSWVEEEEEIGAVANKYFETLFASSNP--NPEDIQKAIQHIEP-RVSDKQREELRRPFSKIEIEK
        K LQ+ +      ++V +A   + L   L  +  +   VE   ++  +   Y+  L  S +    P+ +Q+ I+ I P R +D     L    S  EI  
Subjt:  KELQELNKGLGQSNEVAIAKAEQELSCLLEEEKHNGSWVEEEEEIGAVANKYFETLFASSNP--NPEDIQKAIQHIEP-RVSDKQREELRRPFSKIEIEK

Query:  VLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIIS
         +  +   KAPGPD   A F+   W +V +               L   N T I +IPK      +  F+P+S C  VYKII+
Subjt:  VLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLNGEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIIS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-1127.69Show/hide
Query:  YLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKE
        YLG+P  T +     +  + +K+   + +W  +  S  G+  LI +V  ++ N+ MS F+LP +  +E++ + + FLW       K   ++W  +C  K+
Subjt:  YLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAGGKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKE

Query:  NGGLGFRELSLFNQAMLAKKSWRIVKNPNI
         GGLG R L   N+       W I  N  +
Subjt:  NGGLGFRELSLFNQAMLAKKSWRIVKNPNI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.1e-1136.49Show/hide
Query:  LANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRVEWEFIRQTML
        +  R+K +M ++I  +Q++FIPGR+ +DN++   +++HS+  KK G +G++  KLD+ K YDR+ W+++  T++
Subjt:  LANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRVEWEFIRQTML

AT4G29090.1 Ribonuclease H-like superfamily protein1.9e-2938.03Show/hide
Query:  AIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLEA
        A+P YTM+CF LP ++C+++  + A F W +       HW +W  L   K  GG+GF+++  FN A+L K+ WR++  P  L++++ + +YF KS  L A
Subjt:  AIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLEA

Query:  PLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI
        PLG  PS  W+SI   +++ + G R  VGNG+ I I    W+
Subjt:  PLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.2e-3646.53Show/hide
Query:  AIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKE-NGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLE
        A+P Y MSCF+L   +C+++     +F W S  NK K  W++W++LC SKE +GGLGFR+L  FNQA+LAK+S+RI+  P+ L+SR+LR +YF  SS +E
Subjt:  AIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKE-NGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLE

Query:  APLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWII
          +G  PS  WRSI+ GR+L   G+   +G+G + K+  D WI+
Subjt:  APLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCGAAATCTCTGGAGGGATATCGGCGGAGGCTGGGTTCCAACCCTGCCGAACGCCATGAAAACGTTATGTTGGAACATTTGGGAGGTGGGGAACCCTCGAACGGT
CCGTGCTCTGCGCCTTAAGGTGAAGCTTGGGTACGATCACTGCTTTGATGTCCCTAGTAATGGTTTAAGCGGGAGCCTGATGCTTTTATGGAAGAAGGAGTTGACGGTTA
AGGTCAATTCTTTCTCTAAGGGGCATATAGATGCTAACCTGCTAGTCAAAAACGAATGGTGGAGCTTCACAGGGTTTTACGGCAGTCCAGAGAGAGAAAAAAGAAGCGAA
TCGTGGGAGCTTGTTAAGCGGTTTCATGCTTTGGAAAATATCTCGTGGCTTATGGGTGGCGATTTCAATGAGGTTCTCTTAGATAATGAGAAGAAGGGGGGTAACCCGAA
GAAGCAGAGTGACATGGATGCTTTCAAGGATGTCCTCAATTTGTGCAGCCTTGTAGACTTAGGCTACAAAGGGGAAAAATACACTTGGAAAAGACGAGACAGGAAAGGAG
ATATCATAAAAGAGAGGCTTGACAGATTTGTGGCCAACACAAGTCTTATTGACAAAGCTTATAAACTTGAAATATCCCATTTAAATTATCACCAATCTGACCATAGACCT
ATCCTAGACATCGAAAACTATAGTCCTCATCCCAATAATCGGAAAAATCCAAAAAAGCTCAAATTTGAAGAAAGATGGGTCCAGTTTGAAGAGTGCAAAAACATAGTGAA
AGGGAACTGGATGTCGGGGAGGAAAGAAAATGCAGGGGATATCAATTGCAAAGTAGAGAGCATTTTACATAAGTTGGCAGATTGGAATAAAGTCAGACTTGGAGGATCCA
TTCAAGGGGCAGTGGAGAGGAAATATAAGGAATTGCAAGAGTTGAACAAAGGCTTAGGTCAATCTAACGAGGTGGCCATTGCAAAAGCTGAACAGGAGTTATCCTGCTTG
CTAGAGGAAGAAAAACACAATGGCTCGTGGGTGGAAGAGGAAGAAGAAATTGGGGCGGTGGCTAACAAGTACTTTGAAACCCTTTTTGCCTCATCCAACCCAAACCCCGA
AGATATCCAAAAAGCCATTCAGCACATAGAACCGAGAGTATCAGACAAACAGAGGGAAGAGCTAAGGCGTCCGTTTTCTAAAATTGAGATCGAGAAGGTTTTAAAAGGGT
TGAAGGCTACTAAAGCTCCAGGTCCTGATGGAGCTCATGCATTGTTTTACCAATCCTACTGGGATATTGTGGGGGAAGATATAACTAAAGTGTGCCTCAACACCCTTAAC
GGAGAGGACTCTTTAGGCCCGCTGAATACCACATGGATTGCCATAATTCCAAAGTCGGCTTCCCCTAAGCACATGGAAGGATTCAAGCCTATTAGCCTTTGCAACACTGT
TTACAAGATCATTTCAAAAGCCCTTGCCAACAGAATGAAATGGATGATGGATTCGATTATATCGCAGTCCCAATCAACATTCATCCCTGGGAGGTTGATATCTGATAACG
TGATAGCTGGATTCAAAAGTATCCATTCGATAAATAGCAAAAAAGCTGGTAAAGAAGGTTTTTTGGCTGCAAAGTTGGACATGAGTAAAGTTTATGACAGGGTGGAGTGG
GAGTTCATCCGTCAGACAATGCTTAAGCTTGATTCTGGAGCTTACATGAGGGACTATGAGGTGCTAAAGGAGATTCTGGTTGAATATGAAGTGGTCTCGAGGCAAAATGT
CAACCTAGAGAAATCTTCTTGTTTGGTGAGCAAGAGCGTAGATCGTGGTCAGGCGGCTTTGTTAAGCAATATTTTGGGGATCAAGCTTACCAATTCCTTGGGGGACTATC
TGGGGATGCCATCTCAGACGGGAAGGAACAAAAACAAAGTGTTTTGGAAAGTCAAAGACAAAGTGTGGAAAGTGCTTCAGAGGTGGAAGGAGAAGTTTTTCTCTGCAGGA
GGGAAGGAAGTTCTTATCAAAGCAGTAGCACAAGCGATCCCAAACTACACAATGAGCTGTTTTAAATTGCCAATATCAATCTGTGAGGAGATGAATAGACTTTATGCGAA
ATTCTTGTGGGGGTCGGTAGGAAATAAGAACAAGGCTCACTGGATGAGCTGGAAGAGATTATGTGTGAGTAAGGAGAATGGAGGCTTGGGCTTTAGGGAGCTTAGTCTAT
TCAACCAAGCTATGCTTGCTAAGAAAAGTTGGAGGATAGTAAAGAACCCTAACATCCTAGTCTCTAGAATCCTAAGAGGAAAGTACTTCAAAAAATCATCATTCCTAGAA
GCCCCTTTAGGTCCCAACCCATCTCTTACTTGGAGGAGTATTTTATGGGGTAGAGATTTGTTTAAGTTGGGTATGAGATGGAGAGTTGGAAACGGTCAGTACATCAAAAT
TGGGGAGGACCCGTGGATTATTGGGTTGAATGGTTACAAGCCAGTATGGACAGAGGACAACATTAAAAAGAAGTATGTGAGCCTCGTTATTGGAAGGATCTATGGAGATC
TAAGGCCTTGCCTCGAGAAAAGATCTACTCTTGGAGAGCAATCCAGGACATCCTTCCTACGCAAAGTAATATTGCTTCTAAAGGGATCGACATTAACACCTTATGTTTTC
TTTGCAAGGAATAACGGGAAACGGGGAGCCATGTCATATGGGATTGCAAGGTTTCAGGTAAGGTTTGGAACCATTTCTTCCTTACCTTATGTGTGTCTCGGTTTGGCTGC
AGATCAAACTGTGATCCAAAGAGCCATTGGATTCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCCGAAATCTCTGGAGGGATATCGGCGGAGGCTGGGTTCCAACCCTGCCGAACGCCATGAAAACGTTATGTTGGAACATTTGGGAGGTGGGGAACCCTCGAACGGT
CCGTGCTCTGCGCCTTAAGGTGAAGCTTGGGTACGATCACTGCTTTGATGTCCCTAGTAATGGTTTAAGCGGGAGCCTGATGCTTTTATGGAAGAAGGAGTTGACGGTTA
AGGTCAATTCTTTCTCTAAGGGGCATATAGATGCTAACCTGCTAGTCAAAAACGAATGGTGGAGCTTCACAGGGTTTTACGGCAGTCCAGAGAGAGAAAAAAGAAGCGAA
TCGTGGGAGCTTGTTAAGCGGTTTCATGCTTTGGAAAATATCTCGTGGCTTATGGGTGGCGATTTCAATGAGGTTCTCTTAGATAATGAGAAGAAGGGGGGTAACCCGAA
GAAGCAGAGTGACATGGATGCTTTCAAGGATGTCCTCAATTTGTGCAGCCTTGTAGACTTAGGCTACAAAGGGGAAAAATACACTTGGAAAAGACGAGACAGGAAAGGAG
ATATCATAAAAGAGAGGCTTGACAGATTTGTGGCCAACACAAGTCTTATTGACAAAGCTTATAAACTTGAAATATCCCATTTAAATTATCACCAATCTGACCATAGACCT
ATCCTAGACATCGAAAACTATAGTCCTCATCCCAATAATCGGAAAAATCCAAAAAAGCTCAAATTTGAAGAAAGATGGGTCCAGTTTGAAGAGTGCAAAAACATAGTGAA
AGGGAACTGGATGTCGGGGAGGAAAGAAAATGCAGGGGATATCAATTGCAAAGTAGAGAGCATTTTACATAAGTTGGCAGATTGGAATAAAGTCAGACTTGGAGGATCCA
TTCAAGGGGCAGTGGAGAGGAAATATAAGGAATTGCAAGAGTTGAACAAAGGCTTAGGTCAATCTAACGAGGTGGCCATTGCAAAAGCTGAACAGGAGTTATCCTGCTTG
CTAGAGGAAGAAAAACACAATGGCTCGTGGGTGGAAGAGGAAGAAGAAATTGGGGCGGTGGCTAACAAGTACTTTGAAACCCTTTTTGCCTCATCCAACCCAAACCCCGA
AGATATCCAAAAAGCCATTCAGCACATAGAACCGAGAGTATCAGACAAACAGAGGGAAGAGCTAAGGCGTCCGTTTTCTAAAATTGAGATCGAGAAGGTTTTAAAAGGGT
TGAAGGCTACTAAAGCTCCAGGTCCTGATGGAGCTCATGCATTGTTTTACCAATCCTACTGGGATATTGTGGGGGAAGATATAACTAAAGTGTGCCTCAACACCCTTAAC
GGAGAGGACTCTTTAGGCCCGCTGAATACCACATGGATTGCCATAATTCCAAAGTCGGCTTCCCCTAAGCACATGGAAGGATTCAAGCCTATTAGCCTTTGCAACACTGT
TTACAAGATCATTTCAAAAGCCCTTGCCAACAGAATGAAATGGATGATGGATTCGATTATATCGCAGTCCCAATCAACATTCATCCCTGGGAGGTTGATATCTGATAACG
TGATAGCTGGATTCAAAAGTATCCATTCGATAAATAGCAAAAAAGCTGGTAAAGAAGGTTTTTTGGCTGCAAAGTTGGACATGAGTAAAGTTTATGACAGGGTGGAGTGG
GAGTTCATCCGTCAGACAATGCTTAAGCTTGATTCTGGAGCTTACATGAGGGACTATGAGGTGCTAAAGGAGATTCTGGTTGAATATGAAGTGGTCTCGAGGCAAAATGT
CAACCTAGAGAAATCTTCTTGTTTGGTGAGCAAGAGCGTAGATCGTGGTCAGGCGGCTTTGTTAAGCAATATTTTGGGGATCAAGCTTACCAATTCCTTGGGGGACTATC
TGGGGATGCCATCTCAGACGGGAAGGAACAAAAACAAAGTGTTTTGGAAAGTCAAAGACAAAGTGTGGAAAGTGCTTCAGAGGTGGAAGGAGAAGTTTTTCTCTGCAGGA
GGGAAGGAAGTTCTTATCAAAGCAGTAGCACAAGCGATCCCAAACTACACAATGAGCTGTTTTAAATTGCCAATATCAATCTGTGAGGAGATGAATAGACTTTATGCGAA
ATTCTTGTGGGGGTCGGTAGGAAATAAGAACAAGGCTCACTGGATGAGCTGGAAGAGATTATGTGTGAGTAAGGAGAATGGAGGCTTGGGCTTTAGGGAGCTTAGTCTAT
TCAACCAAGCTATGCTTGCTAAGAAAAGTTGGAGGATAGTAAAGAACCCTAACATCCTAGTCTCTAGAATCCTAAGAGGAAAGTACTTCAAAAAATCATCATTCCTAGAA
GCCCCTTTAGGTCCCAACCCATCTCTTACTTGGAGGAGTATTTTATGGGGTAGAGATTTGTTTAAGTTGGGTATGAGATGGAGAGTTGGAAACGGTCAGTACATCAAAAT
TGGGGAGGACCCGTGGATTATTGGGTTGAATGGTTACAAGCCAGTATGGACAGAGGACAACATTAAAAAGAAGTATGTGAGCCTCGTTATTGGAAGGATCTATGGAGATC
TAAGGCCTTGCCTCGAGAAAAGATCTACTCTTGGAGAGCAATCCAGGACATCCTTCCTACGCAAAGTAATATTGCTTCTAAAGGGATCGACATTAACACCTTATGTTTTC
TTTGCAAGGAATAACGGGAAACGGGGAGCCATGTCATATGGGATTGCAAGGTTTCAGGTAAGGTTTGGAACCATTTCTTCCTTACCTTATGTGTGTCTCGGTTTGGCTGC
AGATCAAACTGTGATCCAAAGAGCCATTGGATTCGATTGA
Protein sequenceShow/hide protein sequence
MLRNLWRDIGGGWVPTLPNAMKTLCWNIWEVGNPRTVRALRLKVKLGYDHCFDVPSNGLSGSLMLLWKKELTVKVNSFSKGHIDANLLVKNEWWSFTGFYGSPEREKRSE
SWELVKRFHALENISWLMGGDFNEVLLDNEKKGGNPKKQSDMDAFKDVLNLCSLVDLGYKGEKYTWKRRDRKGDIIKERLDRFVANTSLIDKAYKLEISHLNYHQSDHRP
ILDIENYSPHPNNRKNPKKLKFEERWVQFEECKNIVKGNWMSGRKENAGDINCKVESILHKLADWNKVRLGGSIQGAVERKYKELQELNKGLGQSNEVAIAKAEQELSCL
LEEEKHNGSWVEEEEEIGAVANKYFETLFASSNPNPEDIQKAIQHIEPRVSDKQREELRRPFSKIEIEKVLKGLKATKAPGPDGAHALFYQSYWDIVGEDITKVCLNTLN
GEDSLGPLNTTWIAIIPKSASPKHMEGFKPISLCNTVYKIISKALANRMKWMMDSIISQSQSTFIPGRLISDNVIAGFKSIHSINSKKAGKEGFLAAKLDMSKVYDRVEW
EFIRQTMLKLDSGAYMRDYEVLKEILVEYEVVSRQNVNLEKSSCLVSKSVDRGQAALLSNILGIKLTNSLGDYLGMPSQTGRNKNKVFWKVKDKVWKVLQRWKEKFFSAG
GKEVLIKAVAQAIPNYTMSCFKLPISICEEMNRLYAKFLWGSVGNKNKAHWMSWKRLCVSKENGGLGFRELSLFNQAMLAKKSWRIVKNPNILVSRILRGKYFKKSSFLE
APLGPNPSLTWRSILWGRDLFKLGMRWRVGNGQYIKIGEDPWIIGLNGYKPVWTEDNIKKKYVSLVIGRIYGDLRPCLEKRSTLGEQSRTSFLRKVILLLKGSTLTPYVF
FARNNGKRGAMSYGIARFQVRFGTISSLPYVCLGLAADQTVIQRAIGFD