; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G003600 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G003600
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase domain-containing protein
Genome locationCG_Chr04:14084607..14087956
RNA-Seq ExpressionClCG04G003600
SyntenyClCG04G003600
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]6.3e-13631.72Show/hide
Query:  MYWKCRSREDWLKWGDRNTKVTNCKIS--------------------------------------------------------KIPDHLQWGMNNPFSKE
        +YWK RSR DWLK GD+NTK  + K S                                                        K+   +   +  PF+ E
Subjt:  MYWKCRSREDWLKWGDRNTKVTNCKIS--------------------------------------------------------KIPDHLQWGMNNPFSKE

Query:  DIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNE
        DI   +  M P KAPGPDG+ A F+QK+W IVG+   + CL ILNE   +  LN T I+LIPKV+KP+ + EFRPISLCNV Y+I+AK + NRLK  LN 
Subjt:  DIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNE

Query:  IIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQR
        II+P QSAF+  R+ITDN I+G++CLH I   +  ++G VALKL +SKAYD VEW FL   + ++G    W+ ++M C+ T    VLIN  P     P+R
Subjt:  IIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQR

Query:  GLRQGDPLSPYLFLICTEGLSALLHR------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLA
        GLRQG PLSPYLF++C E  S LL++                        ++SL    AS+ + K ++ +   Y + SGQ+ N +KS   FS     +  
Subjt:  GLRQGDPLSPYLFLICTEGLSALLHR------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLA

Query:  IEINNVLGVRKTDSLGIYLGM-----------LKE--------------KLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------------
          I ++  ++       YLG+            KE              KLF   GKE LIKA+AQA+  Y MS FKLP  +C+D+ +            
Subjt:  IEINNVLGVRKTDSLGIYLGM-----------LKE--------------KLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------------

Query:  ------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGIVL
              A   S    + RGGLG RDL  FNQA++AKQ WR+V+  NSL+ARV+K RY+K+  F NA +GSNPS  WR ILWG ++  KG R R   G  +
Subjt:  ------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGIVL

Query:  VDHR----------TPICINGNLALMRVRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA------------
        + ++           PI          V D+I+S+  W  + +   F+  D ++I+ +        DE++W FD    +SVKS Y LA            
Subjt:  VDHR----------TPICINGNLALMRVRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA------------

Query:  ------------------------KSITDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIW----LSIFPSL---PGFLSNFRDCWSPA
                                +++ + LPT  N+ K+     P+C  C+  VE++SHVL  CK  + IW    L + PS      F S  ++ WS +
Subjt:  ------------------------KSITDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIW----LSIFPSL---PGFLSNFRDCWSPA

Query:  GS---------CLDVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGL--EVWVPPVLGSWKINTYASWSRLKGMGGISWVVRD
         +         C  +WS RN  +  G       L  K  + ++ + R +  + G++      G+  + W PP     K+N  A+ S      G+  +VRD
Subjt:  GS---------CLDVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGL--EVWVPPVLGSWKINTYASWSRLKGMGGISWVVRD

Query:  SSGSIILAGCEKRYFAQ----GSLPNVSW---------------ESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLAHRLAR
        + G I+  G ++  F +         + W               ESDC  ++  +N+     +E+   + ++   +     V F +   + N  AH LA+
Subjt:  SSGSIILAGCEKRYFAQ----GSLPNVSW---------------ESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLAHRLAR

Query:  AAVWSGD---WKGFF
         A+ +     W G F
Subjt:  AAVWSGD---WKGFF

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.8e-13032.97Show/hide
Query:  YWKCRSREDWLKWGDRNTKVTNCKIS------------------------------------------------------KIPDHLQWGMNNPFSKEDIE
        YW  R++  WLK GDRNTK  + + S                                                      K+ + +   +   F+KE++ 
Subjt:  YWKCRSREDWLKWGDRNTKVTNCKIS------------------------------------------------------KIPDHLQWGMNNPFSKEDIE

Query:  MVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIA
        + +K + P KAPGPDG+ A+F+QKYW IVG +   + L +LN    +  LNKT ISLIPK   PK +++FRPISLCNV YK+I+K L NRLK  L  II+
Subjt:  MVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIA

Query:  PTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLR
          QSAF   R+ITDN ++ F+ +H +  K  GK+G++A+KL MSKA+D VEW F+  ++  +G    W  ++M+C+ +V   +LIN   H    P RGLR
Subjt:  PTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLR

Query:  QGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIE
        QGDPLSP LFL+C EGLSAL+++                         ++S+    A+  E   +R +L +YEE SGQ IN DKS   FS N  Q+   E
Subjt:  QGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIE

Query:  INNVLGVRKTDSLGIYLG--------------MLKE-----------KLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR--------------
        I N+LG  +      YLG              MLKE           KL  + GKE LIKA+AQAI  Y MSCF LP  +CDD+ R              
Subjt:  INNVLGVRKTDSLGIYLG--------------MLKE-----------KLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR--------------

Query:  ----ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IVL
             S       +  GGLG R+L+ FN AMLAKQ+WRI+ N NSL+ RVLK RYF  GD  NA LGS+PS +WR I    E+  +G R R   G  I +
Subjt:  ----ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IVL

Query:  VDHR---TPICIN------GNLALMRVRDIINSDGS-WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSI---------
         + R   TP           N     V  +I+ D   W  E + +IFLP + ++I+ +P       D++IW  +    FSVKSAY++A SI         
Subjt:  VDHR---TPICIN------GNLALMRVRDIINSDGS-WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSI---------

Query:  ------------------------------TDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWL--SIFPSLP-GFLSNFRD-----C
                                       D LPT  NISK+GI  +  C  C    E ++H L  C+   ++W   S +P  P     +F D     C
Subjt:  ------------------------------TDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWL--SIFPSLP-GFLSNFRD-----C

Query:  WSPAGSCLD--------VWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVE----QGSLWRELTGGLEVWVPPVLGSWKINTYASWSRLKGMGGIS
         S A   L+        +W  RN ++HN +P   + +       +E+F +  S++    + S  R        W  P LG +K+N   + S       I 
Subjt:  WSPAGSCLD--------VWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVE----QGSLWRELTGGLEVWVPPVLGSWKINTYASWSRLKGMGGIS

Query:  WVVRDSSGSIILA------------GCEKRYFAQG-------SLPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLA
         ++RDS+G ++ A              E     QG        L  V  E D + +I A+N  ++  +EL   ++ I S+++S    +F+    + N +A
Subjt:  WVVRDSSGSIILA------------GCEKRYFAQG-------SLPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLA

Query:  HRLARAA
        H LA+ A
Subjt:  HRLARAA

XP_023890148.1 uncharacterized protein LOC112002224 [Quercus suber]9.4e-12433.26Show/hide
Query:  WKCRSREDWLKWGDRNTKVTNCKIS-----------KIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEG
        W  R+   WLK GDRNT   + K S           K+  H+   +   F  E++   +K M P  APGPDG+  +FYQ+YW  VG+      L  LN G
Subjt:  WKCRSREDWLKWGDRNTKVTNCKIS-----------KIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEG

Query:  ENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMS
              N+T I LIPKVK PKH+ +FRPISLCNV+YK+ +K L NRLK  L  ++   QSAFV  R+ITDN ++  + +  IS KR GK G +ALKL MS
Subjt:  ENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMS

Query:  KAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHR---------------------
        KAYD VEW  L  +++ +G    WV ++M+C+ TV   + IN +P    +P RGLRQGDP+SPYLFLIC EGLSALLH+                     
Subjt:  KAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHR---------------------

Query:  ----EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------
            ++SL    A++ E   I ++L+ YEE SGQ +N  K+   FS+N   +    I  +LG +       YLG+                         
Subjt:  ----EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------

Query:  LKEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSN
         KEKL   +GKE LIKA+AQA+  Y MSCFKLP  +CDDL                     S       +  GG+G +DL+ FN A+LAKQ WR+    +
Subjt:  LKEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSN

Query:  SLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKG-------------YRGRWEKGIVLVDHRTPICINGNLALMRVRDIINSDGSWNFELI
        SL  RV + +YF  G+F NA +G +PS  WR I+  +++  KG             +R RW          TP  +  + A  +V+D+I   G W+  LI
Subjt:  SLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKG-------------YRGRWEKGIVLVDHRTPICINGNLALMRVRDIINSDGSWNFELI

Query:  HNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA---------------------------------------KSITDSLPTLSNISKK
          +F P DA+ I+S+P       D+ IW    N  F+V SAY L                                        ++  D L + +N+ K+
Subjt:  HNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA---------------------------------------KSITDSLPTLSNISKK

Query:  GIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLS---IFPSLPGFLSNFRD-CW-----SPAGS---------CLDVWSYRNLLLHNGTPHVRNVLVDKI
         I  + LC  C K  E+  H+ W C+  K +W S    FP       NF D  W     SP  S         C ++W  RN + H G     + ++ K 
Subjt:  GIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLS---IFPSLPGFLSNFRD-CW-----SPAGS---------CLDVWSYRNLLLHNGTPHVRNVLVDKI

Query:  FAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSRLKGMGGISWVVRDSSGSIILAGCEK
          +VEE+   +      +  +       W  P  G +K N   +        G+  V+R++ G ++ A   K
Subjt:  FAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSRLKGMGGISWVVRDSSGSIILAGCEK

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]7.9e-12331.21Show/hide
Query:  MYWKCRSREDWLKWGDRNTKVTNCKIS-KIPDHLQWGMNN-------------------------------------------------------PFSKE
        ++ K RSR DWL+ GD+NTK  + K S +   +  WG+ N                                                       PF+ E
Subjt:  MYWKCRSREDWLKWGDRNTKVTNCKIS-KIPDHLQWGMNN-------------------------------------------------------PFSKE

Query:  DIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNE
        D+E  +  M P KAPGPDG+ A F+QK+W  V    +  CL +LNE  N   LN T I+LIPK+  P+ +S++RPISLCNV Y+++AK + NR+K  L++
Subjt:  DIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNE

Query:  IIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQR
        II+P QSAF+  R+ITDN I+G++CLH I   +  K+G VALKL +SKAYD VEWPFL   +L +G     V ++MRCV +    VLIN  P     P+R
Subjt:  IIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQR

Query:  GLRQGDPLSPYLFLICTEGLSALLHREESLSLIHA-----------SIFESKNIRKVLQEYEEVSGQMINLDKSEC--LFSKNV--RQQLAIEINNVLGV
        GLRQG PLSPYLF++C E LS LL   E   LI              +F   ++        E SG++    ++    +F+ NV  + +  + + +++G 
Subjt:  GLRQGDPLSPYLFLICTEGLSALLHREESLSLIHA-----------SIFESKNIRKVLQEYEEVSGQMINLDKSEC--LFSKNV--RQQLAIEINNVLGV

Query:  RKTDSLG-IYLGML------KEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNRA------------------SGTSFVLVRMRGGLGLRDLQ
        +K      I L +L      ++K     GKE LIKA AQAI  Y MS FKLP   CDD+ RA                          ++RGGLG R+  
Subjt:  RKTDSLG-IYLGML------KEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNRA------------------SGTSFVLVRMRGGLGLRDLQ

Query:  LFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYR---GRWEKGIVLVDH-------RTPICINGNLALMR
         FNQA++AKQ+WR+++  NSL++RVL+ RYF++  F  A  G+N S  WR I+WGR++  KG R   G  +K  +  D+         PI          
Subjt:  LFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYR---GRWEKGIVLVDH-------RTPICINGNLALMR

Query:  VRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA------------------------------------KSI
        V D+I +D  W+   +   FL  D   I+ +P       DE++W +D    +SVKS Y LA                                    ++ 
Subjt:  VRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA------------------------------------KSI

Query:  TDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLPGFLSNFRDCWSP----------------AGSCLDVWSYRNLLLHNGT
         + LP+  N+ K+ +   P C  C+  VE++SH L  CK  + IWL    S P   +N +D +S                    C   W  RN  + +G 
Subjt:  TDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLPGFLSNFRDCWSP----------------AGSCLDVWSYRNLLLHNGT

Query:  PHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSRLKGMGGISWVVRDSSGSIILAGCEKRYF-AQGSLPN---VSW
             +   K  + +  F R    +Q  +   +    + W+PP    +K+N  A+++      G+  V+RDS+G I+ AG  +       SL     V W
Subjt:  PHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSRLKGMGGISWVVRDSSGSIILAGCEKRYF-AQGSLPN---VSW

Query:  ---------------ESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLAHRLARAAV
                       ESDC+ ++  +N+     SE+   +  I +       V         N  AH LA+ A+
Subjt:  ---------------ESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLAHRLARAAV

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.6e-12332.57Show/hide
Query:  MYWKCRSREDWLKWGDRNTKVTNCKIS------------------------------------------------------KIPDHLQWGMNNPFSKEDI
        +YW  RSR +WL+ GDRNTK  + K S                                                      K+ + ++  ++N F+ E++
Subjt:  MYWKCRSREDWLKWGDRNTKVTNCKIS------------------------------------------------------KIPDHLQWGMNNPFSKEDI

Query:  EMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEII
        +  +  MGP KAPGPDG++A+FYQK+W IVG   +   L  LN G  +  +N T I LIPKV+ P+ +SEFRPISLCNV YKII+K L NRLK+ L +II
Subjt:  EMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEII

Query:  APTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGL
        + TQSAFV GR+ITDN ++ ++ LH + +++ GK G VALKL +SKAYD VEW FL++++  +G    W+  +M CV T    +L+N KP+    P RG+
Subjt:  APTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGL

Query:  RQGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAI
        RQGDP+SPYLFL+C EGL+ALL++                         ++SL    A+  E + I ++LQ YE  SGQ INL+KS   FS N  +    
Subjt:  RQGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAI

Query:  EINNVLGVRKTDSLGIYLGM--------------LKEK-----------LFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR-------------
        +I  +LGV++ D    YLG+              LK++           L   +GKE LIKA+AQAI  Y MS F++P  +C +L               
Subjt:  EINNVLGVRKTDSLGIYLGM--------------LKEK-----------LFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR-------------

Query:  -----ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGI---
              S       +  GG+G RDL+ FN AMLAKQ WR+V+  +SLL R  K RYF    F  A    N S  WR ++  + +   GY  R   G    
Subjt:  -----ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGI---

Query:  VLVDHRTPICINGNLALMRVRDIINSDGS--------------WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSI-TD
         + D   P     N    +V + +  DGS              WN+E I  IF   +A++I  +P       D I W + P   FSVKSAY++A+ I TD
Subjt:  VLVDHRTPICINGNLALMRVRDIINSDGS--------------WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSI-TD

Query:  S--------------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLPGFLSNFRDCWS
        +                                      LPT  N++ + I  +  C  C +  ES  H LW C   + IW      L        D   
Subjt:  S--------------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLPGFLSNFRDCWS

Query:  PAGSCLD----------------VWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSRLKGMGG
             L+                VW  RN LLH G   V + L  +   ++ EF   N+  +  + R      ++W PP  G +K+N  A+     G  G
Subjt:  PAGSCLD----------------VWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSRLKGMGG

Query:  ISWVVRDSSGSIILA
           ++R+  G ++ A
Subjt:  ISWVVRDSSGSIILA

TrEMBL top hitse value%identityAlignment
A0A2N9FN47 Uncharacterized protein2.4e-12535.53Show/hide
Query:  KVTNCKISKIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRP
        +VT    +++   +   + +PFS E+I+  +  M P KAPGPDG+ A+F+QKYW++VG       L  LN G  +G +N T I LIPKVK P +++ FRP
Subjt:  KVTNCKISKIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRP

Query:  ISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRIL
        ISLCNV YKII+K LVNR+K  L+++I+ +QSAFV GRMITDN ++ F+ LH + +KR GK G +A KL MSKAYD VEW +LRA+LL +G    WV ++
Subjt:  ISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRIL

Query:  MRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEY
        M CV +V   VL+N +      P RGLRQGDPLSPYLFLIC EGLSALL + E   LIH                         A+  + + +  +L  Y
Subjt:  MRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEY

Query:  EEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFPLSGKETLIKAIAQAILIYMMS
        E+ SGQ +N  K+   FS N  Q     I  + G   T     YLG+                          KEKL   +G+E LIKA+ QAI  Y MS
Subjt:  EEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFPLSGKETLIKAIAQAILIYMMS

Query:  CFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSL
        CFKLP  +C +++                    S       + RGG+G R+L LFN AMLA+Q WR+++  NSLL RVLK +YF +  F  A +  NPSL
Subjt:  CFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSL

Query:  TWRGILWGRELFLKGYRGR---------WEKGIVLVDHRTPICINGNLAL---MRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIW
        TWR I   +++ + G   R         W K   L+    P  ++    L     V ++IN + G WN  LI  IFLPSDA+ I  +P       D++IW
Subjt:  TWRGILWGRELFLKGYRGR---------WEKGIVLVDHRTPICINGNLAL---MRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIW

Query:  GFDPNVCFSVKSAYNL----AKSITDS---------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKI
          +    FSVK+AY L    A  IT+S                                 LPT + +  K I+++  C +C    E+  H+LW C F + 
Subjt:  GFDPNVCFSVKSAYNL----AKSITDS---------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKI

Query:  IWLSIFPSLP------GFLSNF-----RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPP
        +W +    +P      G  S+F     RD  SP    +      +W  RN L+  G     + +  +      EF      E  S+  EL    E W PP
Subjt:  IWLSIFPSLP------GFLSNF-----RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPP

Query:  VLGSWKINTYASWSRLKGMGGISWVVRDSSGSIILA
         +GS+K++    +       G+  ++RD  G +++A
Subjt:  VLGSWKINTYASWSRLKGMGGISWVVRDSSGSIILA

A0A2N9FNH6 Reverse transcriptase domain-containing protein4.8e-12633.15Show/hide
Query:  PFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLK
        PF+ E+I   +  M P KAPGPDG++AMFYQK+W IVG D     L  L+ G+ +  +N T I+LIPK+  P+ +++FRPISLCNV YKII+K L NRLK
Subjt:  PFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLK

Query:  RALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHE
          L+ II+  QSAFV GR+ITDN ++ F+ LH + +KR G+   +A+KL MSKAYD VEW FL  M++ +G D+ WV ++M+C+ +V   V++N +P   
Subjt:  RALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHE

Query:  FVPQRGLRQGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKN
          P RG+RQGDPLSPYLFLIC EGL+ALL +                         ++SL    A++ E +N+  +L  YE+ SGQ +N +K+   FS N
Subjt:  FVPQRGLRQGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKN

Query:  VRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------
            L   I  +L    T  LG YLG+                          K KL   +G+E LIK++AQAI +Y MSCF++P T+C ++N       
Subjt:  VRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------

Query:  ------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRW
                       ++    +  GG+G RDL LFNQA+LAKQ WR++++ N+LL R+LK +YF +  F  A +  + S  WR I   R +  KG R R 
Subjt:  ------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRW

Query:  EKGIVL----------------VDHRTPICINGNLALMRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNL
          G  +                V  R  +  N       V D+I+ +   WN  LI +IF P +A  I ++P R +   D ++W   PN  F+ +SAY L
Subjt:  EKGIVL----------------VDHRTPICINGNLALMRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNL

Query:  A--------------------------------------KSITDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLS---------IF
                                               ++ T  LPT +N+ ++G+  +  C  C    E++ H LW C++ +  WL+         + 
Subjt:  A--------------------------------------KSITDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLS---------IF

Query:  PSLPGFLSNF--RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSR
        PS    L ++  R   +P           +W+YRN    N        L  K  ++VEEF   N+         +T     W PP    +K+N   SW R
Subjt:  PSLPGFLSNF--RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSR

Query:  LKGMG--GISWVVRDSSGSIILAGCEKRYFAQGSLPNVS----------WESDCMNLINAINHK------TSD---LSELLSFVEEIGSLADSAHVVSFR
         K     GI  V+RDS G+++ A CE+   A   L N +           E+   ++I   +H       T+D   LSEL   + +I   ++  H ++F 
Subjt:  LKGMG--GISWVVRDSSGSIILAGCEKRYFAQGSLPNVS----------WESDCMNLINAINHK------TSD---LSELLSFVEEIGSLADSAHVVSFR

Query:  WCIWSTNGLAHRLA
            S N  A  LA
Subjt:  WCIWSTNGLAHRLA

A0A2N9GM07 Reverse transcriptase domain-containing protein1.3e-12631.55Show/hide
Query:  WKCRSREDWLKWGDRNTKVTNCKISK---------------------------------------IPDHLQWG-----------MNNP----FSKEDIEM
        WK R+R  WL  GD+NT+  + K S+                                       +P H+Q G           MN+     F+ E++  
Subjt:  WKCRSREDWLKWGDRNTKVTNCKISK---------------------------------------IPDHLQWG-----------MNNP----FSKEDIEM

Query:  VVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIAP
         ++ M P KAPGPDG+ A+F+QKYW IVG++     L++LN   +    NKT I+LIPK K P+ ++EFRPISLCNV+YK+I+K + NRLK  L+++I+ 
Subjt:  VVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIAP

Query:  TQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLRQ
        TQSAFV GR ITDNA++ F+ +H    KR GKD ++ALKL MSKAYD VEW F+  ++  +G    W+ ++M C+ TVQ  V +N       +P RGLRQ
Subjt:  TQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLRQ

Query:  GDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIEI
        GDPLSPYLFL+C EG S+LL   E   LIH                         A+  + + +  + + YE+ SGQ IN+DKS   FS+N       EI
Subjt:  GDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIEI

Query:  NNVLGVRKTDSLGIYLGM-------------------------LKEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDL----------NRASGTS
               +      YLG+                          KEKL    G+E LIK++AQAI  Y MSCF+LP T+C ++           R   + 
Subjt:  NNVLGVRKTDSLGIYLGM-------------------------LKEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDL----------NRASGTS

Query:  FVLV--------RMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGI---VL
          LV        ++RGG+G RDL  FN A+LAKQ WR++ N NS+L R+ K +YF  G+   A +G NPS  WR I    ++  +G R R   GI   + 
Subjt:  FVLV--------RMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGI---VL

Query:  VDHRTP-----------ICINGNLALMRVRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSITDS-----
         D   P           + I     +  + D I     W  E +H  FLP D  +I+S+P   I   D+ +W  + N  F+VKSAY++A ++  S     
Subjt:  VDHRTP-----------ICINGNLALMRVRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSITDS-----

Query:  ----------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIW-----LSIFP-------------S
                                          LPT+  + ++G+  NP C  C +  ES+SH +W C   + IW     L I P              
Subjt:  ----------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIW-----LSIFP-------------S

Query:  LPGFLSNFRDCWSPAGSCLDVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEV----WVPPVLGSWKINTYASWSRLKGMG
          G +      W  A    +VW  RN  +HN     +N +  ++F   ++       E   L+ + +G   +    W  P  G +K+NT  +        
Subjt:  LPGFLSNFRDCWSPAGSCLDVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEV----WVPPVLGSWKINTYASWSRLKGMG

Query:  GISWVVRDSSGSIILAGCEKRYFAQGS--------------------LPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWST
        GI  V+RD  GS  LAG   R     S                    L ++  ESD  N+++AIN +  D   +   +  IG L  S       +    +
Subjt:  GISWVVRDSSGSIILAGCEKRYFAQGS--------------------LPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWST

Query:  NGLAHRLARAA
        N +AH LA+ A
Subjt:  NGLAHRLARAA

A0A2N9GWG4 Uncharacterized protein2.4e-12535.53Show/hide
Query:  KVTNCKISKIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRP
        +VT    +++   +   + +PFS E+I+  +  M P KAPGPDG+ A+F+QKYW++VG       L  LN G  +G +N T I LIPKVK P +++ FRP
Subjt:  KVTNCKISKIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRP

Query:  ISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRIL
        ISLCNV YKII+K LVNR+K  L+++I+ +QSAFV GRMITDN ++ F+ LH + +KR GK G +A KL MSKAYD VEW +LRA+LL +G    WV ++
Subjt:  ISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRIL

Query:  MRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEY
        M CV +V   VL+N +      P RGLRQGDPLSPYLFLIC EGLSALL + E   LIH                         A+  + + +  +L  Y
Subjt:  MRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEY

Query:  EEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFPLSGKETLIKAIAQAILIYMMS
        E+ SGQ +N  K+   FS N  Q     I  + G   T     YLG+                          KEKL   +G+E LIKA+ QAI  Y MS
Subjt:  EEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFPLSGKETLIKAIAQAILIYMMS

Query:  CFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSL
        CFKLP  +C +++                    S       + RGG+G R+L LFN AMLA+Q WR+++  NSLL RVLK +YF +  F  A +  NPSL
Subjt:  CFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSL

Query:  TWRGILWGRELFLKGYRGR---------WEKGIVLVDHRTPICINGNLAL---MRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIW
        TWR I   +++ + G   R         W K   L+    P  ++    L     V ++IN + G WN  LI  IFLPSDA+ I  +P       D++IW
Subjt:  TWRGILWGRELFLKGYRGR---------WEKGIVLVDHRTPICINGNLAL---MRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIW

Query:  GFDPNVCFSVKSAYNL----AKSITDS---------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKI
          +    FSVK+AY L    A  IT+S                                 LPT + +  K I+++  C +C    E+  H+LW C F + 
Subjt:  GFDPNVCFSVKSAYNL----AKSITDS---------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKI

Query:  IWLSIFPSLP------GFLSNF-----RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPP
        +W +    +P      G  S+F     RD  SP    +      +W  RN L+  G     + +  +      EF      E  S+  EL    E W PP
Subjt:  IWLSIFPSLP------GFLSNF-----RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPP

Query:  VLGSWKINTYASWSRLKGMGGISWVVRDSSGSIILA
         +GS+K++    +       G+  ++RD  G +++A
Subjt:  VLGSWKINTYASWSRLKGMGGISWVVRDSSGSIILA

A0A7N2L6Z9 Reverse transcriptase domain-containing protein1.1e-12532.74Show/hide
Query:  MYWKCRSREDWLKWGDRNTKVTNCKISK------------------------------------------------------IPDHLQWGMNNPFSKEDI
        ++W  RS+  WLK GDRNTK  + + S+                                                      I + +   ++  F++E+I
Subjt:  MYWKCRSREDWLKWGDRNTKVTNCKISK------------------------------------------------------IPDHLQWGMNNPFSKEDI

Query:  EMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEII
           +K + P K+PGPDG+ A+F+QKYWDIVG +   + L +LN G ++  +NKT I LIPK   PK +++FRPISLCNV YK+I+K L NRLK  L  II
Subjt:  EMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEII

Query:  APTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGL
           QSAF   R+ITDN ++ ++ +H +  K+ GKD ++A KL MSKA+D VEW F+  ++  +G +  W+ ++MRC+ +V   V+IN +     VP RGL
Subjt:  APTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGL

Query:  RQGDPLSPYLFLICTEGLSALLH-------------------------REESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAI
        RQGDPLSPYLFL+C EGLSALLH                          ++SL    A+  E + ++++L++YE  SGQ +N DKS   FS N   +L  
Subjt:  RQGDPLSPYLFLICTEGLSALLH-------------------------REESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAI

Query:  EINNVLGVRKTDSLGIYLGM-------------------------LKEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDL------------NRA
         I N+LG  +      YLG+                          K KL    GKE LIKA+AQAI  Y MSCF LP ++CD+L            N+ 
Subjt:  EINNVLGVRKTDSLGIYLGM-------------------------LKEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDL------------NRA

Query:  SGTSFVLVRMR------GGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IV
        S  +++  R        GGLG R+L  FN A+LAKQ+WRI+ N  SL AR+LK +YF  GD  NA LGSNPS TWR I    E+  KG R R   G  I 
Subjt:  SGTSFVLVRMR------GGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IV

Query:  LVDHR-----------TPICINGNLALMRVRDIINSDGS-WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSITDS---
        + D +           TP  I  +  +  V  +I+ D   W  + I  +FLP DA++I+ +P       D IIW  +    FSVKSAY +A ++ +S   
Subjt:  LVDHR-----------TPICINGNLALMRVRDIINSDGS-WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSITDS---

Query:  ------------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLP-GFLSNFRD-----
                                            LPT++N+  +G+  N  C  C + VE L+H L  C F   +W +++   P G L   RD     
Subjt:  ------------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLP-GFLSNFRD-----

Query:  ----CWSPAGSCL-------DVWSYRNLLLHNG---TPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWS-RLKGM
              SP    L        +W  RNL +H+    +P     +  ++   ++++     ++   +   L G    W  P  G +K+N   + S    G 
Subjt:  ----CWSPAGSCL-------DVWSYRNLLLHNG---TPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWS-RLKGM

Query:  GGISWVVRDSSGSIILAGCE--KRYF----------AQG-------SLPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWST
         G+  V+RD SG +I A C+    YF           QG        LP +  ESD ++ I AIN    +  E    VE I          SF +     
Subjt:  GGISWVVRDSSGSIILAGCE--KRYF----------AQG-------SLPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWST

Query:  NGLAHRLARAA
        N +AH+LA+ A
Subjt:  NGLAHRLARAA

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.4e-2925.67Show/hide
Query:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSE-FRPISLCNVSYKIIAKYLV
        +N P +  +I  ++  +   K+PGPDG  A FYQ+Y + +    +++   I  EG       +  I LIPK  +     E FRPISL N+  KI+ K L 
Subjt:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSE-FRPISLCNVSYKIIAKYLV

Query:  NRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCK
        NR+++ + ++I   Q  F+ G     N       +  I+  R      V + +   KA+D ++ PF+   L  +G+D  +++I+          +++N +
Subjt:  NRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCK

Query:  PHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHAS--------------------IFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNV
            F  + G RQG PLSP LF I  E L+  + +E+ +  I                       I  ++N+ K++  + +VSG  IN+ KS+     N 
Subjt:  PHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHAS--------------------IFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNV

Query:  RQ---QLAIEINNVLGVRKTDSLGI-------------YLGMLKE--------KLFPLS--GKETLIKAIAQAILIYMMSC--FKLPYTVCDDLNRASGT
        RQ   Q+  E+   +  ++   LGI             Y  +LKE        K  P S  G+  ++K      +IY  +    KLP T   +L + +  
Subjt:  RQ---QLAIEINNVLGVRKTDSLGI-------------YLGMLKE--------KLFPLS--GKETLIKAIAQAILIYMMSC--FKLPYTVCDDLNRASGT

Query:  SFVLVRMR--------------GGLGLRDLQLFNQAMLAKQSWRIVKN
         F+  + R              GG+ L D +L+ +A + K +W   +N
Subjt:  SFVLVRMR--------------GGLGLRDLQLFNQAMLAKQSWRIVKN

P08548 LINE-1 reverse transcriptase homolog2.2e-2727.92Show/hide
Query:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKV-KKPKHLSEFRPISLCNVSYKIIAKYLV
        +N P S  +I   ++ +   K+PGPDG  + FYQ + + +    + +   I  EG       +  I+LIPK  K P     +RPISL N+  KI+ K L 
Subjt:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKV-KKPKHLSEFRPISLCNVSYKIIAKYLV

Query:  NRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCK
        NR+++ + +II   Q  F+ G     N       +  I +K   KD  + L +   KA+D ++ PF+   L  IG++  +++++          +++N  
Subjt:  NRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCK

Query:  PHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHASIFESK--------------------NIRKVLQEYEEVSGQMINLDKSEC-LFSKN
            F  + G RQG PLSP LF I  E L+  +  E+++  IH    E K                     + +V++EY  VSG  IN  KS   +++ N
Subjt:  PHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHASIFESK--------------------NIRKVLQEYEEVSGQMINLDKSEC-LFSKN

Query:  VRQQLAIE--INNVLGVRKTDSLGIYLGMLKEKLFPLSGKETLIKAIAQAI
         + +  ++  I   +  +K   LG+YL    + L+     ETL K IA+ +
Subjt:  VRQQLAIE--INNVLGVRKTDSLGIYLGMLKEKLFPLSGKETLIKAIAQAI

P11369 LINE-1 retrotransposable element ORF2 protein2.1e-2528.43Show/hide
Query:  DHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKK-PKHLSEFRPISLCNVSYKI
        DHL    N+P S ++IE V+  +   K+PGPDG  A FYQ + + +     ++  +I  EG       +  I+LIPK +K P  +  FRPISL N+  KI
Subjt:  DHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKK-PKHLSEFRPISLCNVSYKI

Query:  IAKYLVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCP
        + K L NR++  +  II P Q  F+ G     N       +H I+  ++     + + L   KA+D ++ PF+  +L   G+   ++ ++          
Subjt:  IAKYLVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCP

Query:  VLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHAS----------------IFESKN----IRKVLQEYEEVSGQMINLDKSEC
        + +N +       + G RQG PLSPYLF I  E L+  + +++ +  I                   I + KN    +  ++  + EV G  IN +KS  
Subjt:  VLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHAS----------------IFESKN----IRKVLQEYEEVSGQMINLDKSEC

Query:  -LFSKNVRQQLAI
         L++KN + +  I
Subjt:  -LFSKNVRQQLAI

P14381 Transposon TX1 uncharacterized 149 kDa protein7.2e-2629.35Show/hide
Query:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVN
        +  P + +++   ++ M   K+PG DG+   F+Q +WD +G D  R+      +GE      + ++SL+PK    + +  +RP+SL +  YKI+AK +  
Subjt:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVN

Query:  RLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKP
        RLK  L E+I P QS  V GR I DN  L    LH   ++R G      L L   KA+D V+  +L   L +      +V  L     + +C V IN   
Subjt:  RLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKP

Query:  HHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHASIFESKNIRKVLQEYEE----VSGQMINLDKSE
               RG+RQG PLS  L+ +  E    LL +      +   + +  ++R VL  Y +    V+  +++L++++
Subjt:  HHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHASIFESKNIRKVLQEYEE----VSGQMINLDKSE

P93295 Uncharacterized mitochondrial protein AtMg003101.3e-1439.2Show/hide
Query:  AILIYMMSCFKLPYTVCDDLNRASGTSF-----------------VLVRMR---GGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFC
        A+ +Y MSCF+L   +C  L  A  T F                  L + +   GGLG RDL  FNQA+LAKQS+RI+   ++LL+R+L+ RYF      
Subjt:  AILIYMMSCFKLPYTVCDDLNRASGTSF-----------------VLVRMR---GGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFC

Query:  NALLGSNPSLTWRGILWGRELFLKG
           +G+ PS  WR I+ GREL  +G
Subjt:  NALLGSNPSLTWRGILWGRELFLKG

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.3e-1040.45Show/hide
Query:  SKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKII
        S ++I   V  M   KAPGPD   A F+ + W +V   T+         G  +   N T I+LIPKV     LS FRP+S C V YKII
Subjt:  SKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.6e-1236.36Show/hide
Query:  LVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMR
        +V RLK  +  +I P Q++F+ GR+ TDN +   + +H++  K+ G  GW+ LKL + KAYD + W +L   L+S G    W+  + R
Subjt:  LVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMR

AT4G29090.1 Ribonuclease H-like superfamily protein7.6e-2324.53Show/hide
Query:  AILIYMMSCFKLPYTVCDDL------------NRASGTSF------VLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNA
        A+  Y M+CF LP TVC  +              A G  +         +  GG+G +D++ FN A+L KQ WR++    SL+A+V K RYF   D  NA
Subjt:  AILIYMMSCFKLPYTVCDDL------------NRASGTSF------VLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFCNA

Query:  LLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IVLVDHR----------------TPICINGNLALMRVRDIINSDG-SWNFELIHNIFLPSDAKSIIS
         LGS PS  W+ I   +E+  +G R     G  I++  H+                 P       ++++V D+I+  G  W  ++I  +F   + K I  
Subjt:  LLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IVLVDHR----------------TPICINGNLALMRVRDIINSDG-SWNFELIHNIFLPSDAKSIIS

Query:  VPRRGIGGADEIIWGFDPNVCFSVKSAY---------------------------------------NLAKSITDSLPTLSNISKKGIATNPLCFFCRKF
        +   G    D   W +  +  ++VKS Y                                        L K +++SLP    ++ + ++    C  C   
Subjt:  VPRRGIGGADEIIWGFDPNVCFSVKSAY---------------------------------------NLAKSITDSLPTLSNISKKGIATNPLCFFCRKF

Query:  VESLSHVLWGCKFTKIIW
         E+++H+L+ C F ++ W
Subjt:  VESLSHVLWGCKFTKIIW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.0e-1639.2Show/hide
Query:  AILIYMMSCFKLPYTVCDDLNRASGTSF-----------------VLVRMR---GGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFC
        A+ +Y MSCF+L   +C  L  A  T F                  L + +   GGLG RDL  FNQA+LAKQS+RI+   ++LL+R+L+ RYF      
Subjt:  AILIYMMSCFKLPYTVCDDLNRASGTSF-----------------VLVRMR---GGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDFC

Query:  NALLGSNPSLTWRGILWGRELFLKG
           +G+ PS  WR I+ GREL  +G
Subjt:  NALLGSNPSLTWRGILWGRELFLKG

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.2e-0759.52Show/hide
Query:  LINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREE
        +IN  P     P RGLRQGDPLSPYLF++CTE LS L  R +
Subjt:  LINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTGGAAGTGTAGATCGCGGGAAGATTGGCTTAAATGGGGGGATAGGAACACAAAGGTGACCAACTGCAAAATTAGTAAAATCCCAGATCATCTCCAGTGGGGTAT
GAACAATCCTTTCTCGAAAGAGGACATTGAGATGGTTGTCAAAGGTATGGGTCCCTATAAAGCCCCAGGCCCTGATGGGGTGCATGCTATGTTCTATCAAAAGTACTGGG
ACATTGTGGGGCAGGACACAATGAGAATATGCCTGAGGATCCTGAATGAAGGGGAGAATGTAGGCCCCCTTAACAAGACCTTAATTTCTTTGATTCCCAAAGTTAAGAAG
CCAAAGCATTTGTCTGAGTTTCGGCCCATCAGCTTGTGCAATGTGAGCTATAAAATTATTGCTAAATATTTGGTCAATAGACTGAAAAGAGCGCTCAATGAGATTATAGC
ACCAACTCAATCTGCCTTCGTTCTGGGGAGAATGATTACTGATAATGCTATTTTGGGTTTTAAATGCTTACATGCTATTAGTAGCAAACGAGTTGGGAAAGATGGGTGGG
TGGCTCTTAAATTGGGCATGAGCAAGGCCTATGACACGGTCGAATGGCCGTTTTTGCGAGCTATGCTCCTTTCCATTGGCCTTGACAGGGCGTGGGTAAGGATTCTCATG
AGGTGTGTGGTAACAGTCCAATGTCCTGTCCTCATCAATTGTAAGCCCCATCACGAGTTTGTTCCTCAAAGAGGTCTTCGCCAAGGGGACCCTCTCTCTCCTTATCTGTT
CCTCATTTGCACGGAAGGTTTGTCGGCTCTTCTACACAGGGAAGAATCTCTGTCTCTTATTCATGCGTCGATTTTCGAGAGCAAGAATATTCGCAAAGTTTTGCAAGAGT
ATGAAGAAGTGTCGGGCCAAATGATAAACCTGGATAAGTCTGAATGTCTCTTTAGTAAGAATGTTAGGCAGCAGTTGGCGATTGAAATCAATAATGTTCTAGGAGTGCGA
AAAACTGACTCTTTAGGGATATATTTGGGGATGCTGAAAGAGAAGCTCTTCCCCCTTAGTGGCAAAGAGACTTTGATCAAGGCTATAGCCCAAGCAATCCTCATCTATAT
GATGAGTTGTTTCAAGCTTCCCTACACAGTTTGTGATGATCTAAATAGGGCCTCTGGAACAAGCTTTGTGTTAGTAAGGATGCGGGGGGGTCTTGGTCTTCGAGATCTCC
AGTTATTTAACCAGGCTATGCTTGCTAAACAAAGCTGGCGAATAGTTAAAAATTCTAACAGCCTGCTTGCGCGTGTTCTCAAAGGCCGATATTTTAAGGATGGTGATTTT
TGCAATGCCCTGTTGGGGAGCAATCCCTCCCTCACTTGGAGGGGTATTTTGTGGGGGAGGGAGTTGTTCCTCAAAGGTTATAGAGGAAGGTGGGAAAAGGGAATTGTGTT
AGTTGACCATAGGACTCCTATTTGCATTAATGGAAATTTGGCCTTGATGAGGGTCCGAGATATTATTAACAGTGATGGAAGTTGGAACTTTGAGTTGATTCATAACATTT
TCCTCCCTAGTGATGCAAAGTCCATTATTAGTGTGCCCCGCCGGGGTATCGGGGGAGCGGATGAGATTATTTGGGGTTTCGACCCAAATGTTTGTTTCTCTGTTAAAAGT
GCGTACAACTTAGCGAAGTCCATTACCGATTCCTTGCCCACGCTGAGCAACATATCAAAGAAAGGAATTGCTACTAACCCACTTTGTTTTTTTTGCAGGAAATTTGTGGA
ATCATTGAGCCATGTGTTGTGGGGGTGTAAATTTACTAAAATCATTTGGCTTTCTATTTTTCCTTCCTTACCTGGTTTTTTATCTAATTTCAGGGACTGTTGGAGTCCAG
CGGGAAGCTGCTTGGATGTCTGGAGCTACAGGAACCTTTTGCTGCATAATGGAACCCCCCATGTGCGCAATGTTTTGGTTGACAAGATCTTCGCGCATGTCGAGGAGTTT
TGTAGGGGAAATAGCGTTGAGCAAGGGTCTCTTTGGAGAGAGTTGACAGGGGGCCTAGAGGTGTGGGTGCCGCCTGTTTTGGGCTCTTGGAAAATCAACACTTATGCCTC
GTGGTCTAGGCTTAAAGGAATGGGTGGGATTAGTTGGGTGGTTCGTGACTCATCTGGCTCGATCATCCTAGCTGGATGTGAGAAAAGGTATTTCGCTCAAGGGTCCCTTC
CCAATGTTTCATGGGAGTCTGATTGTATGAACCTTATCAACGCCATAAATCATAAGACTTCGGATCTATCGGAATTGTTGAGTTTTGTTGAGGAGATTGGGTCTTTGGCC
GATTCTGCTCATGTGGTTTCCTTTCGTTGGTGTATCTGGTCGACAAATGGGTTGGCTCATCGCTTGGCTCGAGCAGCAGTGTGGAGTGGTGACTGGAAGGGGTTTTTTGT
TTCTGGTTCTTCTTCTAGCTCTGAAGAAGTAGTTAGGATACCTTGTATTATTCCAAACTCTTTCTCCTCGGTCTTTGAAGAGGAGGGTTGTGGTTGTGGATTTTCACAAA
GGCCTAGTATTGATTATGAGGAGACATATGCTCTAGTGGTGGATGCAATTACATTAAGATATTTAATTGGTCTGACTGTGTATGAAAATCTGGACATGCATCTTACGGAT
GTAGTCACAACATATTTATATGAATCTCTTGATAAATGTCGAAACAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATTGGAAGTGTAGATCGCGGGAAGATTGGCTTAAATGGGGGGATAGGAACACAAAGGTGACCAACTGCAAAATTAGTAAAATCCCAGATCATCTCCAGTGGGGTAT
GAACAATCCTTTCTCGAAAGAGGACATTGAGATGGTTGTCAAAGGTATGGGTCCCTATAAAGCCCCAGGCCCTGATGGGGTGCATGCTATGTTCTATCAAAAGTACTGGG
ACATTGTGGGGCAGGACACAATGAGAATATGCCTGAGGATCCTGAATGAAGGGGAGAATGTAGGCCCCCTTAACAAGACCTTAATTTCTTTGATTCCCAAAGTTAAGAAG
CCAAAGCATTTGTCTGAGTTTCGGCCCATCAGCTTGTGCAATGTGAGCTATAAAATTATTGCTAAATATTTGGTCAATAGACTGAAAAGAGCGCTCAATGAGATTATAGC
ACCAACTCAATCTGCCTTCGTTCTGGGGAGAATGATTACTGATAATGCTATTTTGGGTTTTAAATGCTTACATGCTATTAGTAGCAAACGAGTTGGGAAAGATGGGTGGG
TGGCTCTTAAATTGGGCATGAGCAAGGCCTATGACACGGTCGAATGGCCGTTTTTGCGAGCTATGCTCCTTTCCATTGGCCTTGACAGGGCGTGGGTAAGGATTCTCATG
AGGTGTGTGGTAACAGTCCAATGTCCTGTCCTCATCAATTGTAAGCCCCATCACGAGTTTGTTCCTCAAAGAGGTCTTCGCCAAGGGGACCCTCTCTCTCCTTATCTGTT
CCTCATTTGCACGGAAGGTTTGTCGGCTCTTCTACACAGGGAAGAATCTCTGTCTCTTATTCATGCGTCGATTTTCGAGAGCAAGAATATTCGCAAAGTTTTGCAAGAGT
ATGAAGAAGTGTCGGGCCAAATGATAAACCTGGATAAGTCTGAATGTCTCTTTAGTAAGAATGTTAGGCAGCAGTTGGCGATTGAAATCAATAATGTTCTAGGAGTGCGA
AAAACTGACTCTTTAGGGATATATTTGGGGATGCTGAAAGAGAAGCTCTTCCCCCTTAGTGGCAAAGAGACTTTGATCAAGGCTATAGCCCAAGCAATCCTCATCTATAT
GATGAGTTGTTTCAAGCTTCCCTACACAGTTTGTGATGATCTAAATAGGGCCTCTGGAACAAGCTTTGTGTTAGTAAGGATGCGGGGGGGTCTTGGTCTTCGAGATCTCC
AGTTATTTAACCAGGCTATGCTTGCTAAACAAAGCTGGCGAATAGTTAAAAATTCTAACAGCCTGCTTGCGCGTGTTCTCAAAGGCCGATATTTTAAGGATGGTGATTTT
TGCAATGCCCTGTTGGGGAGCAATCCCTCCCTCACTTGGAGGGGTATTTTGTGGGGGAGGGAGTTGTTCCTCAAAGGTTATAGAGGAAGGTGGGAAAAGGGAATTGTGTT
AGTTGACCATAGGACTCCTATTTGCATTAATGGAAATTTGGCCTTGATGAGGGTCCGAGATATTATTAACAGTGATGGAAGTTGGAACTTTGAGTTGATTCATAACATTT
TCCTCCCTAGTGATGCAAAGTCCATTATTAGTGTGCCCCGCCGGGGTATCGGGGGAGCGGATGAGATTATTTGGGGTTTCGACCCAAATGTTTGTTTCTCTGTTAAAAGT
GCGTACAACTTAGCGAAGTCCATTACCGATTCCTTGCCCACGCTGAGCAACATATCAAAGAAAGGAATTGCTACTAACCCACTTTGTTTTTTTTGCAGGAAATTTGTGGA
ATCATTGAGCCATGTGTTGTGGGGGTGTAAATTTACTAAAATCATTTGGCTTTCTATTTTTCCTTCCTTACCTGGTTTTTTATCTAATTTCAGGGACTGTTGGAGTCCAG
CGGGAAGCTGCTTGGATGTCTGGAGCTACAGGAACCTTTTGCTGCATAATGGAACCCCCCATGTGCGCAATGTTTTGGTTGACAAGATCTTCGCGCATGTCGAGGAGTTT
TGTAGGGGAAATAGCGTTGAGCAAGGGTCTCTTTGGAGAGAGTTGACAGGGGGCCTAGAGGTGTGGGTGCCGCCTGTTTTGGGCTCTTGGAAAATCAACACTTATGCCTC
GTGGTCTAGGCTTAAAGGAATGGGTGGGATTAGTTGGGTGGTTCGTGACTCATCTGGCTCGATCATCCTAGCTGGATGTGAGAAAAGGTATTTCGCTCAAGGGTCCCTTC
CCAATGTTTCATGGGAGTCTGATTGTATGAACCTTATCAACGCCATAAATCATAAGACTTCGGATCTATCGGAATTGTTGAGTTTTGTTGAGGAGATTGGGTCTTTGGCC
GATTCTGCTCATGTGGTTTCCTTTCGTTGGTGTATCTGGTCGACAAATGGGTTGGCTCATCGCTTGGCTCGAGCAGCAGTGTGGAGTGGTGACTGGAAGGGGTTTTTTGT
TTCTGGTTCTTCTTCTAGCTCTGAAGAAGTAGTTAGGATACCTTGTATTATTCCAAACTCTTTCTCCTCGGTCTTTGAAGAGGAGGGTTGTGGTTGTGGATTTTCACAAA
GGCCTAGTATTGATTATGAGGAGACATATGCTCTAGTGGTGGATGCAATTACATTAAGATATTTAATTGGTCTGACTGTGTATGAAAATCTGGACATGCATCTTACGGAT
GTAGTCACAACATATTTATATGAATCTCTTGATAAATGTCGAAACAATTAG
Protein sequenceShow/hide protein sequence
MYWKCRSREDWLKWGDRNTKVTNCKISKIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKK
PKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSAFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILM
RCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVR
KTDSLGIYLGMLKEKLFPLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNRASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSWRIVKNSNSLLARVLKGRYFKDGDF
CNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGIVLVDHRTPICINGNLALMRVRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKS
AYNLAKSITDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLPGFLSNFRDCWSPAGSCLDVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEF
CRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSRLKGMGGISWVVRDSSGSIILAGCEKRYFAQGSLPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLA
DSAHVVSFRWCIWSTNGLAHRLARAAVWSGDWKGFFVSGSSSSSEEVVRIPCIIPNSFSSVFEEEGCGCGFSQRPSIDYEETYALVVDAITLRYLIGLTVYENLDMHLTD
VVTTYLYESLDKCRNN