; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G03990 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G03990
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationClcChr04:13660725..13664075
RNA-Seq ExpressionClc04G03990
SyntenyClc04G03990
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]7.6e-13431.63Show/hide
Query:  MYWKCRSREDWLKWGDRNTKVTNCKIS--------------------------------------------------------KIPDHLQWGMNNPFSKE
        +YWK RSR DWLK GD+NTK  + K S                                                        K+   +   +  PF+ E
Subjt:  MYWKCRSREDWLKWGDRNTKVTNCKIS--------------------------------------------------------KIPDHLQWGMNNPFSKE

Query:  DIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNE
        DI   +  M P KAPGPDG+ A F+QK+W IVG+   + CL ILNE   +  LN T I+LIPKV+KP+ + EFRPISLCNV Y+I+AK + NRLK  LN 
Subjt:  DIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNE

Query:  IIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQR
        II+P QS F+  R+ITDN I+G++CLH I   +  ++G VALKL +SKAYD VEW FL   + ++G    W+ ++M C+ T    VLIN  P     P+R
Subjt:  IIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQR

Query:  GLRQGDPLSPYLFLICTEGLSALLHR------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLA
        GLRQG PLSPYLF++C E  S LL++                        ++SL    AS+ + K ++ +   Y + SGQ+ N +KS   FS     +  
Subjt:  GLRQGDPLSPYLFLICTEGLSALLHR------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLA

Query:  IEINNVLGVRKTDSLGIYLGM-----------LKE--------------KLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------------
          I ++  ++       YLG+            KE              KLFS  GKE LIKA+AQA+  Y MS FKLP  +C+D+ +            
Subjt:  IEINNVLGVRKTDSLGIYLGM-----------LKE--------------KLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------------

Query:  ------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGIVL
              A   S    + RGGLG RDL  FNQA++AKQ  R+V+  NSL+ARV+K RY+K+  F NA +GSNPS  WR ILWG ++  KG R R   G  +
Subjt:  ------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGIVL

Query:  VDHR----------TPICINGNLALMRVRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA------------
        + ++           PI          V D+I+S+  W  + +   F+  D ++I+ +        DE++W FD    +SVKS Y LA            
Subjt:  VDHR----------TPICINGNLALMRVRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA------------

Query:  ------------------------KSITDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIW----LSIFPSL---PGFLSNFRDCWSPA
                                +++ + LPT  N+ K+     P+C  C+  VE++SHVL  CK  + IW    L + PS      F S  ++ WS +
Subjt:  ------------------------KSITDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIW----LSIFPSL---PGFLSNFRDCWSPA

Query:  GS---------CLDVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGL--EVWVPPVLGSWKINTYASWSKLKGTGGISWVVRD
         +         C  +WS RN  +  G       L  K  + ++ + R +  + G++      G+  + W PP     K+N  A+ S      G+  +VRD
Subjt:  GS---------CLDVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGL--EVWVPPVLGSWKINTYASWSKLKGTGGISWVVRD

Query:  SSGSIILAGCEKRYFAQ----GSLPNVSW---------------ESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLAHRLAR
        + G I+  G ++  F +         + W               ESDC  ++  +N+     +E+   + ++   +     V F +   + N  AH LA+
Subjt:  SSGSIILAGCEKRYFAQ----GSLPNVSW---------------ESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLAHRLAR

Query:  AAVWSGD---WKGFF
         A+ +     W G F
Subjt:  AAVWSGD---WKGFF

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]9.7e-12932.87Show/hide
Query:  YWKCRSREDWLKWGDRNTKVTNCKIS------------------------------------------------------KIPDHLQWGMNNPFSKEDIE
        YW  R++  WLK GDRNTK  + + S                                                      K+ + +   +   F+KE++ 
Subjt:  YWKCRSREDWLKWGDRNTKVTNCKIS------------------------------------------------------KIPDHLQWGMNNPFSKEDIE

Query:  MVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIA
        + +K + P KAPGPDG+ A+F+QKYW IVG +   + L +LN    +  LNKT ISLIPK   PK +++FRPISLCNV YK+I+K L NRLK  L  II+
Subjt:  MVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIA

Query:  PTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLR
          QS F   R+ITDN ++ F+ +H +  K  GK+G++A+KL MSKA+D VEW F+  ++  +G    W  ++M+C+ +V   +LIN   H    P RGLR
Subjt:  PTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLR

Query:  QGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIE
        QGDPLSP LFL+C EGLSAL+++                         ++S+    A+  E   +R +L +YEE SGQ IN DKS   FS N  Q+   E
Subjt:  QGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIE

Query:  INNVLGVRKTDSLGIYLG--------------MLKE-----------KLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR--------------
        I N+LG  +      YLG              MLKE           KL S+ GKE LIKA+AQAI  Y MSCF LP  +CDD+ R              
Subjt:  INNVLGVRKTDSLGIYLG--------------MLKE-----------KLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR--------------

Query:  ----ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IVL
             S       +  GGLG R+L+ FN AMLAKQ+ RI+ N NSL+ RVLK RYF  GD  NA LGS+PS +WR I    E+  +G R R   G  I +
Subjt:  ----ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IVL

Query:  VDHR---TPICIN------GNLALMRVRDIINSDGS-WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSI---------
         + R   TP           N     V  +I+ D   W  E + +IFLP + ++I+ +P       D++IW  +    FSVKSAY++A SI         
Subjt:  VDHR---TPICIN------GNLALMRVRDIINSDGS-WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSI---------

Query:  ------------------------------TDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWL--SIFPSLP-GFLSNFRD-----C
                                       D LPT  NISK+GI  +  C  C    E ++H L  C+   ++W   S +P  P     +F D     C
Subjt:  ------------------------------TDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWL--SIFPSLP-GFLSNFRD-----C

Query:  WSPAGSCLD--------VWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVE----QGSLWRELTGGLEVWVPPVLGSWKINTYASWSKLKGTGGIS
         S A   L+        +W  RN ++HN +P   + +       +E+F +  S++    + S  R        W  P LG +K+N   + S       I 
Subjt:  WSPAGSCLD--------VWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVE----QGSLWRELTGGLEVWVPPVLGSWKINTYASWSKLKGTGGIS

Query:  WVVRDSSGSIILA------------GCEKRYFAQG-------SLPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLA
         ++RDS+G ++ A              E     QG        L  V  E D + +I A+N  ++  +EL   ++ I S+++S    +F+    + N +A
Subjt:  WVVRDSSGSIILA------------GCEKRYFAQG-------SLPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLA

Query:  HRLARAA
        H LA+ A
Subjt:  HRLARAA

XP_023890148.1 uncharacterized protein LOC112002224 [Quercus suber]8.8e-12233.14Show/hide
Query:  WKCRSREDWLKWGDRNTKVTNCKIS-----------KIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEG
        W  R+   WLK GDRNT   + K S           K+  H+   +   F  E++   +K M P  APGPDG+  +FYQ+YW  VG+      L  LN G
Subjt:  WKCRSREDWLKWGDRNTKVTNCKIS-----------KIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEG

Query:  ENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMS
              N+T I LIPKVK PKH+ +FRPISLCNV+YK+ +K L NRLK  L  ++   QS FV  R+ITDN ++  + +  IS KR GK G +ALKL MS
Subjt:  ENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMS

Query:  KAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHR---------------------
        KAYD VEW  L  +++ +G    WV ++M+C+ TV   + IN +P    +P RGLRQGDP+SPYLFLIC EGLSALLH+                     
Subjt:  KAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHR---------------------

Query:  ----EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------
            ++SL    A++ E   I ++L+ YEE SGQ +N  K+   FS+N   +    I  +LG +       YLG+                         
Subjt:  ----EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------

Query:  LKEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSN
         KEKL S +GKE LIKA+AQA+  Y MSCFKLP  +CDDL                     S       +  GG+G +DL+ FN A+LAKQ  R+    +
Subjt:  LKEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSN

Query:  SLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKG-------------YRGRWEKGIVLVDHRTPICINGNLALMRVRDIINSDGSWNFELI
        SL  RV + +YF  G+F NA +G +PS  WR I+  +++  KG             +R RW          TP  +  + A  +V+D+I   G W+  LI
Subjt:  SLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKG-------------YRGRWEKGIVLVDHRTPICINGNLALMRVRDIINSDGSWNFELI

Query:  HNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA---------------------------------------KSITDSLPTLSNISKK
          +F P DA+ I+S+P       D+ IW    N  F+V SAY L                                        ++  D L + +N+ K+
Subjt:  HNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA---------------------------------------KSITDSLPTLSNISKK

Query:  GIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLS---IFPSLPGFLSNFRD-CW-----SPAGS---------CLDVWSYRNLLLHNGTPHVRNVLVDKI
         I  + LC  C K  E+  H+ W C+  K +W S    FP       NF D  W     SP  S         C ++W  RN + H G     + ++ K 
Subjt:  GIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLS---IFPSLPGFLSNFRD-CW-----SPAGS---------CLDVWSYRNLLLHNGTPHVRNVLVDKI

Query:  FAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSKLKGTGGISWVVRDSSGSIILAGCEK
          +VEE+   +      +  +       W  P  G +K N   +        G+  V+R++ G ++ A   K
Subjt:  FAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSKLKGTGGISWVVRDSSGSIILAGCEK

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]3.3e-12131.11Show/hide
Query:  MYWKCRSREDWLKWGDRNTKVTNCKIS-KIPDHLQWGMNN-------------------------------------------------------PFSKE
        ++ K RSR DWL+ GD+NTK  + K S +   +  WG+ N                                                       PF+ E
Subjt:  MYWKCRSREDWLKWGDRNTKVTNCKIS-KIPDHLQWGMNN-------------------------------------------------------PFSKE

Query:  DIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNE
        D+E  +  M P KAPGPDG+ A F+QK+W  V    +  CL +LNE  N   LN T I+LIPK+  P+ +S++RPISLCNV Y+++AK + NR+K  L++
Subjt:  DIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNE

Query:  IIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQR
        II+P QS F+  R+ITDN I+G++CLH I   +  K+G VALKL +SKAYD VEWPFL   +L +G     V ++MRCV +    VLIN  P     P+R
Subjt:  IIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQR

Query:  GLRQGDPLSPYLFLICTEGLSALLHREESLSLIHA-----------SIFESKNIRKVLQEYEEVSGQMINLDKSEC--LFSKNV--RQQLAIEINNVLGV
        GLRQG PLSPYLF++C E LS LL   E   LI              +F   ++        E SG++    ++    +F+ NV  + +  + + +++G 
Subjt:  GLRQGDPLSPYLFLICTEGLSALLHREESLSLIHA-----------SIFESKNIRKVLQEYEEVSGQMINLDKSEC--LFSKNV--RQQLAIEINNVLGV

Query:  RKTDSLG-IYLGML------KEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNRA------------------SGTSFVLVRMRGGLGLRDLQ
        +K      I L +L      ++K  S  GKE LIKA AQAI  Y MS FKLP   CDD+ RA                          ++RGGLG R+  
Subjt:  RKTDSLG-IYLGML------KEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNRA------------------SGTSFVLVRMRGGLGLRDLQ

Query:  LFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYR---GRWEKGIVLVDH-------RTPICINGNLALMR
         FNQA++AKQ+ R+++  NSL++RVL+ RYF++  F  A  G+N S  WR I+WGR++  KG R   G  +K  +  D+         PI          
Subjt:  LFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYR---GRWEKGIVLVDH-------RTPICINGNLALMR

Query:  VRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA------------------------------------KSI
        V D+I +D  W+   +   FL  D   I+ +P       DE++W +D    +SVKS Y LA                                    ++ 
Subjt:  VRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLA------------------------------------KSI

Query:  TDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLPGFLSNFRDCWSP----------------AGSCLDVWSYRNLLLHNGT
         + LP+  N+ K+ +   P C  C+  VE++SH L  CK  + IWL    S P   +N +D +S                    C   W  RN  + +G 
Subjt:  TDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLPGFLSNFRDCWSP----------------AGSCLDVWSYRNLLLHNGT

Query:  PHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSKLKGTGGISWVVRDSSGSIILAGCEKRYF-AQGSLPN---VSW
             +   K  + +  F R    +Q  +   +    + W+PP    +K+N  A+++    + G+  V+RDS+G I+ AG  +       SL     V W
Subjt:  PHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSKLKGTGGISWVVRDSSGSIILAGCEKRYF-AQGSLPN---VSW

Query:  ---------------ESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLAHRLARAAV
                       ESDC+ ++  +N+     SE+   +  I +       V         N  AH LA+ A+
Subjt:  ---------------ESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWSTNGLAHRLARAAV

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.5e-12132.46Show/hide
Query:  MYWKCRSREDWLKWGDRNTKVTNCKIS------------------------------------------------------KIPDHLQWGMNNPFSKEDI
        +YW  RSR +WL+ GDRNTK  + K S                                                      K+ + ++  ++N F+ E++
Subjt:  MYWKCRSREDWLKWGDRNTKVTNCKIS------------------------------------------------------KIPDHLQWGMNNPFSKEDI

Query:  EMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEII
        +  +  MGP KAPGPDG++A+FYQK+W IVG   +   L  LN G  +  +N T I LIPKV+ P+ +SEFRPISLCNV YKII+K L NRLK+ L +II
Subjt:  EMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEII

Query:  APTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGL
        + TQS FV GR+ITDN ++ ++ LH + +++ GK G VALKL +SKAYD VEW FL++++  +G    W+  +M CV T    +L+N KP+    P RG+
Subjt:  APTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGL

Query:  RQGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAI
        RQGDP+SPYLFL+C EGL+ALL++                         ++SL    A+  E + I ++LQ YE  SGQ INL+KS   FS N  +    
Subjt:  RQGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAI

Query:  EINNVLGVRKTDSLGIYLGM--------------LKEK-----------LFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR-------------
        +I  +LGV++ D    YLG+              LK++           L S +GKE LIKA+AQAI  Y MS F++P  +C +L               
Subjt:  EINNVLGVRKTDSLGIYLGM--------------LKEK-----------LFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR-------------

Query:  -----ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGI---
              S       +  GG+G RDL+ FN AMLAKQ  R+V+  +SLL R  K RYF    F  A    N S  WR ++  + +   GY  R   G    
Subjt:  -----ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGI---

Query:  VLVDHRTPICINGNLALMRVRDIINSDGS--------------WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSI-TD
         + D   P     N    +V + +  DGS              WN+E I  IF   +A++I  +P       D I W + P   FSVKSAY++A+ I TD
Subjt:  VLVDHRTPICINGNLALMRVRDIINSDGS--------------WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSI-TD

Query:  S--------------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLPGFLSNFRDCWS
        +                                      LPT  N++ + I  +  C  C +  ES  H LW C   + IW      L        D   
Subjt:  S--------------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLPGFLSNFRDCWS

Query:  PAGSCLD----------------VWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSKLKGTGG
             L+                VW  RN LLH G   V + L  +   ++ EF   N+  +  + R      ++W PP  G +K+N  A+     G  G
Subjt:  PAGSCLD----------------VWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSKLKGTGG

Query:  ISWVVRDSSGSIILA
           ++R+  G ++ A
Subjt:  ISWVVRDSSGSIILA

TrEMBL top hitse value%identityAlignment
A0A2N9FN47 Uncharacterized protein2.3e-12335.41Show/hide
Query:  KVTNCKISKIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRP
        +VT    +++   +   + +PFS E+I+  +  M P KAPGPDG+ A+F+QKYW++VG       L  LN G  +G +N T I LIPKVK P +++ FRP
Subjt:  KVTNCKISKIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRP

Query:  ISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRIL
        ISLCNV YKII+K LVNR+K  L+++I+ +QS FV GRMITDN ++ F+ LH + +KR GK G +A KL MSKAYD VEW +LRA+LL +G    WV ++
Subjt:  ISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRIL

Query:  MRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEY
        M CV +V   VL+N +      P RGLRQGDPLSPYLFLIC EGLSALL + E   LIH                         A+  + + +  +L  Y
Subjt:  MRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEY

Query:  EEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFSLSGKETLIKAIAQAILIYMMS
        E+ SGQ +N  K+   FS N  Q     I  + G   T     YLG+                          KEKL S +G+E LIKA+ QAI  Y MS
Subjt:  EEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFSLSGKETLIKAIAQAILIYMMS

Query:  CFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSL
        CFKLP  +C +++                    S       + RGG+G R+L LFN AMLA+Q  R+++  NSLL RVLK +YF +  F  A +  NPSL
Subjt:  CFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSL

Query:  TWRGILWGRELFLKGYRGR---------WEKGIVLVDHRTPICINGNLAL---MRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIW
        TWR I   +++ + G   R         W K   L+    P  ++    L     V ++IN + G WN  LI  IFLPSDA+ I  +P       D++IW
Subjt:  TWRGILWGRELFLKGYRGR---------WEKGIVLVDHRTPICINGNLAL---MRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIW

Query:  GFDPNVCFSVKSAYNL----AKSITDS---------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKI
          +    FSVK+AY L    A  IT+S                                 LPT + +  K I+++  C +C    E+  H+LW C F + 
Subjt:  GFDPNVCFSVKSAYNL----AKSITDS---------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKI

Query:  IWLSIFPSLP------GFLSNF-----RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPP
        +W +    +P      G  S+F     RD  SP    +      +W  RN L+  G     + +  +      EF      E  S+  EL    E W PP
Subjt:  IWLSIFPSLP------GFLSNF-----RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPP

Query:  VLGSWKINTYASWSKLKGTGGISWVVRDSSGSIILA
         +GS+K++    +       G+  ++RD  G +++A
Subjt:  VLGSWKINTYASWSKLKGTGGISWVVRDSSGSIILA

A0A2N9FNH6 Reverse transcriptase domain-containing protein1.3e-12332.79Show/hide
Query:  PFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLK
        PF+ E+I   +  M P KAPGPDG++AMFYQK+W IVG D     L  L+ G+ +  +N T I+LIPK+  P+ +++FRPISLCNV YKII+K L NRLK
Subjt:  PFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLK

Query:  RALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHE
          L+ II+  QS FV GR+ITDN ++ F+ LH + +KR G+   +A+KL MSKAYD VEW FL  M++ +G D+ WV ++M+C+ +V   V++N +P   
Subjt:  RALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHE

Query:  FVPQRGLRQGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKN
          P RG+RQGDPLSPYLFLIC EGL+ALL +                         ++SL    A++ E +N+  +L  YE+ SGQ +N +K+   FS N
Subjt:  FVPQRGLRQGDPLSPYLFLICTEGLSALLHR-------------------------EESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKN

Query:  VRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------
            L   I  +L    T  LG YLG+                          K KL S +G+E LIK++AQAI +Y MSCF++P T+C ++N       
Subjt:  VRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNR------

Query:  ------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRW
                       ++    +  GG+G RDL LFNQA+LAKQ  R++++ N+LL R+LK +YF +  F  A +  + S  WR I   R +  KG R R 
Subjt:  ------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRW

Query:  EKGIVL----------------VDHRTPICINGNLALMRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNL
          G  +                V  R  +  N       V D+I+ +   WN  LI +IF P +A  I ++P R +   D ++W   PN  F+ +SAY L
Subjt:  EKGIVL----------------VDHRTPICINGNLALMRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNL

Query:  A--------------------------------------KSITDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLS---------IF
                                               ++ T  LPT +N+ ++G+  +  C  C    E++ H LW C++ +  WL+         + 
Subjt:  A--------------------------------------KSITDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLS---------IF

Query:  PSLPGFLSNF--RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSK
        PS    L ++  R   +P           +W+YRN    N        L  K  ++VEEF   N+         +T     W PP    +K+N      K
Subjt:  PSLPGFLSNF--RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSK

Query:  LKGTGGISWVVRDSSGSIILAGCEKRYFAQGSLPNVS----------WESDCMNLINAINHK------TSD---LSELLSFVEEIGSLADSAHVVSFRWC
         + + GI  V+RDS G+++ A CE+   A   L N +           E+   ++I   +H       T+D   LSEL   + +I   ++  H ++F   
Subjt:  LKGTGGISWVVRDSSGSIILAGCEKRYFAQGSLPNVS----------WESDCMNLINAINHK------TSD---LSELLSFVEEIGSLADSAHVVSFRWC

Query:  IWSTNGLAHRLA
          S N  A  LA
Subjt:  IWSTNGLAHRLA

A0A2N9GM07 Reverse transcriptase domain-containing protein9.1e-12531.45Show/hide
Query:  WKCRSREDWLKWGDRNTKVTNCKISK---------------------------------------IPDHLQWG-----------MNNP----FSKEDIEM
        WK R+R  WL  GD+NT+  + K S+                                       +P H+Q G           MN+     F+ E++  
Subjt:  WKCRSREDWLKWGDRNTKVTNCKISK---------------------------------------IPDHLQWG-----------MNNP----FSKEDIEM

Query:  VVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIAP
         ++ M P KAPGPDG+ A+F+QKYW IVG++     L++LN   +    NKT I+LIPK K P+ ++EFRPISLCNV+YK+I+K + NRLK  L+++I+ 
Subjt:  VVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIAP

Query:  TQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLRQ
        TQS FV GR ITDNA++ F+ +H    KR GKD ++ALKL MSKAYD VEW F+  ++  +G    W+ ++M C+ TVQ  V +N       +P RGLRQ
Subjt:  TQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGLRQ

Query:  GDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIEI
        GDPLSPYLFL+C EG S+LL   E   LIH                         A+  + + +  + + YE+ SGQ IN+DKS   FS+N       EI
Subjt:  GDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIEI

Query:  NNVLGVRKTDSLGIYLGM-------------------------LKEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDL----------NRASGTS
               +      YLG+                          KEKL S  G+E LIK++AQAI  Y MSCF+LP T+C ++           R   + 
Subjt:  NNVLGVRKTDSLGIYLGM-------------------------LKEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDL----------NRASGTS

Query:  FVLV--------RMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGI---VL
          LV        ++RGG+G RDL  FN A+LAKQ  R++ N NS+L R+ K +YF  G+   A +G NPS  WR I    ++  +G R R   GI   + 
Subjt:  FVLV--------RMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGI---VL

Query:  VDHRTP-----------ICINGNLALMRVRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSITDS-----
         D   P           + I     +  + D I     W  E +H  FLP D  +I+S+P   I   D+ +W  + N  F+VKSAY++A ++  S     
Subjt:  VDHRTP-----------ICINGNLALMRVRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSITDS-----

Query:  ----------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIW-----LSIFP-------------S
                                          LPT+  + ++G+  NP C  C +  ES+SH +W C   + IW     L I P              
Subjt:  ----------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIW-----LSIFP-------------S

Query:  LPGFLSNFRDCWSPAGSCLDVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEV----WVPPVLGSWKINTYASWSKLKGTG
          G +      W  A    +VW  RN  +HN     +N +  ++F   ++       E   L+ + +G   +    W  P  G +K+NT  +        
Subjt:  LPGFLSNFRDCWSPAGSCLDVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEV----WVPPVLGSWKINTYASWSKLKGTG

Query:  GISWVVRDSSGSIILAGCEKRYFAQGS--------------------LPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWST
        GI  V+RD  GS  LAG   R     S                    L ++  ESD  N+++AIN +  D   +   +  IG L  S       +    +
Subjt:  GISWVVRDSSGSIILAGCEKRYFAQGS--------------------LPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWST

Query:  NGLAHRLARAA
        N +AH LA+ A
Subjt:  NGLAHRLARAA

A0A2N9GWG4 Uncharacterized protein2.3e-12335.41Show/hide
Query:  KVTNCKISKIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRP
        +VT    +++   +   + +PFS E+I+  +  M P KAPGPDG+ A+F+QKYW++VG       L  LN G  +G +N T I LIPKVK P +++ FRP
Subjt:  KVTNCKISKIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRP

Query:  ISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRIL
        ISLCNV YKII+K LVNR+K  L+++I+ +QS FV GRMITDN ++ F+ LH + +KR GK G +A KL MSKAYD VEW +LRA+LL +G    WV ++
Subjt:  ISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRIL

Query:  MRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEY
        M CV +V   VL+N +      P RGLRQGDPLSPYLFLIC EGLSALL + E   LIH                         A+  + + +  +L  Y
Subjt:  MRCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIH-------------------------ASIFESKNIRKVLQEY

Query:  EEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFSLSGKETLIKAIAQAILIYMMS
        E+ SGQ +N  K+   FS N  Q     I  + G   T     YLG+                          KEKL S +G+E LIKA+ QAI  Y MS
Subjt:  EEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVRKTDSLGIYLGM-------------------------LKEKLFSLSGKETLIKAIAQAILIYMMS

Query:  CFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSL
        CFKLP  +C +++                    S       + RGG+G R+L LFN AMLA+Q  R+++  NSLL RVLK +YF +  F  A +  NPSL
Subjt:  CFKLPYTVCDDLNR------------------ASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSL

Query:  TWRGILWGRELFLKGYRGR---------WEKGIVLVDHRTPICINGNLAL---MRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIW
        TWR I   +++ + G   R         W K   L+    P  ++    L     V ++IN + G WN  LI  IFLPSDA+ I  +P       D++IW
Subjt:  TWRGILWGRELFLKGYRGR---------WEKGIVLVDHRTPICINGNLAL---MRVRDIINSD-GSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIW

Query:  GFDPNVCFSVKSAYNL----AKSITDS---------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKI
          +    FSVK+AY L    A  IT+S                                 LPT + +  K I+++  C +C    E+  H+LW C F + 
Subjt:  GFDPNVCFSVKSAYNL----AKSITDS---------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKI

Query:  IWLSIFPSLP------GFLSNF-----RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPP
        +W +    +P      G  S+F     RD  SP    +      +W  RN L+  G     + +  +      EF      E  S+  EL    E W PP
Subjt:  IWLSIFPSLP------GFLSNF-----RDCWSPAGSCL-----DVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPP

Query:  VLGSWKINTYASWSKLKGTGGISWVVRDSSGSIILA
         +GS+K++    +       G+  ++RD  G +++A
Subjt:  VLGSWKINTYASWSKLKGTGGISWVVRDSSGSIILA

A0A7N2L6Z9 Reverse transcriptase domain-containing protein4.5e-12432.64Show/hide
Query:  MYWKCRSREDWLKWGDRNTKVTNCKISK------------------------------------------------------IPDHLQWGMNNPFSKEDI
        ++W  RS+  WLK GDRNTK  + + S+                                                      I + +   ++  F++E+I
Subjt:  MYWKCRSREDWLKWGDRNTKVTNCKISK------------------------------------------------------IPDHLQWGMNNPFSKEDI

Query:  EMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEII
           +K + P K+PGPDG+ A+F+QKYWDIVG +   + L +LN G ++  +NKT I LIPK   PK +++FRPISLCNV YK+I+K L NRLK  L  II
Subjt:  EMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEII

Query:  APTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGL
           QS F   R+ITDN ++ ++ +H +  K+ GKD ++A KL MSKA+D VEW F+  ++  +G +  W+ ++MRC+ +V   V+IN +     VP RGL
Subjt:  APTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKPHHEFVPQRGL

Query:  RQGDPLSPYLFLICTEGLSALLH-------------------------REESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAI
        RQGDPLSPYLFL+C EGLSALLH                          ++SL    A+  E + ++++L++YE  SGQ +N DKS   FS N   +L  
Subjt:  RQGDPLSPYLFLICTEGLSALLH-------------------------REESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAI

Query:  EINNVLGVRKTDSLGIYLGM-------------------------LKEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDL------------NRA
         I N+LG  +      YLG+                          K KL S  GKE LIKA+AQAI  Y MSCF LP ++CD+L            N+ 
Subjt:  EINNVLGVRKTDSLGIYLGM-------------------------LKEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDL------------NRA

Query:  SGTSFVLVRMR------GGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IV
        S  +++  R        GGLG R+L  FN A+LAKQ+ RI+ N  SL AR+LK +YF  GD  NA LGSNPS TWR I    E+  KG R R   G  I 
Subjt:  SGTSFVLVRMR------GGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNALLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IV

Query:  LVDHR-----------TPICINGNLALMRVRDIINSDGS-WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSITDS---
        + D +           TP  I  +  +  V  +I+ D   W  + I  +FLP DA++I+ +P       D IIW  +    FSVKSAY +A ++ +S   
Subjt:  LVDHR-----------TPICINGNLALMRVRDIINSDGS-WNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKSAYNLAKSITDS---

Query:  ------------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLP-GFLSNFRD-----
                                            LPT++N+  +G+  N  C  C + VE L+H L  C F   +W +++   P G L   RD     
Subjt:  ------------------------------------LPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLP-GFLSNFRD-----

Query:  ----CWSPAGSCL-------DVWSYRNLLLHNG---TPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWS-KLKGT
              SP    L        +W  RNL +H+    +P     +  ++   ++++     ++   +   L G    W  P  G +K+N   + S    G+
Subjt:  ----CWSPAGSCL-------DVWSYRNLLLHNG---TPHVRNVLVDKIFAHVEEFCRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWS-KLKGT

Query:  GGISWVVRDSSGSIILAGCE--KRYF----------AQG-------SLPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWST
         G+  V+RD SG +I A C+    YF           QG        LP +  ESD ++ I AIN    +  E    VE I          SF +     
Subjt:  GGISWVVRDSSGSIILAGCE--KRYF----------AQG-------SLPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLADSAHVVSFRWCIWST

Query:  NGLAHRLARAA
        N +AH+LA+ A
Subjt:  NGLAHRLARAA

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.9e-2725.68Show/hide
Query:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSE-FRPISLCNVSYKIIAKYLV
        +N P +  +I  ++  +   K+PGPDG  A FYQ+Y + +    +++   I  EG       +  I LIPK  +     E FRPISL N+  KI+ K L 
Subjt:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSE-FRPISLCNVSYKIIAKYLV

Query:  NRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCK
        NR+++ + ++I   Q  F+ G     N       +  I+  R      V + +   KA+D ++ PF+   L  +G+D  +++I+          +++N +
Subjt:  NRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCK

Query:  PHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHAS--------------------IFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNV
            F  + G RQG PLSP LF I  E L+  + +E+ +  I                       I  ++N+ K++  + +VSG  IN+ KS+     N 
Subjt:  PHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHAS--------------------IFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNV

Query:  RQ---QLAIEINNVLGVRKTDSLGIYLGMLKEKLFSLSGKETL--------------------IKAIAQAIL---IYMMSC--FKLPYTVCDDLNRASGT
        RQ   Q+  E+   +  ++   LGI L    + LF  + K  L                    I  +  AIL   IY  +    KLP T   +L + +  
Subjt:  RQ---QLAIEINNVLGVRKTDSLGIYLGMLKEKLFSLSGKETL--------------------IKAIAQAIL---IYMMSC--FKLPYTVCDDLNRASGT

Query:  SFVLVRMR--------------GGLGLRDLQLFNQAMLAK
         F+  + R              GG+ L D +L+ +A + K
Subjt:  SFVLVRMR--------------GGLGLRDLQLFNQAMLAK

P08548 LINE-1 reverse transcriptase homolog5.0e-2727.92Show/hide
Query:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKV-KKPKHLSEFRPISLCNVSYKIIAKYLV
        +N P S  +I   ++ +   K+PGPDG  + FYQ + + +    + +   I  EG       +  I+LIPK  K P     +RPISL N+  KI+ K L 
Subjt:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKV-KKPKHLSEFRPISLCNVSYKIIAKYLV

Query:  NRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCK
        NR+++ + +II   Q  F+ G     N       +  I +K   KD  + L +   KA+D ++ PF+   L  IG++  +++++          +++N  
Subjt:  NRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCK

Query:  PHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHASIFESK--------------------NIRKVLQEYEEVSGQMINLDKSEC-LFSKN
            F  + G RQG PLSP LF I  E L+  +  E+++  IH    E K                     + +V++EY  VSG  IN  KS   +++ N
Subjt:  PHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHASIFESK--------------------NIRKVLQEYEEVSGQMINLDKSEC-LFSKN

Query:  VRQQLAIE--INNVLGVRKTDSLGIYLGMLKEKLFSLSGKETLIKAIAQAI
         + +  ++  I   +  +K   LG+YL    + L+     ETL K IA+ +
Subjt:  VRQQLAIE--INNVLGVRKTDSLGIYLGMLKEKLFSLSGKETLIKAIAQAI

P11369 LINE-1 retrotransposable element ORF2 protein2.7e-2528.43Show/hide
Query:  DHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKK-PKHLSEFRPISLCNVSYKI
        DHL    N+P S ++IE V+  +   K+PGPDG  A FYQ + + +     ++  +I  EG       +  I+LIPK +K P  +  FRPISL N+  KI
Subjt:  DHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKK-PKHLSEFRPISLCNVSYKI

Query:  IAKYLVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCP
        + K L NR++  +  II P Q  F+ G     N       +H I+  ++     + + L   KA+D ++ PF+  +L   G+   ++ ++          
Subjt:  IAKYLVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCP

Query:  VLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHAS----------------IFESKN----IRKVLQEYEEVSGQMINLDKSEC
        + +N +       + G RQG PLSPYLF I  E L+  + +++ +  I                   I + KN    +  ++  + EV G  IN +KS  
Subjt:  VLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHAS----------------IFESKN----IRKVLQEYEEVSGQMINLDKSEC

Query:  -LFSKNVRQQLAI
         L++KN + +  I
Subjt:  -LFSKNVRQQLAI

P14381 Transposon TX1 uncharacterized 149 kDa protein7.2e-2629.35Show/hide
Query:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVN
        +  P + +++   ++ M   K+PG DG+   F+Q +WD +G D  R+      +GE      + ++SL+PK    + +  +RP+SL +  YKI+AK +  
Subjt:  MNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKIIAKYLVN

Query:  RLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKP
        RLK  L E+I P QS  V GR I DN  L    LH   ++R G      L L   KA+D V+  +L   L +      +V  L     + +C V IN   
Subjt:  RLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMRCVVTVQCPVLINCKP

Query:  HHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHASIFESKNIRKVLQEYEE----VSGQMINLDKSE
               RG+RQG PLS  L+ +  E    LL +      +   + +  ++R VL  Y +    V+  +++L++++
Subjt:  HHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHASIFESKNIRKVLQEYEE----VSGQMINLDKSE

P93295 Uncharacterized mitochondrial protein AtMg003104.8e-1439.2Show/hide
Query:  AILIYMMSCFKLPYTVCDDLNRASGTSF-----------------VLVRMR---GGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFC
        A+ +Y MSCF+L   +C  L  A  T F                  L + +   GGLG RDL  FNQA+LAKQS RI+   ++LL+R+L+ RYF      
Subjt:  AILIYMMSCFKLPYTVCDDLNRASGTSF-----------------VLVRMR---GGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFC

Query:  NALLGSNPSLTWRGILWGRELFLKG
           +G+ PS  WR I+ GREL  +G
Subjt:  NALLGSNPSLTWRGILWGRELFLKG

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.3e-1040.45Show/hide
Query:  SKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKII
        S ++I   V  M   KAPGPD   A F+ + W +V   T+         G  +   N T I+LIPKV     LS FRP+S C V YKII
Subjt:  SKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKKPKHLSEFRPISLCNVSYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.6e-1236.36Show/hide
Query:  LVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMR
        +V RLK  +  +I P Q++F+ GR+ TDN +   + +H++  K+ G  GW+ LKL + KAYD + W +L   L+S G    W+  + R
Subjt:  LVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILMR

AT4G29090.1 Ribonuclease H-like superfamily protein5.5e-2124.21Show/hide
Query:  AILIYMMSCFKLPYTVCDDL------------NRASGTSF------VLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNA
        A+  Y M+CF LP TVC  +              A G  +         +  GG+G +D++ FN A+L KQ  R++    SL+A+V K RYF   D  NA
Subjt:  AILIYMMSCFKLPYTVCDDL------------NRASGTSF------VLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFCNA

Query:  LLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IVLVDHR----------------TPICINGNLALMRVRDIINSDG-SWNFELIHNIFLPSDAKSIIS
         LGS PS  W+ I   +E+  +G R     G  I++  H+                 P       ++++V D+I+  G  W  ++I  +F   + K I  
Subjt:  LLGSNPSLTWRGILWGRELFLKGYRGRWEKG--IVLVDHR----------------TPICINGNLALMRVRDIINSDG-SWNFELIHNIFLPSDAKSIIS

Query:  VPRRGIGGADEIIWGFDPNVCFSVKSAY---------------------------------------NLAKSITDSLPTLSNISKKGIATNPLCFFCRKF
        +   G    D   W +  +  ++VKS Y                                        L K +++SLP    ++ + ++    C  C   
Subjt:  VPRRGIGGADEIIWGFDPNVCFSVKSAY---------------------------------------NLAKSITDSLPTLSNISKKGIATNPLCFFCRKF

Query:  VESLSHVLWGCKFTKIIW
         E+++H+L+ C F ++ W
Subjt:  VESLSHVLWGCKFTKIIW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.4e-1539.2Show/hide
Query:  AILIYMMSCFKLPYTVCDDLNRASGTSF-----------------VLVRMR---GGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFC
        A+ +Y MSCF+L   +C  L  A  T F                  L + +   GGLG RDL  FNQA+LAKQS RI+   ++LL+R+L+ RYF      
Subjt:  AILIYMMSCFKLPYTVCDDLNRASGTSF-----------------VLVRMR---GGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDFC

Query:  NALLGSNPSLTWRGILWGRELFLKG
           +G+ PS  WR I+ GREL  +G
Subjt:  NALLGSNPSLTWRGILWGRELFLKG

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.2e-0759.52Show/hide
Query:  LINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREE
        +IN  P     P RGLRQGDPLSPYLF++CTE LS L  R +
Subjt:  LINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTGGAAGTGTAGATCGCGGGAAGATTGGCTTAAATGGGGGGATAGGAACACAAAGGTGACCAACTGCAAAATTAGTAAAATCCCAGATCATCTCCAGTGGGGTAT
GAACAATCCTTTCTCGAAAGAGGACATTGAGATGGTTGTCAAAGGTATGGGTCCCTATAAAGCCCCAGGCCCTGATGGGGTGCATGCTATGTTCTATCAAAAGTACTGGG
ACATTGTGGGGCAGGACACAATGAGAATATGCCTGAGGATCCTGAATGAAGGGGAGAATGTAGGCCCCCTTAACAAGACCTTAATTTCTTTGATTCCCAAAGTTAAGAAG
CCAAAGCATTTGTCTGAGTTTCGGCCCATCAGCTTGTGCAATGTGAGCTATAAAATTATTGCTAAATATTTGGTCAATAGACTGAAAAGAGCGCTCAATGAGATTATAGC
ACCAACTCAATCTACCTTCGTTCTGGGGAGAATGATTACTGATAATGCTATTTTGGGTTTTAAATGCTTACATGCTATTAGTAGCAAACGAGTTGGGAAAGATGGGTGGG
TGGCTCTTAAATTGGGCATGAGCAAGGCCTATGACACGGTCGAATGGCCGTTTTTGCGAGCTATGCTCCTTTCCATTGGCCTTGACAGGGCGTGGGTAAGGATTCTCATG
AGGTGTGTGGTAACAGTCCAATGTCCTGTCCTCATCAATTGTAAGCCCCATCACGAGTTTGTTCCTCAAAGAGGTCTTCGCCAAGGGGACCCTCTCTCTCCTTATCTGTT
CCTCATTTGCACGGAAGGTTTGTCGGCTCTTCTACACAGGGAAGAATCTCTGTCTCTTATTCATGCGTCGATTTTCGAGAGCAAGAATATTCGCAAAGTTTTGCAAGAGT
ATGAAGAAGTGTCGGGCCAAATGATAAACCTGGATAAGTCTGAATGTCTCTTTAGTAAGAATGTTAGGCAGCAGTTGGCGATTGAAATCAATAATGTTCTAGGAGTGCGA
AAAACTGACTCTTTAGGGATATATTTGGGGATGCTGAAAGAGAAGCTCTTCTCCCTTAGTGGCAAAGAGACTTTGATCAAGGCTATAGCCCAAGCAATCCTCATCTATAT
GATGAGTTGTTTCAAGCTTCCCTACACAGTTTGTGATGATCTAAATAGGGCCTCTGGAACAAGCTTTGTGTTAGTAAGGATGCGGGGGGGTCTTGGTCTTCGAGATCTCC
AGTTATTTAACCAGGCTATGCTTGCTAAACAAAGCTCGCGAATAGTTAAAAATTCTAACAGCCTGCTTGCGCGTGTTCTCAAAGGCCGATATTTTAAGGATGGTGATTTT
TGCAATGCCCTGTTGGGGAGCAATCCCTCCCTCACTTGGAGGGGTATTTTGTGGGGGAGGGAGTTGTTCCTCAAAGGTTATAGAGGAAGGTGGGAAAAGGGAATTGTGTT
AGTTGACCATAGGACTCCTATTTGCATTAATGGAAATTTGGCCTTGATGAGGGTCCGAGATATTATTAACAGTGATGGAAGTTGGAACTTTGAGTTGATTCATAACATTT
TCCTCCCTAGTGATGCAAAGTCCATTATTAGTGTGCCCCGCCGGGGTATCGGGGGAGCGGATGAGATTATTTGGGGTTTCGACCCAAATGTTTGTTTCTCTGTTAAAAGT
GCGTACAACTTAGCGAAGTCCATTACCGATTCCTTGCCCACGCTGAGCAACATATCAAAGAAAGGAATTGCTACTAACCCACTTTGTTTTTTTTGCAGGAAATTTGTGGA
ATCATTGAGCCATGTGTTGTGGGGGTGTAAGTTTACTAAAATCATTTGGCTTTCTATTTTTCCTTCCTTACCTGGTTTTTTATCTAATTTCAGGGACTGTTGGAGTCCAG
CGGGAAGCTGCTTGGATGTCTGGAGCTACAGGAACCTTTTGCTGCATAATGGAACCCCCCATGTGCGCAATGTTTTGGTTGACAAGATCTTCGCGCATGTCGAGGAGTTT
TGTAGGGGAAATAGCGTTGAGCAAGGGTCTCTTTGGAGAGAGTTGACGGGGGGCCTAGAGGTGTGGGTGCCGCCTGTTTTGGGCTCTTGGAAAATCAACACTTATGCCTC
GTGGTCTAAGCTTAAAGGAACGGGTGGGATTAGTTGGGTGGTTCGTGACTCATCTGGCTCGATCATCCTAGCTGGATGTGAGAAAAGGTATTTCGCTCAAGGGTCCCTTC
CCAATGTTTCATGGGAGTCTGATTGTATGAACCTTATCAACGCCATAAATCATAAGACTTCGGATCTATCGGAATTGTTGAGTTTTGTTGAGGAGATTGGGTCTTTGGCC
GATTCTGCTCATGTGGTTTCCTTTCGTTGGTGTATCTGGTCGACAAATGGGTTGGCTCATCGCTTGGCTCGAGCAGCAGTGTGGAGTGGTGACTGGAAGGGGTTTTTTGT
TTCTGGTTCTTCTTCTAGCTCTGAAGAAGTAGTTAGGATACCTTGTATTATTCCAGACTCTTTCTCCTCGGTCTTTGAAGAGGAGGGTTGTGGTTGTGGATTTTCACAAA
GGCCTAGTATTGATTATGAGGAGACATATGCTCTAGTGGTGGATGCAATTACATTAAGATATTTAATTGGTCTGACTGTGTATGAAAATCTGGACATGCATCTTACGGAT
GTAGTCACAACATATTTATATGAATCTCTTGATAAATGTCGAAACAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATTGGAAGTGTAGATCGCGGGAAGATTGGCTTAAATGGGGGGATAGGAACACAAAGGTGACCAACTGCAAAATTAGTAAAATCCCAGATCATCTCCAGTGGGGTAT
GAACAATCCTTTCTCGAAAGAGGACATTGAGATGGTTGTCAAAGGTATGGGTCCCTATAAAGCCCCAGGCCCTGATGGGGTGCATGCTATGTTCTATCAAAAGTACTGGG
ACATTGTGGGGCAGGACACAATGAGAATATGCCTGAGGATCCTGAATGAAGGGGAGAATGTAGGCCCCCTTAACAAGACCTTAATTTCTTTGATTCCCAAAGTTAAGAAG
CCAAAGCATTTGTCTGAGTTTCGGCCCATCAGCTTGTGCAATGTGAGCTATAAAATTATTGCTAAATATTTGGTCAATAGACTGAAAAGAGCGCTCAATGAGATTATAGC
ACCAACTCAATCTACCTTCGTTCTGGGGAGAATGATTACTGATAATGCTATTTTGGGTTTTAAATGCTTACATGCTATTAGTAGCAAACGAGTTGGGAAAGATGGGTGGG
TGGCTCTTAAATTGGGCATGAGCAAGGCCTATGACACGGTCGAATGGCCGTTTTTGCGAGCTATGCTCCTTTCCATTGGCCTTGACAGGGCGTGGGTAAGGATTCTCATG
AGGTGTGTGGTAACAGTCCAATGTCCTGTCCTCATCAATTGTAAGCCCCATCACGAGTTTGTTCCTCAAAGAGGTCTTCGCCAAGGGGACCCTCTCTCTCCTTATCTGTT
CCTCATTTGCACGGAAGGTTTGTCGGCTCTTCTACACAGGGAAGAATCTCTGTCTCTTATTCATGCGTCGATTTTCGAGAGCAAGAATATTCGCAAAGTTTTGCAAGAGT
ATGAAGAAGTGTCGGGCCAAATGATAAACCTGGATAAGTCTGAATGTCTCTTTAGTAAGAATGTTAGGCAGCAGTTGGCGATTGAAATCAATAATGTTCTAGGAGTGCGA
AAAACTGACTCTTTAGGGATATATTTGGGGATGCTGAAAGAGAAGCTCTTCTCCCTTAGTGGCAAAGAGACTTTGATCAAGGCTATAGCCCAAGCAATCCTCATCTATAT
GATGAGTTGTTTCAAGCTTCCCTACACAGTTTGTGATGATCTAAATAGGGCCTCTGGAACAAGCTTTGTGTTAGTAAGGATGCGGGGGGGTCTTGGTCTTCGAGATCTCC
AGTTATTTAACCAGGCTATGCTTGCTAAACAAAGCTCGCGAATAGTTAAAAATTCTAACAGCCTGCTTGCGCGTGTTCTCAAAGGCCGATATTTTAAGGATGGTGATTTT
TGCAATGCCCTGTTGGGGAGCAATCCCTCCCTCACTTGGAGGGGTATTTTGTGGGGGAGGGAGTTGTTCCTCAAAGGTTATAGAGGAAGGTGGGAAAAGGGAATTGTGTT
AGTTGACCATAGGACTCCTATTTGCATTAATGGAAATTTGGCCTTGATGAGGGTCCGAGATATTATTAACAGTGATGGAAGTTGGAACTTTGAGTTGATTCATAACATTT
TCCTCCCTAGTGATGCAAAGTCCATTATTAGTGTGCCCCGCCGGGGTATCGGGGGAGCGGATGAGATTATTTGGGGTTTCGACCCAAATGTTTGTTTCTCTGTTAAAAGT
GCGTACAACTTAGCGAAGTCCATTACCGATTCCTTGCCCACGCTGAGCAACATATCAAAGAAAGGAATTGCTACTAACCCACTTTGTTTTTTTTGCAGGAAATTTGTGGA
ATCATTGAGCCATGTGTTGTGGGGGTGTAAGTTTACTAAAATCATTTGGCTTTCTATTTTTCCTTCCTTACCTGGTTTTTTATCTAATTTCAGGGACTGTTGGAGTCCAG
CGGGAAGCTGCTTGGATGTCTGGAGCTACAGGAACCTTTTGCTGCATAATGGAACCCCCCATGTGCGCAATGTTTTGGTTGACAAGATCTTCGCGCATGTCGAGGAGTTT
TGTAGGGGAAATAGCGTTGAGCAAGGGTCTCTTTGGAGAGAGTTGACGGGGGGCCTAGAGGTGTGGGTGCCGCCTGTTTTGGGCTCTTGGAAAATCAACACTTATGCCTC
GTGGTCTAAGCTTAAAGGAACGGGTGGGATTAGTTGGGTGGTTCGTGACTCATCTGGCTCGATCATCCTAGCTGGATGTGAGAAAAGGTATTTCGCTCAAGGGTCCCTTC
CCAATGTTTCATGGGAGTCTGATTGTATGAACCTTATCAACGCCATAAATCATAAGACTTCGGATCTATCGGAATTGTTGAGTTTTGTTGAGGAGATTGGGTCTTTGGCC
GATTCTGCTCATGTGGTTTCCTTTCGTTGGTGTATCTGGTCGACAAATGGGTTGGCTCATCGCTTGGCTCGAGCAGCAGTGTGGAGTGGTGACTGGAAGGGGTTTTTTGT
TTCTGGTTCTTCTTCTAGCTCTGAAGAAGTAGTTAGGATACCTTGTATTATTCCAGACTCTTTCTCCTCGGTCTTTGAAGAGGAGGGTTGTGGTTGTGGATTTTCACAAA
GGCCTAGTATTGATTATGAGGAGACATATGCTCTAGTGGTGGATGCAATTACATTAAGATATTTAATTGGTCTGACTGTGTATGAAAATCTGGACATGCATCTTACGGAT
GTAGTCACAACATATTTATATGAATCTCTTGATAAATGTCGAAACAATTAG
Protein sequenceShow/hide protein sequence
MYWKCRSREDWLKWGDRNTKVTNCKISKIPDHLQWGMNNPFSKEDIEMVVKGMGPYKAPGPDGVHAMFYQKYWDIVGQDTMRICLRILNEGENVGPLNKTLISLIPKVKK
PKHLSEFRPISLCNVSYKIIAKYLVNRLKRALNEIIAPTQSTFVLGRMITDNAILGFKCLHAISSKRVGKDGWVALKLGMSKAYDTVEWPFLRAMLLSIGLDRAWVRILM
RCVVTVQCPVLINCKPHHEFVPQRGLRQGDPLSPYLFLICTEGLSALLHREESLSLIHASIFESKNIRKVLQEYEEVSGQMINLDKSECLFSKNVRQQLAIEINNVLGVR
KTDSLGIYLGMLKEKLFSLSGKETLIKAIAQAILIYMMSCFKLPYTVCDDLNRASGTSFVLVRMRGGLGLRDLQLFNQAMLAKQSSRIVKNSNSLLARVLKGRYFKDGDF
CNALLGSNPSLTWRGILWGRELFLKGYRGRWEKGIVLVDHRTPICINGNLALMRVRDIINSDGSWNFELIHNIFLPSDAKSIISVPRRGIGGADEIIWGFDPNVCFSVKS
AYNLAKSITDSLPTLSNISKKGIATNPLCFFCRKFVESLSHVLWGCKFTKIIWLSIFPSLPGFLSNFRDCWSPAGSCLDVWSYRNLLLHNGTPHVRNVLVDKIFAHVEEF
CRGNSVEQGSLWRELTGGLEVWVPPVLGSWKINTYASWSKLKGTGGISWVVRDSSGSIILAGCEKRYFAQGSLPNVSWESDCMNLINAINHKTSDLSELLSFVEEIGSLA
DSAHVVSFRWCIWSTNGLAHRLARAAVWSGDWKGFFVSGSSSSSEEVVRIPCIIPDSFSSVFEEEGCGCGFSQRPSIDYEETYALVVDAITLRYLIGLTVYENLDMHLTD
VVTTYLYESLDKCRNN