; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011780 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011780
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:32651134..32655186
RNA-Seq ExpressionLag0011780
SyntenyLag0011780
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149385.1 uncharacterized protein LOC111017816 [Momordica charantia]1.4e-6831.25Show/hide
Query:  NQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKV
        NQ LL+ F+ +++ +A +Q  PHKASGPD  SG FYR+ W +V   V H CL VLN   SP  LNET+I LIPK + P R+ +F+PISLCNV YK+I K 
Subjt:  NQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKV

Query:  LVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMS-------------------SGNELEETQNGQGKAMV-------------------------
        +VNRMK +L  IIS NQS F+PGRCVVDNA  G     S                    G  L   + G  +A +                         
Subjt:  LVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMS-------------------SGNELEETQNGQGKAMV-------------------------

Query:  --------VRELLLVYERTTGQTINYEKSVVAFSPNTEDDSQQYISLVLSVSCSP----------CHAQYLELPSF------------------------
                V+ +L  Y++ +GQTIN+EKSV +FSPNT       +   + +   P            A    +PS+                        
Subjt:  --------VRELLLVYERTTGQTINYEKSVVAFSPNTEDDSQQYISLVLSVSCSP----------CHAQYLELPSF------------------------

Query:  --------------MPRNRPGTLKF-IKDRIWKQ----IQELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLIS
                      + R+    L   +K R +K       ++GS PS+IWRSLL G  +L  G RWR+G+G ++PIYGSNW+P + +L +   P L L S
Subjt:  --------------MPRNRPGTLKF-IKDRIWKQ----IQELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLIS

Query:  KVSDLFTVSGQWDEGKIRGHFMSSDCEAILKIPLRYGLFDDQLIWHYEKHGSFTVMSGY-----RLALLMVTQTRPSSSNSDCMCVWW--KSLWKLNVPS
         VS L + SG W+EG IRG F   +   IL+IP+      D  IWHY+  G FT+ S Y     RL   + + +     + D +   W  K + ++   S
Subjt:  KVSDLFTVSGQWDEGKIRGHFMSSDCEAILKIPLRYGLFDDQLIWHYEKHGSFTVMSGY-----RLALLMVTQTRPSSSNSDCMCVWW--KSLWKLNVPS

Query:  KMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCCSSFEEIMWSMKDRLNLLDFELAAANCRLARP
        ++++ L ++    + T +   +         VL    M +   + W    +RN W+ +K +        S     M   + R N    +  + +   +  
Subjt:  KMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCCSSFEEIMWSMKDRLNLLDFELAAANCRLARP

Query:  CLQVISDRTVRRGGWMPPVGVLMAACCVLPKCWSVDIAEGWALIRGIEVAQQMGFS
        CL   S     RG  M     L AA  VL    +VD+AE  A   G+++A+ MG +
Subjt:  CLQVISDRTVRRGGWMPPVGVLMAACCVLPKCWSVDIAEGWALIRGIEVAQQMGFS

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.7e-6929.8Show/hide
Query:  NQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKV
        N  LL+PF E+E+I+AL Q HPHKA GPD  S  FY+N W IV   V+  CL +LN     ++L+E +                                
Subjt:  NQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKV

Query:  LVNRMKHILTRIISENQSTFIPGRCVVDNAYWG---LNVFMSSG------NELEETQNGQG-------------------------------------KA
            +K  L  +IS++QS FIP R + DNA  G   L + MS        N LE      G                                       
Subjt:  LVNRMKHILTRIISENQSTFIPGRCVVDNAYWG---LNVFMSSG------NELEETQNGQG-------------------------------------KA

Query:  MVVRELL-LVYERTTGQTINYEK---SVVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRPGTLKFIKDRIWKQIQ---------------
        M+V   L LV  + +   I Y K     +   P  +D     I  +LSV+   C  QYL LP+FMPRNR     +IKDR+WK +Q               
Subjt:  MVVRELL-LVYERTTGQTINYEK---SVVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRPGTLKFIKDRIWKQIQ---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFT-VSGQWDEGKIRGHFMSSDCEAILKIP
              ++   PS+IWRS+LWG DLL +G RWRIGNG S+ IYG NWVP   +L+I   P L L+S+VS L     G W    +R  F   + + IL IP
Subjt:  ------ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFT-VSGQWDEGKIRGHFMSSDCEAILKIP

Query:  LRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTR-PSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVME
        +  G  +D+LIW+YEK G ++V SGY++ALL     + PSSS+S+ +  WW   WK+++P+K+K FLWRL  D L T  NL +RGV I+  C  C    E
Subjt:  LRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTR-PSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVME

Query:  DSLHLFWNCSVVRNMWLCSKFAPL---------YHSLCCSSFEE---IMWSMKDRLNLLDF
        DS+HLFW C     +W+ SKF  L         + SL  + FEE   ++W + ++ N   F
Subjt:  DSLHLFWNCSVVRNMWLCSKFAPL---------YHSLCCSSFEE---IMWSMKDRLNLLDF

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.0e-7027.53Show/hide
Query:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK
        +N+ LL P+T++EI +A++Q  P KA GPD     FY+ +W +V P    +CL  LN G   +  N T I LIPK K PR + DF+PISLCNVSYK+ISK
Subjt:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK

Query:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAY--------------------------------------------------W-----------------
         + NR+K+++  +IS+ QS F+P R + DN                                                    W                 
Subjt:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAY--------------------------------------------------W-----------------

Query:  ------------------------------GLNVFMS--------SGNELEETQNG-----------------QGKAMVVRELLLVYERTTGQTINYEKS
                                      GL+  ++        +G   EE                     + + + +R LL  Y R +GQ IN+ KS
Subjt:  ------------------------------GLNVFMS--------SGNELEETQNG-----------------QGKAMVVRELLLVYERTTGQTINYEKS

Query:  VVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRP-------------------GTLKF----------IKDRIWKQIQ-------------
         + FSPN   + QQY+  +L+V        YL LPS   R R                    G L F          +   +W+ +Q             
Subjt:  VVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRP-------------------GTLKF----------IKDRIWKQIQ-------------

Query:  ----------ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAIL
                     S+ S+ W+  LWG DLL +G R R+GNG +I  +   W+P   + +     + +L + V+   T  G WD   I   F + D + IL
Subjt:  ----------ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAIL

Query:  KIPLRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSV
         +P+      D  +WHY+K G+++V SGY+  L M  +   +S++++     W S+WKL VP+K+K F+WR  H+H+ T  NL+ RG+     C +C   
Subjt:  KIPLRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSV

Query:  MEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCC------SSFEEIMWSMKDRLNLLDFELAA
         E  +H F++C   R +W       L+  L C       SF E+  S+ ++L   D  LAA
Subjt:  MEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCC------SSFEEIMWSMKDRLNLLDFELAA

XP_024035599.1 uncharacterized protein LOC112096407 [Citrus clementina]3.7e-6932.51Show/hide
Query:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK
        MN  L +PFT +EI  AL Q  P KA GPD L   F++ HW+ V   V  +CL VLNQ  +   LN T IVLIPK   PRRV +++PISLCNV Y L++K
Subjt:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK

Query:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMSSGNELEETQNGQGKAMVVR-ELLLVYERTTGQTINYEKSVVAFSPNTEDDSQQYISLVLS
         + NR+K  L +IIS  QS F+P R + DN   G        +++  ++  +  ++ ++ ++   Y+R     + +    + F       S ++I+L+++
Subjt:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMSSGNELEETQNGQGKAMVVR-ELLLVYERTTGQTINYEKSVVAFSPNTEDDSQQYISLVLS

Query:  VSCSP-------------CHAQ-----------YLEL------------------------------------------PSFMPRNRPGTL--KFIKDRI
           +P              H Q           YL L                                            +     P +L  K IK R 
Subjt:  VSCSP-------------CHAQ-----------YLEL------------------------------------------PSFMPRNRPGTL--KFIKDRI

Query:  WKQIQ----ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAILK
        +K       ++GS PSFIWRS+LWG  +L +G RWRIGNG  I I  SNW+P   + ++   PSL   +KVS+L   + QW+E  I   F   D + I  
Subjt:  WKQIQ----ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAILK

Query:  IPLRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVM
        I L     +D++IWHY++ G ++V SGY+LAL +     P+SS        W++LWKLN+P K+K F+W+     L T  NL RR +    +C +C    
Subjt:  IPLRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVM

Query:  EDSLHLFWNCSVVRNMWLCSKFAPLYHSL
        ED  H    C + R +W C+       S+
Subjt:  EDSLHLFWNCSVVRNMWLCSKFAPLYHSL

XP_042965938.1 uncharacterized protein LOC122299618 [Carya illinoinensis]2.6e-6726.36Show/hide
Query:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK
        MN  L + FT  E+  A+KQ  P K+ GPD     F++ +WQ+V   V+ + L  LN      S+N T I LIPK K PR   D++PISLCNV YK++SK
Subjt:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK

Query:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMSSGNELEETQNGQGKAMVVRELLLVYERTTGQTINYEKSVVAFSPNTEDDSQQYISLVLSV
         L NR+K  +  ++S NQS FI GR + DN      +      EL++   G+     ++ELLL YE+  GQ +N EK+ V FS N+  + Q+ I  +   
Subjt:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMSSGNELEETQNGQGKAMVVRELLLVYERTTGQTINYEKSVVAFSPNTEDDSQQYISLVLSV

Query:  SCSPCHAQYLELPSFMPRNRPGTLKFIKDRIWKQIQ----------------------------------------------------------------
             + +YL LP  + + +  + + IK R+WK+I                                                                 
Subjt:  SCSPCHAQYLELPSFMPRNRPGTLKFIKDRIWKQIQ----------------------------------------------------------------

Query:  ---------------------------------------------------------ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWV-
                                                                 +LG RPS+IWRS+    +LL  G RWR+GN +SI I+   W+ 
Subjt:  ---------------------------------------------------------ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWV-

Query:  -PGNFSLQIQYVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAILKIPLRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCV
         P    +Q   +  L   ++VS+L + +G+WD   ++  F   + E I  IP+     +D+LIW     G FT+ S Y+L +      +  +S       
Subjt:  -PGNFSLQIQYVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAILKIPLRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCV

Query:  WWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCCSSFEEIMWSMKDRLNLLDF
         W+S+W LN+  K+K F+WR     L+T+ NL+ R +  +  C +C +  E + H  W+C    ++W                 E +  + K +    D 
Subjt:  WWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCCSSFEEIMWSMKDRLNLLDF

Query:  ELAAANCRLARPCLQVIS---DRTVRRGGWMPPVG---------VLMAACCVLPKCWSVDIAEGWALIRGIEVAQQMGFSGFSVELDSLRLINVLRNEVT
             N R +RP  Q +    D  V +      +G         VL+A    +       IAE +AL + +EV + + F+    E D+  ++N +  E  
Subjt:  ELAAANCRLARPCLQVIS---DRTVRRGGWMPPVG---------VLMAACCVLPKCWSVDIAEGWALIRGIEVAQQMGFSGFSVELDSLRLINVLRNEVT

Query:  DLSEVGFLMVEVRQLLQ
        D+S  G ++ +V+ LL+
Subjt:  DLSEVGFLMVEVRQLLQ

TrEMBL top hitse value%identityAlignment
A0A2N9G219 RNase H domain-containing protein1.2e-7633.65Show/hide
Query:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK
        MNQ+L  PFTE+E+++A+KQ  P KA GPD +   FY+++W +V   +T + L  L  G    +LN T + LIPKTK+P  V +++PISLCNV YKLISK
Subjt:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK

Query:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMSSGN-------------ELEETQNGQGKAMVVRELLLVYERTTGQTINYEKSVVAFSPNTE
        VL NR+K IL  IISE QS F+PGR + DN            N             ++ +  +      V +E+L +YE+ +GQ +N  K+ + FS NT 
Subjt:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMSSGN-------------ELEETQNGQGKAMVVRELLLVYERTTGQTINYEKSVVAFSPNTE

Query:  DDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRPGTLKFIKDRIW--------KQIQELG----------------------------------------
          +Q+ I  +L V     + +YL LPS + + +      IK+R+W        K + + G                                        
Subjt:  DDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRPGTLKFIKDRIW--------KQIQELG----------------------------------------

Query:  -------------------------SRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQI-QYVPSLSLISKVSDLFTVS-GQWDEGK
                                 +R SF WRS+L    L+  G  WR+G+G  IPI GSNW+      +I   + +L + +KV +L   S   W+  K
Subjt:  -------------------------SRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQI-QYVPSLSLISKVSDLFTVS-GQWDEGK

Query:  IRGHFMSSDCEAILKIPLRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRR
        I   F+  D EAILKIPL     +D+L W   ++G ++V SGY+L         P SS        WK +W+  VP+K++ FLWR  HD L TK+ L +R
Subjt:  IRGHFMSSDCEAILKIPLRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRR

Query:  GVNISGLCVLCNSVMEDSLHLFWNCSVVRNMW
         V  + LC  C +  EDSLH  W C  V  +W
Subjt:  GVNISGLCVLCNSVMEDSLHLFWNCSVVRNMW

A0A6J1D5K1 uncharacterized protein LOC1110178166.8e-6931.25Show/hide
Query:  NQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKV
        NQ LL+ F+ +++ +A +Q  PHKASGPD  SG FYR+ W +V   V H CL VLN   SP  LNET+I LIPK + P R+ +F+PISLCNV YK+I K 
Subjt:  NQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKV

Query:  LVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMS-------------------SGNELEETQNGQGKAMV-------------------------
        +VNRMK +L  IIS NQS F+PGRCVVDNA  G     S                    G  L   + G  +A +                         
Subjt:  LVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMS-------------------SGNELEETQNGQGKAMV-------------------------

Query:  --------VRELLLVYERTTGQTINYEKSVVAFSPNTEDDSQQYISLVLSVSCSP----------CHAQYLELPSF------------------------
                V+ +L  Y++ +GQTIN+EKSV +FSPNT       +   + +   P            A    +PS+                        
Subjt:  --------VRELLLVYERTTGQTINYEKSVVAFSPNTEDDSQQYISLVLSVSCSP----------CHAQYLELPSF------------------------

Query:  --------------MPRNRPGTLKF-IKDRIWKQ----IQELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLIS
                      + R+    L   +K R +K       ++GS PS+IWRSLL G  +L  G RWR+G+G ++PIYGSNW+P + +L +   P L L S
Subjt:  --------------MPRNRPGTLKF-IKDRIWKQ----IQELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLIS

Query:  KVSDLFTVSGQWDEGKIRGHFMSSDCEAILKIPLRYGLFDDQLIWHYEKHGSFTVMSGY-----RLALLMVTQTRPSSSNSDCMCVWW--KSLWKLNVPS
         VS L + SG W+EG IRG F   +   IL+IP+      D  IWHY+  G FT+ S Y     RL   + + +     + D +   W  K + ++   S
Subjt:  KVSDLFTVSGQWDEGKIRGHFMSSDCEAILKIPLRYGLFDDQLIWHYEKHGSFTVMSGY-----RLALLMVTQTRPSSSNSDCMCVWW--KSLWKLNVPS

Query:  KMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCCSSFEEIMWSMKDRLNLLDFELAAANCRLARP
        ++++ L ++    + T +   +         VL    M +   + W    +RN W+ +K +        S     M   + R N    +  + +   +  
Subjt:  KMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCCSSFEEIMWSMKDRLNLLDFELAAANCRLARP

Query:  CLQVISDRTVRRGGWMPPVGVLMAACCVLPKCWSVDIAEGWALIRGIEVAQQMGFS
        CL   S     RG  M     L AA  VL    +VD+AE  A   G+++A+ MG +
Subjt:  CLQVISDRTVRRGGWMPPVGVLMAACCVLPKCWSVDIAEGWALIRGIEVAQQMGFS

A0A6J1DAR4 uncharacterized protein LOC1110189548.0e-7029.8Show/hide
Query:  NQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKV
        N  LL+PF E+E+I+AL Q HPHKA GPD  S  FY+N W IV   V+  CL +LN     ++L+E +                                
Subjt:  NQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKV

Query:  LVNRMKHILTRIISENQSTFIPGRCVVDNAYWG---LNVFMSSG------NELEETQNGQG-------------------------------------KA
            +K  L  +IS++QS FIP R + DNA  G   L + MS        N LE      G                                       
Subjt:  LVNRMKHILTRIISENQSTFIPGRCVVDNAYWG---LNVFMSSG------NELEETQNGQG-------------------------------------KA

Query:  MVVRELL-LVYERTTGQTINYEK---SVVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRPGTLKFIKDRIWKQIQ---------------
        M+V   L LV  + +   I Y K     +   P  +D     I  +LSV+   C  QYL LP+FMPRNR     +IKDR+WK +Q               
Subjt:  MVVRELL-LVYERTTGQTINYEK---SVVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRPGTLKFIKDRIWKQIQ---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFT-VSGQWDEGKIRGHFMSSDCEAILKIP
              ++   PS+IWRS+LWG DLL +G RWRIGNG S+ IYG NWVP   +L+I   P L L+S+VS L     G W    +R  F   + + IL IP
Subjt:  ------ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFT-VSGQWDEGKIRGHFMSSDCEAILKIP

Query:  LRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTR-PSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVME
        +  G  +D+LIW+YEK G ++V SGY++ALL     + PSSS+S+ +  WW   WK+++P+K+K FLWRL  D L T  NL +RGV I+  C  C    E
Subjt:  LRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTR-PSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVME

Query:  DSLHLFWNCSVVRNMWLCSKFAPL---------YHSLCCSSFEE---IMWSMKDRLNLLDF
        DS+HLFW C     +W+ SKF  L         + SL  + FEE   ++W + ++ N   F
Subjt:  DSLHLFWNCSVVRNMWLCSKFAPL---------YHSLCCSSFEE---IMWSMKDRLNLLDF

A0A6J1DX30 uncharacterized protein LOC1110248749.5e-7127.53Show/hide
Query:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK
        +N+ LL P+T++EI +A++Q  P KA GPD     FY+ +W +V P    +CL  LN G   +  N T I LIPK K PR + DF+PISLCNVSYK+ISK
Subjt:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK

Query:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAY--------------------------------------------------W-----------------
         + NR+K+++  +IS+ QS F+P R + DN                                                    W                 
Subjt:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAY--------------------------------------------------W-----------------

Query:  ------------------------------GLNVFMS--------SGNELEETQNG-----------------QGKAMVVRELLLVYERTTGQTINYEKS
                                      GL+  ++        +G   EE                     + + + +R LL  Y R +GQ IN+ KS
Subjt:  ------------------------------GLNVFMS--------SGNELEETQNG-----------------QGKAMVVRELLLVYERTTGQTINYEKS

Query:  VVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRP-------------------GTLKF----------IKDRIWKQIQ-------------
         + FSPN   + QQY+  +L+V        YL LPS   R R                    G L F          +   +W+ +Q             
Subjt:  VVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRP-------------------GTLKF----------IKDRIWKQIQ-------------

Query:  ----------ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAIL
                     S+ S+ W+  LWG DLL +G R R+GNG +I  +   W+P   + +     + +L + V+   T  G WD   I   F + D + IL
Subjt:  ----------ELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAIL

Query:  KIPLRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSV
         +P+      D  +WHY+K G+++V SGY+  L M  +   +S++++     W S+WKL VP+K+K F+WR  H+H+ T  NL+ RG+     C +C   
Subjt:  KIPLRYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSV

Query:  MEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCC------SSFEEIMWSMKDRLNLLDFELAA
         E  +H F++C   R +W       L+  L C       SF E+  S+ ++L   D  LAA
Subjt:  MEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCC------SSFEEIMWSMKDRLNLLDFELAA

A0A803Q7Q3 Uncharacterized protein5.2e-6926.38Show/hide
Query:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK
        +N+ L+ PFT+D+++ A++  HPHKA G D + G FYR  W I+   VT  CL +LN+G S +++N+T+I LIPK   P ++ +F+PISLCNV YK+++K
Subjt:  MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISK

Query:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMS----------SGNELEET------------QNGQGKAMVVRELLLVYERTTGQTINYEKS
         L   MKH L + ISE QS F+ GR + DNA  G     S            N+++ +            +  + +   + ++   Y R +GQ IN EKS
Subjt:  VLVNRMKHILTRIISENQSTFIPGRCVVDNAYWGLNVFMS----------SGNELEET------------QNGQGKAMVVRELLLVYERTTGQTINYEKS

Query:  VVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRP----------------GTLKFIK-------DRIWKQIQELG----------------
         V+   +      Q+++  L V   P HA YL LPS++ R +                 G  K  K       D+++K  +E G                
Subjt:  VVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNRP----------------GTLKFIK-------DRIWKQIQELG----------------

Query:  ---------------------------------SRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFTVSG
                                            S IW+ ++WG D++  G  WR+GNGR+I ++   W+P      I      +  + VS LF    
Subjt:  ---------------------------------SRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFTVSG

Query:  QWDEGKIRGHFMSSDCEAILKIPLRYGLFDDQLIWHYEKHGSFTVMSGYRLAL-LMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLST
         W+E  +  +F   D   IL IP+     +D L+W + K G + V  GYR+A  + +  TR   SN D    WWK  W LN+P +MK F W++  + L  
Subjt:  QWDEGKIRGHFMSSDCEAILKIPLRYGLFDDQLIWHYEKHGSFTVMSGYRLAL-LMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLST

Query:  KVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCCSSFEEIMWSMKDRLNLLDFE------LAAANCRLARPCLQVISDRTV
        K NL  RG  I   C  C    E   H  W C  V+ +W    +  L       S  +++   + +L   +FE       +       +  L  +     
Subjt:  KVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMWLCSKFAPLYHSLCCSSFEEIMWSMKDRLNLLDFE------LAAANCRLARPCLQVISDRTV

Query:  RRGGWMPPVG-----------------------------VLMAACCVLPKCWSVDIAEGWALIRGIEVAQQMGFSGFSVELDSLRLINVLRNEVTDLSEV
        +     PPVG                             ++ A    LP C +V +AE  A++  ++   +   + + ++LD  +L++ +    + L ++
Subjt:  RRGGWMPPVG-----------------------------VLMAACCVLPKCWSVDIAEGWALIRGIEVAQQMGFSGFSVELDSLRLINVLRNEVTDLSEV

Query:  GFLMVEVRQ
          ++ ++++
Subjt:  GFLMVEVRQ

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.5e-1231.71Show/hide
Query:  QALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKT-KAPRRVLDFQPISLCNVSYKLISKV
        ++L RP T  EI+  +      K+ GPD  +  FY+ + + + P +      +  +G  P S  E  I+LIPK  +   +  +F+PISL N+  K+++K+
Subjt:  QALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKT-KAPRRVLDFQPISLCNVSYKLISKV

Query:  LVNRMKHILTRIISENQSTFIPG
        L NR++  + ++I  +Q  FIPG
Subjt:  LVNRMKHILTRIISENQSTFIPG

P08548 LINE-1 reverse transcriptase homolog3.3e-1232.52Show/hide
Query:  QALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKT-KAPRRVLDFQPISLCNVSYKLISKV
        + L RP +  EI   ++     K+ GPD  +  FY+   + + P + +    +  +G  P +  E  I LIPK  K P R  +++PISL N+  K+++K+
Subjt:  QALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKT-KAPRRVLDFQPISLCNVSYKLISKV

Query:  LVNRMKHILTRIISENQSTFIPG
        L NR++  + +II  +Q  FIPG
Subjt:  LVNRMKHILTRIISENQSTFIPG

P0C2F6 Putative ribonuclease H protein At1g657507.1e-1526.89Show/hide
Query:  SFIWRSLLWGW-DLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQ--YVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAILKIPLRYGLFD---
        S  WRS+  G  D+++ G  W  G+G+ I  +   WV G   L++     P+        DL+     WD  KI  +  ++      ++ LR  + D   
Subjt:  SFIWRSLLWGW-DLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQ--YVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAILKIPLRYGLFD---

Query:  ---DQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHL
           D+L W + + G F+V S Y + L +    RP+      M  ++  LWK+ VP ++K FLW + +  + T+    RR ++ S +C +C   +E  LH+
Subjt:  ---DQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHL

Query:  FWNCSVVRNMWL
          +C     +W+
Subjt:  FWNCSVVRNMWL

P11369 LINE-1 retrotransposable element ORF2 protein3.6e-1133.9Show/hide
Query:  PFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPK-TKAPRRVLDFQPISLCNVSYKLISKVLVNRM
        P +  EI   +      K+ GPD  S  FY+   + + P +      +  +G  P S  E  I LIPK  K P ++ +F+PISL N+  K+++K+L NR+
Subjt:  PFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPK-TKAPRRVLDFQPISLCNVSYKLISKVLVNRM

Query:  KHILTRIISENQSTFIPG
        +  +  II  +Q  FIPG
Subjt:  KHILTRIISENQSTFIPG

P14381 Transposon TX1 uncharacterized 149 kDa protein1.9e-1229.23Show/hide
Query:  QALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKVL
        + L  P T DE+  AL+    +K+ G D L+  F++  W  + P           +G  P S    ++ L+PK    R + +++P+SL +  YK+++K +
Subjt:  QALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKVL

Query:  VNRMKHILTRIISENQSTFIPGRCVVDNAY
          R+K +L  +I  +QS  +PGR + DN +
Subjt:  VNRMKHILTRIISENQSTFIPGRCVVDNAY

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.0e-0828.44Show/hide
Query:  DDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFW
        DD  IW  + H    + S  + +L +  Q          +  W+K++W  N   K  F  W +  + L T+  L   G++I  +C+LCNS  E   HLF+
Subjt:  DDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFW

Query:  NCSVVRNMW
         C     +W
Subjt:  NCSVVRNMW

AT2G02650.1 Ribonuclease H-like superfamily protein8.1e-0632.31Show/hide
Query:  KSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMW
        +++WKL+V  K+K FLWR     L+T   L  R ++   +C  C    E   H+ +NC   +++W
Subjt:  KSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMW

AT3G25270.1 Ribonuclease H-like superfamily protein3.6e-0637.88Show/hide
Query:  LWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMWLCS
        +WKL    K+K FLW+L    L+T  NL RR +     C  C    E S HLF++C   + +W  S
Subjt:  LWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVVRNMWLCS

AT4G29090.1 Ribonuclease H-like superfamily protein3.3e-2329.58Show/hide
Query:  LGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWV---PGNFSLQIQYVP-----SLSLISKVSDLFTVSG-QWDEGKIRGHFMSSDCEAILKI
        LGSRPSF+W+S+    ++L +G R  +GNG  I I+   W+   P + +L++Q VP     S+S I KVSDL   SG +W +  I   F   + + I ++
Subjt:  LGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWV---PGNFSLQIQYVP-----SLSLISKVSDLFTVSG-QWDEGKIRGHFMSSDCEAILKI

Query:  PLRYGLFDDQLIWHYEKHGSFTVMSGY-RLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVM
                D   W Y   G +TV SGY  L  ++  ++ P   +   +   ++ +WK     K++ FLW+   + L     L  R ++    C+ C S  
Subjt:  PLRYGLFDDQLIWHYEKHGSFTVMSGY-RLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVM

Query:  EDSLHLFWNCSVVRNMWLCSKF-APLYHSLCCSSFEEIMW
        E   HL + C+  R  W  S    PL      S +  + W
Subjt:  EDSLHLFWNCSVVRNMWLCSKF-APLYHSLCCSSFEEIMW

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.8e-0621.46Show/hide
Query:  SFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNW--------VPGNFSLQIQYVP-SLSLISKVSD-LFTVSGQWDEGKIRGHFMSSDCEAILKIPLRY
        S+IW+S+     +       ++G+G +   +  NW        + G+   ++  +P + S+   + D ++ ++G      I    + +       + L+ 
Subjt:  SFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNW--------VPGNFSLQIQYVP-SLSLISKVSD-LFTVSGQWDEGKIRGHFMSSDCEAILKIPLRY

Query:  GLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLH
           DD  +W   K G      G+  A   +    P     D    W K++W      K  F  W      L T+  L+  G+++  LC+LCN+  E   H
Subjt:  GLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLH

Query:  LFWNC
        LF++C
Subjt:  LFWNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCAGGCGTTGTTGCGTCCTTTCACGGAGGATGAGATCATTGTGGCATTAAAGCAAACACATCCACATAAGGCGTCGGGTCCGGATGACTTGTCTGGCAACTTCTA
TAGGAATCACTGGCAGATTGTTGATCCCACTGTTACTCATAGTTGCCTGGTTGTCCTGAATCAGGGTTGCTCCCCCCAATCCTTGAATGAGACCATGATAGTCCTCATTC
CGAAGACCAAAGCTCCTCGGCGGGTTTTAGATTTCCAACCGATTTCCCTGTGCAACGTGAGCTATAAACTGATTTCGAAGGTGTTGGTTAATCGCATGAAACACATCCTT
ACGAGAATCATCTCGGAAAATCAGAGCACTTTTATCCCCGGGCGATGTGTAGTGGATAATGCATATTGGGGTTTGAATGTATTCATGAGCTCAGGAAACGAACTGGAGGA
AACTCAAAATGGGCAGGGCAAGGCTATGGTTGTTCGAGAGCTGCTATTAGTCTACGAACGAACGACAGGTCAAACTATCAACTATGAGAAGTCTGTGGTGGCGTTCAGTC
CTAACACTGAGGATGACTCTCAGCAGTATATCAGTTTGGTGCTCTCAGTCTCTTGCAGCCCTTGTCATGCTCAGTATCTTGAGCTCCCGTCATTCATGCCTCGCAATCGG
CCAGGGACGTTGAAATTTATTAAGGATCGCATTTGGAAGCAAATTCAGGAGTTGGGATCTCGTCCATCTTTCATTTGGCGTAGCTTGTTATGGGGTTGGGACTTGTTGGC
ACGTGGGTGTCGTTGGCGGATTGGCAATGGACGATCCATACCTATTTATGGTTCTAATTGGGTCCCAGGAAACTTTTCCCTCCAGATACAGTATGTTCCATCCTTGTCAT
TGATTAGTAAGGTCAGTGATTTGTTTACTGTGTCGGGACAGTGGGATGAGGGGAAGATTAGAGGTCACTTTATGTCGTCAGATTGTGAGGCTATTCTGAAAATTCCTTTG
CGTTATGGTTTGTTTGATGATCAACTAATTTGGCATTATGAAAAACATGGAAGTTTCACTGTCATGAGTGGGTACCGGTTGGCTCTATTGATGGTTACTCAAACACGTCC
CTCCTCATCTAACTCTGATTGTATGTGTGTTTGGTGGAAGAGCTTGTGGAAGCTGAATGTTCCAAGCAAGATGAAGTTTTTTCTATGGCGGTTGTTCCATGATCACTTGT
CGACGAAGGTAAATCTTATGAGACGTGGTGTTAACATTTCAGGTTTGTGTGTTCTTTGTAATTCTGTTATGGAGGATTCTCTTCATCTTTTCTGGAATTGCTCGGTTGTT
AGAAATATGTGGCTTTGTTCGAAGTTTGCACCTCTCTATCATTCCTTATGTTGTTCATCGTTTGAGGAAATCATGTGGTCAATGAAGGATAGACTGAATCTGCTGGATTT
TGAACTTGCTGCTGCCAATTGTCGTCTTGCTCGGCCTTGTTTGCAGGTGATTTCGGATAGGACAGTTCGGCGTGGTGGTTGGATGCCGCCAGTGGGTGTGTTGATGGCTG
CTTGCTGCGTGTTGCCGAAATGTTGGAGTGTGGACATTGCAGAGGGTTGGGCGTTGATTCGTGGCATCGAGGTTGCGCAACAGATGGGTTTTTCTGGCTTTAGTGTGGAG
TTGGATTCGTTGAGGTTGATTAATGTACTGCGCAACGAGGTGACTGATTTGTCTGAAGTTGGGTTCTTGATGGTTGAGGTCCGACAGTTGCTGCAGGTTGGAGCTGATGC
ACATGAGCATCAGGCTACCTTTTATGCATATTATGATTTTTTGTTTTGCTCTGTAACCATTATTCAACCCACTACAGTAACTATTTCCTCACTTCTTCCATGCGACTCCT
TGACTTTTCCGACGACCGTCGGCCGCCGTCAGCCGGCGAAGTTTCCGATGAATTTTCCGACGACCGCCGACGACTTTTCCGGCGATCGCCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCAGGCGTTGTTGCGTCCTTTCACGGAGGATGAGATCATTGTGGCATTAAAGCAAACACATCCACATAAGGCGTCGGGTCCGGATGACTTGTCTGGCAACTTCTA
TAGGAATCACTGGCAGATTGTTGATCCCACTGTTACTCATAGTTGCCTGGTTGTCCTGAATCAGGGTTGCTCCCCCCAATCCTTGAATGAGACCATGATAGTCCTCATTC
CGAAGACCAAAGCTCCTCGGCGGGTTTTAGATTTCCAACCGATTTCCCTGTGCAACGTGAGCTATAAACTGATTTCGAAGGTGTTGGTTAATCGCATGAAACACATCCTT
ACGAGAATCATCTCGGAAAATCAGAGCACTTTTATCCCCGGGCGATGTGTAGTGGATAATGCATATTGGGGTTTGAATGTATTCATGAGCTCAGGAAACGAACTGGAGGA
AACTCAAAATGGGCAGGGCAAGGCTATGGTTGTTCGAGAGCTGCTATTAGTCTACGAACGAACGACAGGTCAAACTATCAACTATGAGAAGTCTGTGGTGGCGTTCAGTC
CTAACACTGAGGATGACTCTCAGCAGTATATCAGTTTGGTGCTCTCAGTCTCTTGCAGCCCTTGTCATGCTCAGTATCTTGAGCTCCCGTCATTCATGCCTCGCAATCGG
CCAGGGACGTTGAAATTTATTAAGGATCGCATTTGGAAGCAAATTCAGGAGTTGGGATCTCGTCCATCTTTCATTTGGCGTAGCTTGTTATGGGGTTGGGACTTGTTGGC
ACGTGGGTGTCGTTGGCGGATTGGCAATGGACGATCCATACCTATTTATGGTTCTAATTGGGTCCCAGGAAACTTTTCCCTCCAGATACAGTATGTTCCATCCTTGTCAT
TGATTAGTAAGGTCAGTGATTTGTTTACTGTGTCGGGACAGTGGGATGAGGGGAAGATTAGAGGTCACTTTATGTCGTCAGATTGTGAGGCTATTCTGAAAATTCCTTTG
CGTTATGGTTTGTTTGATGATCAACTAATTTGGCATTATGAAAAACATGGAAGTTTCACTGTCATGAGTGGGTACCGGTTGGCTCTATTGATGGTTACTCAAACACGTCC
CTCCTCATCTAACTCTGATTGTATGTGTGTTTGGTGGAAGAGCTTGTGGAAGCTGAATGTTCCAAGCAAGATGAAGTTTTTTCTATGGCGGTTGTTCCATGATCACTTGT
CGACGAAGGTAAATCTTATGAGACGTGGTGTTAACATTTCAGGTTTGTGTGTTCTTTGTAATTCTGTTATGGAGGATTCTCTTCATCTTTTCTGGAATTGCTCGGTTGTT
AGAAATATGTGGCTTTGTTCGAAGTTTGCACCTCTCTATCATTCCTTATGTTGTTCATCGTTTGAGGAAATCATGTGGTCAATGAAGGATAGACTGAATCTGCTGGATTT
TGAACTTGCTGCTGCCAATTGTCGTCTTGCTCGGCCTTGTTTGCAGGTGATTTCGGATAGGACAGTTCGGCGTGGTGGTTGGATGCCGCCAGTGGGTGTGTTGATGGCTG
CTTGCTGCGTGTTGCCGAAATGTTGGAGTGTGGACATTGCAGAGGGTTGGGCGTTGATTCGTGGCATCGAGGTTGCGCAACAGATGGGTTTTTCTGGCTTTAGTGTGGAG
TTGGATTCGTTGAGGTTGATTAATGTACTGCGCAACGAGGTGACTGATTTGTCTGAAGTTGGGTTCTTGATGGTTGAGGTCCGACAGTTGCTGCAGGTTGGAGCTGATGC
ACATGAGCATCAGGCTACCTTTTATGCATATTATGATTTTTTGTTTTGCTCTGTAACCATTATTCAACCCACTACAGTAACTATTTCCTCACTTCTTCCATGCGACTCCT
TGACTTTTCCGACGACCGTCGGCCGCCGTCAGCCGGCGAAGTTTCCGATGAATTTTCCGACGACCGCCGACGACTTTTCCGGCGATCGCCGATGA
Protein sequenceShow/hide protein sequence
MNQALLRPFTEDEIIVALKQTHPHKASGPDDLSGNFYRNHWQIVDPTVTHSCLVVLNQGCSPQSLNETMIVLIPKTKAPRRVLDFQPISLCNVSYKLISKVLVNRMKHIL
TRIISENQSTFIPGRCVVDNAYWGLNVFMSSGNELEETQNGQGKAMVVRELLLVYERTTGQTINYEKSVVAFSPNTEDDSQQYISLVLSVSCSPCHAQYLELPSFMPRNR
PGTLKFIKDRIWKQIQELGSRPSFIWRSLLWGWDLLARGCRWRIGNGRSIPIYGSNWVPGNFSLQIQYVPSLSLISKVSDLFTVSGQWDEGKIRGHFMSSDCEAILKIPL
RYGLFDDQLIWHYEKHGSFTVMSGYRLALLMVTQTRPSSSNSDCMCVWWKSLWKLNVPSKMKFFLWRLFHDHLSTKVNLMRRGVNISGLCVLCNSVMEDSLHLFWNCSVV
RNMWLCSKFAPLYHSLCCSSFEEIMWSMKDRLNLLDFELAAANCRLARPCLQVISDRTVRRGGWMPPVGVLMAACCVLPKCWSVDIAEGWALIRGIEVAQQMGFSGFSVE
LDSLRLINVLRNEVTDLSEVGFLMVEVRQLLQVGADAHEHQATFYAYYDFLFCSVTIIQPTTVTISSLLPCDSLTFPTTVGRRQPAKFPMNFPTTADDFSGDRR