; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021719 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021719
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:11140461..11144861
RNA-Seq ExpressionLag0021719
SyntenyLag0021719
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]2.2e-17032.09Show/hide
Query:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGW
        MK +SWN + LGN  + R LR LV    PQ++FLMETKL      + +    F   L V   G  GG+MLLWK+  +V++ S +  H D  V  +DG  W
Subjt:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGW

Query:  RFTGIYGNPNREHHHETWNLIKRLKE------------------------GPKQ----------------------------------------------
            IYG P   +   TW LI+RL +                        GP +                                              
Subjt:  RFTGIYGNPNREHHHETWNLIKRLKE------------------------GPKQ----------------------------------------------

Query:  IVDHVWRDRLSMGHVSILE-------------KLNECLSHLKTWSHRQYGGSIRGAIDKKEKEIQSLFS--RLDDQSLLEVVEKERELENLLEDDEIYWH
         +++VW+D L    ++ L+               ++ L  L+ W  +++ G ++  I K ++E+  L S     D  L  V   E+ LE LL ++E YW 
Subjt:  IVDHVWRDRLSMGHVSILE-------------KLNECLSHLKTWSHRQYGGSIRGAIDKKEKEIQSLFS--RLDDQSLLEVVEKERELENLLEDDEIYWH

Query:  QRAREDWLKWGDKNTKWFHMQANRRRKSSS------------PNMEDI----------------------EYILRIIPTTITEAQNSELTKCFTRDEIYG
        QR+R DWLK GD+NTK+FH +A+ R  ++              + +DI                       ++L  IPTTI+  QN  L + FTR ++Y 
Subjt:  QRAREDWLKWGDKNTKWFHMQANRRRKSSS------------PNMEDI----------------------EYILRIIPTTITEAQNSELTKCFTRDEIYG

Query:  VIKKMHPSKAPGPDGIHAVFYQKYWDIVGD-------------------------------------------------EVIAKTLANRMKLVLDTIISP
         +K M   K+PG DG+ A+FYQ YW IVGD                                                 ++I+K LA R+K VL ++IS 
Subjt:  VIKKMHPSKAPGPDGIHAVFYQKYWDIVGD-------------------------------------------------EVIAKTLANRMKLVLDTIISP

Query:  TQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQ
        TQS F+  RLI++N ++ FE IHS+K R++G  G AALK DMSKA+DRVEW ++  +M KMGF+ RW+ LIM C+ + +    +NG       P RGLRQ
Subjt:  TQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQ

Query:  GDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNII
        GDPLSPYLFLIC+EGLS LL + +    L GL ++RH P ++HLF+ADDSLLF +A++  C  I R L  Y RASGQ +N +KS    SPNT     N  
Subjt:  GDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNII

Query:  KNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERK
        + IL +   +    YLGLP+ + + K ++F NIK+++WK +  W  K+FS  GKE+L+K+V Q+IP YAMS F+  V LCNE+ +  A+FWWG+S + +K
Subjt:  KNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERK

Query:  IHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAA
        IHW+ W+ LC  K  GG+GFR    FNQA+LAKQ+WR+ + P SL++RVL+G YF    F+ A+ G   S  W+ I+WGREL  +G R ++G G NI  A
Subjt:  IHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAA

Query:  TDPWIPKEGSCKPI------TIH----------------------PDVQQYSETPGRDHP----FVW-----GVY--------------------LNNRQ
         D WIP     KP       T H                      PDV    + P    P    ++W     G Y                     + ++
Subjt:  TDPWIPKEGSCKPI------TIH----------------------PDVQQYSETPGRDHP----FVW-----GVY--------------------LNNRQ

Query:  GW----------TTTDYFTW----------------------------------------------TWKNSKGEDLD----------DRRMAVS------
         W          +    F W                                               WK S G  +D          D  M +S      
Subjt:  GW----------TTTDYFTW----------------------------------------------TWKNSKGEDLD----------DRRMAVS------

Query:  -----LVMVWLIWSHRNEVIHSRKQPDMEILKAQIHKYSAELIHNKDSHLDQNHSSIVDHVC-----NTPMAPLNPWNPIPTGTWRLSCDATWSDGKSRG
             L ++W IWS RN  IH +K      +K  +  ++  +     +++DQ + SI   V       T  A +  W P P  T++L+ DA     +S+ 
Subjt:  -----LVMVWLIWSHRNEVIHSRKQPDMEILKAQIHKYSAELIHNKDSHLDQNHSSIVDHVC-----NTPMAPLNPWNPIPTGTWRLSCDATWSDGKSRG

Query:  GIGWVVRDWCGNMLRTGYKCVLRAWKISWLEAFAICEGLKSLPT-EKPQLRNETDCLQV
        GIG +VR+  G +        +  +K   +EA A+  GL    T + P    ETDCL +
Subjt:  GIGWVVRDWCGNMLRTGYKCVLRAWKISWLEAFAICEGLKSLPT-EKPQLRNETDCLQV

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]3.1e-16435.65Show/hide
Query:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGW
        MK +SWN + LG+P + R LR L+    PQ++FLMETKL      + +  L +   L VP  G  GG+MLLW++  +V++ S +  H D  V  EDG  W
Subjt:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGW

Query:  RFTGIYGNPNREHHHETWNLIKRLKE------------------------GPKQ----------------------------------------------
         F+ IYG P   +   TW LI+RL +                        GP +                                              
Subjt:  RFTGIYGNPNREHHHETWNLIKRLKE------------------------GPKQ----------------------------------------------

Query:  IVDHVWRDRL------------------------SMGHVSILEK---------------------------------------LNECLSHLKTWSHRQYG
         V+ VW+D L                        S  H  ++ K                                       L +C S L+ W  R+Y 
Subjt:  IVDHVWRDRL------------------------SMGHVSILEK---------------------------------------LNECLSHLKTWSHRQYG

Query:  GSIRGAIDKKEKEIQSLFS--RLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKSS---------------------
        G ++  I   +K +  L S          EV   E+ LE LL ++E YW QR+R +WL+ GD+NTK+FH +A+ R+ ++                     
Subjt:  GSIRGAIDKKEKEIQSLFS--RLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKSS---------------------

Query:  ----------SPNMED---IEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEVIAKTLANRM---------
                  + + ED   + ++L  IPTTI+  QN  L   FT  ++   +  M   K+PG DG+ A+FYQ YW IVGD V  K + N +         
Subjt:  ----------SPNMED---IEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEVIAKTLANRM---------

Query:  -----KLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILN
             K VL ++IS TQS F+  RLI++N ++ FE +HS+K R++G  G AALKLDMSKA+DRVEW ++  +M KMGF+ RW+NLIM C+ +      +N
Subjt:  -----KLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILN

Query:  GSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECRTIHRILGTYERASGQTINFEKSN
        G      IP+RGLRQGDPLSPYLFLIC+EGLS LL + +    L GL ++RH P ++HL +ADDSLLF +A++  C  I R L  Y RASGQ +N +KS 
Subjt:  GSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECRTIHRILGTYERASGQTINFEKSN

Query:  FMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNS
           SPNT     N  + IL +   +    YLGLP+ + + K ++F +IK+++WK +  W  K+FS  GKE+L+K+V Q+IP YAMS F+     CNE+ +
Subjt:  FMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNS

Query:  FCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKR
          ARFWWG++ + +KIHW+ WK LC  K  GG+GFR    FNQA+LAKQ+WR+ + P+SL++RVL+GRY+    F+ A      S  W+ I+WGREL  +
Subjt:  FCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKR

Query:  GYRWRIGKGFNIEAATDPWIPKEGSCKPIT-IHPDVQQYSE--TPGRD
        G R ++G G +I   TD WIP   + K    + P     S+  TP R+
Subjt:  GYRWRIGKGFNIEAATDPWIPKEGSCKPIT-IHPDVQQYSE--TPGRD

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]8.0e-16535.03Show/hide
Query:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGW
        M  +SWN + LGNP + R LR LV    P ++F+METKLQ    +K R+ L F   + VP  GQ GGLMLLWK+   +SI ++S  HID  V S DG   
Subjt:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGW

Query:  RFTGIYGNPNREHHHETWNLIKRLKE------------------------GPKQ----------IVD------------------------HVWRDRLSM
         FTG YG+P+    H TW L++R  +                        GP++          +VD                        HV ++RL  
Subjt:  RFTGIYGNPNREHHHETWNLIKRLKE------------------------GPKQ----------IVD------------------------HVWRDRLSM

Query:  GH---------------------------------------------------------------------------VSILEKLNECLSHLKTWSHRQYG
        G                                                                            V +   +++C S+L+ W +  + 
Subjt:  GH---------------------------------------------------------------------------VSILEKLNECLSHLKTWSHRQYG

Query:  GSIRGAIDKKEKEIQSL--FSRLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKS----------------------
        G++R  I    K + +L   S    +    +   E+ L++LL  +E YWHQRAR  W+K GD NTK+FH +AN R  +                      
Subjt:  GSIRGAIDKKEKEIQSL--FSRLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKS----------------------

Query:  ----------SSPNMED--IEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGD--------------------
                  +S  ++D  I  IL ++PT + E     ++  FT  E+Y  +  M   K+PG DG+  +F+  YW+IVG                     
Subjt:  ----------SSPNMED--IEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGD--------------------

Query:  -----------------------------EVIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVE
                                     ++++K++  R++  L  +IS  QS F+  RLI++N +I FE +HS+K+R++G  G AA+KLDMSKA+DRVE
Subjt:  -----------------------------EVIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVE

Query:  WIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDS
        W Y+ +IM KMGF    +NLI+RC++SV    +LNG    +  P RG+RQGDPLSPYLFLICAEGLS LL   + N  L GL ++R  P ++HLF+ADDS
Subjt:  WIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDS

Query:  LLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFS
        +LF +A+    R+I ++L  Y RASGQ +N +K     SPNT +   N  + +LN+  +    +YLGLPS + + K  +F  I D++WK L  WK +LFS
Subjt:  LLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFS

Query:  AAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVL
        A GKE+L+K+V QAIP YAMS F+   +LC+++ S  A FWWG++ +   IHW++W  LC  K  GGLGFR+   FNQA+LAKQ+WRL+  P+SL++R+L
Subjt:  AAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVL

Query:  RGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKEGSCKPIT
          RYF  G  L A +GN PS  WRSI+WG+EL  +G RWR+G G  I   TDPW+P      P +
Subjt:  RGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKEGSCKPIT

XP_030509050.1 uncharacterized protein LOC115723712 [Cannabis sativa]6.2e-16534.88Show/hide
Query:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMVS-EDGTGW
        M  +SWN + LGNP + R LR L+    P ++FLMETKLQ    +K R  L F   + VP  G  GGLMLLWK E +V+I +FS  HID  +  +D   +
Subjt:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMVS-EDGTGW

Query:  RFTGIYGNPNREHHHETWNLIKRLKE------------------------GPKQIVDHV-----------------------W----------RDRLSMG
         FTG YG+P       TW L+KR  +                        GP +  D +                       W          ++RL  G
Subjt:  RFTGIYGNPNREHHHETWNLIKRLKE------------------------GPKQIVDHV-----------------------W----------RDRLSMG

Query:  ----------HVSILEKLN-----------------------------------------------------------------ECLSHLKTWSHRQYGG
                  H  IL+ L+                                                                  C S L +W H ++ G
Subjt:  ----------HVSILEKLN-----------------------------------------------------------------ECLSHLKTWSHRQYGG

Query:  SIRGAIDKKEKEIQSL--FSRLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRK------------------------
         ++  I    K ++ L   S    +    +   E+ L+ LL  +E YWHQR+R  WLK GD NTK+FH +AN R                          
Subjt:  SIRGAIDKKEKEIQSL--FSRLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRK------------------------

Query:  ----------SSSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGD---------------------
                  S   + + I+ +L  IP  I+      L+  FT  E+Y  +K M    +PG DG+  +FY  YW IVG                      
Subjt:  ----------SSSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGD---------------------

Query:  ----------------------------EVIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEW
                                    ++++KT+  R++  +  +IS  QS F+  RLI++N ++ FE +HS+K+R++G+ G AA+KLDMSKA+DRVEW
Subjt:  ----------------------------EVIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEW

Query:  IYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSL
         +++++M K+GF    + LI+RC++SV    +LNGS +    P+RG+RQGDPLSPYLFLIC+EG S LL   +    L GL ++R  P +THL +ADDS+
Subjt:  IYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSL

Query:  LFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSA
        LF +AS +  R IH  L  Y RASGQ +N EKS    SPNT   + +  +++LN+  +    QYLGLPS + + K ++F  I D++WK +  W+ +LFS 
Subjt:  LFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSA

Query:  AGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLR
         GKE+L+K+V QAIP YAMS F+  V LCN++    +RFWWG S N   IHW++WK LC  K HGG+GFR+   FNQA+LAKQ+WR++  P SL+ARVL+
Subjt:  AGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLR

Query:  GRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKEGSCKPIT-IHPD
         RYF TG FL+A  G  PS  W+SI+WG+EL  +G RWRIG G ++   + PWIP   + KP+  + PD
Subjt:  GRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKEGSCKPIT-IHPD

XP_030931246.1 uncharacterized protein LOC115957168 [Quercus lobata]2.3e-16437.8Show/hide
Query:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGW
        MK +SWN Q LGNP ++R+L  +V+   P + FLMET+L + G +K  +DL F   + V +    GGL L+WK+  +V + +F++ H  A V  EDG  W
Subjt:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGW

Query:  RFTGIYGNPNREHHHETWNLIKRLK---EGPKQIVD------HVWRDRLSMG-HVSILEKLNECLS--HLK---------TWSHRQYG-GSIRGAIDK--
          TG YG P+     ++W L+  L    EGP   +       H+   + +     S +++  E L   HL          TW++++ G  + R  +D+  
Subjt:  RFTGIYGNPNREHHHETWNLIKRLK---EGPKQIVD------HVWRDRLSMG-HVSILEKLNECLS--HLK---------TWSHRQYG-GSIRGAIDK--

Query:  -----------------------------------------------KEKEIQSLFSRLD--------DQSLLEVVEKERELENLLEDDEIYWHQRARED
                                                         + I++L  R++        D S +E +   +EL++LL   EIYW Q +R  
Subjt:  -----------------------------------------------KEKEIQSLFSRLD--------DQSLLEVVEKERELENLLEDDEIYWHQRARED

Query:  WLKWGDKNTKWFHMQA-NRRRKS-------------------------------SSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPS
        WLK GDKNTK+FH +A NRRR++                               ++   + +E  L  +   ITE     LT+ ++ DEI   + +M P+
Subjt:  WLKWGDKNTKWFHMQA-NRRRKS-------------------------------SSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPS

Query:  KAPGPDGIHAVFYQKYWDIVGDEV-------------------------------------------------IAKTLANRMKLVLDTIISPTQSVFVPG
        KAPGPDG++A+FYQK+W+IVGD+V                                                 I+K LAN++K +L  IIS TQS FVP 
Subjt:  KAPGPDGIHAVFYQKYWDIVGDEV-------------------------------------------------IAKTLANRMKLVLDTIISPTQSVFVPG

Query:  RLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYL
        RLI++N ++ +EC+H++  R+KGK G  ALKLD+SKAYDRVEW ++K IMEKMGF   W++ +M CV +    V +NG P     P RG+RQGDPLSPYL
Subjt:  RLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYL

Query:  FLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQH
        FL+CAEG + LL  +++  ++ G+ I R  P +++L +ADDSL+F +A+  E + +  IL  Y  ASGQ IN EKS+   S NT       IKN+L VQ 
Subjt:  FLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQH

Query:  KDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKR
              YLGLP+   +SK + F  +KDR+WK LQGWKGKL S AGKE+LIK+VAQ+IP Y M  F   + LCNELN+ CARFWWG   +ERKIHW+SW  
Subjt:  KDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKR

Query:  LCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKE
        +   K  GG+GFRD+  FN A+LAKQ WRLI+   SL+    + RYF    FL+A+  +N SYVW+SI+  +E+ K G  WR+G G +I    + WIP  
Subjt:  LCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKE

Query:  GS
         S
Subjt:  GS

TrEMBL top hitse value%identityAlignment
A0A2N9G497 Reverse transcriptase domain-containing protein6.9e-17039.54Show/hide
Query:  LGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGWRFTGIYGNPN
        LGNP ++RAL H+V+   P+++FLMETKL     E +R  L FD    VPS G+SGGL LLWK +  V I+++S+ HIDA V S+    WR TG YG P 
Subjt:  LGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGWRFTGIYGNPN

Query:  REHHHETWNLIKRLK-----------------------------EGP--KQIVDHV-----WRD---RLSMGHV--SILEKLNECLSHLKTWSHRQYGGS
        +    E+W L+K L                              + P  K+ +D       W D   R S+ H+  S+ + L   +S +   S  +    
Subjt:  REHHHETWNLIKRLK-----------------------------EGP--KQIVDHV-----WRD---RLSMGHV--SILEKLNECLSHLKTWSHRQYGGS

Query:  IRGAIDK------KEKEIQSLFSRLDDQSL--LEVVEKER--------------ELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKS----
        +R   +K       EK IQ  + +  +  L  LE +  +               E+  LL  DE++W QR+RE WL  GDKNT++FH +A +RR      
Subjt:  IRGAIDK------KEKEIQSLFSRLDDQSL--LEVVEKER--------------ELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKS----

Query:  ----------------------------SSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEVI-
                                    S+  + ++E  +R IP+ +T A N +L   FT  EI     +MHPSKAPGPDG+ + F+QKYW IVG +V+ 
Subjt:  ----------------------------SSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEVI-

Query:  ------------------------------------------------AKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDG
                                                        +K LANR+K VL  IIS +QS FVPGR I++N  + FE +H +++RRKGK  
Subjt:  ------------------------------------------------AKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDG

Query:  VAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLI
          ALKLDMSKAYDRVEW +++++ME+MGF++RW+ L+M CV++    V+LNG P     P RG+RQGDPLSPYLFL+CAEGLS LL  ++  +++ G+ +
Subjt:  VAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLI

Query:  NRHCPILTHLFYADDSLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIK
         R+ P ++HL +ADDSLLF +A+E EC  +  +L  YERASGQ +N EK+    S NT E     I+ +  VQ   +  +YLGLP+   +SKQ  F+N+K
Subjt:  NRHCPILTHLFYADDSLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIK

Query:  DRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQ
        +R+ + LQGWK +L S AG+ ILIK++AQAIP Y MS FK   + C ++NS  + +WWG    E KIHW +W RLC  K  GG+GFRD+  FN A+LAKQ
Subjt:  DRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQ

Query:  SWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPW
         WRL+  P SL A+  + +YF    FLKA++G+NPSY+WRSI+  REL ++G RW+IG G       D W
Subjt:  SWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPW

A0A2N9G8I6 Reverse transcriptase domain-containing protein1.9e-16736.34Show/hide
Query:  APSGTMKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SE
        AP  TM  +SWN Q LGNP ++RAL H+V+   P+++FLMETKL     E +R  L FD    VPS G+SGGL LLWK +  V I+++S+ HIDA V S+
Subjt:  APSGTMKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SE

Query:  DGTGWRFTGIYGNPNREHHHETWNLIKRL----------------------KEG----------------------------------------------
            WR TG YG P +    E+W L+K L                      K G                                              
Subjt:  DGTGWRFTGIYGNPNREHHHETWNLIKRL----------------------KEG----------------------------------------------

Query:  ----------------------PKQIVDHV---------------------------------------WRDRLSMGH--VSILEKLNECLSHLKTWSHR
                              P  + DH+                                       WR  +S+G     + +K++ C   L  WS  
Subjt:  ----------------------PKQIVDHV---------------------------------------WRDRLSMGH--VSILEKLNECLSHLKTWSHR

Query:  QYG-GSIRGAIDKKEKEIQSLFS-RLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKS-------------------
         +   S++  ++ K + +++L +     +   ++     E+  LL  DE++W QR+RE WL  GDKNT++FH +A +RR                     
Subjt:  QYG-GSIRGAIDKKEKEIQSLFS-RLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKS-------------------

Query:  -------------SSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEVI----------------
                     S+  + ++E  +R IP+ +T   N +L   FT  EI     +MHPSKAPGPDG+ + F+QKYW IVG +V+                
Subjt:  -------------SSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEVI----------------

Query:  ---------------------------------AKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRV
                                         +K LANR+K VL  IIS +QS FVPGR I++N  + FE +H +++RRKGK    ALKLDMSKAYDRV
Subjt:  ---------------------------------AKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRV

Query:  EWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADD
        EW +++ +ME+MGF++RW+ L+M CV++    V+LNG P     P RG+RQGDPLSPYLFL+CAEGLS LL  ++  +++ G+ + R+ P ++HL +ADD
Subjt:  EWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADD

Query:  SLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLF
        SLLF +A+E EC  +  +L  YERASGQ +N EK+    S NT E     I+ +  VQ   +  +YLGLP+   +SKQ  F+N+K+R+ + LQGWK +L 
Subjt:  SLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLF

Query:  SAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARV
        S AG+ ILIK++AQAIP Y MS FK   + C ++NS  + +WWG    E KIHW +W RLC  K  GG+GFRD+  FN A+LAKQ WRL+  P SL A+ 
Subjt:  SAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARV

Query:  LRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPW
         + +YF    FLKA++G+NPSY+WRSI+  REL ++G RW+IG G       D W
Subjt:  LRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPW

A0A2N9IBI9 Reverse transcriptase domain-containing protein6.4e-16836.34Show/hide
Query:  APSGTMKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SE
        AP  TM  +SWN Q LGNP ++RAL H+V+   P+++FLMETKL     E +R  L FD    VPS G+SGGL LLWK +  V I+++S+ HIDA V S+
Subjt:  APSGTMKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SE

Query:  DGTGWRFTGIYGNPNREHHHETWNLIKRL----------------------KEG----------------------------------------------
            WR TG YG P +    E+W L+K L                      K G                                              
Subjt:  DGTGWRFTGIYGNPNREHHHETWNLIKRL----------------------KEG----------------------------------------------

Query:  ----------------------PKQIVDHV---------------------------------------WRDRLSMGH--VSILEKLNECLSHLKTWSHR
                              P  + DH+                                       WR  +S+G     + +K++ C   L  WS  
Subjt:  ----------------------PKQIVDHV---------------------------------------WRDRLSMGH--VSILEKLNECLSHLKTWSHR

Query:  QYG-GSIRGAIDKKEKEIQSLFS-RLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKS-------------------
         +   S++  ++ K + +++L +     +   ++     E+  LL  DE++W QR+RE WL  GDKNT++FH +A +RR                     
Subjt:  QYG-GSIRGAIDKKEKEIQSLFS-RLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKS-------------------

Query:  -------------SSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEVI----------------
                     S+  + ++E  +R IP+ +T A N +L   FT  EI     +MHPSKAPGPDG+ + F+QKYW IVG +V+                
Subjt:  -------------SSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEVI----------------

Query:  ---------------------------------AKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRV
                                         +K LANR+K VL  IIS +QS FVPGR I++N  + FE +H +++RRKGK    ALKLDMSKAYDRV
Subjt:  ---------------------------------AKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRV

Query:  EWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADD
        EW +++++ME+MGF++RW++L+M CV++    V+LNG P     P RG+RQGDPLSPYLFL+CAEGLS LL  ++  +++ G+ + R+ P ++HL +ADD
Subjt:  EWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADD

Query:  SLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLF
        SLLF +A++ EC  +  +L  YERASGQ +N EK+    S NT E     I+ +  VQ   +  +YLGLP+   +SKQ  F+N+K+R+ + LQGWK +L 
Subjt:  SLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLF

Query:  SAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARV
        S AG+ ILIK++AQAIP Y MS FK   + C ++NS  + +WWG    E KIHW +W RLC  K  GG+GFRD+  FN A+LAKQ WRL+  P SL A+ 
Subjt:  SAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARV

Query:  LRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPW
         + +YF    FLKA++G+NPSY+WRSI+  REL ++G RW+IG G       D W
Subjt:  LRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPW

A0A2N9IPS8 Reverse transcriptase domain-containing protein3.5e-17435.92Show/hide
Query:  APSGTMKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV--S
        AP   M+ +SWN Q LGN  ++R L  L++   P ++FL ET+L + G E++R  ++FD    VP  G  GGL +LW  + +V + ++S+ HIDA +   
Subjt:  APSGTMKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV--S

Query:  EDGTGWRFTGIYGNPNREHHHETWNLIKRLK---------------------------------------------------------------------
        E G G+R TG YGNP      E+W L+K L                                                                      
Subjt:  EDGTGWRFTGIYGNPNREHHHETWNLIKRLK---------------------------------------------------------------------

Query:  --------------------------------------EGP---------------------KQIVDHVWRDRLSMGH--VSILEKLNECLSHLKTWSHR
                                              +GP                     ++++DH W D ++ G     ++EK+  C + L  WS  
Subjt:  --------------------------------------EGP---------------------KQIVDHVWRDRLSMGH--VSILEKLNECLSHLKTWSHR

Query:  QYGGSIRGAIDKKEKEIQSLFSRLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRK----------------------
        ++ GS+  +I +K +++Q L +         ++E + +L  LLE +EI+W QR+R  W+  GDKNTK+FH Q N RR+                      
Subjt:  QYGGSIRGAIDKKEKEIQSLFSRLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRK----------------------

Query:  ------------SSSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEV-----------------
                    SS+P+ E I  +L+ + + +T A N +L   FT+DE+   +K+M+P+KAPGPDG+ A+FYQ YWDIVG EV                 
Subjt:  ------------SSSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEV-----------------

Query:  --------------------------------IAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRV
                                        ++K LANR+K VL  +IS  QS FVPGRLI++N ++ FE +HS+  +RKGK G  ALKLDMSKAYDRV
Subjt:  --------------------------------IAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRV

Query:  EWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADD
        EW++++ IM  MGF+  W+ L+M C+ SV   V++NG     F   RG+RQGD LSPYLFLICAEGLS LL  + ++K LTG+  +R  P LTHLF+ADD
Subjt:  EWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADD

Query:  SLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLF
        SLLF +A+   C  +  IL  YE ASGQ +N  K++   + +TS      I++   V    S  +YLGLPS   +SK   F  IK RVW+ + GWK K  
Subjt:  SLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLF

Query:  SAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARV
        S AG+E+LIK+VAQ+IP Y+MS FK   SLCN+LN+  + FWWG  +  +K HW  W +LC  K  GGLGFRDL  FN A+LAKQ WR +++ +SL+ RV
Subjt:  SAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARV

Query:  LRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKEGSCKPIT
         + +YF  G F+ A +GN PSY WRSI   R++ + G +W IG G +++ + DPW+P   S K ++
Subjt:  LRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKEGSCKPIT

A0A803QGT2 Uncharacterized protein1.4e-17032.14Show/hide
Query:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGW
        MK +SWN + LGN  + R LR LV    PQ++FLMETKL      + +    F   L V   G  GG+MLLWK+  +V++ S +  H D  V  +DG  W
Subjt:  MKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFDCCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMV-SEDGTGW

Query:  RFTGIYGNPNREHHHETWNLIKRLKE------------------------GPKQ----------------------------------------------
            IYG P   +   TW LI+RL +                        GP +                                              
Subjt:  RFTGIYGNPNREHHHETWNLIKRLKE------------------------GPKQ----------------------------------------------

Query:  IVDHVWRDRL---SMGHVSILEKLNECLSH-----------LKTWSHRQYGGSIRGAIDKKEKEIQSLFS--RLDDQSLLEVVEKERELENLLEDDEIYW
         +++VW+D L   ++ H+      +  LS            L+ W  +++ G ++  I K ++E+  L S     D  L  V   E+ LE LL ++E YW
Subjt:  IVDHVWRDRL---SMGHVSILEKLNECLSH-----------LKTWSHRQYGGSIRGAIDKKEKEIQSLFS--RLDDQSLLEVVEKERELENLLEDDEIYW

Query:  HQRAREDWLKWGDKNTKWFHMQANRRRKSSS------------PNMEDI----------------------EYILRIIPTTITEAQNSELTKCFTRDEIY
         QR+R DWLK GD+NTK+FH +A+ R  ++              + +DI                       ++L  IPTTI+  QN  L + FTR ++Y
Subjt:  HQRAREDWLKWGDKNTKWFHMQANRRRKSSS------------PNMEDI----------------------EYILRIIPTTITEAQNSELTKCFTRDEIY

Query:  GVIKKMHPSKAPGPDGIHAVFYQKYWDIVGD-------------------------------------------------EVIAKTLANRMKLVLDTIIS
          +K M   K+PG DG+ A+FYQ YW IVGD                                                 ++I+K LA R+K VL ++IS
Subjt:  GVIKKMHPSKAPGPDGIHAVFYQKYWDIVGD-------------------------------------------------EVIAKTLANRMKLVLDTIIS

Query:  PTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLR
         TQS F+  RLI++N ++ FE IHS+K R++G  G AALK DMSKA+DRVEW ++  +M KMGF+ RW+ LIM C+ + +    +NG       P RGLR
Subjt:  PTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLR

Query:  QGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNI
        QGDPLSPYLFLIC+EGLS LL + +    L GL ++RH P ++HLF+ADDSLLF +A++  C  I R L  Y RASGQ +N +KS    SPNT     N 
Subjt:  QGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNI

Query:  IKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENER
         + IL +   +    YLGLP+ + + K ++F NIK+++WK +  W  K+FS  GKE+L+K+V Q+IP YAMS F+  V LCNE+ +  A+FWWG+S + +
Subjt:  IKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENER

Query:  KIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEA
        KIHW+ W+ LC  K  GG+GFR    FNQA+LAKQ+WR+ + P SL++RVL+G YF    F+ A+ G   S  W+ I+WGREL  +G R ++G G NI  
Subjt:  KIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEA

Query:  ATDPWIPKEGSCKPI------TIH----------------------PDVQQYSETPGRDHP----FVW-----GVY--------------------LNNR
        A D WIP     KP       T H                      PDV    + P    P    ++W     G Y                     + +
Subjt:  ATDPWIPKEGSCKPI------TIH----------------------PDVQQYSETPGRDHP----FVW-----GVY--------------------LNNR

Query:  QGW----------TTTDYFTW----------------------------------------------TWKNSKGEDLD----------DRRMAVS-----
        + W          +    F W                                               WK S G  +D          D  M +S     
Subjt:  QGW----------TTTDYFTW----------------------------------------------TWKNSKGEDLD----------DRRMAVS-----

Query:  ------LVMVWLIWSHRNEVIHSRKQPDMEILKAQIHKYSAELIHNKDSHLDQNHSSIVDHVC-----NTPMAPLNPWNPIPTGTWRLSCDATWSDGKSR
              L ++W IWS RN  IH +K      +K  +  ++  +     +++DQ + SI   V       T  A +  W P P  T++L+ DA     +S+
Subjt:  ------LVMVWLIWSHRNEVIHSRKQPDMEILKAQIHKYSAELIHNKDSHLDQNHSSIVDHVC-----NTPMAPLNPWNPIPTGTWRLSCDATWSDGKSR

Query:  GGIGWVVRDWCGNMLRTGYKCVLRAWKISWLEAFAICEGLKSLPT-EKPQLRNETDCLQV
         GIG +VR+  G +        +  +K   +EA A+  GL    T + P    ETDCL +
Subjt:  GGIGWVVRDWCGNMLRTGYKCVLRAWKISWLEAFAICEGLKSLPT-EKPQLRNETDCLQV

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.4e-2422.72Show/hide
Query:  HMQANRRRKSSSPNMEDIEYILRIIP-TTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWD-------------------------
        H+ AN+       N+E+++  L       + + +   L +  T  EI  +I  +   K+PGPDG  A FYQ+Y +                         
Subjt:  HMQANRRRKSSSPNMEDIEYILRIIP-TTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWD-------------------------

Query:  -------------------------IVGDEVIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVE
                                  +  +++ K LANR++  +  +I   Q  F+PG     N       I  + +R K K+ V  + +D  KA+D+++
Subjt:  -------------------------IVGDEVIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVE

Query:  WIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDS
          ++ K + K+G    +L +I    +     +ILNG     F  K G RQG PLSP LF I  E L+  +   +  KE+ G+ + +    L+   +ADD 
Subjt:  WIYVKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDS

Query:  LLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIF-DNIK---DRVWKALQGWKG
        +++ +      + + +++  + + SG  IN +KS   +  N  + +S I+  +        + +YLG+  Q  +  +++F +N K     + +    WK 
Subjt:  LLFFKASETECRTIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIF-DNIK---DRVWKALQGWKG

Query:  KLFSAAGKEILIKS--VAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNH-GGLGFRDLSLFNQAILAKQSW
           S  G+  ++K   + + I  +     K  ++   EL     +F W    N+++   R  K +   KN  GG+   D  L+ +A + K +W
Subjt:  KLFSAAGKEILIKS--VAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNH-GGLGFRDLSLFNQAILAKQSW

P0C2F6 Putative ribonuclease H protein At1g657503.0e-2131.09Show/hide
Query:  LPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGG
        +P    +  ++ F  I +RV   + GW+ K  S AG+  L K+V  ++P ++MS      S+ N L+     F WG++  ++K H   W ++C  K  GG
Subjt:  LPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGG

Query:  LGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRY----FKTGHFLKAQVGNNPSYVWRSIIWG-RELFKRGYRWRIGKGFNIEAATDPWI
        LG R     N+A+++K  WRL++  +SL   VL+ +Y     +   +L  +   + S  WRSI  G R++   G  W  G G  I   TD W+
Subjt:  LGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRY----FKTGHFLKAQVGNNPSYVWRSIIWG-RELFKRGYRWRIGKGFNIEAATDPWI

P11369 LINE-1 retrotransposable element ORF2 protein3.8e-2424.43Show/hide
Query:  NMEDIEYIL-RIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWD--------------------------------------
        N+++++  L R     + + Q   L    +  EI  VI  +   K+PGPDG  A FYQ + +                                      
Subjt:  NMEDIEYIL-RIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWD--------------------------------------

Query:  ------------IVGDEVIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGF
                     +  +++ K LANR++  +  II P Q  F+PG     N       IH + ++ K K+ +  + LD  KA+D+++  ++ K++E+ G 
Subjt:  ------------IVGDEVIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGF

Query:  SARWLNLIMRCVESVILQVILNGSPRAEFIP-KRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECR
           +LN+I       +  + +NG  + E IP K G RQG PLSPYLF I  E L+  +   +  KE+ G+ I +    ++ L  ADD +++    +   R
Subjt:  SARWLNLIMRCVESVILQVILNGSPRAEFIP-KRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECR

Query:  TIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLG--LPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKS
         +  ++ ++    G  IN  KS   +     + +  I +        +++ +YLG  L  +      + F ++K  + + L+ WK    S  G+  ++K 
Subjt:  TIHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLG--LPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKS

Query:  --VAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHK-NHGGLGFRDLSLFNQAILAKQSW
          + +AI  +     K      NEL     +F W    N +K   R  K L   K   GG+   DL L+ +AI+ K +W
Subjt:  --VAQAIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHK-NHGGLGFRDLSLFNQAILAKQSW

P14381 Transposon TX1 uncharacterized 149 kDa protein9.6e-2024.94Show/hide
Query:  ITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDE-------------------------------------------------
        ++E +   L    T DE+   ++ M  +K+PG DG+   F+Q +WD +G +                                                 
Subjt:  ITEAQNSELTKCFTRDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDE-------------------------------------------------

Query:  VIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVIL
        ++AK ++ R+K VL  +I P QS  VPGR I +N  +  + +H   +RR G   +A L LD  KA+DRV+  Y+   ++   F  +++  +     S   
Subjt:  VIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMRCVESVIL

Query:  QVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFK-----ASETECRTIHRILGTYERAS
         V +N S  A     RG+RQG PLS  L+ +  E    L     L K LTGL++      +    YADD +L  +         EC+ +      Y  AS
Subjt:  QVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFK-----ASETECRTIHRILGTYERAS

Query:  GQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGL-PSQNAQSKQEIFDNIKDRVWKALQGWKG--KLFSAAGKEILIKSV
           IN+ KS+ ++  +       +     ++  +  + +YLG+  S       + F  +++ V   L  WKG  K+ S  G+ ++I  +
Subjt:  GQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGL-PSQNAQSKQEIFDNIKDRVWKALQGWKG--KLFSAAGKEILIKSV

P93295 Uncharacterized mitochondrial protein AtMg003102.5e-3649.34Show/hide
Query:  AIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHK-NHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLK
        A+P YAMS F+    LC +L S    FWW + EN+RKI W +W++LC  K + GGLGFRDL  FNQA+LAKQS+R+I  P +L++R+LR RYF     ++
Subjt:  AIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHK-NHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLK

Query:  AQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKEGSCKPI
          VG  PSY WRSII GREL  RG    IG G + +   D WI  E    P+
Subjt:  AQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKEGSCKPI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.5e-0423.87Show/hide
Query:  WRDRLSMGH--VSILEKLNECLSHLKTWSHRQYGG---SIRGAIDKKEKEIQSLFSRLDDQSLLEVVEKERELENLLEDD-EIYWHQRAREDWLKWGDKN
        W +++ +G    S+ E L       K  + + +G      + A+D  E  IQS        SL  V    R+  N      E ++ Q++R  WL+ GD N
Subjt:  WRDRLSMGH--VSILEKLNECLSHLKTWSHRQYGG---SIRGAIDKKEKEIQSLFSRLDDQSLLEVVEKERELENLLEDD-EIYWHQRAREDWLKWGDKN

Query:  TKWFH--MQANRRRK----------------------------------SSSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPG
        T++FH  + AN+ +                                   S     + ++ I  I P    +   S L+   +  EI   +  M  +KAPG
Subjt:  TKWFH--MQANRRRK----------------------------------SSSPNMEDIEYILRIIPTTITEAQNSELTKCFTRDEIYGVIKKMHPSKAPG

Query:  PDGIHAVFYQKYWDIVGDEVIA
        PD   A F+ + W +V D  IA
Subjt:  PDGIHAVFYQKYWDIVGDEVIA

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.4e-1238.64Show/hide
Query:  LANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMR
        +  R+K ++  +I P Q+ F+PGR+ ++N +   E +HS++ R+KG  G   LKLD+ KAYDR+ W Y++  +   GF   WL  I R
Subjt:  LANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIYVKKIMEKMGFSARWLNLIMR

AT4G29090.1 Ribonuclease H-like superfamily protein6.8e-2938.03Show/hide
Query:  AIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKA
        A+P Y M+ F    ++C ++ S  A FWW   +  + +HW++W  L  +K  GG+GF+D+  FN A+L KQ WR++  P+SLMA+V + RYF     L A
Subjt:  AIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKA

Query:  QVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWI
         +G+ PS+VW+SI   +E+ ++G R  +G G +I      W+
Subjt:  QVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-3749.34Show/hide
Query:  AIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHK-NHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLK
        A+P YAMS F+    LC +L S    FWW + EN+RKI W +W++LC  K + GGLGFRDL  FNQA+LAKQS+R+I  P +L++R+LR RYF     ++
Subjt:  AIPNYAMSYFKFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHK-NHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLK

Query:  AQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKEGSCKPI
          VG  PSY WRSII GREL  RG    IG G + +   D WI  E    P+
Subjt:  AQVGNNPSYVWRSIIWGRELFKRGYRWRIGKGFNIEAATDPWIPKEGSCKPI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.6e-1447.89Show/hide
Query:  LQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDS
        L  I+NG+P+    P RGLRQGDPLSPYLF++C E LSGL   ++    L G+ ++ + P + HL +ADD+
Subjt:  LQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATGGAGGAAGGGGGAGCTGGAAAAATTATGAAAATGTAGAAGAAGGAACACCGAACAATCAGGTGGGTGGAGATAGATCAGTTCAGGCTAAGGAGCGGCCAAA
GGAGTCTGATGGTGCAACCACAACGGCGGAGGAAAAACGGTCAGCAACGATTTCTGGCGGCCCGGCAACGGCAACTCTGATGACAGGCTCAACGGTCGAAAAAATGAATA
TGGAAAGAGACAACGACTCAATTAAATCCAGTATTAAAGAAGGCGAAAATTCAGGACTAAGGAAGCAAAATAGTATTAACGGGGATTCTGAATTAAAGGAGGAGAATATC
TCAAATGGGAACTTTACAGACATTTCATTTATGGAGTTGGATACCAAGCTGGAGGGACCGACAGAGCATTTAAAAGTGGTGAATATTCAGGACTTAACGGCACCAGCTGA
GACAAAATTCACCTCGGACGAGGGGGTGGGAGGCTGCGGGACAGCCCCGTCGGGCACAATGAAAACTATTAGTTGGAATGTCCAATGGTTGGGGAACCCGAGGTCATTAA
GAGCACTACGACATCTAGTGCGTAGTCATCATCCCCAGTTGGTTTTCTTGATGGAAACCAAGTTACAACAGAGTGGTAGGGAGAAAGTTCGAAGAGATCTTCAGTTTGAT
TGTTGTCTTGCAGTTCCAAGCAGTGGTCAAAGTGGAGGGCTCATGTTATTATGGAAAAATGAGACAAATGTTTCAATAAGATCTTTTTCTAAAGGGCATATCGATGCCAT
GGTATCTGAGGATGGAACAGGGTGGAGATTTACTGGTATTTATGGTAACCCCAACAGGGAACATCATCATGAGACGTGGAACCTTATTAAGAGGTTGAAGGAGGGTCCCA
AACAGATCGTTGATCATGTATGGCGAGATCGGTTGAGTATGGGACATGTGTCTATTCTTGAAAAGTTGAATGAATGCTTGAGCCATCTAAAAACATGGAGCCATAGACAA
TATGGAGGTTCCATTCGGGGAGCCATCGACAAGAAAGAAAAAGAGATTCAATCTCTCTTCAGCCGATTGGATGATCAAAGCTTGCTGGAGGTGGTAGAGAAAGAAAGGGA
ACTTGAAAATCTGTTGGAAGATGATGAGATATACTGGCACCAGAGAGCTCGTGAGGATTGGCTAAAATGGGGGGATAAAAATACCAAATGGTTTCATATGCAGGCCAACC
GCCGGAGAAAGTCATCTAGTCCGAACATGGAGGATATTGAGTATATCTTGAGAATTATTCCTACTACGATCACAGAAGCACAGAACTCTGAGCTCACAAAATGCTTTACT
AGGGATGAGATTTACGGTGTGATAAAGAAGATGCATCCTTCTAAAGCTCCTGGGCCAGATGGAATTCATGCGGTTTTCTACCAAAAGTACTGGGATATAGTGGGAGATGA
AGTGATTGCCAAAACCCTTGCCAATAGAATGAAATTGGTTTTAGATACGATTATATCTCCCACCCAATCTGTGTTTGTACCTGGGAGATTAATTTCTAATAACACTATTA
TTGGCTTTGAGTGCATTCATTCGGTTAAGAGTAGAAGAAAAGGAAAGGATGGGGTCGCCGCCTTAAAGTTGGACATGAGCAAAGCGTACGATCGGGTGGAATGGATCTAT
GTCAAAAAGATCATGGAAAAAATGGGATTTAGTGCTAGATGGTTAAATTTGATAATGAGATGTGTGGAGTCAGTTATCTTACAAGTTATTCTTAATGGGTCGCCACGCGC
GGAGTTCATTCCTAAACGTGGTCTTCGACAAGGAGATCCCTTATCACCATATCTATTTCTGATATGTGCGGAGGGTCTATCAGGTCTTCTTAATCATTCTAAACTCAATA
AGGAGTTGACAGGTTTGCTTATCAATAGACATTGTCCTATTTTAACTCATTTGTTTTATGCTGACGATAGTCTCTTGTTCTTTAAAGCTTCTGAAACAGAGTGTAGAACC
ATTCACAGAATCCTAGGCACTTACGAAAGGGCGTCAGGACAAACGATCAATTTTGAGAAGTCTAACTTTATGGTTAGCCCAAATACAAGTGAGGTTCAGAGCAACATTAT
AAAAAACATCCTAAACGTTCAACATAAAGACAGTTTGGGCCAATACCTAGGGTTGCCTTCTCAAAATGCTCAAAGTAAACAGGAGATATTTGACAACATCAAGGATCGAG
TTTGGAAAGCTTTACAAGGATGGAAAGGGAAGTTATTCTCAGCTGCTGGCAAAGAAATTCTTATTAAATCCGTAGCACAAGCAATTCCGAACTACGCTATGAGCTATTTT
AAATTTCTTGTATCCTTGTGTAACGAGCTAAACTCTTTTTGTGCCAGGTTTTGGTGGGGTGCATCAGAAAATGAAAGGAAAATCCATTGGCGGAGTTGGAAAAGACTATG
CATCCATAAGAATCACGGTGGCTTAGGCTTTCGGGATCTCAGCTTGTTTAACCAAGCTATTTTGGCAAAACAGAGCTGGAGATTAATTCGCTATCCAGACAGTCTAATGG
CCAGAGTTCTTCGGGGTAGATATTTTAAAACAGGGCACTTCCTTAAAGCTCAGGTGGGGAATAACCCGTCTTATGTCTGGAGAAGCATCATATGGGGAAGAGAACTCTTC
AAAAGAGGATATCGATGGAGGATTGGAAAAGGTTTTAATATTGAGGCGGCTACAGACCCTTGGATCCCCAAAGAAGGATCGTGTAAACCCATCACTATACATCCCGATGT
CCAACAATATTCAGAAACTCCCGGAAGGGACCACCCATTTGTTTGGGGAGTGTATTTAAATAATAGGCAAGGCTGGACAACGACAGATTACTTCACTTGGACATGGAAAA
ACAGCAAGGGGGAGGATCTAGATGACAGACGAATGGCGGTTAGCTTGGTGATGGTTTGGCTCATTTGGTCCCACAGGAATGAGGTTATCCACAGCAGAAAACAACCAGAC
ATGGAGATATTAAAGGCCCAGATTCATAAATACAGTGCTGAACTTATTCACAATAAGGATTCTCACTTGGACCAGAATCATTCGAGCATCGTCGACCATGTCTGCAACAC
TCCGATGGCTCCTTTGAACCCATGGAACCCGATTCCGACTGGGACGTGGCGATTAAGCTGCGATGCCACCTGGAGTGATGGAAAATCGAGAGGAGGTATTGGCTGGGTCG
TGAGAGACTGGTGCGGAAATATGCTTCGAACAGGTTACAAATGCGTGTTGCGAGCGTGGAAGATTAGTTGGTTGGAAGCGTTCGCGATCTGTGAAGGATTGAAGTCACTG
CCTACCGAGAAACCCCAACTTCGAAATGAAACAGACTGCTTACAAGTTGCAAAATTTGTCACCTCAAGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAATGGAGGAAGGGGGAGCTGGAAAAATTATGAAAATGTAGAAGAAGGAACACCGAACAATCAGGTGGGTGGAGATAGATCAGTTCAGGCTAAGGAGCGGCCAAA
GGAGTCTGATGGTGCAACCACAACGGCGGAGGAAAAACGGTCAGCAACGATTTCTGGCGGCCCGGCAACGGCAACTCTGATGACAGGCTCAACGGTCGAAAAAATGAATA
TGGAAAGAGACAACGACTCAATTAAATCCAGTATTAAAGAAGGCGAAAATTCAGGACTAAGGAAGCAAAATAGTATTAACGGGGATTCTGAATTAAAGGAGGAGAATATC
TCAAATGGGAACTTTACAGACATTTCATTTATGGAGTTGGATACCAAGCTGGAGGGACCGACAGAGCATTTAAAAGTGGTGAATATTCAGGACTTAACGGCACCAGCTGA
GACAAAATTCACCTCGGACGAGGGGGTGGGAGGCTGCGGGACAGCCCCGTCGGGCACAATGAAAACTATTAGTTGGAATGTCCAATGGTTGGGGAACCCGAGGTCATTAA
GAGCACTACGACATCTAGTGCGTAGTCATCATCCCCAGTTGGTTTTCTTGATGGAAACCAAGTTACAACAGAGTGGTAGGGAGAAAGTTCGAAGAGATCTTCAGTTTGAT
TGTTGTCTTGCAGTTCCAAGCAGTGGTCAAAGTGGAGGGCTCATGTTATTATGGAAAAATGAGACAAATGTTTCAATAAGATCTTTTTCTAAAGGGCATATCGATGCCAT
GGTATCTGAGGATGGAACAGGGTGGAGATTTACTGGTATTTATGGTAACCCCAACAGGGAACATCATCATGAGACGTGGAACCTTATTAAGAGGTTGAAGGAGGGTCCCA
AACAGATCGTTGATCATGTATGGCGAGATCGGTTGAGTATGGGACATGTGTCTATTCTTGAAAAGTTGAATGAATGCTTGAGCCATCTAAAAACATGGAGCCATAGACAA
TATGGAGGTTCCATTCGGGGAGCCATCGACAAGAAAGAAAAAGAGATTCAATCTCTCTTCAGCCGATTGGATGATCAAAGCTTGCTGGAGGTGGTAGAGAAAGAAAGGGA
ACTTGAAAATCTGTTGGAAGATGATGAGATATACTGGCACCAGAGAGCTCGTGAGGATTGGCTAAAATGGGGGGATAAAAATACCAAATGGTTTCATATGCAGGCCAACC
GCCGGAGAAAGTCATCTAGTCCGAACATGGAGGATATTGAGTATATCTTGAGAATTATTCCTACTACGATCACAGAAGCACAGAACTCTGAGCTCACAAAATGCTTTACT
AGGGATGAGATTTACGGTGTGATAAAGAAGATGCATCCTTCTAAAGCTCCTGGGCCAGATGGAATTCATGCGGTTTTCTACCAAAAGTACTGGGATATAGTGGGAGATGA
AGTGATTGCCAAAACCCTTGCCAATAGAATGAAATTGGTTTTAGATACGATTATATCTCCCACCCAATCTGTGTTTGTACCTGGGAGATTAATTTCTAATAACACTATTA
TTGGCTTTGAGTGCATTCATTCGGTTAAGAGTAGAAGAAAAGGAAAGGATGGGGTCGCCGCCTTAAAGTTGGACATGAGCAAAGCGTACGATCGGGTGGAATGGATCTAT
GTCAAAAAGATCATGGAAAAAATGGGATTTAGTGCTAGATGGTTAAATTTGATAATGAGATGTGTGGAGTCAGTTATCTTACAAGTTATTCTTAATGGGTCGCCACGCGC
GGAGTTCATTCCTAAACGTGGTCTTCGACAAGGAGATCCCTTATCACCATATCTATTTCTGATATGTGCGGAGGGTCTATCAGGTCTTCTTAATCATTCTAAACTCAATA
AGGAGTTGACAGGTTTGCTTATCAATAGACATTGTCCTATTTTAACTCATTTGTTTTATGCTGACGATAGTCTCTTGTTCTTTAAAGCTTCTGAAACAGAGTGTAGAACC
ATTCACAGAATCCTAGGCACTTACGAAAGGGCGTCAGGACAAACGATCAATTTTGAGAAGTCTAACTTTATGGTTAGCCCAAATACAAGTGAGGTTCAGAGCAACATTAT
AAAAAACATCCTAAACGTTCAACATAAAGACAGTTTGGGCCAATACCTAGGGTTGCCTTCTCAAAATGCTCAAAGTAAACAGGAGATATTTGACAACATCAAGGATCGAG
TTTGGAAAGCTTTACAAGGATGGAAAGGGAAGTTATTCTCAGCTGCTGGCAAAGAAATTCTTATTAAATCCGTAGCACAAGCAATTCCGAACTACGCTATGAGCTATTTT
AAATTTCTTGTATCCTTGTGTAACGAGCTAAACTCTTTTTGTGCCAGGTTTTGGTGGGGTGCATCAGAAAATGAAAGGAAAATCCATTGGCGGAGTTGGAAAAGACTATG
CATCCATAAGAATCACGGTGGCTTAGGCTTTCGGGATCTCAGCTTGTTTAACCAAGCTATTTTGGCAAAACAGAGCTGGAGATTAATTCGCTATCCAGACAGTCTAATGG
CCAGAGTTCTTCGGGGTAGATATTTTAAAACAGGGCACTTCCTTAAAGCTCAGGTGGGGAATAACCCGTCTTATGTCTGGAGAAGCATCATATGGGGAAGAGAACTCTTC
AAAAGAGGATATCGATGGAGGATTGGAAAAGGTTTTAATATTGAGGCGGCTACAGACCCTTGGATCCCCAAAGAAGGATCGTGTAAACCCATCACTATACATCCCGATGT
CCAACAATATTCAGAAACTCCCGGAAGGGACCACCCATTTGTTTGGGGAGTGTATTTAAATAATAGGCAAGGCTGGACAACGACAGATTACTTCACTTGGACATGGAAAA
ACAGCAAGGGGGAGGATCTAGATGACAGACGAATGGCGGTTAGCTTGGTGATGGTTTGGCTCATTTGGTCCCACAGGAATGAGGTTATCCACAGCAGAAAACAACCAGAC
ATGGAGATATTAAAGGCCCAGATTCATAAATACAGTGCTGAACTTATTCACAATAAGGATTCTCACTTGGACCAGAATCATTCGAGCATCGTCGACCATGTCTGCAACAC
TCCGATGGCTCCTTTGAACCCATGGAACCCGATTCCGACTGGGACGTGGCGATTAAGCTGCGATGCCACCTGGAGTGATGGAAAATCGAGAGGAGGTATTGGCTGGGTCG
TGAGAGACTGGTGCGGAAATATGCTTCGAACAGGTTACAAATGCGTGTTGCGAGCGTGGAAGATTAGTTGGTTGGAAGCGTTCGCGATCTGTGAAGGATTGAAGTCACTG
CCTACCGAGAAACCCCAACTTCGAAATGAAACAGACTGCTTACAAGTTGCAAAATTTGTCACCTCAAGAATTTGA
Protein sequenceShow/hide protein sequence
MGNGGRGSWKNYENVEEGTPNNQVGGDRSVQAKERPKESDGATTTAEEKRSATISGGPATATLMTGSTVEKMNMERDNDSIKSSIKEGENSGLRKQNSINGDSELKEENI
SNGNFTDISFMELDTKLEGPTEHLKVVNIQDLTAPAETKFTSDEGVGGCGTAPSGTMKTISWNVQWLGNPRSLRALRHLVRSHHPQLVFLMETKLQQSGREKVRRDLQFD
CCLAVPSSGQSGGLMLLWKNETNVSIRSFSKGHIDAMVSEDGTGWRFTGIYGNPNREHHHETWNLIKRLKEGPKQIVDHVWRDRLSMGHVSILEKLNECLSHLKTWSHRQ
YGGSIRGAIDKKEKEIQSLFSRLDDQSLLEVVEKERELENLLEDDEIYWHQRAREDWLKWGDKNTKWFHMQANRRRKSSSPNMEDIEYILRIIPTTITEAQNSELTKCFT
RDEIYGVIKKMHPSKAPGPDGIHAVFYQKYWDIVGDEVIAKTLANRMKLVLDTIISPTQSVFVPGRLISNNTIIGFECIHSVKSRRKGKDGVAALKLDMSKAYDRVEWIY
VKKIMEKMGFSARWLNLIMRCVESVILQVILNGSPRAEFIPKRGLRQGDPLSPYLFLICAEGLSGLLNHSKLNKELTGLLINRHCPILTHLFYADDSLLFFKASETECRT
IHRILGTYERASGQTINFEKSNFMVSPNTSEVQSNIIKNILNVQHKDSLGQYLGLPSQNAQSKQEIFDNIKDRVWKALQGWKGKLFSAAGKEILIKSVAQAIPNYAMSYF
KFLVSLCNELNSFCARFWWGASENERKIHWRSWKRLCIHKNHGGLGFRDLSLFNQAILAKQSWRLIRYPDSLMARVLRGRYFKTGHFLKAQVGNNPSYVWRSIIWGRELF
KRGYRWRIGKGFNIEAATDPWIPKEGSCKPITIHPDVQQYSETPGRDHPFVWGVYLNNRQGWTTTDYFTWTWKNSKGEDLDDRRMAVSLVMVWLIWSHRNEVIHSRKQPD
MEILKAQIHKYSAELIHNKDSHLDQNHSSIVDHVCNTPMAPLNPWNPIPTGTWRLSCDATWSDGKSRGGIGWVVRDWCGNMLRTGYKCVLRAWKISWLEAFAICEGLKSL
PTEKPQLRNETDCLQVAKFVTSRI