; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g010790 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g010790
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:12048046..12051228
RNA-Seq ExpressionLcy06g010790
SyntenyLcy06g010790
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]2.7e-16541.34Show/hide
Query:  WHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW-----------------
        W  TG YG P+ H R+ +W LL  L++     W+  GD NEI+  +EK GGA R+   M  FR  +  CG  DLG+ G  +TW                 
Subjt:  WHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW-----------------

Query:  LSNLDWAC-----SNHRPVELSLEPLS-----RPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGREANRFLMK
        L+  DW+        H  V+ + +  +       +  R R   F F AQW    +C+ II     +     +  G+  NL  C+  L  W       + K
Subjt:  LSNLDWAC-----SNHRPVELSLEPLS-----RPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGREANRFLMK

Query:  LILQKKQAIKD-AYSVTPVDFSI-IHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFLDYF
         I  K+  +   A      D S+ I+ L  ++  LL++EE YW QR+K +WLK GDRNT++FH QASER+K N I GI  + G W  +E  +    + YF
Subjt:  LILQKKQAIKD-AYSVTPVDFSI-IHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFLDYF

Query:  QNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIAL
         NI++SS  S  ++  + + IP  ++ +MN  L   F + E+  A+ Q+ P KA GPDG  A+F+Q YW IVG       L VLN+   I E NKTNI+L
Subjt:  QNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIAL

Query:  IPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLER
        IPK N P  + DFRPISLCNV +K+I+K+LANRLK +L  ++S+ QSAF S R ITDNV++  E +HY+ H+  G++GF A+KLDMSKA+DRVEW F+ +
Subjt:  IPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLER

Query:  LMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKA
        +M +MGF + W  L+M CIT V +++LIN VA G+I PSRGLRQGDPLSP LF+LCAEGLS  ++ A  ++LI+ + I   CP V+HLFF+DDS++F KA
Subjt:  LMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKA

Query:  NMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGT-YLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTGGKE
           E   ++ IL +YE ASGQ +N  KS++  S N + + R  + ++LG P+ +   T YLGLPS   RSK   F    E+V   + GWK    S GGKE
Subjt:  NMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGT-YLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTGGKE

Query:  TLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLGSSDYALAKVSDFVTSSRE
         LIK+V QAIPTY MS F LP+ +C ++ R +  FWWG      KM W SW+R    +  G   +   K  +    +++
Subjt:  TLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLGSSDYALAKVSDFVTSSRE

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]4.1e-16134.83Show/hide
Query:  LCLFRRRLVFSSAENYENDMLECLGDGDPSFIPGLWKGKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMS
        LCL  +  +  +   + ++ ++ L       I G+     W FTG+YG  K  LR  TW L+  +   +   W++GGD NEI+   EK+GG  R    M 
Subjt:  LCLFRRRLVFSSAENYENDMLECLGDGDPSFIPGLWKGKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMS

Query:  SFRIALEDCGLRDLGFRGDVFTW------------------------------LSNLDWACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECR
        +FR  +E C L DL F G  FTW                              +++L  + S+H P+   +E  S     R R+  F+F   W+H AEC 
Subjt:  SFRIALEDCGLRDLGFRGDVFTW------------------------------LSNLDWACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECR

Query:  EIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGREANRFLMKLI--LQKKQAIKDAYSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDR
         ++ +  +    N     +   +      L  W  +    L   I  ++ K A+    S++         LET L  LL  E  YW QRS+  WL  GD 
Subjt:  EIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGREANRFLMKLI--LQKKQAIKDAYSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDR

Query:  NTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQQLAILDNI-PPVISLDMNAKLTASFCQAEIERAISQMFPTKALG
        NTR+FHH+AS RKK N I G+  +DG W T +S++E+I LDYF  +F++S     + + +  N+ P V++  MN++L   F + EI +A++QM P KA G
Subjt:  NTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQQLAILDNI-PPVISLDMNAKLTASFCQAEIERAISQMFPTKALG

Query:  PDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAIT
        PDGF  +FYQ YW +VG   +A     +N+   +RE N T + LIPK+     +   RPISLCNV +K+ +KVLANRLK +L  +++  QSAFV GR I+
Subjt:  PDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAIT

Query:  DNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLC
        DN ++  E  H+++ R  G  G+ ALKLDMSKAYDRVEW F+E +MR MGF  +WI  IM C+T V ++ L+N    G ++P+RGLRQGD +SPYLF+LC
Subjt:  DNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLC

Query:  AEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDL
        AEGLS  LS       +  + I    PS++HLFF+DDS VF KA   E   +K IL  YE ASGQ VNF KS +  S N+    +  L  V GV  VD  
Subjt:  AEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDL

Query:  GTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYR
          YLGLP+    SK   F+  +E+ +  ++ WK    S  GKE +IKSVVQ++PTY MS F LPK +CQE+ R ++ FWWG SE   K+HW +W++    
Subjt:  GTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYR

Query:  RRLGS------------------------SDYALAKV--------SDFV---------------------------------------------------
        +  G                          D  L K         +DF+                                                   
Subjt:  RRLGS------------------------SDYALAKV--------SDFV---------------------------------------------------

Query:  -------------------TSSREWDVEKLADLLTLEDLQLVQCLPIGPSGAEDIWLWHYDKRGVYTVKSGYKLCMLQGQVS
                             S++W V+ L +L   +++ L++ +P+     ED  +WH+DKRG+Y+VKSGY +      +S
Subjt:  -------------------TSSREWDVEKLADLLTLEDLQLVQCLPIGPSGAEDIWLWHYDKRGVYTVKSGYKLCMLQGQVS

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]1.8e-16136.55Show/hide
Query:  GKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDD-SSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW-------------
        G  WHF+ LYG P+   +  TW+L+R L +      W++ GD+NEI  +  K GG  R    M +FR  L+ C L ++   GD FTW             
Subjt:  GKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDD-SSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW-------------

Query:  ------------------LSNLDWACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWG-
                          LS+LD+  S+HR +   ++    P     R+  F+F   W+   EC EII+N    S      + L  +L  C++ L  W  
Subjt:  ------------------LSNLDWACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWG-

Query:  REANRFLMKLILQKK--QAIKDAYSVTPVDFSI-IHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSES
        R+  +    + L +K    +  + S  P DFS  +HS E+ L  LL  EE YW QRS+ +WL+ GDRNT++FH +AS R  +N I  +  D G  VT++ 
Subjt:  REANRFLMKLILQKK--QAIKDAYSVTPVDFSI-IHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSES

Query:  EVESIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSI
         +  +  DYFQ +FT+S         +L  IP  IS + N  L   F  +E+  A+  +   K+ G DG  A+FY   W+IVG       L+VLNN  + 
Subjt:  EVESIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSI

Query:  REWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAY
          +NKT I LIPKI  P  + DFRPISLCNV++KII+K+LA R K VL SV+S+ QSAF+S R ITDN+++  E +H ++HR +G  GFAALKLDMSKA+
Subjt:  REWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAY

Query:  DRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFF
        DRVEWSFL  +M +MGF    I LIM C+    F+ LIN    GS++P RGLRQGDPLSPYLF++C+EGLS  L        +  + +  + PS++HL F
Subjt:  DRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFF

Query:  SDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKR
        +DDSL+F +AN      IKR L+ Y  ASGQ +N  KS +  S N     +++   +LG+P+     +YLGLP+   R K   F    ER+ K++  W  
Subjt:  SDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKR

Query:  SFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLG------------------------------
          FS GGKE L+K+VVQAIPTYAMS FRL    C++I   ++RFWWGSS    K+HWK+W+     +R G                              
Subjt:  SFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLG------------------------------

Query:  ----------SSDYALAKVS-----------------------------------------------------------DFVTSSREWDVEKLADLLTLE
                   +D+  AKVS                                                           D++T +REWD+E L +  +  
Subjt:  ----------SSDYALAKVS-----------------------------------------------------------DFVTSSREWDVEKLADLLTLE

Query:  DLQLVQCLPIGPSGAEDIWLWHYDKRGVYTVKSGYKL-CMLQGQ
        D+  +  +P+  +   D W WHYD  G YTVKSGY L C L+ +
Subjt:  DLQLVQCLPIGPSGAEDIWLWHYDKRGVYTVKSGYKL-CMLQGQ

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.4e-16140.9Show/hide
Query:  GKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW--------------
        G  W+ TG YG P    +  +W LL+ L  F    W+V GD N  +  SEK       F  + +FR AL  C L DLGF+G  +TW              
Subjt:  GKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW--------------

Query:  ---LSNLDWA--------------CSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNGLF---HNLSSCSSRLRHW
           ++N +W                S+H P+ L ++  S+P     R   FKF   W+   EC  +I     W + +G+ +GL      + +C   L  W
Subjt:  ---LSNLDWA--------------CSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNGLF---HNLSSCSSRLRHW

Query:  G---REANRFLMKLILQKKQAIKDAYSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSE
        G    + +   +K I ++   + +   +T    +   +L   +  LL+++EIYW QRS+ NWL+ GDRNT++FH +AS+R++ N I GI    G WV + 
Subjt:  G---REANRFLMKLILQKKQAIKDAYSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSE

Query:  SEVESIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKS
         EV  +  DYF N+F +   + DQ    LD +   ++ DM   L+  F   E++ A+ QM PTKA GPDG  ALFYQ +W IVG   V+  L+ LNN   
Subjt:  SEVESIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKS

Query:  IREWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKA
        + E N TNI LIPK+  P  + +FRPISLCNV +KII+KVLANRLK VL  ++S  QSAFV GR ITDNV++ +E LH +  RKKG+ G  ALKLD+SKA
Subjt:  IREWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKA

Query:  YDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLF
        YDRVEW FL+ +M +MGF   WI  +M C+T   F++L+N      I PSRG+RQGDP+SPYLF+LCAEGL+  L+ A ++ +I+ V I    P +++L 
Subjt:  YDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLF

Query:  FSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWK
        F+DDSL+F +A   EG  I  IL  YE ASGQ +N  KS+   S+N S   +  +  +LGV  VD    YLGLP+   R+K   F +  +RV K +QGWK
Subjt:  FSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWK

Query:  RSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLGSSDY
            S  GKE LIK+V QAIPTY MS F++P  +C E+    +RFWWG   +  K+HWKSW++    ++ G   +
Subjt:  RSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLGSSDY

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]3.0e-16439.57Show/hide
Query:  WHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW----------------L
        W  TG YG+P+   R + W +LR+L +     W   GD NE++  S+K GG  R+   M SFR AL+ CG  DLGF G  FTW                +
Subjt:  WHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW----------------L

Query:  SNLDWAC--------------SNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGRE--AN
        +N +W                S+HRP+ LSL+  S     R R+  F+F + W+ +  C+  +A       R          +  C  RL+ W +E   N
Subjt:  SNLDWAC--------------SNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGRE--AN

Query:  RFLMKLILQKKQAIKDAYSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFL
              ++++K  + +  SV   D   + +L+ +L  LLE+EE  WHQRS+  WL+ GD+NTR+FH  A+ RK+ N I G+  ++G W + E     +  
Subjt:  RFLMKLILQKKQAIKDAYSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFL

Query:  DYFQNIFTSSE-LSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKT
        D+++ +F SS   ++D+   ++D +  V++  MNA L   +   E+ERAI  M P KA GPDG P LFYQTYW  V        L  LN+   ++  N T
Subjt:  DYFQNIFTSSE-LSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKT

Query:  NIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWS
         I LIPK+  P  V +FRPISLCNV +KI++K +ANRLK +L S++SD QSAF++ R ITDNV+I  E LH++++   G+ GF ALKLDMSKAYDRVEWS
Subjt:  NIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWS

Query:  FLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLV
        FLE+++ ++GF + W+ LIM+CIT V +++L+N    G I P+RGLRQGDPLSPYLF+ CAEGL+     A V   I    I    P ++HLFF+DD L+
Subjt:  FLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLV

Query:  FFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTG
        F ++++ E   IK +L  YE ASGQ VN  K+ L  S N     + A+ + LGVP +     YLGLPS   R+K  CF +  ER+   +QGWK    S  
Subjt:  FFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTG

Query:  GKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLGSSDYA-LAKVSDFVTSSREWDVEKLADLLTLE
        GKE +IK+VVQ+IPTY+MS F+LP  +C++I   + +FWWG  E+R K+HW +W      + +G   +  + + ++ + + + W +    D L  +
Subjt:  GKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLGSSDYA-LAKVSDFVTSSREWDVEKLADLLTLE

TrEMBL top hitse value%identityAlignment
A0A2N9F5W1 Reverse transcriptase domain-containing protein1.6e-16639.44Show/hide
Query:  LCLFRRRLVFSSAENYENDMLECLGDGDPSFIPGLWKGKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMS
        L LF RR +  S  +Y +  ++ L D D           TW FTG YG P    +   W+LLR+  +     W  GGD NE++   EK G  AR    M 
Subjt:  LCLFRRRLVFSSAENYENDMLECLGDGDPSFIPGLWKGKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMS

Query:  SFRIALEDCGLRDLGFRGDVFTW-----------------LSNLDW--------------ACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAEC
         FR  ++DCG  DLGF G  +TW                 L+  DW                S+HRP+ + L    R    R  +  F+F   W  H  C
Subjt:  SFRIALEDCGLRDLGFRGDVFTW-----------------LSNLDW--------------ACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAEC

Query:  REIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGREANRFLMKLILQKKQAIKDAYSVTPV--DFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGD
         E I    +          +   + +    L+ W       +   I  K + ++    ++PV  + S+I  L  +LA L  +EE  W QRS+  WL+ GD
Subjt:  REIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGREANRFLMKLILQKKQAIKDAYSVTPV--DFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGD

Query:  RNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALG
        RNT++FH QA+ RK+ N IHGI    G W +   EVE   ++Y++++FT+S+     +  IL  +  +I++DMN +L A F  AE+E A++QM P KALG
Subjt:  RNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALG

Query:  PDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAIT
         DG   +FYQ YW+IVG    A+ L  L +   +++ N T+I LIPK+  P  V DFRPISLCNV +KIIAKVLANRLK +L  ++S++QSAFV GR I+
Subjt:  PDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAIT

Query:  DNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLC
        DN++I  E LH+++H K  + G+ ALKLDMSKAYDRVEW FLER+M +MGF++ W+ +IM+C+  V ++VLIN    G   P+RGLRQGDP+SPYLF+LC
Subjt:  DNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLC

Query:  AEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDL
        AEGL+  L+ A +S+ I  + I    P +SHLFF+DDS++F +A++ E   I+ IL+ YE AS Q +N  K+ L  SS+   + +  +   L +P++   
Subjt:  AEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDL

Query:  GTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYR
          YLGLPS   R+K   F +  +RV   +QGWK    S  G+E LIK+VVQAIP Y M+ F+LPK +  ++ R V  FWWG S    K+HW +W      
Subjt:  GTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYR

Query:  RRLGSSDY-ALAKVSDFVTSSREW
        +++G   +  L+K +D + + + W
Subjt:  RRLGSSDY-ALAKVSDFVTSSREW

A0A2N9I2P8 Reverse transcriptase domain-containing protein1.0e-16537.49Show/hide
Query:  GKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW--------------
        GK++  TG YG P+ H R ++W LL+ L + + S W+  GD NEI+ +SE+ G   R    +  FR A+    L DLGF G  FTW              
Subjt:  GKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW--------------

Query:  ---LSNLDWAC--------------SNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTN-GLFHNLSSCSSRLRHWGR
           L++  W                S+H P+ L +   S  V    R+  F+F A W    +CR +I        R GS    +   L  C   L  W +
Subjt:  ---LSNLDWAC--------------SNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTN-GLFHNLSSCSSRLRHWGR

Query:  EANRFLMKLILQKKQAIKDAYSVTPVDFSI-IHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVE
        E    L   I  K++ ++   +++    S  +  L+T+L  LLE+EEI+W QRS+ +W+  GD+NT++FH   ++R++ N I G++  D  W T ++++ 
Subjt:  EANRFLMKLILQKKQAIKDAYSVTPVDFSI-IHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVE

Query:  SIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREW
         I + YFQNIFTSS+  VD   + L+ +  V++ DMNA L A F + E+  A+ QM+PTKA GPDG  A+FYQTYW++VG +     L ++++   + + 
Subjt:  SIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREW

Query:  NKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRV
        N T+IAL+PKI +   + DFRPI+LCNV +KII+KVLANRLK +L  +VS++QSAFV GR ITDNV++  E +H +  ++ GR G  ALKLDMSKAYDRV
Subjt:  NKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRV

Query:  EWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDD
        EWSFLE +MRR+GFA+ WI LIM CI  V ++VLIN   CG    SRG+RQGD LSPYLF+LCAEGLS  L  A     I+ V      P ++HLFF+DD
Subjt:  EWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDD

Query:  SLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFF
        SL+F +ANM     +  IL +YE ASGQ +N  K+++  + N +   R  + ++  VP +     YLGLPS   RSK + F     RV + + GWK  F 
Subjt:  SLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFF

Query:  STGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWER-----------------------------------------
        S+ G+E L+K+V Q+IPTY MS F+LP+S+C ++    S FWWG  +   K HW  W +                                         
Subjt:  STGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWER-----------------------------------------

Query:  --------CAY-----RRRLGSSDYALAKVSDFVT----------SSREWDVEKLADLLTLEDLQLVQCLPIGPSGAEDIWLWHYDKRGVYTVKSGYKL
                C++     R R   +  +L    D +T            REW VE +  L +  +  ++  +P+ P    D   W   K G++TV+S Y +
Subjt:  --------CAY-----RRRLGSSDYALAKVSDFVT----------SSREWDVEKLADLLTLEDLQLVQCLPIGPSGAEDIWLWHYDKRGVYTVKSGYKL

A0A2N9I6L8 Reverse transcriptase domain-containing protein6.0e-16641.76Show/hide
Query:  WHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTWLS---------------
        W FTG YG P+   R  +W +LR LH      W   GD NE++   EK+GG  R    M +FR  L+DCG +DLGF G  FTW +               
Subjt:  WHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTWLS---------------

Query:  --NLDW--------------ACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTN--GLFHNLSSCSSRLRHWGREA
          N +W              + S+H P+ LS       VG   R+  F+F + W+    C+  + +   W + +  ++   L++ +  C  RLR W R +
Subjt:  --NLDW--------------ACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTN--GLFHNLSSCSSRLRHWGREA

Query:  NRFLMKLILQKKQAIKDA--YSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVES
           + + + +K++ ++ A   S+   D S + +L ++L  LLE EE  W QRS+ +WL+ GDRNTR+FH +AS+R++ N I G+  D+G+W    ++V  
Subjt:  NRFLMKLILQKKQAIKDA--YSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVES

Query:  IFLDYFQNIF-TSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREW
        I LDYFQN+F T     VD+   +LD+IP VI+ DM++ L+  +  +E+ERAI QM P  A GPDG P LFYQ++W ++G    A  L  LN+   ++  
Subjt:  IFLDYFQNIF-TSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREW

Query:  NKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRV
        N T I LIPK+ +P+ + +FRPISLCNV +KI++KVLANRLK +L  ++S+ QSAFV GR ITDN+++  E LH+++H   G+ G  ALKLDMSKAYDRV
Subjt:  NKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRV

Query:  EWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDD
        EW++L  +M +MGF    I LIM+C+  V ++VL+N    G   P+RGLRQGDPLSPYLF+LC EG    L AA  S  I  V I  Y P +SHLFF+DD
Subjt:  EWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDD

Query:  SLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFF
        SL+F KAN+ E   +  IL+ YE ASGQ +N  K+ L  S +  A  +  +   LGVP++     YLGLPS   RSK   F    +RV   +QGWK    
Subjt:  SLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFF

Query:  STGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLGSSDY
        S  G+E LIK+VVQAIPTY+MS FRLP  +C E+   + RFWWG  +++ K+ W  W+     +  G   +
Subjt:  STGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLGSSDY

A0A2N9IPS8 Reverse transcriptase domain-containing protein1.2e-16641.73Show/hide
Query:  KGKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW-------------
        KGK +  TG YG P+ H R ++W LL+ L +   S W+  GD NEI+ ++E+ G   R    +  FR A+  CGL DLG+ G+ +TW             
Subjt:  KGKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTW-------------

Query:  ----LSNLDWAC--------------SNHRPVELSLEPLSRPVG--FRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNG-----LFHNLSSCSS
            ++++ W                S+H P+ L +     P G   + ++  F+F A WI   +CRE+I +   W D  G T G     +   +  C +
Subjt:  ----LSNLDWAC--------------SNHRPVELSLEPLSRPVG--FRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNG-----LFHNLSSCSS

Query:  RLRHWGREANRFLMKLILQKKQAIKDAYSVTPVDFSI-IHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWV
         L  W RE    L   I +K++ ++   + TP  FS  I  L+ DL  LLE+EEI+W QRS+  W+  GD+NT++FH Q +ER++ N I G+   DG W 
Subjt:  RLRHWGREANRFLMKLILQKKQAIKDAYSVTPVDFSI-IHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWV

Query:  TSESEVESIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNN
        T ++++  I +DYFQ IFTSS  S +    +L  +  V++  MN +L A F + E+  A+ QM+PTKA GPDG  A+FYQTYWDIVG +     L +L++
Subjt:  TSESEVESIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNN

Query:  RKSIREWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDM
           +R+ N T+IALIPK+  P  + DFRPISLCNV +KI++KVLANRLK VL  V+S+AQSAFV GR ITDNV++  E +H +  ++KG+ G  ALKLDM
Subjt:  RKSIREWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDM

Query:  SKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVS
        SKAYDRVEW FLE +MR MGFA  WI L+M C+  V ++VLIN   CG    SRG+RQGD LSPYLF++CAEGLS  L  A + + ++ V      P ++
Subjt:  SKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVS

Query:  HLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQ
        HLFF+DDSL+F +A +     +  IL +YE ASGQ +N  K+++  + + S   R  +     VP +     YLGLPS   RSK   F +   RV + + 
Subjt:  HLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQ

Query:  GWKRSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLG
        GWK  F S  G+E LIK+V Q+IPTY+MS F+LP+S+C ++    S FWWG  +   K HW  W +    +  G
Subjt:  GWKRSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLG

A0A2N9ITS3 Reverse transcriptase domain-containing protein7.5e-16941.56Show/hide
Query:  WHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTWLSN--------------
        W  T +YG P+ HLR +TW L+R L       W   GD NEI+  SE QG   R    M  FR  L+DCG+ DLGFRG  FTW +N              
Subjt:  WHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRIALEDCGLRDLGFRGDVFTWLSN--------------

Query:  -----------------LDWACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTN--GLFHNLSSCSSRLRHWGREA
                         LD   S+H+ + L LEP ++P   + R+P F+F   W   + C   I    +W  R   T    +++ L +C   L +W R++
Subjt:  -----------------LDWACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTN--GLFHNLSSCSSRLRHWGREA

Query:  NRFLMKLILQKKQAIK--DAYSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVES
           + + + QK+Q +K  +A ++       + SL++++  LLE+EE  W QRS+ +WLK GDRNTR+FH QAS+R++ N I GI  + G W   + EV +
Subjt:  NRFLMKLILQKKQAIK--DAYSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVES

Query:  IFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWN
        I +DY+++IF +S  S+ ++   + ++P V++  MN  LT  F   E+E A+ QM P KA GPDG P LFYQ +W +VG       L  LN+ + +   N
Subjt:  IFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWN

Query:  KTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVE
         T I LIPK+  P  V +FRPISLCNV +K+++KV+ANRLK +L  ++SD+QSAFV GR ITDNV++  E LH++   K GRDG  ALKLDMSKAYDRVE
Subjt:  KTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVE

Query:  WSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDS
        W++LE +MR+MGF   W+ +IM CI+ V +++L+N    G + PSRGLRQGDPLSPYLF+LCAEGL   +S A V   +  V +    P ++HLFF+DDS
Subjt:  WSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDS

Query:  LVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFS
        L+F KA   +   ++ IL  YE ASGQ +N  K+ +  S       + A+ + LGVP++     YLGLPS   +++  CF +  ERV   + GWK    S
Subjt:  LVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFS

Query:  TGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLGSSDY
          G+E LIKSV QAIPTY MS FRLP  +CQ++   + +FWWG    + K+ W  W     ++ LG   +
Subjt:  TGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERCAYRRRLGSSDY

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.9e-3222.84Show/hide
Query:  ERKKHNDIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQQLAILDNIP-PVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQ
        ++++ N I  I  D G   T  +E+++   +Y+++++ +   ++++    LD    P ++ +    L      +EI   I+ +   K+ GPDGF A FYQ
Subjt:  ERKKHNDIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQQLAILDNIP-PVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQ

Query:  TYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGD-FRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHEC
         Y + +    +     +         + + +I LIPK    +   + FRPISL N+  KI+ K+LANR++  +  ++   Q  F+ G     N+    + 
Subjt:  TYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGD-FRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHEC

Query:  LHYIQHRKKGRD-GFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHAL
        ++ IQH  + +D     + +D  KA+D+++  F+ + + ++G   +++ +I          +++N     +     G RQG PLSP LF +  E L+ A+
Subjt:  LHYIQHRKKGRD-GFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHAL

Query:  SAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKS-ALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLP
              + I  +Q+G     V    F+DD +V+ +  +V   ++ ++++ +   SG  +N  KS A L ++N   + ++     L   +      YLG+ 
Subjt:  SAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKS-ALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLP

Query:  SRFPRSKGLCFRKT----LERVKKVVQGWKRSFFSTGGKETLIKSVV--QAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRM
         +  R     F++     L+ +K+    WK    S  G+  ++K  +  + I  +     +LP +   E+ +   +F W    +R+
Subjt:  SRFPRSKGLCFRKT----LERVKKVVQGWKRSFFSTGGKETLIKSVV--QAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRM

P08548 LINE-1 reverse transcriptase homolog4.3e-2824.17Show/hide
Query:  IHGIHRDDGTWVTSESEVESIFLDYFQNIFT---SSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDI
        I  I   +    T  SE++ I  +Y++ +++    +   +DQ L       P +S      L      +EI   I  +   K+ GPDGF + FYQT+ + 
Subjt:  IHGIHRDDGTWVTSESEVESIFLDYFQNIFT---SSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDI

Query:  VGAQTVANCLEVLNN--RKSI--REWNKTNIALIPKI-NTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECL
           + V   L +  N  ++ I    + + NI LIPK    P+   ++RPISL N+  KI+ K+L NR++  +  ++   Q  F+ G     N+    + +
Subjt:  VGAQTVANCLEVLNN--RKSI--REWNKTNIALIPKI-NTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECL

Query:  HYIQHRKKGRD-GFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALS
        + IQH  K ++     L +D  KA+D ++  F+ R ++++G    ++ LI    +     +++N V   S     G RQG PLSP LF +  E L+ A+ 
Subjt:  HYIQHRKKGRD-GFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALS

Query:  AAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSR
             + I  + IG+    +    F+DD +V+ +        +  ++ EY + SG  +N  KS   + +N +   +    S+    +V     YLG+   
Subjt:  AAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVALGSVLGVPLVDDLGTYLGLPSR

Query:  FPRSKGLCFRKTLERVKKV----VQGWKRSFFSTGGKETLIKSVV--QAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRM
          +     +++  E ++K     V  WK    S  G+  ++K  +  +AI  +     + P S  +++ + +  F W   + ++
Subjt:  FPRSKGLCFRKTLERVKKV----VQGWKRSFFSTGGKETLIKSVV--QAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRM

P11369 LINE-1 retrotransposable element ORF2 protein9.8e-3325.1Show/hide
Query:  KKHND---IHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQQLAILDNIP-PVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFY
        K H D   I+ I  + G   T   E+++    +++ ++++   ++D+    LD    P ++ D    L +     EIE  I+ +   K+ GPDGF A FY
Subjt:  KKHND---IHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQQLAILDNIP-PVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFY

Query:  QTYWD--IVGAQTVANCLEVLNNRKSIREWNKTNIALIPK-INTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIG
        QT+ +  I     + + +EV     +   + +  I LIPK    P+ + +FRPISL N+  KI+ K+LANR++  + +++   Q  F+ G     N+   
Subjt:  QTYWD--IVGAQTVANCLEVLNNRKSIREWNKTNIALIPK-INTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIG

Query:  HECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSH
           +HYI   K        + LD  KA+D+++  F+ +++ R G    ++ +I    +     + +N     +I    G RQG PLSPYLF +  E L+ 
Subjt:  HECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSH

Query:  ALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKS-ALLVSSNLSADGRVALGSVLGVPLVDDLGTYLG
        A+      + I  +QIG     +S L  +DD +V+          +  ++N +    G  +N  KS A L + N  A+  +         +V +   YLG
Subjt:  ALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKS-ALLVSSNLSADGRVALGSVLGVPLVDDLGTYLG

Query:  --LPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTGGKETLIKSVV--QAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRM
          L           F+   + +K+ ++ WK    S  G+  ++K  +  +AI  +     ++P     E+   + +F W + + R+
Subjt:  --LPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTGGKETLIKSVV--QAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRM

P14381 Transposon TX1 uncharacterized 149 kDa protein1.4e-3125.69Show/hide
Query:  RSKENWLKWGDRNTRWFHHQASERKKHN--DIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIE
        RS+   L   DR +R+F+  A E+KK N   I  +  +DGT +     +      ++QN+F+   +S D    + D + PV+S     +L       E+ 
Subjt:  RSKENWLKWGDRNTRWFHHQASERKKHN--DIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQQLAILDNIPPVISLDMNAKLTASFCQAEIE

Query:  RAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVS
        +A+  M   K+ G DG    F+Q +WD +G        E     +      +  ++L+PK     ++ ++RP+SL +  +KI+AK ++ RLK VL  V+ 
Subjt:  RAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGDFRPISLCNVSHKIIAKVLANRLKHVLCSVVS

Query:  DAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLR
          QS  V GR I DNV +  + LH+   R+ G    A L LD  KA+DRV+  +L   ++   F   ++G +       E  V IN      +   RG+R
Subjt:  DAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVACGSIVPSRGLR

Query:  QGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVA
        QG PLS  L+ L  E     L       ++    +     + +      D ++    ++V+    +     Y +AS   +N+ KS+ L+  +L  D    
Subjt:  QGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVA

Query:  LGSVLG--VPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWK--RSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSS
            +     ++  LG YL     +P S+   F +  E V   +  WK      S  G+  +I  +V +   Y +      +    +I R +  F W   
Subjt:  LGSVLG--VPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWK--RSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSS

Query:  ESRMKMHWKS
           +  HW S
Subjt:  ESRMKMHWKS

P92555 Uncharacterized mitochondrial protein AtMg012504.0e-1045.59Show/hide
Query:  LINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDS
        +IN    G + PSRGLRQGDPLSPYLF+LC E LS     A     +  +++    P ++HL F+DD+
Subjt:  LINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.6e-2024.21Show/hide
Query:  DSSWIVGGDLNEIMWDSEKQGGAARAFEL--MSSFRIALEDCGLRDLGFRGDVFTW----------------LSNLDW--------------ACSNHRPV
        D   I+ GD ++I   S+       +  +  +  F+  L D  L D+  RG  +TW                ++N DW                S+H P 
Subjt:  DSSWIVGGDLNEIMWDSEKQGGAARAFEL--MSSFRIALEDCGLRDLGFRGDVFTW----------------LSNLDW--------------ACSNHRPV

Query:  ELSLEPLS-------RPVGFRARQPGF--KFNAQWIHHAECREIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGREANRFLMKLILQKKQAIKDAYSV
         + LE L        R   F +  P F       W         + + G+                 C   L   G    +   K  L   ++I+     
Subjt:  ELSLEPLS-------RPVGFRARQPGF--KFNAQWIHHAECREIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGREANRFLMKLILQKKQAIKDAYSV

Query:  TPVD--FSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSE--LSVDQ
         P D  F + H             E ++ Q+S+  WL+ GD NTR+FH      +  N I  +  DD   V + ++V+ + + Y+ ++  S    L+ D 
Subjt:  TPVD--FSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSE--LSVDQ

Query:  QLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGDF
           I D  P   +  + ++L+A     EI  A+  M   KA GPD F A F+   W +V   T+A   E       ++ +N T I LIPK+     +  F
Subjt:  QLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGDF

Query:  RPISLCNVSHKII
        RP+S C V +KII
Subjt:  RPISLCNVSHKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.6e-1440.96Show/hide
Query:  LANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWI
        +  RLK ++ +++  AQ++F+ GR  TDN++   E +H ++ RKKG  G+  LKLD+ KAYDR+ W +LE  +   GF +VW+
Subjt:  LANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWI

AT4G29090.1 Ribonuclease H-like superfamily protein8.6e-0845.45Show/hide
Query:  AIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWE
        A+PTY M+ F LPK++C++II  ++ FWW + +    MHWK+W+
Subjt:  AIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWE

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.3e-0742.22Show/hide
Query:  AIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWER
        A+P YAMS FRL K +C+++   ++ FWW S E++ K+ W +W++
Subjt:  AIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWER

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.9e-1145.59Show/hide
Query:  LINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDS
        +IN    G + PSRGLRQGDPLSPYLF+LC E LS     A     +  +++    P ++HL F+DD+
Subjt:  LINRVACGSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTTGTTCCGAGTTCTGTGTCTGTTTCGGCGGAGGCTGGTGTTCAGCTCCGCTGAGAATTATGAAAATGATATGTTGGAATGTTTGGGGGATGGGGACCCTTCGTT
CATTCCGGGCTTGTGGAAAGGGAAAACGTGGCATTTCACGGGGCTCTATGGCCAACCAAAGGACCATCTTCGTTTCCAAACGTGGGAATTGTTGCGACTTCTGCATAATT
TTGATGATTCCTCATGGATTGTTGGGGGCGATTTGAATGAGATTATGTGGGATTCTGAAAAACAAGGTGGTGCGGCACGAGCTTTTGAATTAATGTCGAGCTTTCGTATT
GCTTTGGAGGATTGTGGCCTTCGTGATCTTGGTTTTCGTGGTGATGTCTTCACGTGGTTATCGAATTTGGACTGGGCTTGCTCTAACCACCGTCCAGTGGAGCTTTCTTT
GGAACCACTTTCTCGGCCAGTGGGCTTTAGGGCTCGTCAACCTGGATTTAAGTTCAATGCCCAGTGGATTCATCACGCCGAATGTCGAGAAATTATTGCTAATTGTGGTG
ATTGGTCTGATAGAAATGGGTCTACGAATGGGTTATTCCATAATTTGTCCTCTTGCTCTTCTCGATTACGGCATTGGGGCAGGGAAGCGAATCGGTTTCTTATGAAATTG
ATTCTGCAAAAGAAACAGGCTATCAAAGATGCTTATTCGGTGACACCTGTGGACTTTTCGATTATTCACTCCCTAGAGACAGATTTGGCGAGGCTTTTGGAGGAAGAAGA
AATATATTGGCATCAACGATCTAAGGAAAATTGGCTCAAATGGGGTGATAGAAATACACGATGGTTCCATCATCAGGCATCAGAACGGAAAAAGCATAATGATATTCATG
GGATTCATAGGGATGATGGTACCTGGGTCACTTCCGAGTCGGAGGTGGAGTCGATTTTTCTGGACTATTTTCAGAACATTTTTACGTCGTCAGAGCTGTCTGTTGATCAA
CAACTTGCTATTTTGGATAATATTCCTCCTGTTATCTCCCTGGATATGAATGCAAAGCTGACTGCGTCGTTCTGTCAAGCAGAAATTGAGCGGGCTATTTCTCAAATGTT
TCCAACGAAGGCCCTGGGTCCGGATGGTTTTCCTGCTCTTTTTTACCAGACTTATTGGGATATTGTTGGAGCTCAAACTGTTGCCAACTGTCTTGAGGTGCTGAATAATA
GGAAATCTATTCGCGAATGGAACAAAACAAATATTGCTCTTATTCCAAAGATTAATACCCCATCTGTGGTGGGTGATTTTCGCCCAATCAGCCTGTGCAATGTATCGCAC
AAGATTATAGCCAAGGTTTTGGCGAACAGATTGAAACATGTCCTGTGCTCTGTGGTTTCAGATGCTCAATCGGCTTTTGTGTCTGGTCGTGCAATTACTGATAATGTTAT
CATTGGTCATGAATGTCTACACTATATTCAACATCGAAAAAAAGGGCGTGATGGTTTTGCTGCATTGAAATTAGATATGAGCAAAGCCTATGATCGAGTTGAATGGTCTT
TCCTGGAGCGACTCATGCGTCGTATGGGGTTTGCTGATGTGTGGATTGGTCTTATTATGGATTGTATTACGATGGTTGAATTTGCAGTTCTTATTAACCGTGTGGCTTGT
GGGAGTATTGTTCCGAGTCGGGGTCTTCGCCAGGGGGACCCTCTTTCGCCGTATTTGTTTGTGTTGTGTGCCGAAGGCTTGTCTCATGCTTTATCAGCTGCTCATGTGTC
ACGCCTTATTTCCAGCGTCCAGATTGGTACGTACTGTCCCTCTGTTTCCCATTTATTTTTTTCAGATGATAGTTTGGTCTTTTTTAAGGCCAATATGGTGGAGGGTTGGC
ATATTAAACGAATTTTGAATGAGTACGAGTCAGCCTCGGGCCAATGTGTGAATTTTTTAAAATCGGCTTTACTAGTTTCGTCAAATTTGTCTGCCGATGGTAGGGTGGCT
TTGGGATCGGTACTGGGCGTTCCTCTTGTGGATGATTTAGGTACGTATTTGGGGTTGCCATCTCGCTTCCCGCGTTCGAAAGGGTTGTGTTTCAGGAAGACTCTTGAGAG
GGTTAAGAAGGTGGTACAAGGATGGAAGCGTTCTTTCTTTTCTACAGGTGGGAAGGAGACTCTTATTAAAAGTGTGGTGCAGGCGATCCCGACCTATGCAATGAGCTATT
TCCGACTCCCTAAGTCCATTTGTCAAGAAATCATTAGGGAGGTTTCACGCTTTTGGTGGGGTTCGTCTGAGTCTCGTATGAAAATGCATTGGAAATCGTGGGAAAGATGT
GCTTACCGAAGGAGGTTGGGGTCTTCTGATTATGCTTTGGCCAAGGTGAGTGATTTTGTTACTTCTTCTAGAGAGTGGGATGTTGAGAAGTTGGCTGATTTGCTTACTCT
AGAGGATCTCCAGCTGGTTCAATGTCTCCCTATTGGTCCGTCAGGGGCGGAGGATATTTGGCTTTGGCACTACGATAAACGAGGGGTGTATACGGTTAAGAGCGGGTACA
AACTTTGTATGCTTCAGGGTCAAGTGTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGTTGTTCCGAGTTCTGTGTCTGTTTCGGCGGAGGCTGGTGTTCAGCTCCGCTGAGAATTATGAAAATGATATGTTGGAATGTTTGGGGGATGGGGACCCTTCGTT
CATTCCGGGCTTGTGGAAAGGGAAAACGTGGCATTTCACGGGGCTCTATGGCCAACCAAAGGACCATCTTCGTTTCCAAACGTGGGAATTGTTGCGACTTCTGCATAATT
TTGATGATTCCTCATGGATTGTTGGGGGCGATTTGAATGAGATTATGTGGGATTCTGAAAAACAAGGTGGTGCGGCACGAGCTTTTGAATTAATGTCGAGCTTTCGTATT
GCTTTGGAGGATTGTGGCCTTCGTGATCTTGGTTTTCGTGGTGATGTCTTCACGTGGTTATCGAATTTGGACTGGGCTTGCTCTAACCACCGTCCAGTGGAGCTTTCTTT
GGAACCACTTTCTCGGCCAGTGGGCTTTAGGGCTCGTCAACCTGGATTTAAGTTCAATGCCCAGTGGATTCATCACGCCGAATGTCGAGAAATTATTGCTAATTGTGGTG
ATTGGTCTGATAGAAATGGGTCTACGAATGGGTTATTCCATAATTTGTCCTCTTGCTCTTCTCGATTACGGCATTGGGGCAGGGAAGCGAATCGGTTTCTTATGAAATTG
ATTCTGCAAAAGAAACAGGCTATCAAAGATGCTTATTCGGTGACACCTGTGGACTTTTCGATTATTCACTCCCTAGAGACAGATTTGGCGAGGCTTTTGGAGGAAGAAGA
AATATATTGGCATCAACGATCTAAGGAAAATTGGCTCAAATGGGGTGATAGAAATACACGATGGTTCCATCATCAGGCATCAGAACGGAAAAAGCATAATGATATTCATG
GGATTCATAGGGATGATGGTACCTGGGTCACTTCCGAGTCGGAGGTGGAGTCGATTTTTCTGGACTATTTTCAGAACATTTTTACGTCGTCAGAGCTGTCTGTTGATCAA
CAACTTGCTATTTTGGATAATATTCCTCCTGTTATCTCCCTGGATATGAATGCAAAGCTGACTGCGTCGTTCTGTCAAGCAGAAATTGAGCGGGCTATTTCTCAAATGTT
TCCAACGAAGGCCCTGGGTCCGGATGGTTTTCCTGCTCTTTTTTACCAGACTTATTGGGATATTGTTGGAGCTCAAACTGTTGCCAACTGTCTTGAGGTGCTGAATAATA
GGAAATCTATTCGCGAATGGAACAAAACAAATATTGCTCTTATTCCAAAGATTAATACCCCATCTGTGGTGGGTGATTTTCGCCCAATCAGCCTGTGCAATGTATCGCAC
AAGATTATAGCCAAGGTTTTGGCGAACAGATTGAAACATGTCCTGTGCTCTGTGGTTTCAGATGCTCAATCGGCTTTTGTGTCTGGTCGTGCAATTACTGATAATGTTAT
CATTGGTCATGAATGTCTACACTATATTCAACATCGAAAAAAAGGGCGTGATGGTTTTGCTGCATTGAAATTAGATATGAGCAAAGCCTATGATCGAGTTGAATGGTCTT
TCCTGGAGCGACTCATGCGTCGTATGGGGTTTGCTGATGTGTGGATTGGTCTTATTATGGATTGTATTACGATGGTTGAATTTGCAGTTCTTATTAACCGTGTGGCTTGT
GGGAGTATTGTTCCGAGTCGGGGTCTTCGCCAGGGGGACCCTCTTTCGCCGTATTTGTTTGTGTTGTGTGCCGAAGGCTTGTCTCATGCTTTATCAGCTGCTCATGTGTC
ACGCCTTATTTCCAGCGTCCAGATTGGTACGTACTGTCCCTCTGTTTCCCATTTATTTTTTTCAGATGATAGTTTGGTCTTTTTTAAGGCCAATATGGTGGAGGGTTGGC
ATATTAAACGAATTTTGAATGAGTACGAGTCAGCCTCGGGCCAATGTGTGAATTTTTTAAAATCGGCTTTACTAGTTTCGTCAAATTTGTCTGCCGATGGTAGGGTGGCT
TTGGGATCGGTACTGGGCGTTCCTCTTGTGGATGATTTAGGTACGTATTTGGGGTTGCCATCTCGCTTCCCGCGTTCGAAAGGGTTGTGTTTCAGGAAGACTCTTGAGAG
GGTTAAGAAGGTGGTACAAGGATGGAAGCGTTCTTTCTTTTCTACAGGTGGGAAGGAGACTCTTATTAAAAGTGTGGTGCAGGCGATCCCGACCTATGCAATGAGCTATT
TCCGACTCCCTAAGTCCATTTGTCAAGAAATCATTAGGGAGGTTTCACGCTTTTGGTGGGGTTCGTCTGAGTCTCGTATGAAAATGCATTGGAAATCGTGGGAAAGATGT
GCTTACCGAAGGAGGTTGGGGTCTTCTGATTATGCTTTGGCCAAGGTGAGTGATTTTGTTACTTCTTCTAGAGAGTGGGATGTTGAGAAGTTGGCTGATTTGCTTACTCT
AGAGGATCTCCAGCTGGTTCAATGTCTCCCTATTGGTCCGTCAGGGGCGGAGGATATTTGGCTTTGGCACTACGATAAACGAGGGGTGTATACGGTTAAGAGCGGGTACA
AACTTTGTATGCTTCAGGGTCAAGTGTCGTAG
Protein sequenceShow/hide protein sequence
MLLFRVLCLFRRRLVFSSAENYENDMLECLGDGDPSFIPGLWKGKTWHFTGLYGQPKDHLRFQTWELLRLLHNFDDSSWIVGGDLNEIMWDSEKQGGAARAFELMSSFRI
ALEDCGLRDLGFRGDVFTWLSNLDWACSNHRPVELSLEPLSRPVGFRARQPGFKFNAQWIHHAECREIIANCGDWSDRNGSTNGLFHNLSSCSSRLRHWGREANRFLMKL
ILQKKQAIKDAYSVTPVDFSIIHSLETDLARLLEEEEIYWHQRSKENWLKWGDRNTRWFHHQASERKKHNDIHGIHRDDGTWVTSESEVESIFLDYFQNIFTSSELSVDQ
QLAILDNIPPVISLDMNAKLTASFCQAEIERAISQMFPTKALGPDGFPALFYQTYWDIVGAQTVANCLEVLNNRKSIREWNKTNIALIPKINTPSVVGDFRPISLCNVSH
KIIAKVLANRLKHVLCSVVSDAQSAFVSGRAITDNVIIGHECLHYIQHRKKGRDGFAALKLDMSKAYDRVEWSFLERLMRRMGFADVWIGLIMDCITMVEFAVLINRVAC
GSIVPSRGLRQGDPLSPYLFVLCAEGLSHALSAAHVSRLISSVQIGTYCPSVSHLFFSDDSLVFFKANMVEGWHIKRILNEYESASGQCVNFLKSALLVSSNLSADGRVA
LGSVLGVPLVDDLGTYLGLPSRFPRSKGLCFRKTLERVKKVVQGWKRSFFSTGGKETLIKSVVQAIPTYAMSYFRLPKSICQEIIREVSRFWWGSSESRMKMHWKSWERC
AYRRRLGSSDYALAKVSDFVTSSREWDVEKLADLLTLEDLQLVQCLPIGPSGAEDIWLWHYDKRGVYTVKSGYKLCMLQGQVS