; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g010760 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g010760
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:11943924..11946571
RNA-Seq ExpressionLcy06g010760
SyntenyLcy06g010760
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_012435028.1 PREDICTED: uncharacterized protein LOC105761689 [Gossypium raimondii]3.6e-10447.01Show/hide
Query:  NKGDVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFR
        N  ++   ++Q+   + ++E YW QRA ++W++ GDRN+ +FH++ATQRR+ N++  L   DG+  ++ +  E+I   YFQ LF ++GQ    R+     
Subjt:  NKGDVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFR

Query:  HIQQCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLC
         I+ CI++E N+ L + YT++E+L +L +MGP KAPGED  PALFYQ+ W +VG++V+N CL  LN G  + ++N+T IVLIPK     R++ +RPISLC
Subjt:  HIQQCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLC

Query:  NVIYKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCV
        NVIYKLI+K IA R++GV+   I   QSAFVPGR I DN +L +E LH LK +R GK G++A+KLDMSKAYDRVEW F+E+ M  +GFD   V  I+ CV
Subjt:  NVIYKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCV

Query:  STVSYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIIT
        STVSY  ++NG R   I P RG+R+GDPLSP+LFLFC EGLS ++   + + ++ G + ++   +ISHL F DDC LF +A  R    +  +L  Y + +
Subjt:  STVSYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIIT

Query:  GQ
        GQ
Subjt:  GQ

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]5.6e-10547.38Show/hide
Query:  DVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQ
        ++   R +I  LL +EE YW QRA   W+K GDRN+K+FH +A++RR+ N + G+WD  G+W ++EE + Q    YF N++ S+      ++ E    I 
Subjt:  DVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQ

Query:  QCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVI
          +++EMN  L   +T++E+  +LKQ+ P KAPG D + A+F+Q++W +VG  VT++ L+VLNH   +  LN+T I LIPK    +R++D+RPISLCNV+
Subjt:  QCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVI

Query:  YKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTV
        YKLISK +ANRLK +L HIIS  QSAF   R I DN ++ FE +HYL  +  GK+G++A+KLDMSKA+DRVEW FI K M  +GF  +   L+M C+++V
Subjt:  YKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTV

Query:  SYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQK
        SYS ++NG   GNI P RGLR+GDPLSP LFL C EGLS ++    R+ LI+G  I R C  ++HLFF DD  LF KA   E  ++ ++L  Y   +GQK
Subjt:  SYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQK

Query:  I
        I
Subjt:  I

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]3.9e-10646.53Show/hide
Query:  GDVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHI
        G++   R +I  LL  EEI W+QR+ V W+  GDRN+K+FH +A+ RRR N + G+ D +G W +  EG+ ++   YFQ ++ S+      R+ E    I
Subjt:  GDVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHI

Query:  QQCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNV
           +++EMN  L   +T +E+  +L QM P KAPG D + A+F+Q++W +VG ++  + L VLN   S+  +N+T I L+PK +   ++SD+RPISLCNV
Subjt:  QQCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNV

Query:  IYKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVST
        +YKLISK +ANRLK +L  IIS  QSAF+ GR I DN ++ FE +HYL+ +++GK+G+ A+KLDMSKAYDRVEW FI++ M  +GF EK +KL+M C+++
Subjt:  IYKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVST

Query:  VSYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQ
        VSYS ++NG   G+ITP RGLR+GDP+SPY+FL C +G S +L  + R   ISG  I R C  I+HLFF DD  LF KA  +E   + ++L  Y   +GQ
Subjt:  VSYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQ

Query:  KIKL
        KI +
Subjt:  KIKL

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]2.4e-10346.97Show/hide
Query:  RVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQ
        R ++ +LL  EEI WRQR+ V W + GDRN+K+FH RA++RR+ N +  LW+ DG W + +E +      YF+N++ S+   G   ++E    I + ++ 
Subjt:  RVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQ

Query:  EMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLIS
        EMNS L+  +T +E+L +LKQ+ P KAPG D + A F+  +W +VG  +TN+ L+VLN    +  +N+T I LIPK     R++++RPISLCN  YK+IS
Subjt:  EMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLIS

Query:  KTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFM
        K +ANR K +L +IIS  QSAF P R I DN ++ FE +HYL  + +GK+ ++++KLDMSKA+DRVEW FI+  M  +GF EK + LIM CVS+VSYS +
Subjt:  KTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFM

Query:  LNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI
        +NG+  GNITP RG+R+GDPLSP LFL C EGLS ++    R+  I+G  I R C  I+HLFF DD  LF KA+ +E   + ++L  Y   +GQKI
Subjt:  LNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.4e-10347.17Show/hide
Query:  NKGDVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSG-RLHEFF
        +K +      ++++LL ++EIYW QR+ ++W++ GDRN+K+FH +A+QRRR N + G+ +  GQW+E+ E + Q+ + YF NLF    Q G+G ++ E  
Subjt:  NKGDVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSG-RLHEFF

Query:  RHIQQCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISL
          +   ++++M   L++ +T +E+ A+L QMGP KAPG D + ALFYQ+FW +VG  V +  L  LN+G  +  +N T IVLIPK +  +R+S++RPISL
Subjt:  RHIQQCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISL

Query:  CNVIYKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGC
        CNVIYK+ISK +ANRLK VL  IIS TQSAFVPGR I DN ++ +ETLH +  R+KGK G VALKLD+SKAYDRVEW F++  M  +GF    ++ +M C
Subjt:  CNVIYKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGC

Query:  VSTVSYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSII
        V+T S+S ++NG+    I P RG+R+GDP+SPYLFL C EGL+ +L   E +G+I+G  I R    I++L F DD  LF +A   E   IA +L  Y   
Subjt:  VSTVSYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSII

Query:  TGQKIKL
        +GQ I L
Subjt:  TGQKIKL

TrEMBL top hitse value%identityAlignment
A0A2N9E9A1 Reverse transcriptase domain-containing protein2.2e-10747.98Show/hide
Query:  RVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQ
        + ++  LLG+EE  WRQR+ ++W++ GD+N+++FH RATQRRR NR+  L D  G W+  +  + Q+   ++ +LF S       ++ +    I + ++ 
Subjt:  RVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQ

Query:  EMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLIS
        EMN HLT ++   E+L ++KQM P+K+PG D  P +FYQ++W ++G++V+   L  LN G  ++A+N T I LIPK +  + V D+RPISLCNVIYK+IS
Subjt:  EMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLIS

Query:  KTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFM
        K + NRLK +L  I+S +QSAFVPGR I DN ++ FETLH++ Q+R+GK G VALKLDMSKAYDRVEW ++E+ M  +GF EK VK++M C+STVSYS +
Subjt:  KTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFM

Query:  LNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI
        +NG+  G I P RGLR+GDPLSPYLFLFC EGL  +L   + +G + G  I+R    ++HLFF DD  LF KA   E   I ++L  Y   +GQ+I
Subjt:  LNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI

A0A2N9EDY7 Reverse transcriptase domain-containing protein9.9e-10849.75Show/hide
Query:  QIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQEM
        ++  LL +EE  WRQR+ V W++ GDRN+++FH RA+QRRR NR+ GL D  G W E++     ++  +F+++F ++  +    + E   H+   ISQE+
Subjt:  QIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQEM

Query:  NSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLISKT
        N+ LTS +T  E+  +LKQM PLKAPG D +P LF+Q++WK+VG EVT   L  LN G  +  +N T I LIPK +  +R++++RPISLCNV YKLISK 
Subjt:  NSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLISKT

Query:  IANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLN
        IANRLKG+L  IIS  QSAFVPGR I DN ++ FETLH++   + GKDG +A+KLDMSKAYDRVEW F+EK M  +GF  + V LIM C+STVSYS ++N
Subjt:  IANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLN

Query:  GQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI
        G+  G + P RG+R+GDPLSPYLFL C EGL  +++  +  G + G  + R    I+HLFF DD  LF KA +RE  +I  +L  Y   +GQ++
Subjt:  GQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI

A0A2N9GM07 Reverse transcriptase domain-containing protein9.9e-10848.88Show/hide
Query:  DVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQ
        ++   R  +  L+  EE  W+QRA V W+  GD+N+++FH +A+QRR+ N ++GL+   G W+ ++  ++  V  YF+ +F ++       + E  R IQ
Subjt:  DVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQ

Query:  QCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVI
          +++ MN  L  N+T +E+  +L+QM P KAPG D + A+F+Q++W +VGKEVT   L VLN   S  A N+T I LIPK +  QR++++RPISLCNV 
Subjt:  QCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVI

Query:  YKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTV
        YKLISK IANRLK VL  +IS TQSAFVPGRNI DNA++ FE +HY +Q+R GKD ++ALKLDMSKAYDRVEW FIE+ M  +GF EK + LIM C++TV
Subjt:  YKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTV

Query:  SYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQK
         YS  +NG   GNI P RGLR+GDPLSPYLFL C EG S +L   E + LI G  + R    ++HLFF DD  LF KA   +   + N+  TY   +GQK
Subjt:  SYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQK

Query:  IKL
        I +
Subjt:  IKL

A0A2N9HPU0 Uncharacterized protein1.3e-10750Show/hide
Query:  QIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQEM
        ++  LL +EE  WRQR+ V W++ GDRN+++FH RA+Q RR NR+ GL D  G W E++  +  ++  +F+++F ++  +    + E   H+   ISQE+
Subjt:  QIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQEM

Query:  NSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLISKT
        N+ LTS +T  E+  +LKQM PLKAPG D +P LF+Q +WKVVG EVT   L  LN G  +  +N T I LIPK +  +R++++RPISLCNV YKLISK 
Subjt:  NSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLISKT

Query:  IANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLN
        IANRLKG+L  IIS  QSAFVPGR I DN ++ FETLH++   + GKDG +A+KLDMSKAYDRVEW F+EK M  +GF  + V LIM C+STVSYS ++N
Subjt:  IANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLN

Query:  GQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI
        G+  G + P RG+R+GDPLSPYLFL C EGL  +++  +  G + G  + R    I+HLFF DD  LF KA +RE  +I  +L TY   +GQ++
Subjt:  GQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI

A0A7N2L6Z9 Reverse transcriptase domain-containing protein9.9e-10847.63Show/hide
Query:  DVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQ
        ++   R ++  LL +EEI+W QR+ V W+K GDRN+K+FH RA++RR+ N + G+WD  G+W ED + +      YF++++ ++       + E    I 
Subjt:  DVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQ

Query:  QCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVI
          I++EMN+ L+  +T +E++ +LKQ+ P K+PG D + A+F+Q++W +VG  V+N+ L+VLN+G S++ +N+T IVLIPK    +R++D+RPISLCNVI
Subjt:  QCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVI

Query:  YKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTV
        YKLISKT+ANRLK  L  II+  QSAF   R I DN ++ +E +HYLK ++ GKD ++A KLDMSKA+DRVEW FIE+ M  +GF+E  + LIM C+S+V
Subjt:  YKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTV

Query:  SYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQK
        SYS ++NG+  GNI P RGLR+GDPLSPYLFL C EGLS +L    R+ L++G  + R C  I+HLFF DD  LF KA   E   +  +L  Y   +GQK
Subjt:  SYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQK

Query:  I
        +
Subjt:  I

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.0e-2623.18Show/hide
Query:  DRNSKWFHQRAT-----------QRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFF-RHIQQCISQEMNSHLTSNYTEDEL
        + +  WF +R             ++R  N+++ + +  G    D   ++  +  Y+++L+ +N  +    +  F   +    ++QE    L    T  E+
Subjt:  DRNSKWFHQRAT-----------QRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFF-RHIQQCISQEMNSHLTSNYTEDEL

Query:  LASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPK-CRGAQRVSDYRPISLCNVIYKLISKTIANRLKGVLDHI
        +A +  +   K+PG D   A FYQR+ + +   +  +   +   G    +  +  I+LIPK  R   +  ++RPISL N+  K+++K +ANR++  +  +
Subjt:  LASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPK-CRGAQRVSDYRPISLCNVIYKLISKTIANRLKGVLDHI

Query:  ISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKGNITPFRG
        I   Q  F+PG     N       + ++  R K K+  V + +D  KA+D+++  F+ K +  +G D   +K+I       + + +LNGQ+        G
Subjt:  ISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKGNITPFRG

Query:  LREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKIKL
         R+G PLSP LF   +E L+R    + ++  I G ++ +    +    F DD  ++ +  +     +  ++  +S ++G KI +
Subjt:  LREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKIKL

P08548 LINE-1 reverse transcriptase homolog1.6e-2525.15Show/hide
Query:  DEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQC----ISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLH
        D   +++I++ Y++ L+    +     L E  ++++ C    +SQ+    L    +  E+ ++++ +   K+PG D   + FYQ F + +   + N+  +
Subjt:  DEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQC----ISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLH

Query:  VLNHGGSVEALNQTVIVLIPK-CRGAQRVSDYRPISLCNVIYKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVA
        +   G       +  I LIPK  +   R  +YRPISL N+  K+++K + NR++  +  II   Q  F+PG     N       + ++  + K KD  + 
Subjt:  VLNHGGSVEALNQTVIVLIPK-CRGAQRVSDYRPISLCNVIYKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVA

Query:  LKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARA
        L +D  KA+D ++  F+ + +  +G +   +KLI    S  + + +LNG +  +     G R+G PLSP LF   +E L+     +  +  I G  I   
Subjt:  LKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARA

Query:  CHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI
           I    F DD  ++ +     T  +  ++  YS ++G KI
Subjt:  CHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI

P11369 LINE-1 retrotransposable element ORF2 protein2.6e-2827.79Show/hide
Query:  GQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFF-RHIQQCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVC
        G    D E ++  +  +++ L+ S   +    + +F  R+    ++Q+   HL S  +  E+ A +  +   K+PG D   A FYQ F     KE     
Subjt:  GQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFF-RHIQQCISQEMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVC

Query:  LHVLNHGGSVE-----ALNQTVIVLIPK-CRGAQRVSDYRPISLCNVIYKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRK
        LH L H   VE     +  +  I LIPK  +   ++ ++RPISL N+  K+++K +ANR++  +  II P Q  F+PG     N       +HY+  + K
Subjt:  LHVLNHGGSVE-----ALNQTVIVLIPK-CRGAQRVSDYRPISLCNVIYKLISKTIANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRK

Query:  GKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLIS
         K+  + + LD  KA+D+++  F+ K +   G     + +I    S    +  +NG++   I    G R+G PLSPYLF   +E L+R    + +   I 
Subjt:  GKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLIS

Query:  GTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI
        G +I +    IS L   DD  ++       T  + N++ ++  + G KI
Subjt:  GTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKI

P14381 Transposon TX1 uncharacterized 149 kDa protein1.0e-2929.93Show/hide
Query:  DRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQEMNSHLTSNYTEDELLASLKQMGPLKA
        DR S++F+    ++    ++  L+  DG  LED E +      ++QNLF S          E +  +   +S+     L +  T DEL  +L+ M   K+
Subjt:  DRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQEMNSHLTSNYTEDELLASLKQMGPLKA

Query:  PGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLISKTIANRLKGVLDHIISPTQSAFVPGRN
        PG D L   F+Q FW  +G +   V       G    +  + V+ L+PK    + + ++RP+SL +  YK+++K I+ RLK VL  +I P QS  VPGR 
Subjt:  PGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLISKTIANRLKGVLDHIISPTQSAFVPGRN

Query:  ICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKGNITPFRGLREGDPLSPYLFL
        I DN  L  + LH+   RR G      L LD  KA+DRV+  ++   +    F  + V  +    ++      +N      +   RG+R+G PLS  L+ 
Subjt:  ICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKGNITPFRGLREGDPLSPYLFL

Query:  FCVE
          +E
Subjt:  FCVE

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM1.2e-1426.41Show/hide
Query:  SNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLISKTIANRL
        S  TE +L AS   +    +PG D +     +     +   + N+   +L  G    ++     V IPK   A+R  D+RPIS+ +V+ + ++  +A RL
Subjt:  SNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLISKTIANRL

Query:  KGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKG
           ++    P Q  F+P     DNA +    L +    +  +  ++A  LD+SKA+D +    I   +   G  +  V  +         S   +G    
Subjt:  KGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKG

Query:  NITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSII
           P RG+++GDPLSP LF   ++ L R L      G   G  I  A        F DD  LF + RM    ++   L   SI+
Subjt:  NITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSII

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.0e-1527.46Show/hide
Query:  EIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQ----QGSGRLHEFFRHIQQCISQEMNSHLT
        E ++RQ++ + W++ GD N+++FH+     +  N ++ L   D   +E+   +++++  Y+ +L  S+          R+ +   H  +C +  + S L+
Subjt:  EIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQ----QGSGRLHEFFRHIQQCISQEMNSHLT

Query:  SNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLIS
        +  ++ E+ A++  M   KAPG DS  A F+   W VV              G  ++  N T I LIPK  G  ++S +RP+S C V+YK+I+
Subjt:  SNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.4e-1341.25Show/hide
Query:  IANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDE
        +  RLK ++ ++I P Q++F+PGR   DN +   E +H ++ R+KG  GW+ LKLD+ KAYDR+ W ++E  +   GF E
Subjt:  IANRLKGVLDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDE

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.4e-1347.06Show/hide
Query:  FMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDD
        F++NG  +G +TP RGLR+GDPLSPYLF+ C E LS +    +  G + G R++     I+HL F DD
Subjt:  FMLNGQRKGNITPFRGLREGDPLSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAACAAGGGAGACGTTCAAGGAGCAAGGGTTCAGATTGAGAATCTACTGGGGGAGGAGGAAATTTATTGGAGGCAAAGAGCATGGGTGGATTGGATGAAATGGGG
GGACCGTAATTCGAAATGGTTTCATCAGAGGGCGACCCAACGACGAAGATGTAATAGGATGGAGGGCCTATGGGATATTGATGGGCAGTGGTTGGAGGATGAGGAAGGGA
TGGAACAGATTGTATCTGGGTACTTCCAGAACTTGTTTGTCTCTAATGGGCAGCAAGGTAGTGGGAGGTTGCATGAATTTTTTAGGCATATTCAGCAGTGTATCAGTCAA
GAGATGAATTCCCATTTAACGAGTAACTATACTGAAGATGAGTTATTGGCCTCGTTAAAGCAGATGGGTCCTTTGAAGGCTCCGGGGGAGGACAGTCTCCCAGCTTTATT
CTACCAAAGGTTCTGGAAGGTTGTGGGTAAGGAGGTAACAAATGTGTGTCTGCATGTACTGAATCATGGTGGTTCTGTGGAGGCCCTCAATCAAACGGTGATTGTGTTGA
TTCCAAAATGTAGGGGTGCTCAGAGAGTTTCAGATTATAGGCCCATTAGCTTGTGTAACGTGATATACAAGCTTATCTCGAAGACAATTGCAAACAGGTTGAAGGGGGTC
TTGGACCATATTATCTCTCCCACCCAGAGTGCTTTTGTTCCAGGGAGGAATATTTGTGATAATGCAATGTTGGGGTTTGAGACTCTGCATTATTTAAAACAAAGGAGGAA
GGGGAAAGATGGTTGGGTCGCACTGAAGCTCGATATGAGCAAGGCCTATGATAGGGTGGAGTGGTTCTTCATTGAGAAGTTCATGACTGTTGTGGGGTTTGATGAGAAGG
TGGTGAAGCTCATAATGGGTTGTGTTTCGACAGTCTCCTATTCCTTTATGTTAAATGGGCAGAGGAAAGGTAATATTACACCTTTTAGGGGCCTTCGAGAAGGGGACCCC
CTTTCTCCATATTTGTTTTTATTTTGTGTTGAGGGTTTGTCGAGAATATTAACATGGTTGGAACGAGATGGGCTGATTTCTGGAACTCGTATTGCTCGGGCTTGTCACTC
AATTTCTCATTTGTTCTTTGGAGATGACTGTTTCTTATTCTTTAAAGCTAGGATGAGGGAAACAGGCATGATTGCAAACATGCTAGTAACATACTCTATCATTACAGGCC
AGAAGATAAAATTATGGGCCCTTCTGGCGAAACAAGGGTGGAGGTTAATTGATAATCCAGATTTCCTTTTGGGCAGAGTTCTTAAGGGTAGCATTCTGGGGATTCAAATT
GGAGTTTTTTGGGAAACACAGCAGGTTGGAGTTTTTTTGAGAAAGAAGAAAGAAAAAGAAGAGGAGGAGTCGTCGCGAGAAACGGAGGAGGAAGGCAACTCAGATCTTCA
CCTCCAACGAGTATCTTCTTCACCTCCGAGCAACATCTATCTCCTTCATTTCAGAAATGTAGATGTGAGGTTTTCGCTGTCGCTCGATTCAAACTTTGCGCCGTCAATTC
AGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAACAAGGGAGACGTTCAAGGAGCAAGGGTTCAGATTGAGAATCTACTGGGGGAGGAGGAAATTTATTGGAGGCAAAGAGCATGGGTGGATTGGATGAAATGGGG
GGACCGTAATTCGAAATGGTTTCATCAGAGGGCGACCCAACGACGAAGATGTAATAGGATGGAGGGCCTATGGGATATTGATGGGCAGTGGTTGGAGGATGAGGAAGGGA
TGGAACAGATTGTATCTGGGTACTTCCAGAACTTGTTTGTCTCTAATGGGCAGCAAGGTAGTGGGAGGTTGCATGAATTTTTTAGGCATATTCAGCAGTGTATCAGTCAA
GAGATGAATTCCCATTTAACGAGTAACTATACTGAAGATGAGTTATTGGCCTCGTTAAAGCAGATGGGTCCTTTGAAGGCTCCGGGGGAGGACAGTCTCCCAGCTTTATT
CTACCAAAGGTTCTGGAAGGTTGTGGGTAAGGAGGTAACAAATGTGTGTCTGCATGTACTGAATCATGGTGGTTCTGTGGAGGCCCTCAATCAAACGGTGATTGTGTTGA
TTCCAAAATGTAGGGGTGCTCAGAGAGTTTCAGATTATAGGCCCATTAGCTTGTGTAACGTGATATACAAGCTTATCTCGAAGACAATTGCAAACAGGTTGAAGGGGGTC
TTGGACCATATTATCTCTCCCACCCAGAGTGCTTTTGTTCCAGGGAGGAATATTTGTGATAATGCAATGTTGGGGTTTGAGACTCTGCATTATTTAAAACAAAGGAGGAA
GGGGAAAGATGGTTGGGTCGCACTGAAGCTCGATATGAGCAAGGCCTATGATAGGGTGGAGTGGTTCTTCATTGAGAAGTTCATGACTGTTGTGGGGTTTGATGAGAAGG
TGGTGAAGCTCATAATGGGTTGTGTTTCGACAGTCTCCTATTCCTTTATGTTAAATGGGCAGAGGAAAGGTAATATTACACCTTTTAGGGGCCTTCGAGAAGGGGACCCC
CTTTCTCCATATTTGTTTTTATTTTGTGTTGAGGGTTTGTCGAGAATATTAACATGGTTGGAACGAGATGGGCTGATTTCTGGAACTCGTATTGCTCGGGCTTGTCACTC
AATTTCTCATTTGTTCTTTGGAGATGACTGTTTCTTATTCTTTAAAGCTAGGATGAGGGAAACAGGCATGATTGCAAACATGCTAGTAACATACTCTATCATTACAGGCC
AGAAGATAAAATTATGGGCCCTTCTGGCGAAACAAGGGTGGAGGTTAATTGATAATCCAGATTTCCTTTTGGGCAGAGTTCTTAAGGGTAGCATTCTGGGGATTCAAATT
GGAGTTTTTTGGGAAACACAGCAGGTTGGAGTTTTTTTGAGAAAGAAGAAAGAAAAAGAAGAGGAGGAGTCGTCGCGAGAAACGGAGGAGGAAGGCAACTCAGATCTTCA
CCTCCAACGAGTATCTTCTTCACCTCCGAGCAACATCTATCTCCTTCATTTCAGAAATGTAGATGTGAGGTTTTCGCTGTCGCTCGATTCAAACTTTGCGCCGTCAATTC
AGATATGA
Protein sequenceShow/hide protein sequence
MVNKGDVQGARVQIENLLGEEEIYWRQRAWVDWMKWGDRNSKWFHQRATQRRRCNRMEGLWDIDGQWLEDEEGMEQIVSGYFQNLFVSNGQQGSGRLHEFFRHIQQCISQ
EMNSHLTSNYTEDELLASLKQMGPLKAPGEDSLPALFYQRFWKVVGKEVTNVCLHVLNHGGSVEALNQTVIVLIPKCRGAQRVSDYRPISLCNVIYKLISKTIANRLKGV
LDHIISPTQSAFVPGRNICDNAMLGFETLHYLKQRRKGKDGWVALKLDMSKAYDRVEWFFIEKFMTVVGFDEKVVKLIMGCVSTVSYSFMLNGQRKGNITPFRGLREGDP
LSPYLFLFCVEGLSRILTWLERDGLISGTRIARACHSISHLFFGDDCFLFFKARMRETGMIANMLVTYSIITGQKIKLWALLAKQGWRLIDNPDFLLGRVLKGSILGIQI
GVFWETQQVGVFLRKKKEKEEEESSRETEEEGNSDLHLQRVSSSPPSNIYLLHFRNVDVRFSLSLDSNFAPSIQI