; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032884 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032884
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr11:38515420..38517606
RNA-Seq ExpressionLag0032884
SyntenyLag0032884
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-17847.09Show/hide
Query:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI
        K +W+ LK  I+++F +F  + +IN+ VN + IALI KK      ++YRPISLTT +Y+LIAK +AER+K TLP T+AE+Q AFVKG QI DAIL+ANE 
Subjt:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI

Query:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE
        +D WRV    GF+IKLDIEKAFDK++W FI+ ML  KG+P  W  WI+ACISSV YSI++NG+PRGKIQ SRGIRQGDPISPFIFVLAMDY+SRL+ +  
Subjt:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE

Query:  HKGLIEGCSI-NEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN
         K  I+G  +   I ++HLLFADDILLFV D++  ++N   II  F+ ASGL+IN +KS+IS INV   R   IAS+WG + + LPI+YLG PLG K   
Subjt:  HKGLIEGCSI-NEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN

Query:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV
        ++FW  V EKI + L  W+YS +SKGG++TL++S+L S   Y LS+FKAP S C  I++  RNFLWK    + HK+ LV W K+T+    GGLG  + + 
Subjt:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV

Query:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM
        +N ALL+KW+WR+ +E++ LWK  INAKY SL +  IP     SSSR+PW SI K + +F+R+ SW++++G+   FW   W     LS  +PRL+ALST 
Subjt:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM

Query:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHME---PGIMSNVWKAT
        +   S+ + W+     W   PRR L + E   W    ++L      +  D   W  NS+GLYTVAS ++ L +      P+Q+ ++        N+WK +
Subjt:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHME---PGIMSNVWKAT

Query:  LPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIW----TPFFQNIGISSPMPLQAISLCALIFWIKVNSQKAI
        +PKKC FFI T+ + S+NT ++L +        PS C MC RN E   HLF+ CPIA  IW    +    N+   SP  L  I++C+   W +   +  I
Subjt:  LPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIW----TPFFQNIGISSPMPLQAISLCALIFWIKVNSQKAI

Query:  GFKLF
         F  +
Subjt:  GFKLF

KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-17948.8Show/hide
Query:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI
        KK W  LK  +++VF +F + G++N NVN ++IALI KK    + S+YRPISLTT LY+L+AK+LA R+K  LP TIAE+Q AF+KG QI DAIL+ANE 
Subjt:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI

Query:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE
        +D W+     GF++KLD+EKAFDKISW FI+ ML  K FP+ W  WIKACIS+V YSILLNG P+G+I+A RGIRQGDP+SPFIFVLAMDY+SRL+   E
Subjt:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE

Query:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN
         KG I+G S N    +SHLLFADD+L+FV DN+ +L N  M +  FE+ASGL  N SKS+IS IN+S  R   IAS +G   + LP++YLG PLG  P +
Subjt:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN

Query:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV
         SFWS  +E I++ L+ W+YS ISKGGRLTLL+++L+S   Y LS FKAP S+   I++  R+FLW G+E   +   L+ WN  T+P   GGLG  K + 
Subjt:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV

Query:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM
        +N ALL KW+WR+ NE NSLWK  I+AKY    +  IP   R SS+ +PW +I K   ++E   SW   DG  + FW   W     LS  FPRL+ALS M
Subjt:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM

Query:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK
          + +V E WD  +  W+  PRRPL +RE  +W S   +LPR  ++       W+P+    YTVASA+   ++   SS P++++ E   + ++W++ +P+
Subjt:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK

Query:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG
        KC+FFI T+ H+ LNT D++Q+     SL PS C  C  ++E ++HLF+ CP A  +W  +    G
Subjt:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-17748.5Show/hide
Query:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI
        KK W  LK  +++VF +F + G++N NVN ++IALI KK    + S+YRPISLTT LY+++AK+LA R+K  LP TIAE+Q AF+KG QI DAIL+ANE 
Subjt:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI

Query:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE
        +D W+     GF++KLDIEKAFDKISW FI+ ML  K FP+ W  WIKACIS+V YSILLNG P+G+I+A RGIRQGDP+SPFIFVLAMDY+SRL+   E
Subjt:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE

Query:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN
         KG I+G S N    +SHLLFADD+L+FV DN+ +L N  M +  FE+ASGL  N SKS+IS IN+S  R   IAS +G   + LP++YLG PLG  P +
Subjt:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN

Query:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV
         SFW   +E I++ L+ W+YS ISKGGRLTLL+++L+S   Y LS FKAP S+   I++  R+FLW G+E   +   L+ WN  T+P   GGLG  K + 
Subjt:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV

Query:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM
        +N ALL KW+WR+ NE NSLWK  I+AKY    +  IP   R SS+ +PW +I K   ++E   SW   DG  + FW   W     LS   PRL+ALS M
Subjt:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM

Query:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK
          + +V E WD  +  W+  PRRPL +RE  +W S   +LPR  ++       W+P+    YTVASA+   ++   SS P++++ E   + ++W++ +P+
Subjt:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK

Query:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG
        KC+FFI T+ H+ LNT D +Q+     SL PS C  C  ++E ++HLF+ CP A  +W  +    G
Subjt:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.1e-17848.5Show/hide
Query:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI
        KK W  LK  +++VF +F + G++N NVN ++IALI KK    + S+YRPISLTT LY+++AK+LA R+K  LP TIAE+Q AF+KG QI DAIL+ANE+
Subjt:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI

Query:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE
        +D W+     GF++KLDIEKAFDKISW FI+ ML  K FP+ W  WIKACIS+V YSILLNG P+G+I+A RGIRQGDP+SPFIFVLAMDY+SRL+   E
Subjt:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE

Query:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN
         KG I+G S N    +SHLLFADD+L+FV DN+ +L N  M +  FE+ASGL  N SKS+IS IN+S  R   IAS +G   + LP++YLG PLG  P +
Subjt:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN

Query:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV
         SFW   +E I++ L+ W+YS ISKGGRLTLL+++L+S   Y LS FKAP S+   I++  R+FLW G+E   +   L+ WN  T+P   GGLG  K + 
Subjt:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV

Query:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM
        +N ALL KW+WR+ NE NSLWK  I+AKY    +  IP   R SS+ +PW +I K   ++E   SW   DG  + FW   W     LS   PRL+ALS M
Subjt:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM

Query:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK
          + +V E WD  +  W+  PRRPL +RE  +W S   +LPR  ++       W+P+    YTVASA+   ++   SS P++++ E   + ++W++ +P+
Subjt:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK

Query:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG
        KC+FFI T+ H+ LNT D +Q+     SL PS C  C  ++E ++HLF+ CP A  +W  +    G
Subjt:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]3.1e-17846.95Show/hide
Query:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI
        K +W+ LK  I+++F +F  + +IN+ VN + IALI KK      ++YRPISLTT +Y+LIAK +AER+K TLP T+AE+Q AFVKG QI DAIL+ANE 
Subjt:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI

Query:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE
        +D WRV    GF+IKLDIEKAFDK++W FI+ ML  KG+P  W  WI+ACISSV YSI++NG+PRGKIQ SRGIRQGDPISPFIFVLAMDY+SRL+ +  
Subjt:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE

Query:  HKGLIEGCSI-NEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN
         K  I+G  +   I ++HLLFADDILLFV D++  ++N   II  F+ ASGL+IN +KS+IS INV   R   IAS+WG + + LPI+YLG PLG K   
Subjt:  HKGLIEGCSI-NEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN

Query:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV
        ++FW  V EKI + L  W+YS +SKGG++TL++S+L S   Y LS+FK P S C  I++  RNFLWK    + HK+ LV W K+T+    GGLG  + + 
Subjt:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV

Query:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM
        +N ALL+KW+WR+ +E++ LWK  INAKY SL +  IP     SSSR+PW SI K + +F+R+ SW++++G+   FW   W     LS  +PRL+ALST 
Subjt:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM

Query:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHME---PGIMSNVWKAT
        +   S+ + W+     W   PRR L + E   W    ++L      +  D   W  NS+GLYTVAS ++ L +      P+Q+ ++        N+WK +
Subjt:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHME---PGIMSNVWKAT

Query:  LPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIW----TPFFQNIGISSPMPLQAISLCALIFWIKVNSQKAI
        +PKKC FFI T+ + S+NT ++L +        PS C MC RN E   HLF+ CPIA  IW    +    N+   SP  L  I++C+   W +   +  I
Subjt:  LPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIW----TPFFQNIGISSPMPLQAISLCALIFWIKVNSQKAI

Query:  GFKLF
         F  +
Subjt:  GFKLF

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein1.5e-17846.95Show/hide
Query:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI
        K +W+ LK  I+++F +F  + +IN+ VN + IALI KK      ++YRPISLTT +Y+LIAK +AER+K TLP T+AE+Q AFVKG QI DAIL+ANE 
Subjt:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI

Query:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE
        +D WRV    GF+IKLDIEKAFDK++W FI+ ML  KG+P  W  WI+ACISSV YSI++NG+PRGKIQ SRGIRQGDPISPFIFVLAMDY+SRL+ +  
Subjt:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE

Query:  HKGLIEGCSI-NEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN
         K  I+G  +   I ++HLLFADDILLFV D++  ++N   II  F+ ASGL+IN +KS+IS INV   R   IAS+WG + + LPI+YLG PLG K   
Subjt:  HKGLIEGCSI-NEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN

Query:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV
        ++FW  V EKI + L  W+YS +SKGG++TL++S+L S   Y LS+FK P S C  I++  RNFLWK    + HK+ LV W K+T+    GGLG  + + 
Subjt:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV

Query:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM
        +N ALL+KW+WR+ +E++ LWK  INAKY SL +  IP     SSSR+PW SI K + +F+R+ SW++++G+   FW   W     LS  +PRL+ALST 
Subjt:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM

Query:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHME---PGIMSNVWKAT
        +   S+ + W+     W   PRR L + E   W    ++L      +  D   W  NS+GLYTVAS ++ L +      P+Q+ ++        N+WK +
Subjt:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHME---PGIMSNVWKAT

Query:  LPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIW----TPFFQNIGISSPMPLQAISLCALIFWIKVNSQKAI
        +PKKC FFI T+ + S+NT ++L +        PS C MC RN E   HLF+ CPIA  IW    +    N+   SP  L  I++C+   W +   +  I
Subjt:  LPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIW----TPFFQNIGISSPMPLQAISLCALIFWIKVNSQKAI

Query:  GFKLF
         F  +
Subjt:  GFKLF

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein5.8e-17848.5Show/hide
Query:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI
        KK W  LK  +++VF +F + G++N NVN ++IALI KK    + S+YRPISLTT LY+++AK+LA R+K  LP TIAE+Q AF+KG QI DAIL+ANE 
Subjt:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI

Query:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE
        +D W+     GF++KLDIEKAFDKISW FI+ ML  K FP+ W  WIKACIS+V YSILLNG P+G+I+A RGIRQGDP+SPFIFVLAMDY+SRL+   E
Subjt:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE

Query:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN
         KG I+G S N    +SHLLFADD+L+FV DN+ +L N  M +  FE+ASGL  N SKS+IS IN+S  R   IAS +G   + LP++YLG PLG  P +
Subjt:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN

Query:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV
         SFW   +E I++ L+ W+YS ISKGGRLTLL+++L+S   Y LS FKAP S+   I++  R+FLW G+E   +   L+ WN  T+P   GGLG  K + 
Subjt:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV

Query:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM
        +N ALL KW+WR+ NE NSLWK  I+AKY    +  IP   R SS+ +PW +I K   ++E   SW   DG  + FW   W     LS   PRL+ALS M
Subjt:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM

Query:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK
          + +V E WD  +  W+  PRRPL +RE  +W S   +LPR  ++       W+P+    YTVASA+   ++   SS P++++ E   + ++W++ +P+
Subjt:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK

Query:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG
        KC+FFI T+ H+ LNT D +Q+     SL PS C  C  ++E ++HLF+ CP A  +W  +    G
Subjt:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG

A0A5A7UTI6 LINE-1 retrotransposable element ORF2 protein6.2e-18048.8Show/hide
Query:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI
        KK W  LK  +++VF +F + G++N NVN ++IALI KK    + S+YRPISLTT LY+L+AK+LA R+K  LP TIAE+Q AF+KG QI DAIL+ANE 
Subjt:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI

Query:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE
        +D W+     GF++KLD+EKAFDKISW FI+ ML  K FP+ W  WIKACIS+V YSILLNG P+G+I+A RGIRQGDP+SPFIFVLAMDY+SRL+   E
Subjt:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE

Query:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN
         KG I+G S N    +SHLLFADD+L+FV DN+ +L N  M +  FE+ASGL  N SKS+IS IN+S  R   IAS +G   + LP++YLG PLG  P +
Subjt:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN

Query:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV
         SFWS  +E I++ L+ W+YS ISKGGRLTLL+++L+S   Y LS FKAP S+   I++  R+FLW G+E   +   L+ WN  T+P   GGLG  K + 
Subjt:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV

Query:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM
        +N ALL KW+WR+ NE NSLWK  I+AKY    +  IP   R SS+ +PW +I K   ++E   SW   DG  + FW   W     LS  FPRL+ALS M
Subjt:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM

Query:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK
          + +V E WD  +  W+  PRRPL +RE  +W S   +LPR  ++       W+P+    YTVASA+   ++   SS P++++ E   + ++W++ +P+
Subjt:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK

Query:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG
        KC+FFI T+ H+ LNT D++Q+     SL PS C  C  ++E ++HLF+ CP A  +W  +    G
Subjt:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein2.0e-17848.5Show/hide
Query:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI
        KK W  LK  +++VF +F + G++N NVN ++IALI KK    + S+YRPISLTT LY+++AK+LA R+K  LP TIAE+Q AF+KG QI DAIL+ANE+
Subjt:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI

Query:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE
        +D W+     GF++KLDIEKAFDKISW FI+ ML  K FP+ W  WIKACIS+V YSILLNG P+G+I+A RGIRQGDP+SPFIFVLAMDY+SRL+   E
Subjt:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE

Query:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN
         KG I+G S N    +SHLLFADD+L+FV DN+ +L N  M +  FE+ASGL  N SKS+IS IN+S  R   IAS +G   + LP++YLG PLG  P +
Subjt:  HKGLIEGCSINEIF-VSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN

Query:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV
         SFW   +E I++ L+ W+YS ISKGGRLTLL+++L+S   Y LS FKAP S+   I++  R+FLW G+E   +   L+ WN  T+P   GGLG  K + 
Subjt:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV

Query:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM
        +N ALL KW+WR+ NE NSLWK  I+AKY    +  IP   R SS+ +PW +I K   ++E   SW   DG  + FW   W     LS   PRL+ALS M
Subjt:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM

Query:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK
          + +V E WD  +  W+  PRRPL +RE  +W S   +LPR  ++       W+P+    YTVASA+   ++   SS P++++ E   + ++W++ +P+
Subjt:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPK

Query:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG
        KC+FFI T+ H+ LNT D +Q+     SL PS C  C  ++E ++HLF+ CP A  +W  +    G
Subjt:  KCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIWTPFFQNIG

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein5.2e-17947.09Show/hide
Query:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI
        K +W+ LK  I+++F +F  + +IN+ VN + IALI KK      ++YRPISLTT +Y+LIAK +AER+K TLP T+AE+Q AFVKG QI DAIL+ANE 
Subjt:  KKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEI

Query:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE
        +D WRV    GF+IKLDIEKAFDK++W FI+ ML  KG+P  W  WI+ACISSV YSI++NG+PRGKIQ SRGIRQGDPISPFIFVLAMDY+SRL+ +  
Subjt:  VDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAE

Query:  HKGLIEGCSI-NEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN
         K  I+G  +   I ++HLLFADDILLFV D++  ++N   II  F+ ASGL+IN +KS+IS INV   R   IAS+WG + + LPI+YLG PLG K   
Subjt:  HKGLIEGCSI-NEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSN

Query:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV
        ++FW  V EKI + L  W+YS +SKGG++TL++S+L S   Y LS+FKAP S C  I++  RNFLWK    + HK+ LV W K+T+    GGLG  + + 
Subjt:  ESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRV

Query:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM
        +N ALL+KW+WR+ +E++ LWK  INAKY SL +  IP     SSSR+PW SI K + +F+R+ SW++++G+   FW   W     LS  +PRL+ALST 
Subjt:  SNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTM

Query:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHME---PGIMSNVWKAT
        +   S+ + W+     W   PRR L + E   W    ++L      +  D   W  NS+GLYTVAS ++ L +      P+Q+ ++        N+WK +
Subjt:  HNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPSSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHME---PGIMSNVWKAT

Query:  LPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIW----TPFFQNIGISSPMPLQAISLCALIFWIKVNSQKAI
        +PKKC FFI T+ + S+NT ++L +        PS C MC RN E   HLF+ CPIA  IW    +    N+   SP  L  I++C+   W +   +  I
Subjt:  LPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIW----TPFFQNIGISSPMPLQAISLCALIFWIKVNSQKAI

Query:  GFKLF
         F  +
Subjt:  GFKLF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.9e-3426.44Show/hide
Query:  IMSVFHEFWEHGVINRNVNESYIALIPKKA-NSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQ-IFDAILLANEIVDLWRVSH
        ++ +F    + G++  +  E+ I LIPK   ++ +   +RPISL  +  +++ K LA RI+  +   I   Q  F+ G+Q  F+     N I  + R   
Subjt:  IMSVFHEFWEHGVINRNVNESYIALIPKKA-NSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQ-IFDAILLANEIVDLWRVSH

Query:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKGLIEGC
         +  II +D EKAFDKI   F+   L   G   ++   I+A     + +I+LNG+         G RQG P+SP +F + ++ ++R I+  +    I+G 
Subjt:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKGLIEGC

Query:  SINEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPL--GAKPSNESFWSPV
         + +  V   LFADD+++++ +  V  +N   +I  F + SG  IN  KS     N +    S I      T  +  I YLG  L    K   +  + P+
Subjt:  SINEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPL--GAKPSNESFWSPV

Query:  VEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSV--FKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRVSNAAL
        +++I    +KW+    S  GR+ +++  +   +IY  +    K P +  T +++    F+W    +   K  L   NK           +YK  V+  A 
Subjt:  VEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSV--FKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRVSNAAL

Query:  LSKWIWRFFNEENSLW
           W W + N +   W
Subjt:  LSKWIWRFFNEENSLW

P08548 LINE-1 reverse transcriptase homolog4.8e-2824.94Show/hide
Query:  IMSVFHEFWEHGVINRNVNESYIALIPKKA-NSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQ-IFDAILLANEIVDLWRVSH
        ++++F    + G++     E+ I LIPK   +  R   YRPISL  +  +++ K L  RI+  +   I   Q  F+ G Q  F+     N I  + ++ +
Subjt:  IMSVFHEFWEHGVINRNVNESYIALIPKKA-NSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQ-IFDAILLANEIVDLWRVSH

Query:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKGLIEGC
            I+ +D EKAFD I   F+   L+  G    +   I+A  S  + +I+LNG          G RQG P+SP +F + M+ ++  I   E K  I+G 
Subjt:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKGLIEGC

Query:  SINEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPL--GAKPSNESFWSPV
         I    +   LFADD+++++ +          +IK +   SG  IN  KS       +      +      T     + YLG  L    K   +  +  +
Subjt:  SINEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPL--GAKPSNESFWSPV

Query:  VEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSV--FKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRVSNAAL
         ++I   ++KW+    S  GR+ +++ ++    IY  +    KAP S    +++I+ +F+W      + K P +    ++    +GG+     R+   ++
Subjt:  VEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSV--FKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRVSNAAL

Query:  LSKWIWRFF-NEENSLW
        + K  W +  N E  +W
Subjt:  LSKWIWRFF-NEENSLW

P0C2F6 Putative ribonuclease H protein At1g657501.1e-2926.46Show/hide
Query:  PLGAKPSNESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGG
        P+  K  N+  +  ++E++   +  W+   +S  GRLTL ++ L+S  ++ +S    PQSI  R+D++ R FLW G+ +   K  LV W+KV +P   GG
Subjt:  PLGAKPSNESFWSPVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGG

Query:  LGFYKTRVSNAALLSKWIWRFFNEENSLWKCFINAKY----ISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSW---SPFG
        LG    +  N AL+SK  WR   E+NSLW   +  KY    I      IP  S  S+ R+  + +   +        W   DG++IRFW D W    P  
Subjt:  LGFYKTRVSNAALLSKWIWRFFNEENSLWKCFINAKY----ISLEESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSW---SPFG

Query:  QLSRVFPRLFALSTMHNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPS-----SSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPP
        +L           T  + +   + W +    W F         ++D +T+ ++ L   +      +  +D L W  +  G ++V SA   L       P 
Subjt:  QLSRVFPRLFALSTMHNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRPS-----SSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPP

Query:  QQSHMEPGIMSNVWKATLPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIW
          S       + +WK  +P++ + F+  + ++++ T++   R    +S   + C +C    E + H+   CP   GIW
Subjt:  QQSHMEPGIMSNVWKATLPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPIASGIW

P11369 LINE-1 retrotransposable element ORF2 protein8.7e-3025.67Show/hide
Query:  PIM-SVFHEFWEHGVINRNVNESYIALIPK-KANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQ-IFDAILLANEIVDLWRV
        PI+  +FH+    G +  +  E+ I LIPK + +  +I  +RPISL  +  +++ K LA RI+  +   I   Q  F+ G+Q  F+     N I  + ++
Subjt:  PIM-SVFHEFWEHGVINRNVNESYIALIPK-KANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQ-IFDAILLANEIVDLWRV

Query:  SHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKGLIE
           +  II LD EKAFDKI   F+  +L   G    +   IKA  S    +I +NG+    I    G RQG P+SP++F + ++ ++R I+  +    I+
Subjt:  SHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKGLIE

Query:  GCSINEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPL--GAKPSNESFWS
        G  I +  V   L ADD+++++ D          +I +F +  G  IN +KS       ++     I      +  T  I YLG  L    K   +  + 
Subjt:  GCSINEIFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPL--GAKPSNESFWS

Query:  PVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSV--FKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRVSNA
         + ++I   L +W+    S  GR+ +++  +    IY  +    K P      ++  +  F+W      ++K P +  + +     SGG+     ++   
Subjt:  PVVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSV--FKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRVSNA

Query:  ALLSKWIWRFFNE
        A++ K  W ++ +
Subjt:  ALLSKWIWRFFNE

P14381 Transposon TX1 uncharacterized 149 kDa protein4.8e-2825.69Show/hide
Query:  WTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEIVDL
        W TL      V  E ++ G +  +   + ++L+PKK +   I  +RP+SL +  Y+++AK+++ R+K  L   I   Q   V G  IFD + L  +++  
Subjt:  WTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEIVDL

Query:  WRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKG
         R +  S   + LD EKAFD++   ++   L+   F   + G++K   +S    + +N      +   RG+RQG P+S  ++ LA++    L++      
Subjt:  WRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKG

Query:  LIEGCSINEIFVSHLL--FADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW-GCTAQTLPISYLGTPLGAK--PS
         + G  + E  +  +L  +ADD++L  +D  V LE      + +  AS   IN+SKS  SG+     +V  +   +   + ++  I YLG  L A+  P 
Subjt:  LIEGCSINEIFVSHLL--FADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW-GCTAQTLPISYLGTPLGAK--PS

Query:  NESFWSPVVEKIYRHLDKWQ--YSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYK
        +++F   + E +   L KW+     +S  GR  ++   + S + Y L      Q    +I R L +FLW G          V     + P+  GG G   
Subjt:  NESFWSPVVEKIYRHLDKWQ--YSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYK

Query:  TRVSNAALLSKWIWRFFNEENSLWKCFINAKY
         R        + I R+   + S   C + + +
Subjt:  TRVSNAALLSKWIWRFFNEENSLWKCFINAKY

Arabidopsis top hitse value%identityAlignment
AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.3e-0632.73Show/hide
Query:  VTTPIASGGLGFYKTRVSNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRA------PWLSISKQIPFFERNSSWQLRDGKKIRFW
        V  P A GGLG       N  L  K +WR F+   SLW  +   +Y  L+     S S+F +S+        W  + +  P  ER     + +G   RFW
Subjt:  VTTPIASGGLGFYKTRVSNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPSSSRFSSSRA------PWLSISKQIPFFERNSSWQLRDGKKIRFW

Query:  KDSWSPFGQL
         D+W+PFG L
Subjt:  KDSWSPFGQL

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.6e-0735.8Show/hide
Query:  LAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEIV-DLWRVSHTSGF-IIKLDIEKAFDKISWDFIESMLRFKGFPNIW
        + ER+K  +   I  +Q +F+ G    D I+   E V  + R     G+ ++KLD+EKA+D+I WD++E  L   GFP +W
Subjt:  LAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEIV-DLWRVSHTSGF-IIKLDIEKAFDKISWDFIESMLRFKGFPNIW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-0625.35Show/hide
Query:  IYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKV-TTPIASGGLGFYKTRVSNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPS
        +Y +S F+  + +C ++   +  F W   E+   KI  V W K+  +    GGLGF      N ALL+K  +R  ++ ++L    + ++Y     S++  
Subjt:  IYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKV-TTPIASGGLGFYKTRVSNAALLSKWIWRFFNEENSLWKCFINAKYISLEESTIPS

Query:  SSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSW
         S  +     W SI        R     + DG   + W D W
Subjt:  SSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.4e-1143.28Show/hide
Query:  LLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKGLIEGCSI--NEIFVSHLLFADD
        ++NG P+G +  SRG+RQGDP+SP++F+L  + +S L + A+ +G + G  +  N   ++HLLFADD
Subjt:  LLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKGLIEGCSI--NEIFVSHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTACGGAGGAGTTCTTTAAAAAAATCATGGACCACTCTAAAATCACCCATCATGTCTGTATTCCATGAGTTTTGGGAGCATGGAGTGATCAACCGAAATGTCAA
TGAATCATACATTGCTTTGATTCCTAAAAAGGCAAACTCCTTGAGAATTTCTGAATATCGCCCGATTAGCTTAACAACGGTCCTTTATCGGTTGATAGCAAAATCCCTCG
CTGAAAGGATAAAATGCACTCTCCCTTGCACCATTGCTGAAAGTCAATTTGCTTTTGTCAAAGGCCTCCAGATTTTTGACGCCATTTTATTAGCTAACGAGATTGTAGAT
CTTTGGAGAGTCTCGCACACAAGTGGCTTTATCATTAAGCTTGATATTGAAAAGGCCTTTGACAAGATTAGTTGGGACTTCATAGAAAGTATGCTTCGTTTTAAAGGTTT
CCCAAACATTTGGTGTGGCTGGATAAAAGCTTGCATCTCTTCAGTTTCATATTCCATATTGCTCAATGGCAAGCCGAGGGGTAAAATTCAGGCTTCCAGAGGAATTCGCC
AAGGAGATCCTATTTCTCCTTTCATTTTTGTCCTTGCCATGGATTACATTAGTAGACTCATCCAAACAGCCGAGCATAAGGGCCTCATTGAGGGATGTTCGATTAATGAA
ATATTCGTCTCTCATCTTCTTTTTGCAGACGATATTCTTCTATTCGTTAGAGATAACGATGTTTTCTTGGAGAACTACTTTATGATCATTAAGGCCTTTGAGCAAGCCTC
GGGTCTAAACATCAATTTTTCCAAATCTTCCATCTCCGGTATAAATGTTTCAGAGGATAGAGTTTCATTGATTGCCTCTAGATGGGGTTGTACTGCTCAAACTCTTCCAA
TCTCCTATTTGGGTACTCCTTTAGGAGCTAAACCCTCCAACGAATCGTTCTGGAGTCCGGTTGTGGAGAAAATCTACAGACATCTTGATAAATGGCAATATTCATACATT
TCAAAAGGAGGCCGCTTAACCCTCTTGCAGTCCACTCTGAATAGTTCTCTTATCTACCCCTTGTCGGTTTTTAAAGCGCCGCAGTCTATTTGTACCCGCATTGACCGAAT
CTTACGAAACTTTCTTTGGAAGGGGACCGAAAGTTCTGATCACAAGATCCCTCTGGTGGGCTGGAATAAAGTGACGACTCCTATTGCTTCTGGTGGCCTGGGTTTCTACA
AAACAAGGGTTTCAAATGCCGCGCTTCTCTCCAAGTGGATTTGGCGATTCTTCAATGAAGAAAATTCTCTGTGGAAATGCTTTATTAATGCAAAATATATATCCTTGGAA
GAAAGCACTATTCCCTCAAGCTCTCGATTCTCAAGTTCGAGAGCCCCATGGCTCTCTATATCCAAACAAATTCCCTTTTTTGAGCGGAACTCTTCTTGGCAGCTTAGGGA
TGGAAAAAAAATTCGCTTTTGGAAGGATTCGTGGTCTCCTTTTGGGCAATTATCTCGTGTTTTCCCTCGCCTTTTTGCTCTTTCAACCATGCATAATAATTTATCGGTCT
TTGAGGCTTGGGACCTGGCAAATTCCTCTTGGTCCTTTTTCCCGCGCAGACCTTTGCTAGACAGGGAGCTGGACTCTTGGACATCTTTTTCGTCTGCTCTGCCAAGACCT
TCATCTTCAGACAACAAGGATCTTCTCCGATGGGATCCTAACTCTCACGGGCTTTATACGGTGGCGTCTGCTCGTCGAAAGCTATGGGAAATCTCTCACTCTTCTCCTCC
TCAACAGTCTCATATGGAGCCAGGAATCATGTCCAACGTATGGAAAGCTACATTACCTAAAAAATGTAGATTCTTCATATGTACAATCTTTCACCGCAGCCTCAACACGG
ATGACCGCCTTCAAAGAATATTCAAACAATCCTCTTTGCTTCCGAGTCGATGCTCGATGTGTTGTCGAAACTCCGAATGTTTGGATCATCTCTTCGTCCAGTGTCCCATT
GCATCAGGCATTTGGACTCCTTTCTTCCAGAATATTGGTATCTCCTCGCCCATGCCTCTTCAAGCTATTAGTCTTTGTGCTCTTATTTTTTGGATCAAGGTCAATTCGCA
AAAAGCTATTGGATTCAAGCTTTTTCGGCAGCAACGCTATGGATTTTATGGAATGAACGTAATAGTCGTATTTTCAGAAATCAGTCTCGTTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTACGGAGGAGTTCTTTAAAAAAATCATGGACCACTCTAAAATCACCCATCATGTCTGTATTCCATGAGTTTTGGGAGCATGGAGTGATCAACCGAAATGTCAA
TGAATCATACATTGCTTTGATTCCTAAAAAGGCAAACTCCTTGAGAATTTCTGAATATCGCCCGATTAGCTTAACAACGGTCCTTTATCGGTTGATAGCAAAATCCCTCG
CTGAAAGGATAAAATGCACTCTCCCTTGCACCATTGCTGAAAGTCAATTTGCTTTTGTCAAAGGCCTCCAGATTTTTGACGCCATTTTATTAGCTAACGAGATTGTAGAT
CTTTGGAGAGTCTCGCACACAAGTGGCTTTATCATTAAGCTTGATATTGAAAAGGCCTTTGACAAGATTAGTTGGGACTTCATAGAAAGTATGCTTCGTTTTAAAGGTTT
CCCAAACATTTGGTGTGGCTGGATAAAAGCTTGCATCTCTTCAGTTTCATATTCCATATTGCTCAATGGCAAGCCGAGGGGTAAAATTCAGGCTTCCAGAGGAATTCGCC
AAGGAGATCCTATTTCTCCTTTCATTTTTGTCCTTGCCATGGATTACATTAGTAGACTCATCCAAACAGCCGAGCATAAGGGCCTCATTGAGGGATGTTCGATTAATGAA
ATATTCGTCTCTCATCTTCTTTTTGCAGACGATATTCTTCTATTCGTTAGAGATAACGATGTTTTCTTGGAGAACTACTTTATGATCATTAAGGCCTTTGAGCAAGCCTC
GGGTCTAAACATCAATTTTTCCAAATCTTCCATCTCCGGTATAAATGTTTCAGAGGATAGAGTTTCATTGATTGCCTCTAGATGGGGTTGTACTGCTCAAACTCTTCCAA
TCTCCTATTTGGGTACTCCTTTAGGAGCTAAACCCTCCAACGAATCGTTCTGGAGTCCGGTTGTGGAGAAAATCTACAGACATCTTGATAAATGGCAATATTCATACATT
TCAAAAGGAGGCCGCTTAACCCTCTTGCAGTCCACTCTGAATAGTTCTCTTATCTACCCCTTGTCGGTTTTTAAAGCGCCGCAGTCTATTTGTACCCGCATTGACCGAAT
CTTACGAAACTTTCTTTGGAAGGGGACCGAAAGTTCTGATCACAAGATCCCTCTGGTGGGCTGGAATAAAGTGACGACTCCTATTGCTTCTGGTGGCCTGGGTTTCTACA
AAACAAGGGTTTCAAATGCCGCGCTTCTCTCCAAGTGGATTTGGCGATTCTTCAATGAAGAAAATTCTCTGTGGAAATGCTTTATTAATGCAAAATATATATCCTTGGAA
GAAAGCACTATTCCCTCAAGCTCTCGATTCTCAAGTTCGAGAGCCCCATGGCTCTCTATATCCAAACAAATTCCCTTTTTTGAGCGGAACTCTTCTTGGCAGCTTAGGGA
TGGAAAAAAAATTCGCTTTTGGAAGGATTCGTGGTCTCCTTTTGGGCAATTATCTCGTGTTTTCCCTCGCCTTTTTGCTCTTTCAACCATGCATAATAATTTATCGGTCT
TTGAGGCTTGGGACCTGGCAAATTCCTCTTGGTCCTTTTTCCCGCGCAGACCTTTGCTAGACAGGGAGCTGGACTCTTGGACATCTTTTTCGTCTGCTCTGCCAAGACCT
TCATCTTCAGACAACAAGGATCTTCTCCGATGGGATCCTAACTCTCACGGGCTTTATACGGTGGCGTCTGCTCGTCGAAAGCTATGGGAAATCTCTCACTCTTCTCCTCC
TCAACAGTCTCATATGGAGCCAGGAATCATGTCCAACGTATGGAAAGCTACATTACCTAAAAAATGTAGATTCTTCATATGTACAATCTTTCACCGCAGCCTCAACACGG
ATGACCGCCTTCAAAGAATATTCAAACAATCCTCTTTGCTTCCGAGTCGATGCTCGATGTGTTGTCGAAACTCCGAATGTTTGGATCATCTCTTCGTCCAGTGTCCCATT
GCATCAGGCATTTGGACTCCTTTCTTCCAGAATATTGGTATCTCCTCGCCCATGCCTCTTCAAGCTATTAGTCTTTGTGCTCTTATTTTTTGGATCAAGGTCAATTCGCA
AAAAGCTATTGGATTCAAGCTTTTTCGGCAGCAACGCTATGGATTTTATGGAATGAACGTAATAGTCGTATTTTCAGAAATCAGTCTCGTTGTTTAG
Protein sequenceShow/hide protein sequence
MALRRSSLKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGLQIFDAILLANEIVD
LWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPNIWCGWIKACISSVSYSILLNGKPRGKIQASRGIRQGDPISPFIFVLAMDYISRLIQTAEHKGLIEGCSINE
IFVSHLLFADDILLFVRDNDVFLENYFMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCTAQTLPISYLGTPLGAKPSNESFWSPVVEKIYRHLDKWQYSYI
SKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRILRNFLWKGTESSDHKIPLVGWNKVTTPIASGGLGFYKTRVSNAALLSKWIWRFFNEENSLWKCFINAKYISLE
ESTIPSSSRFSSSRAPWLSISKQIPFFERNSSWQLRDGKKIRFWKDSWSPFGQLSRVFPRLFALSTMHNNLSVFEAWDLANSSWSFFPRRPLLDRELDSWTSFSSALPRP
SSSDNKDLLRWDPNSHGLYTVASARRKLWEISHSSPPQQSHMEPGIMSNVWKATLPKKCRFFICTIFHRSLNTDDRLQRIFKQSSLLPSRCSMCCRNSECLDHLFVQCPI
ASGIWTPFFQNIGISSPMPLQAISLCALIFWIKVNSQKAIGFKLFRQQRYGFYGMNVIVVFSEISLVV