; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0001339 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0001339
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr04:30554473..30557426
RNA-Seq ExpressionPay0001339
SyntenyPay0001339
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0093.19Show/hide
Query:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA
        +ISWGPSPFKLINVHLKE WFK NVT WWKNLRQEGHPGFSFMRKLKQLS IIRNEQRKNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKA
Subjt:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA

Query:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN
        DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNIN VDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+W PIST QAQN
Subjt:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN

Query:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
        LCS+FTEEEIH ALTAFSNNKSPGPDGFTMEFYKSTWS LKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
Subjt:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE

Query:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------
        RLK+TLP TVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQ +          
Subjt:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------

Query:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA
           SRGIRQGDPISPFIFVLAMDY+SRLLNSVGEKIKGVK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDA+
Subjt:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA

Query:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP
        RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP
Subjt:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP

Query:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------
        PETHKLHLVNWAKITS KE+GGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE            
Subjt:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------

Query:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL
           FSFWHSHWHQNSPLS HYPRL+ALSTNK+SSIRDMWNNTLMDWDLNPRRQLREWE+PLWAELKNSLNASFCENG DSP WTLNSNGLYTVASVKK L
Subjt:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL

Query:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP
        QQP+Q+LLD QSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRND+DRIHLFILCPIAKSIWR ISSHL+SNVNCLSP
Subjt:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP

Query:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
        K+LCITMCSWKQKTKKNVIL+NTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
Subjt:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS

KAA0041367.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0093.99Show/hide
Query:  MSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQNLCS
        MSGFKEAQIWYQKSKRLWITEGDENTSFFHKIC+ARQRRSIISNI  VDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSW PISTTQAQNLCS
Subjt:  MSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQNLCS

Query:  LFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK
         FTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK
Subjt:  LFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK

Query:  ETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN-----------ST
        ETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQ +             
Subjt:  ETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN-----------ST

Query:  SRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTE
        SRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTE
Subjt:  SRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTE

Query:  QIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPET
        QIASQWGISTKFLPINYLGVPLGGKQ TKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVS CKNIEKTWRNFLWKNPPET
Subjt:  QIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPET

Query:  HKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE--------------C
        HKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKG+E               
Subjt:  HKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE--------------C

Query:  FSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQP
        FSFWHSHWHQNSPLS HYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSP WTLNSNGLYTVASVKKALQQP
Subjt:  FSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQP

Query:  DQSLLDLQSQNTFKNL
        DQSL+DLQSQNTF  L
Subjt:  DQSLLDLQSQNTFKNL

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0088.82Show/hide
Query:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA
        +ISWGPSPFKLINVHLKE WFK N+T WWKNLRQEGHPGFSFMRKLKQLS IIRNEQRKNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKA
Subjt:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA

Query:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN
        DVL SGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNIN  DGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+W PIST QAQ 
Subjt:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN

Query:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
        LCS+FTEEEIH ALTAFS+NKSP                             S   ++  +NITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
Subjt:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE

Query:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------
        RLK+TLP TVAENQMAFVK RQIIDAILVANEAIDYWR KKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQ +          
Subjt:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------

Query:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA
           SRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDA+
Subjt:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA

Query:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP
        RTEQIASQWGISTKFLPINYLGVPLGGKQ TK FWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP
Subjt:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP

Query:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------
        PETHKLHLVNWAKITS KE+GGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE            
Subjt:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------

Query:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL
           FSFWH HWHQNSPLS HYPRL+ALSTNK+SSIRDMWNNTLMDWDLNPRRQLREWE PLWAELKNS+NASFCENGKDSP W LNSNGLYTVASVKKAL
Subjt:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL

Query:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP
        QQP+QSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLP LCSRPSWCVMCKRND+DRIHLFILCPIAKSIWR ISSHLNSNVNCLSP
Subjt:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP

Query:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
        K+LCITMCSWKQKTKKNVIL+NTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
Subjt:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS

TYK10356.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0071.61Show/hide
Query:  ISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKAD
        ISWGP PFKLINVHLKE WFK N+  WW NLR EGHPGFSFM+KLK LS IIR+EQ+KN  ++DE K AW+KE+DNIDRLEAEGNL EELSLRRT+ KAD
Subjt:  ISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKAD

Query:  VLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQNL
        +L+S FK AQIWYQKSKRLW TEGDENTSFFHKICSARQRRSIISNIN  DGVPC+TNE+IAK FLDHFE IY GGG E+PWLI+NLSW PIST+QAQNL
Subjt:  VLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQNL

Query:  CSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAER
        CS F+EEEIH+ALTAFSNNKSPGPDGFTMEF+K+ W  LK++I NIFRDFH+NCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAER
Subjt:  CSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAER

Query:  LKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQNSTSRGIRQGDP
        LKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGF                         GYP +WRRWI+ACISSVQ +    G  +   
Subjt:  LKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQNSTSRGIRQGDP

Query:  ISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQWGIS
                                                        EDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN+DAART+QIASQWGI+
Subjt:  ISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQWGIS

Query:  TKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWA
        TKF PINYLGVPLGGK  TK FWKN++EKI+KKLASWKYSMLSKGGKITLIKS+LASLPTYQLSIFKAPVST K+IEK+WRNF WKN  ETHKLHLV+W 
Subjt:  TKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWA

Query:  KITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLECFSFWHSHWHQNSPLSFHYPRLFA
                                                KI N +                               FSFWHSHWHQNSPLS HYPRLFA
Subjt:  KITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLECFSFWHSHWHQNSPLSFHYPRLFA

Query:  LSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQPDQSLLDLQSQNTFKNLWKTSIPK
        LSTN+D+SI+DMWN TLMDWDL PRRQLR+WE+PLWAELKNSLNASFCENG+DSPTW LNS+G Y+VASVKKAL QPDQ +L LQ+QNTFKNLWK+SIPK
Subjt:  LSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQPDQSLLDLQSQNTFKNLWKTSIPK

Query:  KCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYAS
        KC FFIWTLLYDSVNTA+QL KR+PNLCSRPSWCVMCKRN++DRIHLFILCPIAKSIW LISSHL SNVNCLSPKDLCITMCSWKQKTKKN+IL+NTYAS
Subjt:  KCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYAS

Query:  ALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
        ALWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTSR  LF+NYQA+SIALNLNAF+
Subjt:  ALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]0.0e+0093.09Show/hide
Query:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA
        +ISWGPSPFKLINVHLKE WFK NVT WWKNLRQEGHPGFSFMRKLKQLS IIRNEQRKNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKA
Subjt:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA

Query:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN
        DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNIN VDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+W PIST QAQN
Subjt:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN

Query:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
        LCS+FTEEEIH ALTAFSNNKSPGPDGFTMEFYKSTWS LKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
Subjt:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE

Query:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------
        RLK+TLP TVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQ +          
Subjt:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------

Query:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA
           SRGIRQGDPISPFIFVLAMDY+SRLLNSVGEKIKGVK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDA+
Subjt:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA

Query:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP
        RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFK PVSTCKNIEKTWRNFLWKNP
Subjt:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP

Query:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------
        PETHKLHLVNWAKITS KE+GGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE            
Subjt:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------

Query:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL
           FSFWHSHWHQNSPLS HYPRL+ALSTNK+SSIRDMWNNTLMDWDLNPRRQLREWE+PLWAELKNSLNASFCENG DSP WTLNSNGLYTVASVKK L
Subjt:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL

Query:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP
        QQP+Q+LLD QSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRND+DRIHLFILCPIAKSIWR ISSHL+SNVNCLSP
Subjt:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP

Query:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
        K+LCITMCSWKQKTKKNVIL+NTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
Subjt:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein0.0e+0093.09Show/hide
Query:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA
        +ISWGPSPFKLINVHLKE WFK NVT WWKNLRQEGHPGFSFMRKLKQLS IIRNEQRKNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKA
Subjt:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA

Query:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN
        DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNIN VDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+W PIST QAQN
Subjt:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN

Query:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
        LCS+FTEEEIH ALTAFSNNKSPGPDGFTMEFYKSTWS LKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
Subjt:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE

Query:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------
        RLK+TLP TVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQ +          
Subjt:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------

Query:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA
           SRGIRQGDPISPFIFVLAMDY+SRLLNSVGEKIKGVK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDA+
Subjt:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA

Query:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP
        RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFK PVSTCKNIEKTWRNFLWKNP
Subjt:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP

Query:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------
        PETHKLHLVNWAKITS KE+GGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE            
Subjt:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------

Query:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL
           FSFWHSHWHQNSPLS HYPRL+ALSTNK+SSIRDMWNNTLMDWDLNPRRQLREWE+PLWAELKNSLNASFCENG DSP WTLNSNGLYTVASVKK L
Subjt:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL

Query:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP
        QQP+Q+LLD QSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRND+DRIHLFILCPIAKSIWR ISSHL+SNVNCLSP
Subjt:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP

Query:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
        K+LCITMCSWKQKTKKNVIL+NTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
Subjt:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS

A0A5A7TI93 LINE-1 retrotransposable element ORF2 protein0.0e+0093.99Show/hide
Query:  MSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQNLCS
        MSGFKEAQIWYQKSKRLWITEGDENTSFFHKIC+ARQRRSIISNI  VDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSW PISTTQAQNLCS
Subjt:  MSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQNLCS

Query:  LFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK
         FTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK
Subjt:  LFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK

Query:  ETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN-----------ST
        ETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQ +             
Subjt:  ETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN-----------ST

Query:  SRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTE
        SRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTE
Subjt:  SRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTE

Query:  QIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPET
        QIASQWGISTKFLPINYLGVPLGGKQ TKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVS CKNIEKTWRNFLWKNPPET
Subjt:  QIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPET

Query:  HKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE--------------C
        HKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKG+E               
Subjt:  HKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE--------------C

Query:  FSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQP
        FSFWHSHWHQNSPLS HYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSP WTLNSNGLYTVASVKKALQQP
Subjt:  FSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQP

Query:  DQSLLDLQSQNTFKNL
        DQSL+DLQSQNTF  L
Subjt:  DQSLLDLQSQNTFKNL

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein0.0e+0088.82Show/hide
Query:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA
        +ISWGPSPFKLINVHLKE WFK N+T WWKNLRQEGHPGFSFMRKLKQLS IIRNEQRKNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKA
Subjt:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA

Query:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN
        DVL SGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNIN  DGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+W PIST QAQ 
Subjt:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN

Query:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
        LCS+FTEEEIH ALTAFS+NKSP                             S   ++  +NITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
Subjt:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE

Query:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------
        RLK+TLP TVAENQMAFVK RQIIDAILVANEAIDYWR KKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQ +          
Subjt:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------

Query:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA
           SRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDA+
Subjt:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA

Query:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP
        RTEQIASQWGISTKFLPINYLGVPLGGKQ TK FWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP
Subjt:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP

Query:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------
        PETHKLHLVNWAKITS KE+GGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE            
Subjt:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------

Query:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL
           FSFWH HWHQNSPLS HYPRL+ALSTNK+SSIRDMWNNTLMDWDLNPRRQLREWE PLWAELKNS+NASFCENGKDSP W LNSNGLYTVASVKKAL
Subjt:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL

Query:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP
        QQP+QSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLP LCSRPSWCVMCKRND+DRIHLFILCPIAKSIWR ISSHLNSNVNCLSP
Subjt:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP

Query:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
        K+LCITMCSWKQKTKKNVIL+NTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
Subjt:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS

A0A5D3CJ08 LINE-1 retrotransposable element ORF2 protein0.0e+0071.61Show/hide
Query:  ISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKAD
        ISWGP PFKLINVHLKE WFK N+  WW NLR EGHPGFSFM+KLK LS IIR+EQ+KN  ++DE K AW+KE+DNIDRLEAEGNL EELSLRRT+ KAD
Subjt:  ISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKAD

Query:  VLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQNL
        +L+S FK AQIWYQKSKRLW TEGDENTSFFHKICSARQRRSIISNIN  DGVPC+TNE+IAK FLDHFE IY GGG E+PWLI+NLSW PIST+QAQNL
Subjt:  VLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQNL

Query:  CSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAER
        CS F+EEEIH+ALTAFSNNKSPGPDGFTMEF+K+ W  LK++I NIFRDFH+NCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAER
Subjt:  CSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAER

Query:  LKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQNSTSRGIRQGDP
        LKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGF                         GYP +WRRWI+ACISSVQ +    G  +   
Subjt:  LKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQNSTSRGIRQGDP

Query:  ISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQWGIS
                                                        EDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN+DAART+QIASQWGI+
Subjt:  ISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQWGIS

Query:  TKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWA
        TKF PINYLGVPLGGK  TK FWKN++EKI+KKLASWKYSMLSKGGKITLIKS+LASLPTYQLSIFKAPVST K+IEK+WRNF WKN  ETHKLHLV+W 
Subjt:  TKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWA

Query:  KITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLECFSFWHSHWHQNSPLSFHYPRLFA
                                                KI N +                               FSFWHSHWHQNSPLS HYPRLFA
Subjt:  KITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLECFSFWHSHWHQNSPLSFHYPRLFA

Query:  LSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQPDQSLLDLQSQNTFKNLWKTSIPK
        LSTN+D+SI+DMWN TLMDWDL PRRQLR+WE+PLWAELKNSLNASFCENG+DSPTW LNS+G Y+VASVKKAL QPDQ +L LQ+QNTFKNLWK+SIPK
Subjt:  LSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQPDQSLLDLQSQNTFKNLWKTSIPK

Query:  KCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYAS
        KC FFIWTLLYDSVNTA+QL KR+PNLCSRPSWCVMCKRN++DRIHLFILCPIAKSIW LISSHL SNVNCLSPKDLCITMCSWKQKTKKN+IL+NTYAS
Subjt:  KCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYAS

Query:  ALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
        ALWNIWLERNARIFNGKEKTVA++WEDIKALAGLWTSR  LF+NYQA+SIALNLNAF+
Subjt:  ALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein0.0e+0093.19Show/hide
Query:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA
        +ISWGPSPFKLINVHLKE WFK NVT WWKNLRQEGHPGFSFMRKLKQLS IIRNEQRKNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKA
Subjt:  MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKA

Query:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN
        DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNIN VDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+W PIST QAQN
Subjt:  DVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQN

Query:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
        LCS+FTEEEIH ALTAFSNNKSPGPDGFTMEFYKSTWS LKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE
Subjt:  LCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAE

Query:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------
        RLK+TLP TVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQ +          
Subjt:  RLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN----------

Query:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA
           SRGIRQGDPISPFIFVLAMDY+SRLLNSVGEKIKGVK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDA+
Subjt:  -STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAA

Query:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP
        RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP
Subjt:  RTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP

Query:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------
        PETHKLHLVNWAKITS KE+GGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE            
Subjt:  PETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLE------------

Query:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL
           FSFWHSHWHQNSPLS HYPRL+ALSTNK+SSIRDMWNNTLMDWDLNPRRQLREWE+PLWAELKNSLNASFCENG DSP WTLNSNGLYTVASVKK L
Subjt:  --CFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKAL

Query:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP
        QQP+Q+LLD QSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRND+DRIHLFILCPIAKSIWR ISSHL+SNVNCLSP
Subjt:  QQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP

Query:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
        K+LCITMCSWKQKTKKNVIL+NTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS
Subjt:  KDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNAFS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.5e-4024.22Show/hide
Query:  QRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLK---ADVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDG
        +RK +    +   + +KE++  ++  ++ +  +E++  R  LK       +    E++ W+ +     I + D   +   ++   ++ ++ I  I    G
Subjt:  QRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLK---ADVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDG

Query:  VPCSTNESIAKAFLDHFEDIYKG---GGEESPWLIDNLSWCPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRD
           +    I     ++++ +Y       EE    +D  +   ++  + ++L    T  EI A + +    KSPGPDGFT EFY+    +L   +L +F+ 
Subjt:  VPCSTNESIAKAFLDHFEDIYKG---GGEESPWLIDNLSWCPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRD

Query:  FHSNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIK
             I+  +    +I LI K  +   +  ++RPISL     K++ K++A R+++ +   +  +Q+ F+ G Q    I  +   I +  R K     +I 
Subjt:  FHSNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIK

Query:  LDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN-----------STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLT
        +D EKAFDK+   F+   L K G    + + IRA       N               G RQG P+SP +F + ++ ++R +    E IKG+++ G   + 
Subjt:  LDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN-----------STSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLT

Query:  HLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQWGISTKFLPINYLGVPL--GGKQITKTFWKNVEEKINKKL
          LFADD+++++E+   S QNL  +I+ F   SG  IN+ KS     N +     QI  +   +     I YLG+ L    K + K  +K + ++I +  
Subjt:  HLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQWGISTKFLPINYLGVPL--GGKQITKTFWKNVEEKINKKL

Query:  ASWKYSMLSKGGKITLIKSSLASLPTYQLSI--FKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTK--WLWR
          WK    S  G+I ++K ++     Y+ +    K P++    +EKT   F+W       K   +  + ++   + GG+ +   K    A +TK  W W 
Subjt:  ASWKYSMLSKGGKITLIKSSLASLPTYQLSI--FKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTK--WLWR

Query:  YIHEDSPLWKK
        Y + D   W +
Subjt:  YIHEDSPLWKK

P08548 LINE-1 reverse transcriptase homolog7.6e-4224.51Show/hide
Query:  FKLINVHLKESW----FKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEI-DNIDRLEAEGNLSEELSLRR--TRLKAD
        +KL N+ LK++W     KK +T   K L Q  +   ++        A++R +    + +  + +   +  +  ++ +LE E + + + S R+  T+++A+
Subjt:  FKLINVHLKESW----FKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEI-DNIDRLEAEGNLSEELSLRR--TRLKAD

Query:  VLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWC---PISTTQA
        +     K       KSK  +  + ++       +   ++ +S+IS+I   +    +    I K   ++++ +Y    E    +   L  C    +S  + 
Subjt:  VLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWC---PISTTQA

Query:  QNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKV
        + L    +  EI + +      KSPGPDGFT EFY++   +L   +LN+F++     I+       NI LI K  K      +YRPISL     K++ K+
Subjt:  QNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKV

Query:  IAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQNSTSRGI
        +  R+++ +   +  +Q+ F+ G Q    I  +   I +  ++K     ++ +D EKAFD +   F+   L K G    + + I A  S    N    G+
Subjt:  IAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQNSTSRGI

Query:  -----------RQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKST--ISP
                   RQG P+SP +F + M+ ++  +    + IKG+ + G+  +   LFADD+++++E+   S   L  +I  +   SG  IN +KS   I  
Subjt:  -----------RQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKST--ISP

Query:  INVDAARTEQIASQWGISTKFLPINYLGVPL--GGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSI--FKAPVSTCKNIEK
         N  A +T + +  + +  K   + YLGV L    K + K  ++ + ++I + +  WK    S  G+I ++K S+     Y  +    KAP+S  K++EK
Subjt:  INVDAARTEQIASQWGISTKFLPINYLGVPL--GGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSI--FKAPVSTCKNIEK

Query:  TWRNFLW-KNPPETHKLHLVNWAKITSPKERGGLGIS--RLKDTNFALLTKWLWRYIHEDSPLWKKIIN
           +F+W +  P+  K  L N        + GG+ +   RL   +  + T W W + + +  +W +I N
Subjt:  TWRNFLW-KNPPETHKLHLVNWAKITSPKERGGLGIS--RLKDTNFALLTKWLWRYIHEDSPLWKKIIN

P0C2F6 Putative ribonuclease H protein At1g657503.6e-3126.26Show/hide
Query:  VPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKITSPKERGG
        +P+  K+I K  +  + E+++ +++ W+   LS  G++TL K+ L+S+P + +S    P S    +++  R FLW +  E  K HLV W+K+ SPK+ GG
Subjt:  VPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKITSPKERGG

Query:  LGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGL---------------ECFSFWHSHWHQNSPL----
        LG+   K  N AL++K  WR + E + LW  ++  KY      D   +    S  S W SI  GL               +   FW   W    PL    
Subjt:  LGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGL---------------ECFSFWHSHWHQNSPL----

Query:  SFHYPRLFALSTNKDSSI-RDMWNNTLMDWDL---------NPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQPDQSL
        +   P      T+ D+ + +D+W      WD          N R +LR            ++        +D  +W  + +G ++V S  + L   +   
Subjt:  SFHYPRLFALSTNKDSSI-RDMWNNTLMDWDL---------NPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQPDQSL

Query:  LDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIW
         ++ S   F  LWK  +P++   F+W +   +V T E+  +R     S  + C +CK   +  +H+   CP    IW
Subjt:  LDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSIW

P11369 LINE-1 retrotransposable element ORF2 protein6.7e-3825.56Show/hide
Query:  KICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGE---ESPWLIDNLSWCPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTM
        ++    + + +I+ I    G   +  E I       ++ +Y    E   E    +D      ++  Q  +L S  + +EI A + +    KSPGPDGF+ 
Subjt:  KICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGE---ESPWLIDNLSWCPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTM

Query:  EFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILV
        EFY++   DL   +  +F        +  +     I LI K +K   +  ++RPISL     K++ K++A R++E + + +  +Q+ F+ G Q    I  
Subjt:  EFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILV

Query:  ANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN-----------STSRGIRQGDPISPFIFVLAMDYISRL
        +   I Y  ++K     +I LD EKAFDK+   F+  +L + G    +   I+A  S    N               G RQG P+SP++F + ++ ++R 
Subjt:  ANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQN-----------STSRGIRQGDPISPFIFVLAMDYISRL

Query:  LNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQWGISTKFLPINYLGVPLGG-
        +    E IKG+++ G   +   L ADD+++++ D ++S + L N+IN F    G  IN NKS       +    ++I      S     I YLGV L   
Subjt:  LNSVGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQWGISTKFLPINYLGVPLGG-

Query:  -KQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSI--FKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKITSPKERGGLG
         K +    +K+++++I + L  WK    S  G+I ++K ++     Y+ +    K P      +E     F+W N     K   +  + +   +  GG+ 
Subjt:  -KQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSI--FKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKITSPKERGGLG

Query:  ISRLKDTNFALL--TKWLWRYIHEDSPLWKKI
        +  LK    A++  T W W Y       W +I
Subjt:  ISRLKDTNFALL--TKWLWRYIHEDSPLWKKI

P14381 Transposon TX1 uncharacterized 149 kDa protein5.9e-3423.65Show/hide
Query:  KNAWIKEIDNIDRLEAEGNL--SEELSLRRTRLKADVLMSGFKEAQI--WYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIA
        +NA I+ + N + L+ E  L  SE+ +L+   L+    +   ++ Q    + +S+   + + D  + FF+ +   +  R  I+ +   DG P    E+I 
Subjt:  KNAWIKEIDNIDRLEAEGNL--SEELSLRRTRLKADVLMSGFKEAQI--WYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIA

Query:  KAFLDHFEDIYKG---GGEESPWLIDNLSWCPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKA
              +++++       +    L D L    +S  + + L +  T +E+  AL    +NKSPG DG T+EF++  W  L  +   +  +      +  +
Subjt:  KAFLDHFEDIYKG---GGEESPWLIDNLSWCPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKA

Query:  VNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNW
             ++L+ KK       ++RP+SL ++ YK++AK I+ RLK  L   +  +Q   V GR I D + +  + + + R   +    + LD EKAFD+++ 
Subjt:  VNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNW

Query:  RFIDFMLMKKGYPFRWRRWIRA------CISSVQQNSTS-----RGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKG-VKMEGNINLTHLLFADDILLF
        +++   L    +  ++  +++       C+  +  + T+     RG+RQG P+S  ++ LA++    LL    +++ G V  E ++ +    +ADD++L 
Subjt:  RFIDFMLMKKGYPFRWRRWIRA------CISSVQQNSTS-----RGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKG-VKMEGNINLTHLLFADDILLF

Query:  VEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQW-GISTKFLPINYLGVPLGGKQ--ITKTFWKNVEEKINKKLASWK--YSML
        V  D   ++  +    ++  AS   IN +KS  S +   + + + +   +  IS +   I YLGV L  ++  +++ F + +EE +  +L  WK    +L
Subjt:  VEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQW-GISTKFLPINYLGVPLGGKQ--ITKTFWKNVEEKINKKLASWK--YSML

Query:  SKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHED-SPLWKK
        S  G+  +I   +AS   Y+L            I++   +FLW         H V+    + P + GG G+  ++        + + RY++ D SP W  
Subjt:  SKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHED-SPLWKK

Query:  IINAKYRSL
        + ++ YR +
Subjt:  IINAKYRSL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.4e-2027.43Show/hide
Query:  IDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEA--QIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFED
        +D+++ ++++   +   SL R    A    + F  A    + QKS+  W+ +GD NT FFHK+  A Q +++I  +   D V       + +  + ++  
Subjt:  IDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEA--QIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFED

Query:  IYKGGGE----ESPWLIDNLSWCPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIAL
        +     +    +S   I ++     + T A  L +L +++EI AA+ A   NK+PGPD FT EF+  +W  +K+  +   ++F     + K  N T I L
Subjt:  IYKGGGE----ESPWLIDNLSWCPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIAL

Query:  IAKKEKCAEPADYRPISLTTSIYKLI
        I K     + + +RP+S  T +YK+I
Subjt:  IAKKEKCAEPADYRPISLTTSIYKLI

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.0e-2224.23Show/hide
Query:  LPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKIT
        LP+ YLG+PL  K++T + +  + EKI  ++  W    LS  G++ LI S + SL  + +S F+ P +  K I+    +FLW  P    K   V W+ + 
Subjt:  LPINYLGVPLGGKQITKTFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKIT

Query:  SPKERGGLGISRLKDTNFALLTKWLWRYIHE---DSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLECFSFWHSHWHQNSPL--SFHYPRL
        +PK+ GGLGI  LK+ N        W         S +WKKI+  K+R+L+ G +    ++ S+              SFW  +W +   L     +   
Subjt:  SPKERGGLGISRLKDTNFALLTKWLWRYIHE---DSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLECFSFWHSHWHQNSPL--SFHYPRL

Query:  FALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQPDQSLLDLQSQNTFKNLWKTSI
          +     +S+ +   N        PRR   +    +  ++   +      +G+D+  W  N +      + K+      +  L +   N +K +W +  
Subjt:  FALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYPLWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQPDQSLLDLQSQNTFKNLWKTSI

Query:  PKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSI
          K     W  + + + T +++   L       S CV+C    + R HLF  CP +  +
Subjt:  PKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDKDRIHLFILCPIAKSI

AT4G05095.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT4G04650.1)3.8e-0424.3Show/hide
Query:  SRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKK----NVILYNTYASALWNIWLERNARIFNGKEKTVAEI
        S PS CV+C  N + R HLF  C ++ ++W       +     L+P  L +   +W     +    ++I+   + ++++ +W ERN R+ +   ++   I
Subjt:  SRPSWCVMCKRNDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKK----NVILYNTYASALWNIWLERNARIFNGKEKTVAEI

Query:  WEDIKAL
         ++IK +
Subjt:  WEDIKAL

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.3e-0734.57Show/hide
Query:  IAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKK-IQGF-VIKLDIEKAFDKLNWRFIDFMLMKKGYPFRW
        + ERLK  + + +   Q +F+ GR   D I+   EA+   R KK ++G+ ++KLD+EKA+D++ W +++  L+  G+P  W
Subjt:  IAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKK-IQGF-VIKLDIEKAFDKLNWRFIDFMLMKKGYPFRW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.4e-0643.64Show/hide
Query:  SRGIRQGDPISPFIFVLAMDYISRLLNSVGE--KIKGVKMEGNI-NLTHLLFADD
        SRG+RQGDP+SP++F+L  + +S L     E  ++ G+++  N   + HLLFADD
Subjt:  SRGIRQGDPISPFIFVLAMDYISRLLNSVGE--KIKGVKMEGNI-NLTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAGTTGGGGCCCTTCTCCCTTTAAGCTTATAAATGTACATCTGAAAGAATCATGGTTCAAAAAAAATGTCACATATTGGTGGAAAAATTTGAGACAGGAG
GGGCACCCAGGCTTTTCTTTTATGAGAAAGCTAAAGCAACTATCTGCCATTATAAGAAATGAGCAAAGAAAGAACAAATGCTACAGTGATGAAGATAAAAATGCT
TGGATAAAAGAAATCGACAACATAGACAGATTAGAAGCTGAAGGAAACTTATCTGAAGAGCTTAGCCTCCGTAGGACTAGATTAAAAGCTGATGTCCTTATGTCC
GGTTTCAAAGAAGCTCAAATATGGTACCAAAAAAGCAAGAGATTGTGGATCACTGAAGGAGATGAAAATACATCTTTCTTTCACAAAATCTGCTCTGCCAGACAA
AGAAGAAGCATTATATCAAATATTAACTTCGTCGATGGTGTTCCTTGTTCAACAAATGAGAGCATTGCAAAAGCCTTCTTAGATCACTTTGAAGATATTTACAAA
GGGGGTGGAGAGGAAAGCCCTTGGCTTATTGATAATCTTAGTTGGTGTCCTATATCAACCACCCAAGCACAAAATTTATGTTCTTTGTTCACGGAGGAGGAAATT
CATGCAGCCCTTACTGCTTTCTCAAACAATAAAAGCCCGGGTCCAGATGGCTTTACCATGGAATTCTATAAATCAACTTGGTCTGACCTCAAGGAAGAAATTCTC
AACATATTCAGAGACTTCCACTCAAACTGTATCATTAACAAAGCAGTAAACATCACAAATATTGCTCTAATTGCCAAAAAGGAAAAGTGTGCGGAGCCTGCGGAT
TACAGACCTATAAGTTTAACGACCTCCATTTACAAACTTATTGCCAAAGTCATTGCGGAAAGACTAAAAGAAACTCTTCCATCCACAGTGGCAGAGAACCAAATG
GCCTTTGTAAAAGGCAGACAAATCATAGATGCTATCTTAGTTGCAAATGAAGCCATCGACTATTGGAGAGTTAAAAAAATTCAAGGCTTTGTTATAAAGCTGGAT
ATTGAAAAAGCATTTGACAAACTAAACTGGAGATTCATTGATTTCATGCTTATGAAAAAGGGGTACCCCTTCAGATGGAGGAGGTGGATAAGAGCTTGTATTAGC
AGTGTACAGCAAAATTCAACCTCCCGTGGCATAAGGCAAGGAGACCCTATTTCCCCTTTTATTTTTGTCCTAGCAATGGATTATATAAGCAGGCTGCTGAACTCT
GTGGGTGAAAAAATCAAAGGGGTGAAAATGGAGGGCAACATAAATCTGACACACTTACTCTTTGCAGATGATATTCTGCTTTTTGTAGAAGATGATGAGCACTCA
ATTCAAAATTTAAAGAATATCATCAACCTCTTCCAGCTTGCTTCGGGGCTGAGTATCAATCTAAATAAATCCACCATTTCTCCTATAAATGTTGACGCTGCAAGA
ACTGAACAGATAGCTTCACAATGGGGAATTTCTACAAAATTTCTTCCAATCAACTACCTTGGAGTTCCTCTCGGAGGTAAACAAATCACAAAGACTTTTTGGAAG
AACGTTGAAGAAAAGATAAACAAAAAACTTGCCAGCTGGAAATATTCTATGTTATCTAAAGGAGGTAAAATTACTCTGATTAAATCTTCTTTGGCTAGCCTTCCT
ACTTATCAGCTATCAATCTTCAAAGCCCCTGTATCAACCTGCAAAAACATTGAAAAAACCTGGAGAAATTTCTTATGGAAGAACCCACCAGAGACCCATAAACTA
CACCTAGTTAATTGGGCGAAGATTACTTCTCCAAAAGAGAGAGGAGGGCTGGGCATAAGTCGACTGAAAGACACAAACTTTGCTCTTCTAACAAAATGGCTCTGG
AGATACATCCATGAAGATTCCCCCCTATGGAAGAAAATTATAAATGCAAAATATAGAAGCCTATCCAAAGGGGACATTCCTTGTGTTTGCAATCATAGCAGCAGC
CGTTCCCCATGGTTTTCCATTTGCAAAGGCTTGGAGTGCTTCTCCTTTTGGCATAGTCACTGGCATCAAAATAGTCCTCTGTCTTTTCACTACCCCAGATTATTT
GCTCTATCTACAAATAAGGACAGCTCCATAAGAGACATGTGGAATAATACCTTGATGGATTGGGATCTAAATCCAAGAAGACAGTTAAGAGAGTGGGAATATCCT
CTGTGGGCTGAGTTAAAAAACTCTCTAAATGCAAGCTTTTGCGAAAATGGAAAAGACTCTCCAACGTGGACCCTAAACTCTAATGGCTTATACACAGTGGCCTCG
GTTAAAAAAGCTCTCCAACAGCCTGATCAAAGCCTCTTAGACCTCCAAAGCCAAAACACTTTCAAGAATCTTTGGAAGACAAGCATCCCAAAGAAATGCATCTTT
TTCATATGGACTCTGCTCTATGATAGTGTGAACACTGCTGAACAACTTATGAAGAGATTACCAAATCTCTGTTCTAGACCGAGCTGGTGTGTCATGTGTAAGAGG
AATGACAAAGACAGAATACACCTCTTTATCCTATGCCCCATTGCAAAGTCTATTTGGAGATTAATATCATCACACTTAAACAGCAATGTAAACTGCCTCAGTCCA
AAGGATCTTTGTATTACCATGTGCAGCTGGAAACAGAAAACAAAGAAAAATGTCATTCTCTATAACACCTATGCCTCTGCCCTCTGGAACATTTGGTTGGAAAGG
AATGCCCGTATCTTCAATGGGAAAGAAAAAACAGTTGCAGAAATTTGGGAAGATATAAAAGCTCTTGCAGGACTATGGACCAGTAGATCTTCTCTTTTCTCAAAT
TATCAAGCTTCTTCCATAGCACTAAACCTTAATGCTTTTAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCAGTTGGGGCCCTTCTCCCTTTAAGCTTATAAATGTACATCTGAAAGAATCATGGTTCAAAAAAAATGTCACATATTGGTGGAAAAATTTGAGACAGGAG
GGGCACCCAGGCTTTTCTTTTATGAGAAAGCTAAAGCAACTATCTGCCATTATAAGAAATGAGCAAAGAAAGAACAAATGCTACAGTGATGAAGATAAAAATGCT
TGGATAAAAGAAATCGACAACATAGACAGATTAGAAGCTGAAGGAAACTTATCTGAAGAGCTTAGCCTCCGTAGGACTAGATTAAAAGCTGATGTCCTTATGTCC
GGTTTCAAAGAAGCTCAAATATGGTACCAAAAAAGCAAGAGATTGTGGATCACTGAAGGAGATGAAAATACATCTTTCTTTCACAAAATCTGCTCTGCCAGACAA
AGAAGAAGCATTATATCAAATATTAACTTCGTCGATGGTGTTCCTTGTTCAACAAATGAGAGCATTGCAAAAGCCTTCTTAGATCACTTTGAAGATATTTACAAA
GGGGGTGGAGAGGAAAGCCCTTGGCTTATTGATAATCTTAGTTGGTGTCCTATATCAACCACCCAAGCACAAAATTTATGTTCTTTGTTCACGGAGGAGGAAATT
CATGCAGCCCTTACTGCTTTCTCAAACAATAAAAGCCCGGGTCCAGATGGCTTTACCATGGAATTCTATAAATCAACTTGGTCTGACCTCAAGGAAGAAATTCTC
AACATATTCAGAGACTTCCACTCAAACTGTATCATTAACAAAGCAGTAAACATCACAAATATTGCTCTAATTGCCAAAAAGGAAAAGTGTGCGGAGCCTGCGGAT
TACAGACCTATAAGTTTAACGACCTCCATTTACAAACTTATTGCCAAAGTCATTGCGGAAAGACTAAAAGAAACTCTTCCATCCACAGTGGCAGAGAACCAAATG
GCCTTTGTAAAAGGCAGACAAATCATAGATGCTATCTTAGTTGCAAATGAAGCCATCGACTATTGGAGAGTTAAAAAAATTCAAGGCTTTGTTATAAAGCTGGAT
ATTGAAAAAGCATTTGACAAACTAAACTGGAGATTCATTGATTTCATGCTTATGAAAAAGGGGTACCCCTTCAGATGGAGGAGGTGGATAAGAGCTTGTATTAGC
AGTGTACAGCAAAATTCAACCTCCCGTGGCATAAGGCAAGGAGACCCTATTTCCCCTTTTATTTTTGTCCTAGCAATGGATTATATAAGCAGGCTGCTGAACTCT
GTGGGTGAAAAAATCAAAGGGGTGAAAATGGAGGGCAACATAAATCTGACACACTTACTCTTTGCAGATGATATTCTGCTTTTTGTAGAAGATGATGAGCACTCA
ATTCAAAATTTAAAGAATATCATCAACCTCTTCCAGCTTGCTTCGGGGCTGAGTATCAATCTAAATAAATCCACCATTTCTCCTATAAATGTTGACGCTGCAAGA
ACTGAACAGATAGCTTCACAATGGGGAATTTCTACAAAATTTCTTCCAATCAACTACCTTGGAGTTCCTCTCGGAGGTAAACAAATCACAAAGACTTTTTGGAAG
AACGTTGAAGAAAAGATAAACAAAAAACTTGCCAGCTGGAAATATTCTATGTTATCTAAAGGAGGTAAAATTACTCTGATTAAATCTTCTTTGGCTAGCCTTCCT
ACTTATCAGCTATCAATCTTCAAAGCCCCTGTATCAACCTGCAAAAACATTGAAAAAACCTGGAGAAATTTCTTATGGAAGAACCCACCAGAGACCCATAAACTA
CACCTAGTTAATTGGGCGAAGATTACTTCTCCAAAAGAGAGAGGAGGGCTGGGCATAAGTCGACTGAAAGACACAAACTTTGCTCTTCTAACAAAATGGCTCTGG
AGATACATCCATGAAGATTCCCCCCTATGGAAGAAAATTATAAATGCAAAATATAGAAGCCTATCCAAAGGGGACATTCCTTGTGTTTGCAATCATAGCAGCAGC
CGTTCCCCATGGTTTTCCATTTGCAAAGGCTTGGAGTGCTTCTCCTTTTGGCATAGTCACTGGCATCAAAATAGTCCTCTGTCTTTTCACTACCCCAGATTATTT
GCTCTATCTACAAATAAGGACAGCTCCATAAGAGACATGTGGAATAATACCTTGATGGATTGGGATCTAAATCCAAGAAGACAGTTAAGAGAGTGGGAATATCCT
CTGTGGGCTGAGTTAAAAAACTCTCTAAATGCAAGCTTTTGCGAAAATGGAAAAGACTCTCCAACGTGGACCCTAAACTCTAATGGCTTATACACAGTGGCCTCG
GTTAAAAAAGCTCTCCAACAGCCTGATCAAAGCCTCTTAGACCTCCAAAGCCAAAACACTTTCAAGAATCTTTGGAAGACAAGCATCCCAAAGAAATGCATCTTT
TTCATATGGACTCTGCTCTATGATAGTGTGAACACTGCTGAACAACTTATGAAGAGATTACCAAATCTCTGTTCTAGACCGAGCTGGTGTGTCATGTGTAAGAGG
AATGACAAAGACAGAATACACCTCTTTATCCTATGCCCCATTGCAAAGTCTATTTGGAGATTAATATCATCACACTTAAACAGCAATGTAAACTGCCTCAGTCCA
AAGGATCTTTGTATTACCATGTGCAGCTGGAAACAGAAAACAAAGAAAAATGTCATTCTCTATAACACCTATGCCTCTGCCCTCTGGAACATTTGGTTGGAAAGG
AATGCCCGTATCTTCAATGGGAAAGAAAAAACAGTTGCAGAAATTTGGGAAGATATAAAAGCTCTTGCAGGACTATGGACCAGTAGATCTTCTCTTTTCTCAAAT
TATCAAGCTTCTTCCATAGCACTAAACCTTAATGCTTTTAGTTAG
Protein sequenceShow/hide protein sequence
MISWGPSPFKLINVHLKESWFKKNVTYWWKNLRQEGHPGFSFMRKLKQLSAIIRNEQRKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMS
GFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINFVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWCPISTTQAQNLCSLFTEEEI
HAALTAFSNNKSPGPDGFTMEFYKSTWSDLKEEILNIFRDFHSNCIINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQM
AFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQQNSTSRGIRQGDPISPFIFVLAMDYISRLLNS
VGEKIKGVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVDAARTEQIASQWGISTKFLPINYLGVPLGGKQITKTFWK
NVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPPETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLW
RYIHEDSPLWKKIINAKYRSLSKGDIPCVCNHSSSRSPWFSICKGLECFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLREWEYP
LWAELKNSLNASFCENGKDSPTWTLNSNGLYTVASVKKALQQPDQSLLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKR
NDKDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSN
YQASSIALNLNAFS