; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0015640 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0015640
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr12:23978388..23981569
RNA-Seq ExpressionPay0015640
SyntenyPay0015640
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0092.96Show/hide
Query:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------
        KFI+DSNLIDPPLSNAKFTWSNLRV PVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPI LESS+ISWGPSPFK INVHLKEPWFKNN        
Subjt:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------

Query:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
                     LKQLS IIR+EQ+KNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
Subjt:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF

Query:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
        HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+WSPIST QAQNLCS+FTEEEIH ALTAFSNNKSPGPDGFTMEF
Subjt:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF

Query:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
        YKSTWSVLKEEI NIFRDFHSNC+INKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP TVAENQMAFVKGRQIIDAILVANE
Subjt:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE

Query:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
        AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDY+SRLLNSV
Subjt:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV

Query:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
        GEKIK VK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINV+A+RTEQIASQWGISTKFLPINYLGVPLGGKQ TK
Subjt:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK

Query:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN
        +FWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP ETHKLHLVNWAKITS KE+GGLGISRLKDTN
Subjt:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN

Query:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD
        FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDI CV NHSSSRSPWFSICKGL+WFQRHVSWKIKNGRSFSFWHSHWHQNSPLS HYPRL+ALSTNK+
Subjt:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD

Query:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
        SSIRDMWNNTLMDWDLNPRRQLR+WE+PLWAELKNSLNASFCENG  SP W LNS+G YTVASVKK LQQP+Q+ LD QSQNTFKNLWKTSIPKKCIFFI
Subjt:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI

Query:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW
        WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWR ISSHL+SNVNCLSPK+LCITMCSWKQKTKKNVIL+NTYASALWNIW
Subjt:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW

Query:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF
        LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLN F
Subjt:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0089.53Show/hide
Query:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------
        KFI+DSNLIDPPLSNAKFTWSNLRV PVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPI LESS+ISWGPSPFK INVHLKEPWFKNN        
Subjt:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------

Query:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
                     LKQLS IIR+EQ+KNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKADVL SGFKEAQIWYQKSKRLWITEGDENTSFF
Subjt:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF

Query:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
        HKICSARQRRSIISNINS DGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+WSPIST QAQ LCS+FTEEEIH ALTAFS+NKSP         
Subjt:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF

Query:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
                            S   ++  +NITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP TVAENQMAFVK RQIIDAILVANE
Subjt:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE

Query:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
        AIDYWR KKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
Subjt:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV

Query:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
        GEKIK VK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINV+A+RTEQIASQWGISTKFLPINYLGVPLGGKQTTK
Subjt:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK

Query:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN
        +FWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP ETHKLHLVNWAKITS KE+GGLGISRLKDTN
Subjt:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN

Query:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD
        FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDI CV NHSSSRSPWFSICKGL+WFQRHVSWKIKNGRSFSFWH HWHQNSPLS HYPRL+ALSTNK+
Subjt:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD

Query:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
        SSIRDMWNNTLMDWDLNPRRQLR+WE PLWAELKNS+NASFCENGK SP WALNS+G YTVASVKKALQQP+QS LDLQSQNTFKNLWKTSIPKKCIFFI
Subjt:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI

Query:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW
        WTLLYDSVNTAEQLMKRLP LCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWR ISSHLNSNVNCLSPK+LCITMCSWKQKTKKNVIL+NTYASALWNIW
Subjt:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW

Query:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF
        LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLN F
Subjt:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF

TYK10356.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0071.74Show/hide
Query:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------
        KFISDSNLIDPPLSNAK+TWSNL+ QP+LSRIDRFLYT +WENLFTAHYSKTLSRVTSDHFPI LESS ISWGP PFK INVHLKEPWFKNN        
Subjt:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------

Query:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
                     LK LS IIRDEQKKN  ++DE K AW+KE+DNIDRLEAEGNL EELSLRRT+ KAD+L+S FK AQIWYQKSKRLW TEGDENTSFF
Subjt:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF

Query:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
        HKICSARQRRSIISNINS DGVPC+TNE+IAK FLDHFE IY GGG E+PWLI+NLSWSPIST+QAQNLCS F+EEEIH+ALTAFSNNKSPGPDGFTMEF
Subjt:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF

Query:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
        +K+ W VLK++IFNIFRDFH+NC+INKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
Subjt:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE

Query:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
        AIDYWRVKKIQGF                         GYP +WRRWI+ACISSVQYSIIINGRPR                                  
Subjt:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV

Query:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
                                    EDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN++AART+QIASQWGI+TKF PINYLGVPLGGK TTK
Subjt:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK

Query:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN
        +FWKN++EKI+KKLASWKYSMLSKGGKITLIKS+LASLPTYQLSIFKAPVST K+IEK+WRNF WKN SETHKLHL                        
Subjt:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN

Query:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD
                                                                     VSWKIKNGR+FSFWHSHWHQNSPLS HYPRLFALSTN+D
Subjt:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD

Query:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
        +SI+DMWN TLMDWDL PRRQLRDWE+PLWAELKNSLNASFCENG+ SPTW LNSDGFY+VASVKKAL QPDQ  L LQ+QNTFKNLWK+SIPKKC FFI
Subjt:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI

Query:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW
        WTLLYDSVNTA+QL KR+PNLCSRPSWCVMCKRN+EDRIHLFILCPIAKSIW LISSHL SNVNCLSPKDLCITMCSWKQKTKKN+IL+NTYASALWNIW
Subjt:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW

Query:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF
        LERNARIFNGKEKTVA++WEDIKALAGLWTSR  LF+NYQA+SIALNLN F
Subjt:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF

TYK24536.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]0.0e+0072.88Show/hide
Query:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------
        KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYT NWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFK N        
Subjt:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------

Query:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
                     LKQLSAIIRDEQKKNKCY+DE+K AWIKEIDNIDRLEAEGN SEELSLRRT+LKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
Subjt:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF

Query:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
        HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
Subjt:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF

Query:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
        YKSTWSVLKEEIFNIFRDFHSNC+INKAVN+TNIALIAKKEKCAEPADYRPI                         +AENQMAFVKGRQIIDAILVANE
Subjt:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE

Query:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
        AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWR+WIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
Subjt:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV

Query:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
        GEKIK VKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINV AARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
Subjt:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK

Query:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN
        SFWKNVEEKINKKL SWKYSMLSKG                                           ++ H                            
Subjt:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN

Query:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD
                       SP  K                                   DW                               +  +F   +  D
Subjt:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD

Query:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
        SSIRDMWNNTLMDWDLNPRRQ+RDWEYPLWAELKNSLN SFCENGK SPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
Subjt:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI

Query:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW
        WTLLYDSVNTAEQL KR+PNLCSRPS                                                   WKQKTKKNVILYNTYASALWNIW
Subjt:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW

Query:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF
        LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLF+NYQASSIALNLN F
Subjt:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]0.0e+0092.86Show/hide
Query:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------
        KFI+DSNLIDPPLSNAKFTWSNLRV PVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPI LESS+ISWGPSPFK INVHLKEPWFKNN        
Subjt:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------

Query:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
                     LKQLS IIR+EQ+KNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
Subjt:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF

Query:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
        HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+WSPIST QAQNLCS+FTEEEIH ALTAFSNNKSPGPDGFTMEF
Subjt:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF

Query:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
        YKSTWSVLKEEI NIFRDFHSNC+INKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP TVAENQMAFVKGRQIIDAILVANE
Subjt:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE

Query:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
        AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDY+SRLLNSV
Subjt:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV

Query:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
        GEKIK VK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINV+A+RTEQIASQWGISTKFLPINYLGVPLGGKQ TK
Subjt:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK

Query:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN
        +FWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFK PVSTCKNIEKTWRNFLWKNP ETHKLHLVNWAKITS KE+GGLGISRLKDTN
Subjt:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN

Query:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD
        FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDI CV NHSSSRSPWFSICKGL+WFQRHVSWKIKNGRSFSFWHSHWHQNSPLS HYPRL+ALSTNK+
Subjt:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD

Query:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
        SSIRDMWNNTLMDWDLNPRRQLR+WE+PLWAELKNSLNASFCENG  SP W LNS+G YTVASVKK LQQP+Q+ LD QSQNTFKNLWKTSIPKKCIFFI
Subjt:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI

Query:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW
        WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWR ISSHL+SNVNCLSPK+LCITMCSWKQKTKKNVIL+NTYASALWNIW
Subjt:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW

Query:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF
        LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLN F
Subjt:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein0.0e+0092.86Show/hide
Query:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------
        KFI+DSNLIDPPLSNAKFTWSNLRV PVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPI LESS+ISWGPSPFK INVHLKEPWFKNN        
Subjt:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------

Query:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
                     LKQLS IIR+EQ+KNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
Subjt:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF

Query:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
        HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+WSPIST QAQNLCS+FTEEEIH ALTAFSNNKSPGPDGFTMEF
Subjt:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF

Query:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
        YKSTWSVLKEEI NIFRDFHSNC+INKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP TVAENQMAFVKGRQIIDAILVANE
Subjt:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE

Query:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
        AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDY+SRLLNSV
Subjt:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV

Query:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
        GEKIK VK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINV+A+RTEQIASQWGISTKFLPINYLGVPLGGKQ TK
Subjt:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK

Query:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN
        +FWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFK PVSTCKNIEKTWRNFLWKNP ETHKLHLVNWAKITS KE+GGLGISRLKDTN
Subjt:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN

Query:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD
        FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDI CV NHSSSRSPWFSICKGL+WFQRHVSWKIKNGRSFSFWHSHWHQNSPLS HYPRL+ALSTNK+
Subjt:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD

Query:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
        SSIRDMWNNTLMDWDLNPRRQLR+WE+PLWAELKNSLNASFCENG  SP W LNS+G YTVASVKK LQQP+Q+ LD QSQNTFKNLWKTSIPKKCIFFI
Subjt:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI

Query:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW
        WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWR ISSHL+SNVNCLSPK+LCITMCSWKQKTKKNVIL+NTYASALWNIW
Subjt:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW

Query:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF
        LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLN F
Subjt:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein0.0e+0089.53Show/hide
Query:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------
        KFI+DSNLIDPPLSNAKFTWSNLRV PVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPI LESS+ISWGPSPFK INVHLKEPWFKNN        
Subjt:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------

Query:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
                     LKQLS IIR+EQ+KNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKADVL SGFKEAQIWYQKSKRLWITEGDENTSFF
Subjt:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF

Query:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
        HKICSARQRRSIISNINS DGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+WSPIST QAQ LCS+FTEEEIH ALTAFS+NKSP         
Subjt:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF

Query:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
                            S   ++  +NITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP TVAENQMAFVK RQIIDAILVANE
Subjt:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE

Query:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
        AIDYWR KKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
Subjt:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV

Query:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
        GEKIK VK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINV+A+RTEQIASQWGISTKFLPINYLGVPLGGKQTTK
Subjt:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK

Query:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN
        +FWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP ETHKLHLVNWAKITS KE+GGLGISRLKDTN
Subjt:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN

Query:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD
        FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDI CV NHSSSRSPWFSICKGL+WFQRHVSWKIKNGRSFSFWH HWHQNSPLS HYPRL+ALSTNK+
Subjt:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD

Query:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
        SSIRDMWNNTLMDWDLNPRRQLR+WE PLWAELKNS+NASFCENGK SP WALNS+G YTVASVKKALQQP+QS LDLQSQNTFKNLWKTSIPKKCIFFI
Subjt:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI

Query:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW
        WTLLYDSVNTAEQLMKRLP LCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWR ISSHLNSNVNCLSPK+LCITMCSWKQKTKKNVIL+NTYASALWNIW
Subjt:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW

Query:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF
        LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLN F
Subjt:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF

A0A5D3CJ08 LINE-1 retrotransposable element ORF2 protein0.0e+0071.74Show/hide
Query:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------
        KFISDSNLIDPPLSNAK+TWSNL+ QP+LSRIDRFLYT +WENLFTAHYSKTLSRVTSDHFPI LESS ISWGP PFK INVHLKEPWFKNN        
Subjt:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------

Query:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
                     LK LS IIRDEQKKN  ++DE K AW+KE+DNIDRLEAEGNL EELSLRRT+ KAD+L+S FK AQIWYQKSKRLW TEGDENTSFF
Subjt:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF

Query:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
        HKICSARQRRSIISNINS DGVPC+TNE+IAK FLDHFE IY GGG E+PWLI+NLSWSPIST+QAQNLCS F+EEEIH+ALTAFSNNKSPGPDGFTMEF
Subjt:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF

Query:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
        +K+ W VLK++IFNIFRDFH+NC+INKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
Subjt:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE

Query:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
        AIDYWRVKKIQGF                         GYP +WRRWI+ACISSVQYSIIINGRPR                                  
Subjt:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV

Query:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
                                    EDDEHS+QNLKNIINLFQLASGL+INLNKSTISPIN++AART+QIASQWGI+TKF PINYLGVPLGGK TTK
Subjt:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK

Query:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN
        +FWKN++EKI+KKLASWKYSMLSKGGKITLIKS+LASLPTYQLSIFKAPVST K+IEK+WRNF WKN SETHKLHL                        
Subjt:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN

Query:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD
                                                                     VSWKIKNGR+FSFWHSHWHQNSPLS HYPRLFALSTN+D
Subjt:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD

Query:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
        +SI+DMWN TLMDWDL PRRQLRDWE+PLWAELKNSLNASFCENG+ SPTW LNSDGFY+VASVKKAL QPDQ  L LQ+QNTFKNLWK+SIPKKC FFI
Subjt:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI

Query:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW
        WTLLYDSVNTA+QL KR+PNLCSRPSWCVMCKRN+EDRIHLFILCPIAKSIW LISSHL SNVNCLSPKDLCITMCSWKQKTKKN+IL+NTYASALWNIW
Subjt:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW

Query:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF
        LERNARIFNGKEKTVA++WEDIKALAGLWTSR  LF+NYQA+SIALNLN F
Subjt:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF

A0A5D3DLM2 LINE-1 retrotransposable element ORF2 protein0.0e+0072.88Show/hide
Query:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------
        KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYT NWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFK N        
Subjt:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------

Query:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
                     LKQLSAIIRDEQKKNKCY+DE+K AWIKEIDNIDRLEAEGN SEELSLRRT+LKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
Subjt:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF

Query:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
        HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
Subjt:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF

Query:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
        YKSTWSVLKEEIFNIFRDFHSNC+INKAVN+TNIALIAKKEKCAEPADYRPI                         +AENQMAFVKGRQIIDAILVANE
Subjt:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE

Query:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
        AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWR+WIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
Subjt:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV

Query:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
        GEKIK VKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINV AARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
Subjt:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK

Query:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN
        SFWKNVEEKINKKL SWKYSMLSKG                                           ++ H                            
Subjt:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN

Query:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD
                       SP  K                                   DW                               +  +F   +  D
Subjt:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD

Query:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
        SSIRDMWNNTLMDWDLNPRRQ+RDWEYPLWAELKNSLN SFCENGK SPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
Subjt:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI

Query:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW
        WTLLYDSVNTAEQL KR+PNLCSRPS                                                   WKQKTKKNVILYNTYASALWNIW
Subjt:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW

Query:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF
        LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLF+NYQASSIALNLN F
Subjt:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein0.0e+0092.96Show/hide
Query:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------
        KFI+DSNLIDPPLSNAKFTWSNLRV PVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPI LESS+ISWGPSPFK INVHLKEPWFKNN        
Subjt:  KFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNN--------

Query:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
                     LKQLS IIR+EQ+KNKCYSDEDKNAWIKEID+IDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF
Subjt:  -------------LKQLSAIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFF

Query:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF
        HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNL+WSPIST QAQNLCS+FTEEEIH ALTAFSNNKSPGPDGFTMEF
Subjt:  HKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEF

Query:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE
        YKSTWSVLKEEI NIFRDFHSNC+INKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLK+TLP TVAENQMAFVKGRQIIDAILVANE
Subjt:  YKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANE

Query:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV
        AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPF+WR WIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDY+SRLLNSV
Subjt:  AIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSV

Query:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK
        GEKIK VK+EGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINV+A+RTEQIASQWGISTKFLPINYLGVPLGGKQ TK
Subjt:  GEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTK

Query:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN
        +FWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNP ETHKLHLVNWAKITS KE+GGLGISRLKDTN
Subjt:  SFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTN

Query:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD
        FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDI CV NHSSSRSPWFSICKGL+WFQRHVSWKIKNGRSFSFWHSHWHQNSPLS HYPRL+ALSTNK+
Subjt:  FALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKD

Query:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI
        SSIRDMWNNTLMDWDLNPRRQLR+WE+PLWAELKNSLNASFCENG  SP W LNS+G YTVASVKK LQQP+Q+ LD QSQNTFKNLWKTSIPKKCIFFI
Subjt:  SSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFI

Query:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW
        WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWR ISSHL+SNVNCLSPK+LCITMCSWKQKTKKNVIL+NTYASALWNIW
Subjt:  WTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIW

Query:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF
        LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLN F
Subjt:  LERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.9e-4224.67Show/hide
Query:  IKEIDNIDRLEAEGNLSEELSLRRTRLK---ADVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINSVDGVPCSTNESIAKAFLD
        +KE++  ++  ++ +  +E++  R  LK       +    E++ W+ +     I + D   +   ++   ++ ++ I  I +  G   +    I     +
Subjt:  IKEIDNIDRLEAEGNLSEELSLRRTRLK---ADVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINSVDGVPCSTNESIAKAFLD

Query:  HFEDIYKG---GGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSVLKEEIFNIFRDFHSNCVINKAVNITN
        +++ +Y       EE    +D  +   ++  + ++L    T  EI A + +    KSPGPDGFT EFY+     L   +  +F+      ++  +    +
Subjt:  HFEDIYKG---GGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSVLKEEIFNIFRDFHSNCVINKAVNITN

Query:  IALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFI
        I LI K  +   +  ++RPISL     K++ K++A R+++ +   +  +Q+ F+ G Q    I  +   I +  R K     +I +D EKAFDK+   F+
Subjt:  IALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFI

Query:  DFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKRVKMEGNINLTHLLFADDILLFVEDD
           L K G    + + IRA       +II+NG+         G RQG P+SP +F + ++ ++R +    E IK +++ G   +   LFADD+++++E+ 
Subjt:  DFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKRVKMEGNINLTHLLFADDILLFVEDD

Query:  EHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGG------KQTTKSFWKNVEEKINKKLASWKYSMLSKG
          S QNL  +I+ F   SG  IN+ KS     N       QI  +   +     I YLG+ L        K+  K   K ++E  NK    WK    S  
Subjt:  EHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGG------KQTTKSFWKNVEEKINKKLASWKYSMLSKG

Query:  GKITLIKSSLASLPTYQLSI--FKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTK--WLWRYIHEDSPLWKK
        G+I ++K ++     Y+ +    K P++    +EKT   F+W       K   +  + ++   + GG+ +   K    A +TK  W W Y + D   W +
Subjt:  GKITLIKSSLASLPTYQLSI--FKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTK--WLWRYIHEDSPLWKK

P08548 LINE-1 reverse transcriptase homolog1.7e-3925.04Show/hide
Query:  NIDRLEAEGNLSEELSLRR--TRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIY
        ++ +LE E + + + S R+  T+++A++     K       KSK  +  + ++       +   ++ +S+IS+I + +    +    I K   ++++ +Y
Subjt:  NIDRLEAEGNLSEELSLRR--TRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIY

Query:  KGGGE---ESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAK
            E   E    ++      +S  + + L    +  EI + +      KSPGPDGFT EFY++    L   + N+F++     ++       NI LI K
Subjt:  KGGGE---ESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAK

Query:  KEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMK
          K      +YRPISL     K++ K++  R+++ +   +  +Q+ F+ G Q    I  +   I +  ++K     ++ +D EKAFD +   F+   L K
Subjt:  KEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMK

Query:  KGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQN
         G    + + I A  S    +II+NG          G RQG P+SP +F + M+ ++  +    EK  +    G+  +   LFADD+++++E+   S   
Subjt:  KGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQN

Query:  LKNIINLFQLASGLSINLNKST--ISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTKSFWKNVEEKINKKLA----SWKYSMLSKGGKITLI
        L  +I  +   SG  IN +KS   I   N +A +T + +  + +  K   + YLGV L   +  K  +K   E + K++A     WK    S  G+I ++
Subjt:  LKNIINLFQLASGLSINLNKST--ISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTKSFWKNVEEKINKKLA----SWKYSMLSKGGKITLI

Query:  KSSLASLPTYQLSI--FKAPVSTCKNIEKTWRNFLW--KNPSETHKLHLVNWAKITSPKERGGLGIS--RLKDTNFALLTKWLWRYIHEDSPLWKKIIN
        K S+     Y  +    KAP+S  K++EK   +F+W  K P     L       +++  + GG+ +   RL   +  + T W W + + +  +W +I N
Subjt:  KSSLASLPTYQLSI--FKAPVSTCKNIEKTWRNFLW--KNPSETHKLHLVNWAKITSPKERGGLGIS--RLKDTNFALLTKWLWRYIHEDSPLWKKIIN

P0C2F6 Putative ribonuclease H protein At1g657503.4e-3528.12Show/hide
Query:  VPLGGKQTTKSFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGG
        +P+  K+  K  +  + E+++ +++ W+   LS  G++TL K+ L+S+P + +S    P S    +++  R FLW + +E  K HLV W+K+ SPK+ GG
Subjt:  VPLGGKQTTKSFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGG

Query:  LGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGL-DWFQRHVSWKIKNGRSFSFWHSHWHQNSPL----
        LG+   K  N AL++K  WR + E + LW  ++  KY      D   +    S  S W SI  GL D     V W   +G+   FW   W    PL    
Subjt:  LGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGL-DWFQRHVSWKIKNGRSFSFWHSHWHQNSPL----

Query:  SFHYPRLFALSTNKDSSI-RDMWNNTLMDWDL---------NPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSF
        +   P      T+ D+ + +D+W      WD          N R +LR     L    ++ L+  F ++G++S   A      Y + +V +  +    SF
Subjt:  SFHYPRLFALSTNKDSSI-RDMWNNTLMDWDL---------NPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSF

Query:  LDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIW
                F  LWK  +P++   F+W +   +V T E+  +R     S  + C +CK   E  +H+   CP    IW
Subjt:  LDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIW

P11369 LINE-1 retrotransposable element ORF2 protein4.5e-4025.09Show/hide
Query:  KICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGE---ESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTM
        ++    + + +I+ I +  G   +  E I       ++ +Y    E   E    +D      ++  Q  +L S  + +EI A + +    KSPGPDGF+ 
Subjt:  KICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGE---ESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTM

Query:  EFYKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILV
        EFY++    L   +  +F        +  +     I LI K +K   +  ++RPISL     K++ K++A R++E + + +  +Q+ F+ G Q    I  
Subjt:  EFYKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEK-CAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILV

Query:  ANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRL
        +   I Y  ++K     +I LD EKAFDK+   F+  +L + G    +   I+A  S    +I +NG     I    G RQG P+SP++F + ++ ++R 
Subjt:  ANEAIDYW-RVKKIQGFVIKLDIEKAFDKLNWRFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRL

Query:  LNSVGEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGG-
        +    E IK +++ G   +   L ADD+++++ D ++S + L N+IN F    G  IN NKS            ++I      S     I YLGV L   
Subjt:  LNSVGEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGG-

Query:  -KQTTKSFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSI--FKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLG
         K      +K+++++I + L  WK    S  G+I ++K ++     Y+ +    K P      +E     F+W N     K   +  + +   +  GG+ 
Subjt:  -KQTTKSFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSI--FKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLG

Query:  ISRLKDTNFALL--TKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPW
        +  LK    A++  T W W Y       W +I + +    + G  H +++  +    W
Subjt:  ISRLKDTNFALL--TKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPW

P14381 Transposon TX1 uncharacterized 149 kDa protein8.9e-3623.72Show/hide
Query:  KNAWIKEIDNIDRLEAEGNL--SEELSLRRTRLKADVLMSGFKEAQI--WYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINSVDGVPCSTNESIA
        +NA I+ + N + L+ E  L  SE+ +L+   L+    +   ++ Q    + +S+   + + D  + FF+ +   +  R  I+ + + DG P    E+I 
Subjt:  KNAWIKEIDNIDRLEAEGNL--SEELSLRRTRLKADVLMSGFKEAQI--WYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINSVDGVPCSTNESIA

Query:  KAFLDHFEDIYKGGGEESPWLIDNLSWS---PISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSVLKEEIFNIFRDFHSNCVINKA
              +++++      SP   + L W     +S  + + L +  T +E+  AL    +NKSPG DG T+EF++  W  L  +   +  +      +  +
Subjt:  KAFLDHFEDIYKGGGEESPWLIDNLSWS---PISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSVLKEEIFNIFRDFHSNCVINKA

Query:  VNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNW
             ++L+ KK       ++RP+SL ++ YK++AK I+ RLK  L   +  +Q   V GR I D + +  + + + R   +    + LD EKAFD+++ 
Subjt:  VNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNW

Query:  RFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKRVKMEGNINLTHLLFADDILLFV
        +++   L    +  ++  +++   +S +  + IN      +   RG+RQG P+S  ++ LA++    LL         V  E ++ +    +ADD++L V
Subjt:  RFIDFMLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKRVKMEGNINLTHLLFADDILLFV

Query:  EDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQW-GISTKFLPINYLGVPLGGKQ-TTKSFWKNVEEKINKKLASWK--YSMLSK
          D   ++  +    ++  AS   IN +KS  S +   + + + +   +  IS +   I YLGV L  ++      +  +EE +  +L  WK    +LS 
Subjt:  EDDEHSIQNLKNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQW-GISTKFLPINYLGVPLGGKQ-TTKSFWKNVEEKINKKLASWK--YSMLSK

Query:  GGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHED-SPLWKKII
         G+  +I   +AS   Y+L            I++   +FLW         H V+    + P + GG G+  ++        + + RY++ D SP W  + 
Subjt:  GGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHED-SPLWKKII

Query:  NAKYRSL
        ++ YR +
Subjt:  NAKYRSL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.1e-2424.66Show/hide
Query:  ISDSNLIDPPLSNAKFTWSNLR-VQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFP-IALESSMISWGPSPFKFINVHLKEPWFKNNL-------
        + DS+L+D P     +TWSN +   P++ ++DR +   +W + F +  +       SDH P I +  ++       F++ +     P F  +L       
Subjt:  ISDSNLIDPPLSNAKFTWSNLR-VQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFP-IALESSMISWGPSPFKFINVHLKEPWFKNNL-------

Query:  ----KQLSAIIRDEQKKNKCYSDEDKNAW------IKE-IDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEA--QIWYQKSKRLWITEGDENTSFFH
              + ++    +   KC    ++  +       KE +D+++ ++++   +   SL R    A    + F  A    + QKS+  W+ +GD NT FFH
Subjt:  ----KQLSAIIRDEQKKNKCYSDEDKNAW------IKE-IDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEA--QIWYQKSKRLWITEGDENTSFFH

Query:  KICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGE----ESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFT
        K+  A Q +++I  +   D V       + +  + ++  +     +    +S   I ++     + T A  L +L +++EI AA+ A   NK+PGPD FT
Subjt:  KICSARQRRSIISNINSVDGVPCSTNESIAKAFLDHFEDIYKGGGE----ESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFT

Query:  MEFYKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLI
         EF+  +W V+K+      ++F     + K  N T I LI K     + + +RP+S  T +YK+I
Subjt:  MEFYKSTWSVLKEEIFNIFRDFHSNCVINKAVNITNIALIAKKEKCAEPADYRPISLTTSIYKLI

AT1G45063.1 copper ion binding;electron carriers2.0e-0628.93Show/hide
Query:  TLLYDSVNTAEQLMKRLPNLCSR--------PSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP---KDLCITMCSWKQKTKKNVILYN
        TL    ++  E+L +   ++C R        PS C++C   DE R H+F  CP +  +W    S   SN     P   KD    +    +  K   IL  
Subjt:  TLLYDSVNTAEQLMKRLPNLCSR--------PSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSHLNSNVNCLSP---KDLCITMCSWKQKTKKNVILYN

Query:  TYASALWNIWLERNARIFNGK
         Y +++++IW ERN R+ + K
Subjt:  TYASALWNIWLERNARIFNGK

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-2325.47Show/hide
Query:  LPINYLGVPLGGKQTTKSFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKIT
        LP+ YLG+PL  K+ T S +  + EKI  ++  W    LS  G++ LI S + SL  + +S F+ P +  K I+    +FLW  P    K   V W+ + 
Subjt:  LPINYLGVPLGGKQTTKSFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKIT

Query:  SPKERGGLGISRLKDTNFALLTKWLWRYIHE---DSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWH
        +PK+ GGLGI  LK+ N        W         S +WKKI+  K+R+L+ G                        F +H    I NG + SFW  +W 
Subjt:  SPKERGGLGISRLKDTNFALLTKWLWRYIHE---DSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPWFSICKGLDWFQRHVSWKIKNGRSFSFWHSHWH

Query:  QNSPL--SFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDL
        +   L     +     +     +S+ +   N        PRR   D    +  ++   +      +G+ +  W  N D F    + K+      +  L +
Subjt:  QNSPL--SFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWALNSDGFYTVASVKKALQQPDQSFLDL

Query:  QSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSI
           N +K +W +    K     W  + + + T +++   L       S CV+C    E R HLF  CP +  +
Subjt:  QSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.4e-0734.57Show/hide
Query:  IAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKK-IQGF-VIKLDIEKAFDKLNWRFIDFMLMKKGYPFRW
        + ERLK  + + +   Q +F+ GR   D I+   EA+   R KK ++G+ ++KLD+EKA+D++ W +++  L+  G+P  W
Subjt:  IAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKK-IQGF-VIKLDIEKAFDKLNWRFIDFMLMKKGYPFRW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.8e-1044.78Show/hide
Query:  IINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGE--KIKRVKMEGNI-NLTHLLFADD
        IING P+G + PSRG+RQGDP+SP++F+L  + +S L     E  ++  +++  N   + HLLFADD
Subjt:  IINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGE--KIKRVKMEGNI-NLTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTTTATCAAGCCCAGATCAAATTCATTTCCGACAGCAATCTTATTGATCCTCCCCTCTCAAATGCAAAATTTACTTGGTCTAATCTCAGAGTTCAACCA
GTTCTCTCCAGGATCGACAGATTCCTATATACAACAAATTGGGAAAATTTATTCACTGCCCACTATTCAAAGACCCTCTCGCGAGTTACTTCAGATCATTTCCCG
ATTGCACTGGAATCCTCCATGATTAGTTGGGGCCCTTCTCCCTTCAAGTTTATAAATGTACATCTGAAAGAACCATGGTTCAAAAATAATCTAAAGCAACTATCT
GCCATAATAAGAGATGAGCAAAAAAAGAACAAATGCTACAGTGATGAAGATAAGAATGCTTGGATAAAAGAAATCGACAACATAGACAGACTAGAAGCTGAAGGA
AACTTATCTGAAGAGCTTAGCCTCCGTAGGACTAGATTAAAAGCCGATGTCCTTATGTCCGGATTCAAAGAAGCTCAAATATGGTACCAAAAAAGCAAGAGATTG
TGGATCACTGAAGGAGATGAAAATACATCCTTCTTTCACAAAATCTGCTCTGCCAGACAAAGAAGGAGTATTATATCAAATATTAACTCTGTCGATGGTGTTCCT
TGTTCGACAAATGAGAGCATTGCAAAAGCCTTCTTAGATCACTTTGAAGATATTTATAAAGGGGGTGGAGAGGAAAGCCCTTGGCTTATTGATAATCTTAGTTGG
TCTCCTATATCAACCACCCAAGCACAAAATTTATGTTCTTTGTTCACGGAGGAGGAAATTCATGCAGCCCTTACTGCTTTCTCAAACAATAAAAGCCCGGGTCCA
GATGGCTTTACCATGGAATTCTACAAATCAACTTGGTCTGTCCTCAAGGAGGAAATTTTCAACATATTCAGAGACTTCCACTCGAACTGTGTCATTAACAAAGCA
GTAAACATTACAAATATTGCTCTAATTGCCAAAAAAGAGAAGTGTGCGGAGCCTGCGGATTACAGACCTATAAGTCTAACGACTTCCATTTACAAACTTATTGCC
AAAGTTATTGCGGAAAGACTAAAAGAAACTCTTCCCTCCACAGTGGCAGAGAATCAAATGGCCTTTGTAAAAGGCAGACAAATCATTGATGCTATCTTAGTTGCA
AATGAAGCCATCGACTATTGGAGAGTTAAAAAAATTCAAGGCTTTGTTATAAAGCTGGATATTGAAAAAGCATTTGACAAACTAAACTGGAGATTCATTGATTTC
ATGCTTATGAAAAAGGGTTACCCCTTCAGATGGAGGAGGTGGATAAGAGCTTGTATTAGTAGTGTTCAGTACTCTATTATCATCAACGGCAGACCTAGAGGAAAA
ATTCAACCTTCCCGTGGCATAAGGCAAGGAGACCCTATTTCCCCTTTTATTTTTGTCCTAGCAATGGATTATATAAGCAGGCTGCTGAACTCTGTGGGCGAAAAA
ATCAAAAGGGTGAAAATGGAGGGCAACATAAATCTGACACACTTACTCTTTGCAGATGATATTCTGCTTTTTGTAGAAGATGATGAGCACTCAATTCAAAATTTA
AAGAATATCATCAACCTCTTCCAGCTTGCATCGGGGCTGAGTATCAACCTAAATAAATCCACCATATCCCCTATAAATGTTGAAGCTGCAAGAACTGAACAGATA
GCTTCACAATGGGGAATTTCTACAAAATTTCTTCCAATCAACTACCTTGGAGTTCCTCTCGGAGGCAAACAAACCACAAAGTCTTTTTGGAAGAACGTCGAAGAA
AAGATAAACAAAAAACTTGCCAGCTGGAAATATTCTATGTTATCTAAAGGAGGTAAAATTACTCTGATTAAATCTTCTTTGGCTAGCCTTCCTACTTATCAACTA
TCAATCTTCAAAGCCCCTGTATCAACCTGCAAAAACATTGAAAAAACTTGGAGAAATTTCTTATGGAAGAACCCATCAGAGACCCACAAACTACACCTAGTTAAT
TGGGCGAAGATTACTTCTCCAAAAGAGAGAGGAGGGCTGGGCATAAGTCGACTGAAAGACACAAACTTTGCTCTTCTAACAAAATGGCTCTGGAGATACATCCAT
GAAGATTCCCCCCTATGGAAGAAAATTATAAATGCAAAATATAGAAGCCTATCCAAAGGGGACATTCATTGTGTTTACAATCATAGCAGCAGCCGTTCCCCATGG
TTCTCCATTTGCAAAGGATTGGATTGGTTTCAAAGACATGTTTCCTGGAAAATTAAAAATGGTAGAAGCTTCTCCTTTTGGCATAGCCACTGGCATCAAAATAGT
CCTCTGTCATTTCACTATCCCAGATTATTTGCTCTATCTACAAATAAGGACAGCTCCATAAGAGACATGTGGAATAATACCTTGATGGATTGGGATCTAAATCCA
AGAAGACAGTTAAGAGATTGGGAATATCCTCTGTGGGCTGAGTTAAAAAACTCTCTAAATGCTAGCTTTTGCGAAAATGGAAAATACTCTCCAACGTGGGCCCTA
AACTCTGATGGCTTTTACACTGTGGCCTCGGTTAAAAAAGCTCTCCAACAGCCTGATCAAAGCTTCTTAGACCTCCAAAGCCAAAACACTTTCAAGAATCTTTGG
AAGACAAGCATCCCAAAGAAATGCATCTTTTTCATATGGACTCTGCTCTATGATAGTGTGAACACTGCTGAACAACTTATGAAGAGATTGCCAAATCTCTGTTCT
AGACCGAGCTGGTGTGTAATGTGTAAGAGGAATGACGAAGACAGAATACACCTCTTTATCCTATGCCCCATTGCAAAGTCTATTTGGAGATTAATATCATCACAC
TTAAACAGCAATGTAAACTGCCTCAGTCCAAAGGATCTATGTATTACCATGTGCAGCTGGAAACAGAAAACAAAGAAGAATGTCATTCTCTACAACACCTATGCC
TCTGCCCTTTGGAACATTTGGTTGGAGAGGAATGCCCGTATCTTCAATGGGAAAGAAAAAACAGTTGCAGAAATTTGGGAAGATATAAAAGCTCTTGCAGGACTA
TGGACCAGTAGATCTTCTCTTTTTTCAAATTATCAAGCTTCTTCCATAGCACTAAACCTTAATGGATTTAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTTTTATCAAGCCCAGATCAAATTCATTTCCGACAGCAATCTTATTGATCCTCCCCTCTCAAATGCAAAATTTACTTGGTCTAATCTCAGAGTTCAACCA
GTTCTCTCCAGGATCGACAGATTCCTATATACAACAAATTGGGAAAATTTATTCACTGCCCACTATTCAAAGACCCTCTCGCGAGTTACTTCAGATCATTTCCCG
ATTGCACTGGAATCCTCCATGATTAGTTGGGGCCCTTCTCCCTTCAAGTTTATAAATGTACATCTGAAAGAACCATGGTTCAAAAATAATCTAAAGCAACTATCT
GCCATAATAAGAGATGAGCAAAAAAAGAACAAATGCTACAGTGATGAAGATAAGAATGCTTGGATAAAAGAAATCGACAACATAGACAGACTAGAAGCTGAAGGA
AACTTATCTGAAGAGCTTAGCCTCCGTAGGACTAGATTAAAAGCCGATGTCCTTATGTCCGGATTCAAAGAAGCTCAAATATGGTACCAAAAAAGCAAGAGATTG
TGGATCACTGAAGGAGATGAAAATACATCCTTCTTTCACAAAATCTGCTCTGCCAGACAAAGAAGGAGTATTATATCAAATATTAACTCTGTCGATGGTGTTCCT
TGTTCGACAAATGAGAGCATTGCAAAAGCCTTCTTAGATCACTTTGAAGATATTTATAAAGGGGGTGGAGAGGAAAGCCCTTGGCTTATTGATAATCTTAGTTGG
TCTCCTATATCAACCACCCAAGCACAAAATTTATGTTCTTTGTTCACGGAGGAGGAAATTCATGCAGCCCTTACTGCTTTCTCAAACAATAAAAGCCCGGGTCCA
GATGGCTTTACCATGGAATTCTACAAATCAACTTGGTCTGTCCTCAAGGAGGAAATTTTCAACATATTCAGAGACTTCCACTCGAACTGTGTCATTAACAAAGCA
GTAAACATTACAAATATTGCTCTAATTGCCAAAAAAGAGAAGTGTGCGGAGCCTGCGGATTACAGACCTATAAGTCTAACGACTTCCATTTACAAACTTATTGCC
AAAGTTATTGCGGAAAGACTAAAAGAAACTCTTCCCTCCACAGTGGCAGAGAATCAAATGGCCTTTGTAAAAGGCAGACAAATCATTGATGCTATCTTAGTTGCA
AATGAAGCCATCGACTATTGGAGAGTTAAAAAAATTCAAGGCTTTGTTATAAAGCTGGATATTGAAAAAGCATTTGACAAACTAAACTGGAGATTCATTGATTTC
ATGCTTATGAAAAAGGGTTACCCCTTCAGATGGAGGAGGTGGATAAGAGCTTGTATTAGTAGTGTTCAGTACTCTATTATCATCAACGGCAGACCTAGAGGAAAA
ATTCAACCTTCCCGTGGCATAAGGCAAGGAGACCCTATTTCCCCTTTTATTTTTGTCCTAGCAATGGATTATATAAGCAGGCTGCTGAACTCTGTGGGCGAAAAA
ATCAAAAGGGTGAAAATGGAGGGCAACATAAATCTGACACACTTACTCTTTGCAGATGATATTCTGCTTTTTGTAGAAGATGATGAGCACTCAATTCAAAATTTA
AAGAATATCATCAACCTCTTCCAGCTTGCATCGGGGCTGAGTATCAACCTAAATAAATCCACCATATCCCCTATAAATGTTGAAGCTGCAAGAACTGAACAGATA
GCTTCACAATGGGGAATTTCTACAAAATTTCTTCCAATCAACTACCTTGGAGTTCCTCTCGGAGGCAAACAAACCACAAAGTCTTTTTGGAAGAACGTCGAAGAA
AAGATAAACAAAAAACTTGCCAGCTGGAAATATTCTATGTTATCTAAAGGAGGTAAAATTACTCTGATTAAATCTTCTTTGGCTAGCCTTCCTACTTATCAACTA
TCAATCTTCAAAGCCCCTGTATCAACCTGCAAAAACATTGAAAAAACTTGGAGAAATTTCTTATGGAAGAACCCATCAGAGACCCACAAACTACACCTAGTTAAT
TGGGCGAAGATTACTTCTCCAAAAGAGAGAGGAGGGCTGGGCATAAGTCGACTGAAAGACACAAACTTTGCTCTTCTAACAAAATGGCTCTGGAGATACATCCAT
GAAGATTCCCCCCTATGGAAGAAAATTATAAATGCAAAATATAGAAGCCTATCCAAAGGGGACATTCATTGTGTTTACAATCATAGCAGCAGCCGTTCCCCATGG
TTCTCCATTTGCAAAGGATTGGATTGGTTTCAAAGACATGTTTCCTGGAAAATTAAAAATGGTAGAAGCTTCTCCTTTTGGCATAGCCACTGGCATCAAAATAGT
CCTCTGTCATTTCACTATCCCAGATTATTTGCTCTATCTACAAATAAGGACAGCTCCATAAGAGACATGTGGAATAATACCTTGATGGATTGGGATCTAAATCCA
AGAAGACAGTTAAGAGATTGGGAATATCCTCTGTGGGCTGAGTTAAAAAACTCTCTAAATGCTAGCTTTTGCGAAAATGGAAAATACTCTCCAACGTGGGCCCTA
AACTCTGATGGCTTTTACACTGTGGCCTCGGTTAAAAAAGCTCTCCAACAGCCTGATCAAAGCTTCTTAGACCTCCAAAGCCAAAACACTTTCAAGAATCTTTGG
AAGACAAGCATCCCAAAGAAATGCATCTTTTTCATATGGACTCTGCTCTATGATAGTGTGAACACTGCTGAACAACTTATGAAGAGATTGCCAAATCTCTGTTCT
AGACCGAGCTGGTGTGTAATGTGTAAGAGGAATGACGAAGACAGAATACACCTCTTTATCCTATGCCCCATTGCAAAGTCTATTTGGAGATTAATATCATCACAC
TTAAACAGCAATGTAAACTGCCTCAGTCCAAAGGATCTATGTATTACCATGTGCAGCTGGAAACAGAAAACAAAGAAGAATGTCATTCTCTACAACACCTATGCC
TCTGCCCTTTGGAACATTTGGTTGGAGAGGAATGCCCGTATCTTCAATGGGAAAGAAAAAACAGTTGCAGAAATTTGGGAAGATATAAAAGCTCTTGCAGGACTA
TGGACCAGTAGATCTTCTCTTTTTTCAAATTATCAAGCTTCTTCCATAGCACTAAACCTTAATGGATTTAGGTAG
Protein sequenceShow/hide protein sequence
MNFYQAQIKFISDSNLIDPPLSNAKFTWSNLRVQPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIALESSMISWGPSPFKFINVHLKEPWFKNNLKQLS
AIIRDEQKKNKCYSDEDKNAWIKEIDNIDRLEAEGNLSEELSLRRTRLKADVLMSGFKEAQIWYQKSKRLWITEGDENTSFFHKICSARQRRSIISNINSVDGVP
CSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLSWSPISTTQAQNLCSLFTEEEIHAALTAFSNNKSPGPDGFTMEFYKSTWSVLKEEIFNIFRDFHSNCVINKA
VNITNIALIAKKEKCAEPADYRPISLTTSIYKLIAKVIAERLKETLPSTVAENQMAFVKGRQIIDAILVANEAIDYWRVKKIQGFVIKLDIEKAFDKLNWRFIDF
MLMKKGYPFRWRRWIRACISSVQYSIIINGRPRGKIQPSRGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKRVKMEGNINLTHLLFADDILLFVEDDEHSIQNL
KNIINLFQLASGLSINLNKSTISPINVEAARTEQIASQWGISTKFLPINYLGVPLGGKQTTKSFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQL
SIFKAPVSTCKNIEKTWRNFLWKNPSETHKLHLVNWAKITSPKERGGLGISRLKDTNFALLTKWLWRYIHEDSPLWKKIINAKYRSLSKGDIHCVYNHSSSRSPW
FSICKGLDWFQRHVSWKIKNGRSFSFWHSHWHQNSPLSFHYPRLFALSTNKDSSIRDMWNNTLMDWDLNPRRQLRDWEYPLWAELKNSLNASFCENGKYSPTWAL
NSDGFYTVASVKKALQQPDQSFLDLQSQNTFKNLWKTSIPKKCIFFIWTLLYDSVNTAEQLMKRLPNLCSRPSWCVMCKRNDEDRIHLFILCPIAKSIWRLISSH
LNSNVNCLSPKDLCITMCSWKQKTKKNVILYNTYASALWNIWLERNARIFNGKEKTVAEIWEDIKALAGLWTSRSSLFSNYQASSIALNLNGFR