; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G001860 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G001860
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCG_Chr04:6489654..6494542
RNA-Seq ExpressionClCG04G001860
SyntenyClCG04G001860
Gene Ontology termsNA
InterPro domainsIPR007021 - Domain of unknown function DUF659
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.0e-20051.69Show/hide
Query:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL
        M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHRCK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L
Subjt:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL

Query:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL
         INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P+ ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD IL
Subjt:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL

Query:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR
        GMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK            +L+ECR +                 EGG   A      +V+ R
Subjt:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR

Query:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----
           +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+SG+IRPSTSPYSSP+LLV+K+DG WRFCVDYRALNNVT+PD F IP     
Subjt:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----

Query:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV
                    IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +
Subjt:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV

Query:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER
        LRE+ELYAN +KCH+A+                              P N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+
Subjt:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER

Query:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI
        LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  LS +DR +PVYERELMAVV A QRWRPYL                     VI
Subjt:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI

Query:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV
        QPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP ++DVEVI+ EV  +  L+++
Subjt:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV

KAA0049630.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-20051.69Show/hide
Query:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL
        M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHRCK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L
Subjt:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL

Query:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL
         INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P+ ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD IL
Subjt:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL

Query:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR
        GMQ LH++G T+VD  NLT+T  + G  + +KGD SLTK + SLK            +L+ECR +                 EGG + A      +V+ R
Subjt:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR

Query:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----
           +  +   EH     L++G DP+NVR YRYA+ QK EME+L+DEML+SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP     
Subjt:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----

Query:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV
                    IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +
Subjt:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV

Query:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER
        LRE+ELYAN +KCH+A+                              P N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+
Subjt:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER

Query:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI
        LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  LS +DR +PVYERELMAVV A QRWRPYL                     VI
Subjt:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI

Query:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV
        QPQ+Q+WI KLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP ++DVEVI+ EV  +  L+++
Subjt:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV

KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.6e-20051.82Show/hide
Query:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL
        M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHRCK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L
Subjt:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL

Query:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL
         INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P+ ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD IL
Subjt:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL

Query:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR
        GMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK            +L+ECR +                 EGG   A      +V+ R
Subjt:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR

Query:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----
           +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP     
Subjt:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----

Query:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV
                    IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +
Subjt:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV

Query:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER
        LRE+ELYAN +KCH+A+                              P N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+
Subjt:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER

Query:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI
        LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  LS +DR +PVYERELMAVV A QRWRPYL                     VI
Subjt:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI

Query:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV
        QPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP ++DVEVI+ EV  +  L+++
Subjt:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV

TYK15990.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.6e-20051.82Show/hide
Query:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL
        M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHRCK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L
Subjt:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL

Query:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL
         INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P+ ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD IL
Subjt:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL

Query:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR
        GMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK            +L+ECR +                 EGG   A      +V+ R
Subjt:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR

Query:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----
           +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP     
Subjt:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----

Query:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV
                    IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +
Subjt:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV

Query:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER
        LRE+ELYAN +KCH+A+                              P N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+
Subjt:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER

Query:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI
        LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  LS +DR +PVYERELMAVV A QRWRPYL                     VI
Subjt:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI

Query:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV
        QPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP ++DVEVI+ EV  +  L+++
Subjt:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV

TYK23090.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.7e-20051.82Show/hide
Query:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL
        M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHRCK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L
Subjt:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL

Query:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL
         INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P+ ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD IL
Subjt:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL

Query:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR
        GMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK            +L+ECR +                 EGG   A      +V+ R
Subjt:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR

Query:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----
           +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP     
Subjt:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----

Query:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV
                    IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +
Subjt:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV

Query:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER
        LRE+ELYAN +KCH+A+                              P N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+
Subjt:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER

Query:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI
        LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  LS +DR +PVYERELMAVV A QRWRPYL                     VI
Subjt:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI

Query:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV
        QPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP ++DVEVI+ EV  +  L+++
Subjt:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV

TrEMBL top hitse value%identityAlignment
A0A5A7SIV7 Ty3/gypsy retrotransposon protein3.9e-20051.69Show/hide
Query:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL
        M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHRCK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L
Subjt:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL

Query:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL
         INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P+ ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD IL
Subjt:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL

Query:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR
        GMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK            +L+ECR +                 EGG   A      +V+ R
Subjt:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR

Query:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----
           +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+SG+IRPSTSPYSSP+LLV+K+DG WRFCVDYRALNNVT+PD F IP     
Subjt:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----

Query:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV
                    IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +
Subjt:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV

Query:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER
        LRE+ELYAN +KCH+A+                              P N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+
Subjt:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER

Query:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI
        LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  LS +DR +PVYERELMAVV A QRWRPYL                     VI
Subjt:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI

Query:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV
        QPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP ++DVEVI+ EV  +  L+++
Subjt:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV

A0A5A7U2S1 Ty3/gypsy retrotransposon protein7.8e-20151.69Show/hide
Query:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL
        M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHRCK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L
Subjt:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL

Query:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL
         INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P+ ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD IL
Subjt:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL

Query:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR
        GMQ LH++G T+VD  NLT+T  + G  + +KGD SLTK + SLK            +L+ECR +                 EGG + A      +V+ R
Subjt:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR

Query:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----
           +  +   EH     L++G DP+NVR YRYA+ QK EME+L+DEML+SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP     
Subjt:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----

Query:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV
                    IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +
Subjt:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV

Query:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER
        LRE+ELYAN +KCH+A+                              P N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+
Subjt:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER

Query:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI
        LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  LS +DR +PVYERELMAVV A QRWRPYL                     VI
Subjt:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI

Query:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV
        QPQ+Q+WI KLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP ++DVEVI+ EV  +  L+++
Subjt:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV

A0A5A7U6J3 Ty3/gypsy retrotransposon protein1.7e-20051.82Show/hide
Query:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL
        M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHRCK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L
Subjt:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL

Query:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL
         INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P+ ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD IL
Subjt:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL

Query:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR
        GMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK            +L+ECR +                 EGG   A      +V+ R
Subjt:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR

Query:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----
           +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP     
Subjt:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----

Query:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV
                    IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +
Subjt:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV

Query:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER
        LRE+ELYAN +KCH+A+                              P N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+
Subjt:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER

Query:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI
        LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  LS +DR +PVYERELMAVV A QRWRPYL                     VI
Subjt:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI

Query:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV
        QPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP ++DVEVI+ EV  +  L+++
Subjt:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV

A0A5D3CXB1 Ty3/gypsy retrotransposon protein1.7e-20051.82Show/hide
Query:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL
        M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHRCK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L
Subjt:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL

Query:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL
         INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P+ ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD IL
Subjt:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL

Query:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR
        GMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK            +L+ECR +                 EGG   A      +V+ R
Subjt:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR

Query:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----
           +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP     
Subjt:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----

Query:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV
                    IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +
Subjt:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV

Query:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER
        LRE+ELYAN +KCH+A+                              P N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+
Subjt:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER

Query:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI
        LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  LS +DR +PVYERELMAVV A QRWRPYL                     VI
Subjt:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI

Query:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV
        QPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP ++DVEVI+ EV  +  L+++
Subjt:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV

A0A5D3DI73 Ty3/gypsy retrotransposon protein2.3e-20051.82Show/hide
Query:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL
        M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHRCK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L
Subjt:  MKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKL

Query:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL
         INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P+ ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD IL
Subjt:  LINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVIL

Query:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR
        GMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK            +L+ECR +                 EGG   A      +V+ R
Subjt:  GMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK-----------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTR

Query:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----
           +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP     
Subjt:  SESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----

Query:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV
                    IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +
Subjt:  ------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSV

Query:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER
        LRE+ELYAN +KCH+A+                              P N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+
Subjt:  LRENELYANKNKCHYARV---------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFER

Query:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI
        LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  LS +DR +PVYERELMAVV A QRWRPYL                     VI
Subjt:  LKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VI

Query:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV
        QPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP ++DVEVI+ EV  +  L+++
Subjt:  QPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.3e-4730.9Show/hide
Query:  VNVRH-----YRYAY---HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGG-----WRFCVDYRALNNVTIPD-----------------NFLIP
        +N +H      +Y+Y   +++E+E  + +ML  G+IR S SPY+SPI +V KK        +R  +DYR LN +T+ D                 N+   
Subjt:  VNVRH-----YRYAY---HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGG-----WRFCVDYRALNNVTIPD-----------------NFLIP

Query:  IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNK
        IDL  G+HQI   P  + K AF T  GH E+L +PF L NAP+TFQ  MN I +P L              S +++EH++ L  VF  L +  L    +K
Subjt:  IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNK

Query:  CHYARVAN---------------------------PTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYE-AFERLKRAMMTLP--
        C + +                              PT  KE++   GLTGYY++F+ ++  +A  +++ LK     +     Y+ AF++LK  +   P  
Subjt:  CHYARVAN---------------------------PTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYE-AFERLKRAMMTLP--

Query:  -------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYLV-----IQPQHQ----------------KWIAKL
                F + TD S   + AVLSQ   P++Y S TL+  +      E+EL+A+V A + +R YL+     I   HQ                +W  KL
Subjt:  -------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYLV-----IQPQHQ----------------KWIAKL

Query:  LGYDFQVVYRPGLENEAADALSCI
          +DF + Y  G EN  ADALS I
Subjt:  LGYDFQVVYRPGLENEAADALSCI

P20825 Retrovirus-related Pol polyprotein from transposon 2973.5e-4933.49Show/hide
Query:  PVNVRHYRYAY-HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGG-----WRFCVDYRALNNVTIPDNFLIP-----------------IDLKSG
        P+  + Y  A  H+ E+E  + EML  G+IR S SPY+SP  +V KK        +R  +DYR LN +TIPD + IP                 IDL  G
Subjt:  PVNVRHYRYAY-HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGG-----WRFCVDYRALNNVTIPDNFLIP-----------------IDLKSG

Query:  YHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHY-AR
        +HQI      I K AF T  GH E+L +PF L NAP+TFQ  MN I +P L              S ++ EH+  ++ VF+ L +  L    +KC +  +
Subjt:  YHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHY-AR

Query:  VAN--------------------------PTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAY-EAFERLKR-----AMMTLPSFE
         AN                          PT  KE+R   GLTGYY++F+ +Y  +A  ++  LK     +  +  Y EAFE+LK       ++ LP FE
Subjt:  VAN--------------------------PTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAY-EAFERLKR-----AMMTLPSFE

Query:  ----VETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL-----VIQPQHQ----------------KWIAKLLGYDFQ
            + TD S   + AVLSQ   PI++ S TL+  +      E+EL+A+V A + +R YL     +I   HQ                +W  +L  Y F+
Subjt:  ----VETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL-----VIQPQHQ----------------KWIAKLLGYDFQ

Query:  VVYRPGLENEAADALSCI
        + Y  G EN  ADALS I
Subjt:  VVYRPGLENEAADALSCI

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.2e-4229.85Show/hide
Query:  HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFC
        +++E+ K++ ++L +  I PS SP SSP++LV KKDG +R CVDYR LN  TI D F +P                 +DL SGYHQI   P D  K AF 
Subjt:  HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFC

Query:  THEGHCEFLVIPFALSNAPSTFQALMNAIFKP-----------YLRSKTVEEHVKHLESVFSVLRENELYANKNKCHYA--------------RVA----
        T  G  E+ V+PF L NAPSTF   M   F+             + S++ EEH KHL++V   L+   L   K KC +A              ++A    
Subjt:  THEGHCEFLVIPFALSNAPSTFQALMNAIFKP-----------YLRSKTVEEHVKHLESVFSVLRENELYANKNKCHYA--------------RVA----

Query:  ---------NPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQ
                  P  +K+ +   G+  YY+RF+ +   +A  +   +      +W E+  +A E+LK A+   P         ++ + TD S  GI AVL +
Subjt:  ---------NPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQ

Query:  QRRP------IAYYSNTLSLKDRGKPVYERELMAVVMAAQRWR-----PYLVIQPQH----------------QKWIAKLLGYDFQVVYRPGLENEAADA
                  + Y+S +L    +  P  E EL+ ++ A   +R      +  ++  H                Q+W+  L  YDF + Y  G +N  ADA
Subjt:  QRRP------IAYYSNTLSLKDRGKPVYERELMAVVMAAQRWR-----PYLVIQPQH----------------QKWIAKLLGYDFQVVYRPGLENEAADA

Query:  LS
        +S
Subjt:  LS

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.1e-4730.07Show/hide
Query:  DPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKK-----DGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKS
        DP+  + Y Y  + + E+E+ +DE+L  G+IRPS SPY+SPI +V KK     +  +R  VD++ LN VTIPD + IP                 +DL S
Subjt:  DPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKK-----DGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKS

Query:  GYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHY--
        G+HQI     DI K AF T  G  EFL +PF L NAP+ FQ +++ I + ++              S+  + H K+L  V + L +  L  N  K H+  
Subjt:  GYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHY--

Query:  -------------------------ARVANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGA-----------FEWNEEAYEAFERLKRAMM
                                 + +  PT++KE++   G+T YY++F+Q Y  +A  L+ L +G  A              +E A ++F  LK  + 
Subjt:  -------------------------ARVANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGA-----------FEWNEEAYEAFERLKRAMM

Query:  T---------LPSFEVETDTSGYGIRAVLSQ----QRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYLV------IQPQHQ------------
        +            F + TD S + I AVLSQ    + RPIAY S +L+  +      E+E++A++ +    R YL       +   HQ            
Subjt:  T---------LPSFEVETDTSGYGIRAVLSQ----QRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYLV------IQPQHQ------------

Query:  ----KWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMH
            +W A++  Y+ +++Y+PG  N  ADALS IPP ++
Subjt:  ----KWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMH

Q99315 Transposon Ty3-G Gag-Pol polyprotein9.3e-4229.6Show/hide
Query:  HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFC
        +++E+ K++ ++L +  I PS SP SSP++LV KKDG +R CVDYR LN  TI D F +P                 +DL SGYHQI   P D  K AF 
Subjt:  HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFC

Query:  THEGHCEFLVIPFALSNAPSTFQALMNAIFKP-----------YLRSKTVEEHVKHLESVFSVLRENELYANKNKCHYA--------------RVA----
        T  G  E+ V+PF L NAPSTF   M   F+             + S++ EEH KHL++V   L+   L   K KC +A              ++A    
Subjt:  THEGHCEFLVIPFALSNAPSTFQALMNAIFKP-----------YLRSKTVEEHVKHLESVFSVLRENELYANKNKCHYA--------------RVA----

Query:  ---------NPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQ
                  P  +K+ +   G+  YY+RF+ +   +A  +   +      +W E+  +A ++LK A+   P         ++ + TD S  GI AVL +
Subjt:  ---------NPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQ

Query:  QRRP------IAYYSNTLSLKDRGKPVYERELMAVVMAAQRWR-----PYLVIQPQH----------------QKWIAKLLGYDFQVVYRPGLENEAADA
                  + Y+S +L    +  P  E EL+ ++ A   +R      +  ++  H                Q+W+  L  YDF + Y  G +N  ADA
Subjt:  QRRP------IAYYSNTLSLKDRGKPVYERELMAVVMAAQRWR-----PYLVIQPQH----------------QKWIAKLLGYDFQVVYRPGLENEAADA

Query:  LS
        +S
Subjt:  LS

Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily1.8e-0825.6Show/hide
Query:  IVIDGWSDSQRRPLINFMEITEGGPMFLKAIVCSSEIKDKYFIANLMKEVINEVGHENMIQIITDNA------ANFLTLNLA--LKNICAARNIASNQHV
        I+ + W+D++ R LINF   +     F K++  SS  K+   +A+L   VI ++G E+++QII DN+      +N L  N A    + CA++ +      
Subjt:  IVIDGWSDSQRRPLINFMEITEGGPMFLKAIVCSSEIKDKYFIANLMKEVINEVGHENMIQIITDNA------ANFLTLNLA--LKNICAARNIASNQHV

Query:  FAECSWISEISDDVMFVKNFIMNHSMRLAIFDEFV-HLKLLSVAETRFASTIIMLKRFKLIKGGLQIM
        F++  W+++       +  F+ N+S  L +  +      ++    TR  S  + L+     K  L+ M
Subjt:  FAECSWISEISDDVMFVKNFIMNHSMRLAIFDEFV-HLKLLSVAETRFASTIIMLKRFKLIKGGLQIM

AT4G08267.1 hAT transposon superfamily protein2.4e-0836.46Show/hide
Query:  IITDNAANFL---------------------TLNLALKNICA-ARNIASNQHVFAECSWISEISDDVMFVKNFIMNHSMRLAIFDEFVHLKLLSVA
        ++T+NA+N++                     TLNLALKN CA + +  +N+ V+  C WI  IS++V ++KN IMN+ +RL +F E   LKLL+++
Subjt:  IITDNAANFL---------------------TLNLALKNICA-ARNIASNQHVFAECSWISEISDDVMFVKNFIMNHSMRLAIFDEFVHLKLLSVA

AT4G15020.1 hAT transposon superfamily4.1e-0825.84Show/hide
Query:  IVIDGWSDSQRRPLINFMEITEGGPMFLKAIVCSSEIKDKYFIANLMKEVINEVGHENMIQIITDNAANFLTLNLALKNI--------CAARNIASNQHV
        I+++  +  +   ++NF+       +FLK++  S  +     +  L+ E++ EVG  N++Q+IT     ++     L  +        CAA  I      
Subjt:  IVIDGWSDSQRRPLINFMEITEGGPMFLKAIVCSSEIKDKYFIANLMKEVINEVGHENMIQIITDNAANFLTLNLALKNI--------CAARNIASNQHV

Query:  FAECSWISEISDDVMFVKNFIMNHSMRLAIFDEF-----VHLKLLSVAETRFASTIIMLKRFKLIKGGLQIMVISDKW
        F +  WISE  +    +  F+ NHS  L +  +F     + L   S + T FA+    L R   +K  LQ MV S +W
Subjt:  FAECSWISEISDDVMFVKNFIMNHSMRLAIFDEF-----VHLKLLSVAETRFASTIIMLKRFKLIKGGLQIMVISDKW

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.3e-1343.75Show/hide
Query:  NLVTWHSKKQNVMARNSAEAEYRAMTNRVCEILWIHRILKESRMEVKAPMKLFHDNKAALNIAQNPIQHDSSRTKHIKVD
        +L++W SKKQ V++++SAEAEYRA++    E++W+ +  +E ++ +  P  LF DN AA++IA N + H+  RTKHI+ D
Subjt:  NLVTWHSKKQNVMARNSAEAEYRAMTNRVCEILWIHRILKESRMEVKAPMKLFHDNKAALNIAQNPIQHDSSRTKHIKVD

ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-1032.23Show/hide
Query:  VKHLESVFSVLRENELYANKNKCHYARV-----------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAF
        + HL  V  +  +++ YAN+ KC + +                                P N  E+RG  GLTGYY+RFV++YG +   L++LLK   + 
Subjt:  VKHLESVFSVLRENELYANKNKCHYARV-----------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAF

Query:  EWNEEAYEAFERLKRAMMTLP
        +W E A  AF+ LK A+ TLP
Subjt:  EWNEEAYEAFERLKRAMMTLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCCGAAGATGATACAAAACTGTTATGGCAACATGCAACAAAGAATGAAAGATTGAATGGAGGAGGTGGGAACATTGCTTGGCAAGTTGGGGCACACCTC
CTAAAATTGGGAGGATATGGGATTGGAGTATGTAAAAAGGCCACCTCTAAAGATCTTGCAAAAATGCAAAAGTTGGAAGATGAAGCAAAAGCTCAGATTTTAAAG
AATGCTCCTAAACAAGTTCCTTTACCACCTTCACAACATATGCAAACTGAAACTCATTCTTTTCGAACAGGAAGTACAGGTCTCGCATTTGCTTTGGCTGCGAAT
AATGCATTGTCAGTATGTATTGTCATTGACGGATGGAGCGATTCGCAGAGGAGACCCTTGATTAACTTTATGGAAATTACAGAAGGAGGACCAATGTTTCTTAAA
GCTATAGTTTGCTCAAGTGAGATTAAAGATAAATACTTCATAGCAAATCTGATGAAAGAAGTCATAAATGAGGTGGGACACGAGAATATGATTCAAATAATCACT
GACAATGCTGCAAATTTTCTTACACTCAATCTTGCTTTGAAAAACATTTGTGCTGCAAGAAATATTGCTAGTAATCAACATGTATTTGCAGAGTGTAGTTGGATT
TCTGAAATTTCTGATGATGTTATGTTTGTCAAGAACTTTATCATGAATCATTCCATGAGACTTGCTATATTCGACGAGTTTGTGCATTTGAAGTTGCTTTCAGTA
GCAGAAACACGTTTTGCATCTACCATCATTATGCTTAAAAGATTTAAACTCATTAAGGGTGGTTTGCAAATAATGGTTATCAGTGACAAATGGGGATGCTACAGA
GAAGATGATGTGGAGAAAGCAAAACATGTAAAGGAATTGTTGGTGGGGCATCCTGTTAGGCCACCCATAGTGCTAACATATAGTAGAAAGGGTAATAAGGGGAAA
GGTATTGTGTGTGGGAGTGTAGCTGCTGAGGGGAGAGGGGGACTTTTTGGGTTCCCTAGTGTTCTTGTTTTCGCAGTAGTTCCTTATCAAATTGGCATCAGAGTG
TTCAACTTAGGGAAGATTGGGAAGATGGCGCAAAAGAAATTTGAGGAACGGATAGATGCAATGGACCAGGAGGTGTCGGAAATTCGGGTAGAGATTCGACGGTTA
CCGGAGATTGAAGAGACTTTAGTATCGTTGACGAAGACACGAGGACAGAAGACGACGATTAAAGCAAGTGCAGAATCTGGAAGTAAGAGTGCTTCAAGTGAGACG
ATGGGACCCGCTGACACCAACACAATGAAGACGATTACTTTAAGAGGAGTATCGACGTTAGGAAACTGGCAAGAAGGACCGGCGAAGAGGTTGACAGATGCTGAA
TTCCAAGCAGAACGAGAAAAGGGTCTGTACTTCAGGTGTGATGAACGATATTCGGCCGGCCATAGATGCAAAAATCAAGAGCAGAGGGAATTAAGAATACTGGTA
GTTATGGAAGATGATGAATTGGAGGTATTCGACGACAAAGATACCAAGGGAGAGCAAGTCGAATTACAAATGATGAAAGCGAAAGGGGAGATACAAATAGGGGCG
AAATTATTGATTAACTCGGTCGTGGGTTTAACTTATTCGGGGATGATGAAGGTGAAAGGAAGAATCCAGGAAGAAGAAGTGACTATGTTGATCGACTGCGCAACA
CATAATTACATTGCCGAGAGGTTAGTTTCAACTTTGCAATCACCAATTGTAGAGACCCCCAACTATGGTGTAATTTTGGGCTCGGGGTCAGCGATAATAGGGAAG
GGCATTCGTAATGTTGTTAAATTGACAATTGGTGAGCTGATTCTGCGGGATAGTTTCTTACCCTTGGAGTTCGAGGGGGTAGATGTGATTTTGGGTATGCAACGG
TTACATACTTTAGGAGTCACGAAGGTCGATTTGTGGAATTTGACCATGACAATTAATCAAGGAGGTAAAACTATTGTGTTGAAAGGGGACCTGAGTCTGACGAAG
TTGCGAGGGAGCTTAAAGAGATTCTTGATAGAGTGTCGAGCTATGGAAGGAGGAATGTCGTTGGCCAAACGGTATGGAGTGGATGAAGTATACACCAGATCCGAG
TCAATACAAGCTGAGGAGCATAGAACATCATATTCACTTGAAGAAGGGACAGATCCGGTCAATGTAAGGCATTATAGATATGCATACCACCAAAAAGAGATGGAA
AAATTGATGGATGAGATGTTGACCTCAGGGGTTATCCGGCCTAGCACCAGTCCATACTCGAGCCCTATCTTGTTGGTGAAGAAGAAGGATGGGGGATGGAGGTTT
TGTGTGGATTATAGAGCGTTGAACAATGTTACTATACCAGACAATTTTCTGATTCCCATTGACTTGAAGTCGGGATACCACCAAATATGGACGCACCCTGGTGAC
ATTGAGAAAATTGCATTCTGTACGCATGAAGGCCACTGTGAATTTTTAGTCATACCCTTCGCATTGTCTAATGCTCCGTCCACCTTCCAGGCGCTTATGAATGCA
ATTTTCAAGCCTTATTTGAGGAGTAAAACTGTGGAGGAGCATGTGAAGCATCTAGAGTCAGTATTTTCTGTGCTTCGGGAGAATGAGTTGTATGCTAATAAGAAT
AAGTGCCACTATGCTAGAGTGGCTAACCCCACTAATATTAAGGAGGTACGTGGGTTGACGGGGTTGACAGGATATTATAAACGTTTTGTCCAACATTATGGAAGC
ATGGCAGCCCTCTTAAGCCAATTGTTAAAAGGAGGAGGAGCATTTGAATGGAATGAAGAGGCATATGAGGCTTTCGAACGATTGAAGAGGGCTATGATGACCTTA
CCGTCCTTTGAGGTGGAAACTGACACATCGGGGTATGGAATTAGGGCCGTTTTGTCTCAACAGAGGAGGCCAATTGCGTATTATAGCAACACTCTGTCACTGAAG
GACAGAGGAAAACCAGTTTATGAGAGGGAACTGATGGCCGTGGTTATGGCGGCGCAACGATGGAGACCTTATTTGGTGATACAACCACAACATCAGAAGTGGATT
GCTAAGCTGTTGGGGTATGATTTCCAAGTTGTTTACAGACCAGGATTGGAGAACGAAGCTGCTGATGCTTTGTCGTGCATACCTCCCTCTATGCATTTGGCCCAT
TTGTCAGCCCCTACTATTGTCGATGTGGAAGTCATCAAATTTGAGGTAGAGGCTGAAACAAAATTGAAGGACGTGAAAAGACTTCAACTCGAGCTGATATTGTAC
ATATGTATGGGAAACTTGGTGACATGGCACAGTAAGAAGCAAAATGTAATGGCTCGAAATAGTGCTGAGGCAGAATATAGGGCCATGACTAACAGAGTATGTGAG
ATTTTGTGGATTCATAGAATACTTAAAGAGTCAAGAATGGAGGTCAAAGCTCCAATGAAGCTCTTTCATGATAACAAAGCTGCTTTAAACATTGCACAAAATCCA
ATCCAACATGATAGCAGTCGCACAAAGCACATAAAGGTGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGCCGAAGATGATACAAAACTGTTATGGCAACATGCAACAAAGAATGAAAGATTGAATGGAGGAGGTGGGAACATTGCTTGGCAAGTTGGGGCACACCTC
CTAAAATTGGGAGGATATGGGATTGGAGTATGTAAAAAGGCCACCTCTAAAGATCTTGCAAAAATGCAAAAGTTGGAAGATGAAGCAAAAGCTCAGATTTTAAAG
AATGCTCCTAAACAAGTTCCTTTACCACCTTCACAACATATGCAAACTGAAACTCATTCTTTTCGAACAGGAAGTACAGGTCTCGCATTTGCTTTGGCTGCGAAT
AATGCATTGTCAGTATGTATTGTCATTGACGGATGGAGCGATTCGCAGAGGAGACCCTTGATTAACTTTATGGAAATTACAGAAGGAGGACCAATGTTTCTTAAA
GCTATAGTTTGCTCAAGTGAGATTAAAGATAAATACTTCATAGCAAATCTGATGAAAGAAGTCATAAATGAGGTGGGACACGAGAATATGATTCAAATAATCACT
GACAATGCTGCAAATTTTCTTACACTCAATCTTGCTTTGAAAAACATTTGTGCTGCAAGAAATATTGCTAGTAATCAACATGTATTTGCAGAGTGTAGTTGGATT
TCTGAAATTTCTGATGATGTTATGTTTGTCAAGAACTTTATCATGAATCATTCCATGAGACTTGCTATATTCGACGAGTTTGTGCATTTGAAGTTGCTTTCAGTA
GCAGAAACACGTTTTGCATCTACCATCATTATGCTTAAAAGATTTAAACTCATTAAGGGTGGTTTGCAAATAATGGTTATCAGTGACAAATGGGGATGCTACAGA
GAAGATGATGTGGAGAAAGCAAAACATGTAAAGGAATTGTTGGTGGGGCATCCTGTTAGGCCACCCATAGTGCTAACATATAGTAGAAAGGGTAATAAGGGGAAA
GGTATTGTGTGTGGGAGTGTAGCTGCTGAGGGGAGAGGGGGACTTTTTGGGTTCCCTAGTGTTCTTGTTTTCGCAGTAGTTCCTTATCAAATTGGCATCAGAGTG
TTCAACTTAGGGAAGATTGGGAAGATGGCGCAAAAGAAATTTGAGGAACGGATAGATGCAATGGACCAGGAGGTGTCGGAAATTCGGGTAGAGATTCGACGGTTA
CCGGAGATTGAAGAGACTTTAGTATCGTTGACGAAGACACGAGGACAGAAGACGACGATTAAAGCAAGTGCAGAATCTGGAAGTAAGAGTGCTTCAAGTGAGACG
ATGGGACCCGCTGACACCAACACAATGAAGACGATTACTTTAAGAGGAGTATCGACGTTAGGAAACTGGCAAGAAGGACCGGCGAAGAGGTTGACAGATGCTGAA
TTCCAAGCAGAACGAGAAAAGGGTCTGTACTTCAGGTGTGATGAACGATATTCGGCCGGCCATAGATGCAAAAATCAAGAGCAGAGGGAATTAAGAATACTGGTA
GTTATGGAAGATGATGAATTGGAGGTATTCGACGACAAAGATACCAAGGGAGAGCAAGTCGAATTACAAATGATGAAAGCGAAAGGGGAGATACAAATAGGGGCG
AAATTATTGATTAACTCGGTCGTGGGTTTAACTTATTCGGGGATGATGAAGGTGAAAGGAAGAATCCAGGAAGAAGAAGTGACTATGTTGATCGACTGCGCAACA
CATAATTACATTGCCGAGAGGTTAGTTTCAACTTTGCAATCACCAATTGTAGAGACCCCCAACTATGGTGTAATTTTGGGCTCGGGGTCAGCGATAATAGGGAAG
GGCATTCGTAATGTTGTTAAATTGACAATTGGTGAGCTGATTCTGCGGGATAGTTTCTTACCCTTGGAGTTCGAGGGGGTAGATGTGATTTTGGGTATGCAACGG
TTACATACTTTAGGAGTCACGAAGGTCGATTTGTGGAATTTGACCATGACAATTAATCAAGGAGGTAAAACTATTGTGTTGAAAGGGGACCTGAGTCTGACGAAG
TTGCGAGGGAGCTTAAAGAGATTCTTGATAGAGTGTCGAGCTATGGAAGGAGGAATGTCGTTGGCCAAACGGTATGGAGTGGATGAAGTATACACCAGATCCGAG
TCAATACAAGCTGAGGAGCATAGAACATCATATTCACTTGAAGAAGGGACAGATCCGGTCAATGTAAGGCATTATAGATATGCATACCACCAAAAAGAGATGGAA
AAATTGATGGATGAGATGTTGACCTCAGGGGTTATCCGGCCTAGCACCAGTCCATACTCGAGCCCTATCTTGTTGGTGAAGAAGAAGGATGGGGGATGGAGGTTT
TGTGTGGATTATAGAGCGTTGAACAATGTTACTATACCAGACAATTTTCTGATTCCCATTGACTTGAAGTCGGGATACCACCAAATATGGACGCACCCTGGTGAC
ATTGAGAAAATTGCATTCTGTACGCATGAAGGCCACTGTGAATTTTTAGTCATACCCTTCGCATTGTCTAATGCTCCGTCCACCTTCCAGGCGCTTATGAATGCA
ATTTTCAAGCCTTATTTGAGGAGTAAAACTGTGGAGGAGCATGTGAAGCATCTAGAGTCAGTATTTTCTGTGCTTCGGGAGAATGAGTTGTATGCTAATAAGAAT
AAGTGCCACTATGCTAGAGTGGCTAACCCCACTAATATTAAGGAGGTACGTGGGTTGACGGGGTTGACAGGATATTATAAACGTTTTGTCCAACATTATGGAAGC
ATGGCAGCCCTCTTAAGCCAATTGTTAAAAGGAGGAGGAGCATTTGAATGGAATGAAGAGGCATATGAGGCTTTCGAACGATTGAAGAGGGCTATGATGACCTTA
CCGTCCTTTGAGGTGGAAACTGACACATCGGGGTATGGAATTAGGGCCGTTTTGTCTCAACAGAGGAGGCCAATTGCGTATTATAGCAACACTCTGTCACTGAAG
GACAGAGGAAAACCAGTTTATGAGAGGGAACTGATGGCCGTGGTTATGGCGGCGCAACGATGGAGACCTTATTTGGTGATACAACCACAACATCAGAAGTGGATT
GCTAAGCTGTTGGGGTATGATTTCCAAGTTGTTTACAGACCAGGATTGGAGAACGAAGCTGCTGATGCTTTGTCGTGCATACCTCCCTCTATGCATTTGGCCCAT
TTGTCAGCCCCTACTATTGTCGATGTGGAAGTCATCAAATTTGAGGTAGAGGCTGAAACAAAATTGAAGGACGTGAAAAGACTTCAACTCGAGCTGATATTGTAC
ATATGTATGGGAAACTTGGTGACATGGCACAGTAAGAAGCAAAATGTAATGGCTCGAAATAGTGCTGAGGCAGAATATAGGGCCATGACTAACAGAGTATGTGAG
ATTTTGTGGATTCATAGAATACTTAAAGAGTCAAGAATGGAGGTCAAAGCTCCAATGAAGCTCTTTCATGATAACAAAGCTGCTTTAAACATTGCACAAAATCCA
ATCCAACATGATAGCAGTCGCACAAAGCACATAAAGGTGGATTGA
Protein sequenceShow/hide protein sequence
MTAEDDTKLLWQHATKNERLNGGGGNIAWQVGAHLLKLGGYGIGVCKKATSKDLAKMQKLEDEAKAQILKNAPKQVPLPPSQHMQTETHSFRTGSTGLAFALAAN
NALSVCIVIDGWSDSQRRPLINFMEITEGGPMFLKAIVCSSEIKDKYFIANLMKEVINEVGHENMIQIITDNAANFLTLNLALKNICAARNIASNQHVFAECSWI
SEISDDVMFVKNFIMNHSMRLAIFDEFVHLKLLSVAETRFASTIIMLKRFKLIKGGLQIMVISDKWGCYREDDVEKAKHVKELLVGHPVRPPIVLTYSRKGNKGK
GIVCGSVAAEGRGGLFGFPSVLVFAVVPYQIGIRVFNLGKIGKMAQKKFEERIDAMDQEVSEIRVEIRRLPEIEETLVSLTKTRGQKTTIKASAESGSKSASSET
MGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMEDDELEVFDDKDTKGEQVELQMMKAKGEIQIGA
KLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDCATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQR
LHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLKRFLIECRAMEGGMSLAKRYGVDEVYTRSESIQAEEHRTSYSLEEGTDPVNVRHYRYAYHQKEME
KLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIPIDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNA
IFKPYLRSKTVEEHVKHLESVFSVLRENELYANKNKCHYARVANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTL
PSFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYLVIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAH
LSAPTIVDVEVIKFEVEAETKLKDVKRLQLELILYICMGNLVTWHSKKQNVMARNSAEAEYRAMTNRVCEILWIHRILKESRMEVKAPMKLFHDNKAALNIAQNP
IQHDSSRTKHIKVD