; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G01910 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G01910
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationClcChr04:6171776..6176665
RNA-Seq ExpressionClc04G01910
SyntenyClc04G01910
Gene Ontology termsNA
InterPro domainsIPR007021 - Domain of unknown function DUF659
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.7e-19849.45Show/hide
Query:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR
        +++E R +   E  L+S   ++  +        + S    + T G      M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHR
Subjt:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR

Query:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP
        CK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P
Subjt:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP

Query:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----
        + ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD ILGMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK    
Subjt:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----

Query:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT
                +L+ECR +                 EGG   A      +V+ R   +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+
Subjt:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT

Query:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA
        SG+IRPSTSPYSSP+LLV+K+DG WRFCVDYRALNNVT+PD F IP                 IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF 
Subjt:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA

Query:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT
        L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +LRE+ELYAN +KCH+A+                              P 
Subjt:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT

Query:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL
        N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  L
Subjt:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL

Query:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI
        S +DR +PVYERELMAVV A QRWRPYL                     VIQPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP +
Subjt:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI

Query:  VDVEVIKFEVEAETKLKDV
        +DVEVI+ EV  +  L+++
Subjt:  VDVEVIKFEVEAETKLKDV

KAA0049630.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-19849.45Show/hide
Query:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR
        +++E R +   E  L+S   ++  +        + S    + T G      M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHR
Subjt:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR

Query:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP
        CK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P
Subjt:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP

Query:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----
        + ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD ILGMQ LH++G T+VD  NLT+T  + G  + +KGD SLTK + SLK    
Subjt:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----

Query:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT
                +L+ECR +                 EGG + A      +V+ R   +  +   EH     L++G DP+NVR YRYA+ QK EME+L+DEML+
Subjt:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT

Query:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA
        SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP                 IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF 
Subjt:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA

Query:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT
        L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +LRE+ELYAN +KCH+A+                              P 
Subjt:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT

Query:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL
        N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  L
Subjt:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL

Query:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI
        S +DR +PVYERELMAVV A QRWRPYL                     VIQPQ+Q+WI KLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP +
Subjt:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI

Query:  VDVEVIKFEVEAETKLKDV
        +DVEVI+ EV  +  L+++
Subjt:  VDVEVIKFEVEAETKLKDV

KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-19849.57Show/hide
Query:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR
        +++E R +   E  L+S   ++  +        + S    + T G      M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHR
Subjt:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR

Query:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP
        CK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P
Subjt:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP

Query:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----
        + ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD ILGMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK    
Subjt:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----

Query:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT
                +L+ECR +                 EGG   A      +V+ R   +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+
Subjt:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT

Query:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA
        SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP                 IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF 
Subjt:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA

Query:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT
        L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +LRE+ELYAN +KCH+A+                              P 
Subjt:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT

Query:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL
        N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  L
Subjt:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL

Query:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI
        S +DR +PVYERELMAVV A QRWRPYL                     VIQPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP +
Subjt:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI

Query:  VDVEVIKFEVEAETKLKDV
        +DVEVI+ EV  +  L+++
Subjt:  VDVEVIKFEVEAETKLKDV

TYK15990.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-19849.57Show/hide
Query:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR
        +++E R +   E  L+S   ++  +        + S    + T G      M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHR
Subjt:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR

Query:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP
        CK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P
Subjt:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP

Query:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----
        + ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD ILGMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK    
Subjt:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----

Query:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT
                +L+ECR +                 EGG   A      +V+ R   +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+
Subjt:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT

Query:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA
        SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP                 IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF 
Subjt:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA

Query:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT
        L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +LRE+ELYAN +KCH+A+                              P 
Subjt:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT

Query:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL
        N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  L
Subjt:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL

Query:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI
        S +DR +PVYERELMAVV A QRWRPYL                     VIQPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP +
Subjt:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI

Query:  VDVEVIKFEVEAETKLKDV
        +DVEVI+ EV  +  L+++
Subjt:  VDVEVIKFEVEAETKLKDV

TYK23090.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-19849.57Show/hide
Query:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR
        +++E R +   E  L+S   ++  +        + S    + T G      M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHR
Subjt:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR

Query:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP
        CK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P
Subjt:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP

Query:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----
        + ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD ILGMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK    
Subjt:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----

Query:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT
                +L+ECR +                 EGG   A      +V+ R   +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+
Subjt:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT

Query:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA
        SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP                 IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF 
Subjt:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA

Query:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT
        L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +LRE+ELYAN +KCH+A+                              P 
Subjt:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT

Query:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL
        N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  L
Subjt:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL

Query:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI
        S +DR +PVYERELMAVV A QRWRPYL                     VIQPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP +
Subjt:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI

Query:  VDVEVIKFEVEAETKLKDV
        +DVEVI+ EV  +  L+++
Subjt:  VDVEVIKFEVEAETKLKDV

TrEMBL top hitse value%identityAlignment
A0A5A7SIV7 Ty3/gypsy retrotransposon protein2.8e-19849.45Show/hide
Query:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR
        +++E R +   E  L+S   ++  +        + S    + T G      M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHR
Subjt:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR

Query:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP
        CK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P
Subjt:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP

Query:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----
        + ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD ILGMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK    
Subjt:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----

Query:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT
                +L+ECR +                 EGG   A      +V+ R   +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+
Subjt:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT

Query:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA
        SG+IRPSTSPYSSP+LLV+K+DG WRFCVDYRALNNVT+PD F IP                 IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF 
Subjt:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA

Query:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT
        L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +LRE+ELYAN +KCH+A+                              P 
Subjt:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT

Query:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL
        N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  L
Subjt:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL

Query:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI
        S +DR +PVYERELMAVV A QRWRPYL                     VIQPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP +
Subjt:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI

Query:  VDVEVIKFEVEAETKLKDV
        +DVEVI+ EV  +  L+++
Subjt:  VDVEVIKFEVEAETKLKDV

A0A5A7U2S1 Ty3/gypsy retrotransposon protein5.6e-19949.45Show/hide
Query:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR
        +++E R +   E  L+S   ++  +        + S    + T G      M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHR
Subjt:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR

Query:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP
        CK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P
Subjt:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP

Query:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----
        + ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD ILGMQ LH++G T+VD  NLT+T  + G  + +KGD SLTK + SLK    
Subjt:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----

Query:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT
                +L+ECR +                 EGG + A      +V+ R   +  +   EH     L++G DP+NVR YRYA+ QK EME+L+DEML+
Subjt:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT

Query:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA
        SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP                 IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF 
Subjt:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA

Query:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT
        L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +LRE+ELYAN +KCH+A+                              P 
Subjt:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT

Query:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL
        N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  L
Subjt:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL

Query:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI
        S +DR +PVYERELMAVV A QRWRPYL                     VIQPQ+Q+WI KLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP +
Subjt:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI

Query:  VDVEVIKFEVEAETKLKDV
        +DVEVI+ EV  +  L+++
Subjt:  VDVEVIKFEVEAETKLKDV

A0A5A7U6J3 Ty3/gypsy retrotransposon protein1.2e-19849.57Show/hide
Query:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR
        +++E R +   E  L+S   ++  +        + S    + T G      M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHR
Subjt:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR

Query:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP
        CK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P
Subjt:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP

Query:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----
        + ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD ILGMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK    
Subjt:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----

Query:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT
                +L+ECR +                 EGG   A      +V+ R   +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+
Subjt:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT

Query:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA
        SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP                 IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF 
Subjt:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA

Query:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT
        L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +LRE+ELYAN +KCH+A+                              P 
Subjt:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT

Query:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL
        N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  L
Subjt:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL

Query:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI
        S +DR +PVYERELMAVV A QRWRPYL                     VIQPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP +
Subjt:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI

Query:  VDVEVIKFEVEAETKLKDV
        +DVEVI+ EV  +  L+++
Subjt:  VDVEVIKFEVEAETKLKDV

A0A5D3CXB1 Ty3/gypsy retrotransposon protein1.2e-19849.57Show/hide
Query:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR
        +++E R +   E  L+S   ++  +        + S    + T G      M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHR
Subjt:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR

Query:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP
        CK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P
Subjt:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP

Query:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----
        + ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD ILGMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK    
Subjt:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----

Query:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT
                +L+ECR +                 EGG   A      +V+ R   +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+
Subjt:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT

Query:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA
        SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP                 IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF 
Subjt:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA

Query:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT
        L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +LRE+ELYAN +KCH+A+                              P 
Subjt:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT

Query:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL
        N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  L
Subjt:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL

Query:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI
        S +DR +PVYERELMAVV A QRWRPYL                     VIQPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP +
Subjt:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI

Query:  VDVEVIKFEVEAETKLKDV
        +DVEVI+ EV  +  L+++
Subjt:  VDVEVIKFEVEAETKLKDV

A0A5D3DI73 Ty3/gypsy retrotransposon protein1.6e-19849.57Show/hide
Query:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR
        +++E R +   E  L+S   ++  +        + S    + T G      M+TITLR V+T  N +EGP KRLTDAEFQA REKGL FRC E+Y AGHR
Subjt:  IRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTLGNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHR

Query:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP
        CK ++ +ELR+L+V E+ +ELE+ +++    E  E++  + K    +  +L INSVVGLT  G MKVKGR+ +EEV +LIDC ATHN+IAE+LV  LQ P
Subjt:  CKNQEQRELRILVVMED-DELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEEEVTMLIDC-ATHNYIAERLVSTLQSP

Query:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----
        + ETPNYGVILGSG++I GKG+ + V+L +G   + DSFLPLE  GVD ILGMQ LH++G T+VD  NL +T  + G  + +KGD SLTK + SLK    
Subjt:  IVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGDLSLTKLRGSLK----

Query:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT
                +L+ECR +                 EGG   A      +V+ R   +  +   EH     L++G DPVNVR YRYA+ QK EME+L+DEML+
Subjt:  -------RFLIECRAM-----------------EGGMSLAKRYGVDEVYTRSESIQAE---EHRTSYSLEEGTDPVNVRHYRYAYHQK-EMEKLMDEMLT

Query:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA
        SG+IRPSTSPYSSP+LLV+KKDG WRFCVDYRALNNVT+PD F IP                 IDLK+GYHQI  HP DIEK AF THEGH EF+V+PF 
Subjt:  SGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFA

Query:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT
        L+NAPSTFQALMN +FKP+LR             SK +EEH +HLE V  +LRE+ELYAN +KCH+A+                              P 
Subjt:  LSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHYARV---------------------------ANPT

Query:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL
        N++E+RG  GLTGYY+RFVQ+YGS++A L+QLLK  GA++W EE   AFE+LK+AMMTLP          FE+E+D SG+G+ AVL Q ++P+AY+S  L
Subjt:  NIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTL

Query:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI
        S +DR +PVYERELMAVV A QRWRPYL                     VIQPQ+Q+WIAKLLGY F+V+Y+PGLEN+AADALS I P+ HL  L+AP +
Subjt:  SLKDRGKPVYERELMAVVMAAQRWRPYL---------------------VIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTI

Query:  VDVEVIKFEVEAETKLKDV
        +DVEVI+ EV  +  L+++
Subjt:  VDVEVIKFEVEAETKLKDV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.3e-4730.9Show/hide
Query:  VNVRH-----YRYAY---HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGG-----WRFCVDYRALNNVTIPD-----------------NFLIP
        +N +H      +Y+Y   +++E+E  + +ML  G+IR S SPY+SPI +V KK        +R  +DYR LN +T+ D                 N+   
Subjt:  VNVRH-----YRYAY---HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGG-----WRFCVDYRALNNVTIPD-----------------NFLIP

Query:  IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNK
        IDL  G+HQI   P  + K AF T  GH E+L +PF L NAP+TFQ  MN I +P L              S +++EH++ L  VF  L +  L    +K
Subjt:  IDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNK

Query:  CHYARVAN---------------------------PTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYE-AFERLKRAMMTLP--
        C + +                              PT  KE++   GLTGYY++F+ ++  +A  +++ LK     +     Y+ AF++LK  +   P  
Subjt:  CHYARVAN---------------------------PTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYE-AFERLKRAMMTLP--

Query:  -------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYLV-----IQPQHQ----------------KWIAKL
                F + TD S   + AVLSQ   P++Y S TL+  +      E+EL+A+V A + +R YL+     I   HQ                +W  KL
Subjt:  -------SFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYLV-----IQPQHQ----------------KWIAKL

Query:  LGYDFQVVYRPGLENEAADALSCI
          +DF + Y  G EN  ADALS I
Subjt:  LGYDFQVVYRPGLENEAADALSCI

P20825 Retrovirus-related Pol polyprotein from transposon 2973.5e-4933.49Show/hide
Query:  PVNVRHYRYAY-HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGG-----WRFCVDYRALNNVTIPDNFLIP-----------------IDLKSG
        P+  + Y  A  H+ E+E  + EML  G+IR S SPY+SP  +V KK        +R  +DYR LN +TIPD + IP                 IDL  G
Subjt:  PVNVRHYRYAY-HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGG-----WRFCVDYRALNNVTIPDNFLIP-----------------IDLKSG

Query:  YHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHY-AR
        +HQI      I K AF T  GH E+L +PF L NAP+TFQ  MN I +P L              S ++ EH+  ++ VF+ L +  L    +KC +  +
Subjt:  YHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHY-AR

Query:  VAN--------------------------PTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAY-EAFERLKR-----AMMTLPSFE
         AN                          PT  KE+R   GLTGYY++F+ +Y  +A  ++  LK     +  +  Y EAFE+LK       ++ LP FE
Subjt:  VAN--------------------------PTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAY-EAFERLKR-----AMMTLPSFE

Query:  ----VETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL-----VIQPQHQ----------------KWIAKLLGYDFQ
            + TD S   + AVLSQ   PI++ S TL+  +      E+EL+A+V A + +R YL     +I   HQ                +W  +L  Y F+
Subjt:  ----VETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYL-----VIQPQHQ----------------KWIAKLLGYDFQ

Query:  VVYRPGLENEAADALSCI
        + Y  G EN  ADALS I
Subjt:  VVYRPGLENEAADALSCI

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.2e-4229.85Show/hide
Query:  HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFC
        +++E+ K++ ++L +  I PS SP SSP++LV KKDG +R CVDYR LN  TI D F +P                 +DL SGYHQI   P D  K AF 
Subjt:  HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFC

Query:  THEGHCEFLVIPFALSNAPSTFQALMNAIFKP-----------YLRSKTVEEHVKHLESVFSVLRENELYANKNKCHYA--------------RVA----
        T  G  E+ V+PF L NAPSTF   M   F+             + S++ EEH KHL++V   L+   L   K KC +A              ++A    
Subjt:  THEGHCEFLVIPFALSNAPSTFQALMNAIFKP-----------YLRSKTVEEHVKHLESVFSVLRENELYANKNKCHYA--------------RVA----

Query:  ---------NPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQ
                  P  +K+ +   G+  YY+RF+ +   +A  +   +      +W E+  +A E+LK A+   P         ++ + TD S  GI AVL +
Subjt:  ---------NPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQ

Query:  QRRP------IAYYSNTLSLKDRGKPVYERELMAVVMAAQRWR-----PYLVIQPQH----------------QKWIAKLLGYDFQVVYRPGLENEAADA
                  + Y+S +L    +  P  E EL+ ++ A   +R      +  ++  H                Q+W+  L  YDF + Y  G +N  ADA
Subjt:  QRRP------IAYYSNTLSLKDRGKPVYERELMAVVMAAQRWR-----PYLVIQPQH----------------QKWIAKLLGYDFQVVYRPGLENEAADA

Query:  LS
        +S
Subjt:  LS

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.1e-4730.07Show/hide
Query:  DPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKK-----DGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKS
        DP+  + Y Y  + + E+E+ +DE+L  G+IRPS SPY+SPI +V KK     +  +R  VD++ LN VTIPD + IP                 +DL S
Subjt:  DPVNVRHYRYAYHQK-EMEKLMDEMLTSGVIRPSTSPYSSPILLVKKK-----DGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKS

Query:  GYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHY--
        G+HQI     DI K AF T  G  EFL +PF L NAP+ FQ +++ I + ++              S+  + H K+L  V + L +  L  N  K H+  
Subjt:  GYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLR-------------SKTVEEHVKHLESVFSVLRENELYANKNKCHY--

Query:  -------------------------ARVANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGA-----------FEWNEEAYEAFERLKRAMM
                                 + +  PT++KE++   G+T YY++F+Q Y  +A  L+ L +G  A              +E A ++F  LK  + 
Subjt:  -------------------------ARVANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGA-----------FEWNEEAYEAFERLKRAMM

Query:  T---------LPSFEVETDTSGYGIRAVLSQ----QRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYLV------IQPQHQ------------
        +            F + TD S + I AVLSQ    + RPIAY S +L+  +      E+E++A++ +    R YL       +   HQ            
Subjt:  T---------LPSFEVETDTSGYGIRAVLSQ----QRRPIAYYSNTLSLKDRGKPVYERELMAVVMAAQRWRPYLV------IQPQHQ------------

Query:  ----KWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMH
            +W A++  Y+ +++Y+PG  N  ADALS IPP ++
Subjt:  ----KWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMH

Q99315 Transposon Ty3-G Gag-Pol polyprotein9.3e-4229.6Show/hide
Query:  HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFC
        +++E+ K++ ++L +  I PS SP SSP++LV KKDG +R CVDYR LN  TI D F +P                 +DL SGYHQI   P D  K AF 
Subjt:  HQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRFCVDYRALNNVTIPDNFLIP-----------------IDLKSGYHQIWTHPGDIEKIAFC

Query:  THEGHCEFLVIPFALSNAPSTFQALMNAIFKP-----------YLRSKTVEEHVKHLESVFSVLRENELYANKNKCHYA--------------RVA----
        T  G  E+ V+PF L NAPSTF   M   F+             + S++ EEH KHL++V   L+   L   K KC +A              ++A    
Subjt:  THEGHCEFLVIPFALSNAPSTFQALMNAIFKP-----------YLRSKTVEEHVKHLESVFSVLRENELYANKNKCHYA--------------RVA----

Query:  ---------NPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQ
                  P  +K+ +   G+  YY+RF+ +   +A  +   +      +W E+  +A ++LK A+   P         ++ + TD S  GI AVL +
Subjt:  ---------NPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLP---------SFEVETDTSGYGIRAVLSQ

Query:  QRRP------IAYYSNTLSLKDRGKPVYERELMAVVMAAQRWR-----PYLVIQPQH----------------QKWIAKLLGYDFQVVYRPGLENEAADA
                  + Y+S +L    +  P  E EL+ ++ A   +R      +  ++  H                Q+W+  L  YDF + Y  G +N  ADA
Subjt:  QRRP------IAYYSNTLSLKDRGKPVYERELMAVVMAAQRWR-----PYLVIQPQH----------------QKWIAKLLGYDFQVVYRPGLENEAADA

Query:  LS
        +S
Subjt:  LS

Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily1.8e-0825.6Show/hide
Query:  IVIDGWSDSQRRPLINFMEITEGGPMFLKAIVCSSEIKDKYFIANLMKEVINEVGHENMIQIITDNA------ANFLTLNLA--LKNICAARNIASNQHV
        I+ + W+D++ R LINF   +     F K++  SS  K+   +A+L   VI ++G E+++QII DN+      +N L  N A    + CA++ +      
Subjt:  IVIDGWSDSQRRPLINFMEITEGGPMFLKAIVCSSEIKDKYFIANLMKEVINEVGHENMIQIITDNA------ANFLTLNLA--LKNICAARNIASNQHV

Query:  FAECSWISEISDDVMFVKNFIMNHSMRLAIFDEFV-HLKLLSVAETRFASTIIMLKRFKLIKGGLQIM
        F++  W+++       +  F+ N+S  L +  +      ++    TR  S  + L+     K  L+ M
Subjt:  FAECSWISEISDDVMFVKNFIMNHSMRLAIFDEFV-HLKLLSVAETRFASTIIMLKRFKLIKGGLQIM

AT4G08267.1 hAT transposon superfamily protein2.4e-0836.46Show/hide
Query:  IITDNAANFL---------------------TLNLALKNICA-ARNIASNQHVFAECSWISEISDDVMFVKNFIMNHSMRLAIFDEFVHLKLLSVA
        ++T+NA+N++                     TLNLALKN CA + +  +N+ V+  C WI  IS++V ++KN IMN+ +RL +F E   LKLL+++
Subjt:  IITDNAANFL---------------------TLNLALKNICA-ARNIASNQHVFAECSWISEISDDVMFVKNFIMNHSMRLAIFDEFVHLKLLSVA

AT4G15020.1 hAT transposon superfamily4.1e-0825.84Show/hide
Query:  IVIDGWSDSQRRPLINFMEITEGGPMFLKAIVCSSEIKDKYFIANLMKEVINEVGHENMIQIITDNAANFLTLNLALKNI--------CAARNIASNQHV
        I+++  +  +   ++NF+       +FLK++  S  +     +  L+ E++ EVG  N++Q+IT     ++     L  +        CAA  I      
Subjt:  IVIDGWSDSQRRPLINFMEITEGGPMFLKAIVCSSEIKDKYFIANLMKEVINEVGHENMIQIITDNAANFLTLNLALKNI--------CAARNIASNQHV

Query:  FAECSWISEISDDVMFVKNFIMNHSMRLAIFDEF-----VHLKLLSVAETRFASTIIMLKRFKLIKGGLQIMVISDKW
        F +  WISE  +    +  F+ NHS  L +  +F     + L   S + T FA+    L R   +K  LQ MV S +W
Subjt:  FAECSWISEISDDVMFVKNFIMNHSMRLAIFDEF-----VHLKLLSVAETRFASTIIMLKRFKLIKGGLQIMVISDKW

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.3e-1343.75Show/hide
Query:  NLVTWHSKKQNVMARNSAEAEYRAMTNRVCEILWIHRILKESRMEVKAPMKLFHDNKAALNIAQNPIQHDSSRTKHIKVD
        +L++W SKKQ V++++SAEAEYRA++    E++W+ +  +E ++ +  P  LF DN AA++IA N + H+  RTKHI+ D
Subjt:  NLVTWHSKKQNVMARNSAEAEYRAMTNRVCEILWIHRILKESRMEVKAPMKLFHDNKAALNIAQNPIQHDSSRTKHIKVD

ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-1032.23Show/hide
Query:  VKHLESVFSVLRENELYANKNKCHYARV-----------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAF
        + HL  V  +  +++ YAN+ KC + +                                P N  E+RG  GLTGYY+RFV++YG +   L++LLK   + 
Subjt:  VKHLESVFSVLRENELYANKNKCHYARV-----------------------------ANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAF

Query:  EWNEEAYEAFERLKRAMMTLP
        +W E A  AF+ LK A+ TLP
Subjt:  EWNEEAYEAFERLKRAMMTLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCCGAAGATGATACAAAACTGTTATGGCAACATGCAACAAAGAATGAAAGATTGAATGGAGGAGGTGGGAACATTGCTTGGCAAGTTGGGGCACACCTCCTAAA
ATTGGGAGGATATGGGATTGGAGTATGTAAAAAGGCCACCTCTAAAGATCTTGCAAAAATGCAAAAGTTGGAAGATGAAGCAAAAGCTCAGATTTTAAAGAATGCTCCTA
AACAAGTTCCTTTACCACCTTCACAACATATGCAAACTGAAACTCATTCTTTTCGAACAGGAAGTACAGGTCTCGCATTTGCTTTGGCTGCGAATAATGCATTGTCAGTA
TGTATTGTCATTGACGGATGGAGCGATTCGCAGAGGAGACCCTTGATTAACTTTATGGAAATTACAGAAGGAGGACCAATGTTTCTTAAAGCTATAGTTTGCTCAAGTGA
GATTAAAGATAAATACTTCATAGCAAATCTGATGAAAGAAGTCATAAATGAGGTGGGACACGAGAATATGATTCAAATAATCACTGACAATGCTGCAAATTTTCTTACAC
TCAATCTTGCTTTGAAAAACATTTGTGCTGCAAGAAATATTGCTAGTAATCAACATGTATTTGCAGAGTGTAGTTGGATTTCTGAAATTTCTGATGATGTTATGTTTGTC
AAGAACTTTATCATGAATCATTCCATGAGACTTGCTATATTCGACGAGTTTGTGCATTTGAAGTTGCTTTCAGTAGCAGAAACACGTTTTGCATCTACCATCATTATGCT
TAAAAGATTTAAACTCATTAAGGGTGGTTTGCAAATAATGGTTATCAGTGACAAATGGGGATGCTACAGAGAAGATGATGTGGAGAAAGCAAAACATGTAAAGGAATTGT
TGGTGGGGCATCCTGTTAGGCCACCCATAGTGCTAACATATAGTAGAAAGGGTAATAAGGGGAAAGGTATTGTGTGTGGAAGTGTAGCTGCTGAGGGGAGAGGGGGACTT
TTTGGGTTCCCTAGTGTTCTTGTTTTCGCAGTAGTTCCTTATCAAATTGGCATCAGAGTGTTCAACTTAGGGAAGATTGGGAAGATGGCGCAAAAGAAATTTGAGGAACG
GATAGATGCAATGGACCAGGAGGTGTCGGAAATTCGGGTAGAGATTCGACGGTTACCGGAGATTGAAGAGACTTTAGCATCGTTGACGAAGACACGAGGACAGAAGACGA
CGATTAAAGCAAGTGCAGAATCTGGAAGTAAGAGTGCTTCAAGTGAGACGATGGGACCCGCTGACACCAACACAATGAAGACGATTACTTTAAGAGGAGTATCGACGTTA
GGAAACTGGCAAGAAGGACCGGCGAAGAGGTTGACAGATGCTGAATTCCAAGCAGAACGAGAAAAGGGTCTGTACTTCAGGTGTGATGAACGATATTCGGCCGGCCATAG
ATGCAAAAATCAAGAGCAGAGGGAATTAAGAATACTGGTAGTTATGGAAGATGATGAATTGGAGGTATTCGACGACAAAGATACCAAGGGAGAGCAAGTCGAATTACAAA
TGATGAAAGCGAAAGGGGAGATACAAATAGGGGCGAAATTATTGATTAACTCGGTCGTGGGTTTAACTTATTCGGGGATGATGAAGGTGAAAGGAAGAATCCAGGAAGAA
GAAGTGACTATGTTGATCGACTGCGCAACACATAATTACATTGCCGAGAGGTTAGTTTCAACTTTGCAATCACCAATTGTAGAGACCCCCAACTATGGTGTAATTTTGGG
CTCGGGGTCAGCGATAATAGGGAAGGGCATTCGTAATGTTGTTAAATTGACAATTGGTGAGCTGATTCTGCGGGATAGTTTCTTACCCTTGGAGTTCGAGGGGGTAGATG
TGATTTTGGGTATGCAACGGTTACATACTTTAGGAGTCACGAAGGTCGATTTGTGGAATTTGACCATGACAATTAATCAAGGAGGTAAAACTATTGTGTTGAAAGGGGAC
CTGAGTCTGACGAAGTTGCGAGGGAGCTTAAAGAGATTCTTGATAGAGTGTCGAGCTATGGAAGGAGGAATGTCGTTGGCCAAACGGTATGGAGTGGATGAAGTATACAC
CAGATCCGAGTCAATACAAGCTGAGGAGCATAGAACATCATATTCACTTGAAGAAGGGACAGATCCGGTCAATGTAAGGCATTATAGATATGCATACCACCAAAAAGAGA
TGGAAAAATTGATGGATGAGATGTTGACCTCAGGGGTTATCCGGCCTAGCACCAGTCCATACTCGAGCCCTATCTTGTTGGTGAAGAAGAAGGATGGGGGATGGAGGTTT
TGTGTGGATTATAGAGCGTTGAACAATGTTACTATACCAGACAATTTTCTGATTCCCATTGACTTGAAGTCGGGATACCACCAAATATGGACGCACCCTGGTGACATTGA
GAAAATTGCATTCTGTACGCATGAAGGCCACTGTGAATTTTTAGTCATACCCTTCGCATTGTCTAATGCTCCGTCCACCTTCCAGGCGCTTATGAATGCAATTTTCAAGC
CTTATTTGAGGAGTAAAACTGTGGAGGAGCATGTGAAGCATCTAGAGTCAGTATTTTCTGTGCTTCGGGAGAATGAGTTGTATGCTAATAAGAATAAGTGCCACTATGCT
AGAGTGGCTAACCCCACTAATATTAAGGAGGTACGTGGGTTGACGGGGTTGACAGGATATTATAAACGTTTTGTCCAACATTATGGAAGCATGGCAGCCCTCTTAAGCCA
ATTGTTAAAAGGAGGAGGAGCATTTGAATGGAATGAAGAGGCATATGAGGCTTTCGAACGATTGAAGAGGGCTATGATGACCTTACCGTCCTTTGAGGTGGAAACTGACA
CATCGGGGTATGGAATTAGGGCCGTTTTGTCTCAACAGAGGAGGCCAATTGCGTATTATAGCAACACTCTGTCACTGAAGGACAGAGGAAAACCAGTTTATGAGAGGGAA
CTGATGGCCGTGGTTATGGCGGCGCAACGATGGAGACCTTATTTGGTGATACAACCACAACATCAGAAGTGGATTGCTAAGCTGTTGGGGTATGATTTCCAAGTTGTTTA
CAGACCAGGATTGGAGAACGAAGCTGCTGATGCTTTGTCGTGCATACCTCCCTCTATGCATTTGGCCCATTTGTCAGCCCCTACTATTGTCGATGTGGAAGTCATCAAAT
TTGAGGTAGAGGCTGAAACAAAATTGAAGGACGTGAAAAGACTTCAACTCGAGCTGATATTGTACATATGTATGGGAAACTTGGTGACATGGCACAGTAAGAAGCAAAAT
GTAATGGCTCGAAATAGTGCTGAGGCAGAATATAGGGCCATGACTAACAGAGTATGTGAGATTTTGTGGATTCATAGAATACTTAAAGAGTCAAGAATGGAGGTCAAAGC
TCCAATGAAGCTCTTTCATGATAACAAAGCTGCTTTAAACATTGCACAAAATCCAATCCAACATGATAGCAGTCGCACAAAGCACATAAAGGTGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGCCGAAGATGATACAAAACTGTTATGGCAACATGCAACAAAGAATGAAAGATTGAATGGAGGAGGTGGGAACATTGCTTGGCAAGTTGGGGCACACCTCCTAAA
ATTGGGAGGATATGGGATTGGAGTATGTAAAAAGGCCACCTCTAAAGATCTTGCAAAAATGCAAAAGTTGGAAGATGAAGCAAAAGCTCAGATTTTAAAGAATGCTCCTA
AACAAGTTCCTTTACCACCTTCACAACATATGCAAACTGAAACTCATTCTTTTCGAACAGGAAGTACAGGTCTCGCATTTGCTTTGGCTGCGAATAATGCATTGTCAGTA
TGTATTGTCATTGACGGATGGAGCGATTCGCAGAGGAGACCCTTGATTAACTTTATGGAAATTACAGAAGGAGGACCAATGTTTCTTAAAGCTATAGTTTGCTCAAGTGA
GATTAAAGATAAATACTTCATAGCAAATCTGATGAAAGAAGTCATAAATGAGGTGGGACACGAGAATATGATTCAAATAATCACTGACAATGCTGCAAATTTTCTTACAC
TCAATCTTGCTTTGAAAAACATTTGTGCTGCAAGAAATATTGCTAGTAATCAACATGTATTTGCAGAGTGTAGTTGGATTTCTGAAATTTCTGATGATGTTATGTTTGTC
AAGAACTTTATCATGAATCATTCCATGAGACTTGCTATATTCGACGAGTTTGTGCATTTGAAGTTGCTTTCAGTAGCAGAAACACGTTTTGCATCTACCATCATTATGCT
TAAAAGATTTAAACTCATTAAGGGTGGTTTGCAAATAATGGTTATCAGTGACAAATGGGGATGCTACAGAGAAGATGATGTGGAGAAAGCAAAACATGTAAAGGAATTGT
TGGTGGGGCATCCTGTTAGGCCACCCATAGTGCTAACATATAGTAGAAAGGGTAATAAGGGGAAAGGTATTGTGTGTGGAAGTGTAGCTGCTGAGGGGAGAGGGGGACTT
TTTGGGTTCCCTAGTGTTCTTGTTTTCGCAGTAGTTCCTTATCAAATTGGCATCAGAGTGTTCAACTTAGGGAAGATTGGGAAGATGGCGCAAAAGAAATTTGAGGAACG
GATAGATGCAATGGACCAGGAGGTGTCGGAAATTCGGGTAGAGATTCGACGGTTACCGGAGATTGAAGAGACTTTAGCATCGTTGACGAAGACACGAGGACAGAAGACGA
CGATTAAAGCAAGTGCAGAATCTGGAAGTAAGAGTGCTTCAAGTGAGACGATGGGACCCGCTGACACCAACACAATGAAGACGATTACTTTAAGAGGAGTATCGACGTTA
GGAAACTGGCAAGAAGGACCGGCGAAGAGGTTGACAGATGCTGAATTCCAAGCAGAACGAGAAAAGGGTCTGTACTTCAGGTGTGATGAACGATATTCGGCCGGCCATAG
ATGCAAAAATCAAGAGCAGAGGGAATTAAGAATACTGGTAGTTATGGAAGATGATGAATTGGAGGTATTCGACGACAAAGATACCAAGGGAGAGCAAGTCGAATTACAAA
TGATGAAAGCGAAAGGGGAGATACAAATAGGGGCGAAATTATTGATTAACTCGGTCGTGGGTTTAACTTATTCGGGGATGATGAAGGTGAAAGGAAGAATCCAGGAAGAA
GAAGTGACTATGTTGATCGACTGCGCAACACATAATTACATTGCCGAGAGGTTAGTTTCAACTTTGCAATCACCAATTGTAGAGACCCCCAACTATGGTGTAATTTTGGG
CTCGGGGTCAGCGATAATAGGGAAGGGCATTCGTAATGTTGTTAAATTGACAATTGGTGAGCTGATTCTGCGGGATAGTTTCTTACCCTTGGAGTTCGAGGGGGTAGATG
TGATTTTGGGTATGCAACGGTTACATACTTTAGGAGTCACGAAGGTCGATTTGTGGAATTTGACCATGACAATTAATCAAGGAGGTAAAACTATTGTGTTGAAAGGGGAC
CTGAGTCTGACGAAGTTGCGAGGGAGCTTAAAGAGATTCTTGATAGAGTGTCGAGCTATGGAAGGAGGAATGTCGTTGGCCAAACGGTATGGAGTGGATGAAGTATACAC
CAGATCCGAGTCAATACAAGCTGAGGAGCATAGAACATCATATTCACTTGAAGAAGGGACAGATCCGGTCAATGTAAGGCATTATAGATATGCATACCACCAAAAAGAGA
TGGAAAAATTGATGGATGAGATGTTGACCTCAGGGGTTATCCGGCCTAGCACCAGTCCATACTCGAGCCCTATCTTGTTGGTGAAGAAGAAGGATGGGGGATGGAGGTTT
TGTGTGGATTATAGAGCGTTGAACAATGTTACTATACCAGACAATTTTCTGATTCCCATTGACTTGAAGTCGGGATACCACCAAATATGGACGCACCCTGGTGACATTGA
GAAAATTGCATTCTGTACGCATGAAGGCCACTGTGAATTTTTAGTCATACCCTTCGCATTGTCTAATGCTCCGTCCACCTTCCAGGCGCTTATGAATGCAATTTTCAAGC
CTTATTTGAGGAGTAAAACTGTGGAGGAGCATGTGAAGCATCTAGAGTCAGTATTTTCTGTGCTTCGGGAGAATGAGTTGTATGCTAATAAGAATAAGTGCCACTATGCT
AGAGTGGCTAACCCCACTAATATTAAGGAGGTACGTGGGTTGACGGGGTTGACAGGATATTATAAACGTTTTGTCCAACATTATGGAAGCATGGCAGCCCTCTTAAGCCA
ATTGTTAAAAGGAGGAGGAGCATTTGAATGGAATGAAGAGGCATATGAGGCTTTCGAACGATTGAAGAGGGCTATGATGACCTTACCGTCCTTTGAGGTGGAAACTGACA
CATCGGGGTATGGAATTAGGGCCGTTTTGTCTCAACAGAGGAGGCCAATTGCGTATTATAGCAACACTCTGTCACTGAAGGACAGAGGAAAACCAGTTTATGAGAGGGAA
CTGATGGCCGTGGTTATGGCGGCGCAACGATGGAGACCTTATTTGGTGATACAACCACAACATCAGAAGTGGATTGCTAAGCTGTTGGGGTATGATTTCCAAGTTGTTTA
CAGACCAGGATTGGAGAACGAAGCTGCTGATGCTTTGTCGTGCATACCTCCCTCTATGCATTTGGCCCATTTGTCAGCCCCTACTATTGTCGATGTGGAAGTCATCAAAT
TTGAGGTAGAGGCTGAAACAAAATTGAAGGACGTGAAAAGACTTCAACTCGAGCTGATATTGTACATATGTATGGGAAACTTGGTGACATGGCACAGTAAGAAGCAAAAT
GTAATGGCTCGAAATAGTGCTGAGGCAGAATATAGGGCCATGACTAACAGAGTATGTGAGATTTTGTGGATTCATAGAATACTTAAAGAGTCAAGAATGGAGGTCAAAGC
TCCAATGAAGCTCTTTCATGATAACAAAGCTGCTTTAAACATTGCACAAAATCCAATCCAACATGATAGCAGTCGCACAAAGCACATAAAGGTGGATTGA
Protein sequenceShow/hide protein sequence
MTAEDDTKLLWQHATKNERLNGGGGNIAWQVGAHLLKLGGYGIGVCKKATSKDLAKMQKLEDEAKAQILKNAPKQVPLPPSQHMQTETHSFRTGSTGLAFALAANNALSV
CIVIDGWSDSQRRPLINFMEITEGGPMFLKAIVCSSEIKDKYFIANLMKEVINEVGHENMIQIITDNAANFLTLNLALKNICAARNIASNQHVFAECSWISEISDDVMFV
KNFIMNHSMRLAIFDEFVHLKLLSVAETRFASTIIMLKRFKLIKGGLQIMVISDKWGCYREDDVEKAKHVKELLVGHPVRPPIVLTYSRKGNKGKGIVCGSVAAEGRGGL
FGFPSVLVFAVVPYQIGIRVFNLGKIGKMAQKKFEERIDAMDQEVSEIRVEIRRLPEIEETLASLTKTRGQKTTIKASAESGSKSASSETMGPADTNTMKTITLRGVSTL
GNWQEGPAKRLTDAEFQAEREKGLYFRCDERYSAGHRCKNQEQRELRILVVMEDDELEVFDDKDTKGEQVELQMMKAKGEIQIGAKLLINSVVGLTYSGMMKVKGRIQEE
EVTMLIDCATHNYIAERLVSTLQSPIVETPNYGVILGSGSAIIGKGIRNVVKLTIGELILRDSFLPLEFEGVDVILGMQRLHTLGVTKVDLWNLTMTINQGGKTIVLKGD
LSLTKLRGSLKRFLIECRAMEGGMSLAKRYGVDEVYTRSESIQAEEHRTSYSLEEGTDPVNVRHYRYAYHQKEMEKLMDEMLTSGVIRPSTSPYSSPILLVKKKDGGWRF
CVDYRALNNVTIPDNFLIPIDLKSGYHQIWTHPGDIEKIAFCTHEGHCEFLVIPFALSNAPSTFQALMNAIFKPYLRSKTVEEHVKHLESVFSVLRENELYANKNKCHYA
RVANPTNIKEVRGLTGLTGYYKRFVQHYGSMAALLSQLLKGGGAFEWNEEAYEAFERLKRAMMTLPSFEVETDTSGYGIRAVLSQQRRPIAYYSNTLSLKDRGKPVYERE
LMAVVMAAQRWRPYLVIQPQHQKWIAKLLGYDFQVVYRPGLENEAADALSCIPPSMHLAHLSAPTIVDVEVIKFEVEAETKLKDVKRLQLELILYICMGNLVTWHSKKQN
VMARNSAEAEYRAMTNRVCEILWIHRILKESRMEVKAPMKLFHDNKAALNIAQNPIQHDSSRTKHIKVD