; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G13130 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G13130
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag protease polyprotein
Genome locationChr5:12651598..12656763
RNA-Seq ExpressionCSPI05G13130
SyntenyCSPI05G13130
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR004242 - Transposon, En/Spm-like
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031931.1 pol protein [Cucumis melo var. makuwa]2.4e-21166.84Show/hide
Query:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE
        M+CPE+ +VQCA F+L D+G  WW TT RMLGGDV  ITW QFK+ FY KFF A+LRDAK QEFL ++QG M VE+YD EFDM SRFAPE++  E A A+
Subjt:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE

Query:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR
        +FV+GLR +I+G VRA +P   A+ALRLAVD+S+ +     K+  +G++SGQKRKA+Q                R   Q P  AG+A R KPLC TCGK 
Subjt:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR

Query:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP
        H+GRCL GTR C+KC+QEGH ADRCPLR T   Q +Q    P +G VFAT+++EAE+AGTVVTGTLPVLGH+AL LFDSGSSHSFISS FV HA LEVEP
Subjt:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP

Query:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL
        L +VLSVSTPSGE MLSKEK+K C+IEI  HV++VTLLVLDM DFDVILGMDWLA N+ASIDCSRKEV F+PP+  SFKFKG G+  LP+VI A++ASKL
Subjt:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL

Query:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV
        L+QGTW ILASVVDTRE ++SL+SEPVVR+Y DVFPEELP LPPHREV+FAIELEP T PI RAPYRMA AELKELKVQLQEL DKGFIRPS+SPWGAPV
Subjt:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV

Query:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR
        LFVKKK+ SMRLCIDYRELN                     GAT+FSKIDLRSGYHQLRI+D D+PK  FRSR
Subjt:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR

KAA0037291.1 pol protein [Cucumis melo var. makuwa]2.4e-21166.49Show/hide
Query:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE
        M+CPE+ +VQCA F+L D+G  WW TT RMLGGDV  ITW QFK+ FY KFF A+LRDAK QEFL ++QG M VE+YD EFDM SRFAPE++  E A A+
Subjt:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE

Query:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQRFV----------------GQSPGGAGDATREKPLCNTCGKR
        +FV+GLR +I+G VRA +P   A+ALRLAVD+S+ +     K+  +G++SGQKRKA+Q+ V                 Q P  AG+A R KPLC TCGK 
Subjt:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQRFV----------------GQSPGGAGDATREKPLCNTCGKR

Query:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP
        H+GRCL GTR C+KC+QEGH ADRCPLR T   Q +Q    P +G+VFAT+++EAEKAGTV+TGTLPVLGH+AL LFDSGSSHS+ISS FV HA LEVEP
Subjt:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP

Query:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL
        L +VLSVSTPSG  MLSKEK+K C+IEI  HV++VTL+VLDM DFDVILGMDWLA N+ASIDCSRKEV F+PP+  SFKFKG G+  LP+VI A++ASKL
Subjt:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL

Query:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV
        L+QGTW ILASVVDTRE ++SL+SEPVVR+Y DVFPEELP LPPHREV+FAIELEP T PI RAPYRMA AELKELKVQLQEL DKGFIRPSVSPWGAPV
Subjt:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV

Query:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR
        LFVKKK+ SMRLCIDYRELN                     GAT+FSKIDLRSGYHQLRI+D D+PK  FRSR
Subjt:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR

KAA0038231.1 gag protease polyprotein [Cucumis melo var. makuwa]7.4e-21367.19Show/hide
Query:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE
        M+CPE+ +VQCA F+L D+G  WW TT RMLGGDV  ITW QFK+ FY KFF A+LRDAK QEFL ++QG M VE+YD EFDM SRFAPE++  E A A+
Subjt:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE

Query:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR
        +FVKGLR +I+G VRA +P   A+ALRLAVD+S+ +     K+  +G++SGQKRKA+Q                R   Q P  AG+A R KPLC TCGK 
Subjt:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR

Query:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP
        H+GRCL GTR C+KC+QEGH ADRCPLR T   Q +Q    P +G VFAT+R+EAEKAGTVV+GTLPVLGH+AL LFDSGSSHSFISS FV+HA LEVEP
Subjt:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP

Query:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL
        L +VLSVSTPSGE MLSKEK+K C+IEI  HV++VTL+VLDM DFDVILGMDWLA N+ASIDCSRKEV F+PP+  SFKFKG G+  LP+VI A++ASKL
Subjt:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL

Query:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV
        L+QGTW+ILASVVDTRE ++SL+SEPVVR+Y DVFPEELP LPPHREV+FAIELEP T PI RAPYRMA AELKELKVQLQEL DKGFIRPSVSPWGAPV
Subjt:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV

Query:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR
        LFVKKK+ SMRLCIDYRELN                     GAT+FSKIDLRSGYHQLRI+D D+PK  FRSR
Subjt:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR

KAA0043391.1 pol protein [Cucumis melo var. makuwa]3.7e-21267.02Show/hide
Query:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE
        M+CPE+ +VQCA F+L D+G  WW TT RMLGGDV  ITW QFK+ FY KFF A+LRDAK QEFL ++QG M VE+YD EFDM SRFAPE++  E A A+
Subjt:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE

Query:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR
        +FV+GLR +I+G VRA +P   A+ALRLAVD+S+ +     K+  +G +SGQKRKA+Q                R   Q P  AG+A R KPLC TCGK 
Subjt:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR

Query:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP
        H+GRCL GTR C+KC+QEGH ADRCPLR T   Q +Q    P +G VFAT+++EAEKAGTVVTGTLPVLGH+AL LFDSGSSHSFISS FV HA LEVEP
Subjt:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP

Query:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL
        L +VLSVSTPSGE MLSKEK+K C+IEI  HV++VTL+VLDM DFDVILGMDWLA N+ASIDCSRKEV F+PP+  SFKFKG G+  LP+VI A++ASKL
Subjt:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL

Query:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV
        L+QGTW ILASVVDTRE ++SL+SEPVVR+Y DVFPEELP LPPHREV+FAIELEP T PI RAPYRMA AELKELKVQLQEL DKGFIRPSVSPWGAPV
Subjt:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV

Query:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR
        LFVKKK+ SMRLCIDYRELN                     GAT+FSKIDLRSGYHQLRI+D D+PK  FRSR
Subjt:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR

TYK01613.1 pol protein [Cucumis melo var. makuwa]7.4e-21367.19Show/hide
Query:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE
        M+CPE+ +VQCA F+L D+G  WW TT RMLGGDV  ITW QFK+ FY KFF A+LRDAK QEFL ++QG M VE+YD EFDM SRFAPE++  E A A+
Subjt:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE

Query:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR
        +FV+GLR +I+G VRA +P   A+ALRLAVD+S+ +     K+  +G++SGQKRKA+Q                R   Q P  AG+A R KPLC TCGK 
Subjt:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR

Query:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP
        H+GRCL GTR C+KC+QEGH ADRCPLR T   Q +Q    P +G VFAT+R+EAEKAGTVVTGTLPVLGH+AL LFDSGSSHSFISS FV+HA LEVEP
Subjt:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP

Query:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL
        L +VLSVSTPSGE MLSKEK+K C+IEI  HV++VTL+VLDM DFDVILGMDWLA N+ASIDCSRKEV F+PP+  SFKFKG G+  LP+VI A++ASKL
Subjt:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL

Query:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV
        L+QGTW ILASVVDTRE ++SL+SEPVVR+Y DVFPEELP LPPHREV+FAIELEP T PI RAPYRMA AELKELKVQLQEL DKGFIRPSVSPWGAPV
Subjt:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV

Query:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR
        LFVKKK+ SMRLCIDYRELN                     GAT+FSKIDLRSGYHQLRI+D D+PK  FRSR
Subjt:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR

TrEMBL top hitse value%identityAlignment
A0A5A7SQU8 Reverse transcriptase1.2e-21166.84Show/hide
Query:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE
        M+CPE+ +VQCA F+L D+G  WW TT RMLGGDV  ITW QFK+ FY KFF A+LRDAK QEFL ++QG M VE+YD EFDM SRFAPE++  E A A+
Subjt:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE

Query:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR
        +FV+GLR +I+G VRA +P   A+ALRLAVD+S+ +     K+  +G++SGQKRKA+Q                R   Q P  AG+A R KPLC TCGK 
Subjt:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR

Query:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP
        H+GRCL GTR C+KC+QEGH ADRCPLR T   Q +Q    P +G VFAT+++EAE+AGTVVTGTLPVLGH+AL LFDSGSSHSFISS FV HA LEVEP
Subjt:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP

Query:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL
        L +VLSVSTPSGE MLSKEK+K C+IEI  HV++VTLLVLDM DFDVILGMDWLA N+ASIDCSRKEV F+PP+  SFKFKG G+  LP+VI A++ASKL
Subjt:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL

Query:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV
        L+QGTW ILASVVDTRE ++SL+SEPVVR+Y DVFPEELP LPPHREV+FAIELEP T PI RAPYRMA AELKELKVQLQEL DKGFIRPS+SPWGAPV
Subjt:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV

Query:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR
        LFVKKK+ SMRLCIDYRELN                     GAT+FSKIDLRSGYHQLRI+D D+PK  FRSR
Subjt:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR

A0A5A7T9B7 Gag protease polyprotein3.6e-21367.19Show/hide
Query:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE
        M+CPE+ +VQCA F+L D+G  WW TT RMLGGDV  ITW QFK+ FY KFF A+LRDAK QEFL ++QG M VE+YD EFDM SRFAPE++  E A A+
Subjt:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE

Query:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR
        +FVKGLR +I+G VRA +P   A+ALRLAVD+S+ +     K+  +G++SGQKRKA+Q                R   Q P  AG+A R KPLC TCGK 
Subjt:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR

Query:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP
        H+GRCL GTR C+KC+QEGH ADRCPLR T   Q +Q    P +G VFAT+R+EAEKAGTVV+GTLPVLGH+AL LFDSGSSHSFISS FV+HA LEVEP
Subjt:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP

Query:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL
        L +VLSVSTPSGE MLSKEK+K C+IEI  HV++VTL+VLDM DFDVILGMDWLA N+ASIDCSRKEV F+PP+  SFKFKG G+  LP+VI A++ASKL
Subjt:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL

Query:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV
        L+QGTW+ILASVVDTRE ++SL+SEPVVR+Y DVFPEELP LPPHREV+FAIELEP T PI RAPYRMA AELKELKVQLQEL DKGFIRPSVSPWGAPV
Subjt:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV

Query:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR
        LFVKKK+ SMRLCIDYRELN                     GAT+FSKIDLRSGYHQLRI+D D+PK  FRSR
Subjt:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR

A0A5A7TP96 Reverse transcriptase1.8e-21267.02Show/hide
Query:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE
        M+CPE+ +VQCA F+L D+G  WW TT RMLGGDV  ITW QFK+ FY KFF A+LRDAK QEFL ++QG M VE+YD EFDM SRFAPE++  E A A+
Subjt:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE

Query:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR
        +FV+GLR +I+G VRA +P   A+ALRLAVD+S+ +     K+  +G +SGQKRKA+Q                R   Q P  AG+A R KPLC TCGK 
Subjt:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR

Query:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP
        H+GRCL GTR C+KC+QEGH ADRCPLR T   Q +Q    P +G VFAT+++EAEKAGTVVTGTLPVLGH+AL LFDSGSSHSFISS FV HA LEVEP
Subjt:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP

Query:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL
        L +VLSVSTPSGE MLSKEK+K C+IEI  HV++VTL+VLDM DFDVILGMDWLA N+ASIDCSRKEV F+PP+  SFKFKG G+  LP+VI A++ASKL
Subjt:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL

Query:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV
        L+QGTW ILASVVDTRE ++SL+SEPVVR+Y DVFPEELP LPPHREV+FAIELEP T PI RAPYRMA AELKELKVQLQEL DKGFIRPSVSPWGAPV
Subjt:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV

Query:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR
        LFVKKK+ SMRLCIDYRELN                     GAT+FSKIDLRSGYHQLRI+D D+PK  FRSR
Subjt:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR

A0A5A7VPI8 Reverse transcriptase1.2e-21166.84Show/hide
Query:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE
        M+CPE+ +VQCA F+L D+G  WW TT RMLGGDV  ITW QFK+ FY KFF A+LRDAK QEFL ++QG M VE+YD EFDM SRFAPE++  E A A+
Subjt:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE

Query:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR
        +FV+GLR +I+G VRA +PT  A+ALRLAVD+S+ +     K+  +G++SGQKRKA+Q                R   Q P  AG+A R KPLC TCGK 
Subjt:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR

Query:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP
        H+GRCL GTR C+KC+QEGH ADRCPLR T   Q +Q    P +G VFAT+++EAEKAGTVVTGTLPVLGH+AL LFDSGSSHSFISS FV HA LEVEP
Subjt:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP

Query:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL
        L +VLSVSTPSGE MLSKEK+KTC+IEI  HV++VTLLVLDM DFDVILGMDWLA N+ASIDCSRKEV F+PP+  SFKFKG G+  LP+VI A++ASKL
Subjt:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL

Query:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV
        L+QGTW ILASVVDTRE ++SL+SEPVVR+Y DVFPEELP LPPHREV+FAIE EP T PI RAPYRMA   LKELK+QLQEL DKGFIRPSVSPWGAPV
Subjt:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV

Query:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR
        LFVKKK+ SMRLCIDYRELN                     GAT+FSKIDLRSGYHQLRI+D D+PK  FRSR
Subjt:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR

A0A5D3BPI1 Reverse transcriptase3.6e-21367.19Show/hide
Query:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE
        M+CPE+ +VQCA F+L D+G  WW TT RMLGGDV  ITW QFK+ FY KFF A+LRDAK QEFL ++QG M VE+YD EFDM SRFAPE++  E A A+
Subjt:  MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAE

Query:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR
        +FV+GLR +I+G VRA +P   A+ALRLAVD+S+ +     K+  +G++SGQKRKA+Q                R   Q P  AG+A R KPLC TCGK 
Subjt:  RFVKGLRDEIRGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQ----------------RFVGQSPGGAGDATREKPLCNTCGKR

Query:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP
        H+GRCL GTR C+KC+QEGH ADRCPLR T   Q +Q    P +G VFAT+R+EAEKAGTVVTGTLPVLGH+AL LFDSGSSHSFISS FV+HA LEVEP
Subjt:  HMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSSQRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP

Query:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL
        L +VLSVSTPSGE MLSKEK+K C+IEI  HV++VTL+VLDM DFDVILGMDWLA N+ASIDCSRKEV F+PP+  SFKFKG G+  LP+VI A++ASKL
Subjt:  LDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFDVILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKL

Query:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV
        L+QGTW ILASVVDTRE ++SL+SEPVVR+Y DVFPEELP LPPHREV+FAIELEP T PI RAPYRMA AELKELKVQLQEL DKGFIRPSVSPWGAPV
Subjt:  LNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPV

Query:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR
        LFVKKK+ SMRLCIDYRELN                     GAT+FSKIDLRSGYHQLRI+D D+PK  FRSR
Subjt:  LFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSR

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.3e-1028.57Show/hide
Query:  VVREYSDVFPEELPE-LP-PHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELN----
        + +E+ D+  E   E LP P + ++F +EL  +   +    Y +   +++ +  ++ +    G IR S +    PV+FV KK  ++R+ +DY+ LN    
Subjt:  VVREYSDVFPEELPE-LP-PHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELN----

Query:  ----------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFR
                         G+TIF+K+DL+S YH +R+R  D  K+ FR
Subjt:  ----------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFR

P0CT41 Transposon Tf2-12 polyprotein1.3e-1028.57Show/hide
Query:  VVREYSDVFPEELPE-LP-PHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELN----
        + +E+ D+  E   E LP P + ++F +EL  +   +    Y +   +++ +  ++ +    G IR S +    PV+FV KK  ++R+ +DY+ LN    
Subjt:  VVREYSDVFPEELPE-LP-PHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELN----

Query:  ----------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFR
                         G+TIF+K+DL+S YH +R+R  D  K+ FR
Subjt:  ----------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein9.1e-1228.88Show/hide
Query:  KASKLLNQGTWSILASVVDTREDEISLTSEP---------VVREYSDVFPEELPELPP---HREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQEL
        +AS L   G +S + S + + E   +  S           + ++Y ++   +LP  P    +  V   IE++P        PY +     +E+   +Q+L
Subjt:  KASKLLNQGTWSILASVVDTREDEISLTSEP---------VVREYSDVFPEELPELPP---HREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQEL

Query:  FDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVF
         D  FI PS SP  +PV+ V KK+ + RLC+DYR LN                      A IF+ +DL SGYHQ+ +   D  K  F
Subjt:  FDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVF

Q99315 Transposon Ty3-G Gag-Pol polyprotein9.1e-1228.88Show/hide
Query:  KASKLLNQGTWSILASVVDTREDEISLTSEP---------VVREYSDVFPEELPELPP---HREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQEL
        +AS L   G +S + S + + E   +  S           + ++Y ++   +LP  P    +  V   IE++P        PY +     +E+   +Q+L
Subjt:  KASKLLNQGTWSILASVVDTREDEISLTSEP---------VVREYSDVFPEELPELPP---HREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQEL

Query:  FDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVF
         D  FI PS SP  +PV+ V KK+ + RLC+DYR LN                      A IF+ +DL SGYHQ+ +   D  K  F
Subjt:  FDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELN--------------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVF

Q9UR07 Transposon Tf2-11 polyprotein1.3e-1028.57Show/hide
Query:  VVREYSDVFPEELPE-LP-PHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELN----
        + +E+ D+  E   E LP P + ++F +EL  +   +    Y +   +++ +  ++ +    G IR S +    PV+FV KK  ++R+ +DY+ LN    
Subjt:  VVREYSDVFPEELPE-LP-PHREVDFAIELEPDTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELN----

Query:  ----------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFR
                         G+TIF+K+DL+S YH +R+R  D  K+ FR
Subjt:  ----------------NGATIFSKIDLRSGYHQLRIRDSDIPKIVFR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTGTCCAGAGGAGCATAGGGTTCAGTGTGCTGCCTTTCTTCTGAGGGACAAAGGTATTATTTGGTGGAGGACCACGATGCGTATGTTAGGTGGAGATGTGAGACA
TATCACTTGGGATCAGTTTAAAGACTGCTTCTATACCAAGTTTTTCTTGGCTAACCTTAGAGATGCTAAAAGCCAGGAATTCTTGGAAGTGAAGCAAGGACATATGATAG
TTGAGGAGTATGACCAGGAATTCGACATGCAGTCGCGCTTTGCCCCTGAGCTCGTTGGTAATGAGCAGGCTATAGCTGAGAGGTTTGTGAAAGGATTGAGGGATGAGATT
AGAGGCTTTGTGCGAGCACTGAAGCCCACCATTCAGGCTGAAGCACTACGTCTAGCAGTGGATATGAGTATTGGGAAGGACGAAGTTCGTGTAAAGAGTTTTGATAAGGG
AACGTCGTCTGGTCAGAAGAGGAAAGCAAAGCAGAGATTTGTGGGACAGAGTCCTGGTGGGGCAGGGGACGCTACTAGAGAGAAGCCACTATGCAACACGTGTGGGAAGC
GTCATATGGGCCGCTGTTTGATGGGAACGAGAGCTTGCTATAAGTGCAAGCAAGAGGGACACATGGCTGATCGTTGTCCCTTAAGGTCTACTGAGGCTAGACAGAGCAGT
CAGAGAGTGAGAACTCCACAGCGGGGTACAGTTTTTGCCACTAGTAGATCAGAGGCAGAGAAGGCTGGCACTGTGGTGACAGGTACACTACCAGTGTTAGGGCACTTTGC
CTTAACCTTGTTTGATTCAGGATCTTCTCATTCCTTTATTTCATCGTTATTTGTGACGCATGCATGCTTAGAGGTGGAACCCTTAGATTATGTTTTGTCAGTGTCTACAC
CATCTGGAGAAATTATGTTGTCTAAGGAGAAGATTAAAACATGTAAAATTGAGATAACTAGTCATGTGCTAGATGTAACTTTGTTAGTGTTGGACATGCGTGACTTTGAT
GTAATCTTAGGCATGGATTGGTTAGCTGATAATTATGCTAGCATTGATTGTTCTCGTAAGGAGGTGGTGTTCAGTCCTCCTACCGAGCCTAGTTTTAAATTCAAAGGGGT
AGGAACAGTGGTGTTGCCTAAAGTGATTTTAGCTTTGAAAGCTAGCAAATTGCTCAACCAGGGTACCTGGAGTATTTTGGCTAGTGTGGTGGATACTAGGGAAGATGAGA
TTTCTTTAACTTCAGAACCTGTGGTAAGAGAATACTCAGATGTGTTTCCAGAAGAGCTTCCAGAACTTCCACCTCATAGGGAGGTCGATTTTGCCATTGAGCTGGAGCCA
GACACTACTCCTATTTTGAGAGCTCCTTATAGGATGGCTTCTGCTGAATTGAAAGAACTGAAGGTACAGTTACAGGAGTTGTTTGACAAAGGTTTTATTCGACCTAGTGT
GTCACCTTGGGGTGCACCAGTACTGTTTGTGAAGAAGAAGAATTGGTCGATGCGTCTTTGCATTGACTATAGAGAGCTGAATAATGGAGCCACGATATTCTCTAAGATTG
ACTTGCGCTCAGGTTATCACCAGTTGAGGATTAGAGACAGTGATATCCCTAAGATTGTGTTTCGTTCGAGGAAGATGAAGTCGAAGAATGGAGATAAGTTCCCCATCTTC
GGCTTCGGCTACATCTTCGTTTCTCTGACTAAGAATTTCTGTTCGGCTCTCTGTTCCTCTTTCACCTTCGTTCTCTCAGTTAAGGCAACACCTAACTCATCCTCATATGG
AAAATCTTCTAAGTTTGATACCCACAAGTATGAAGATAATGATGTTGGAAGTATAAATGAAATGATTGAAGTTGCTCATGAGAAGTATTCAAAAGATCCAAATGAATTTG
AGAAATTGCTTAATGATGCTGAAAAATCATTATTTGAAGGATGCAAAAAATTCACTAAGTTGTCCACATTAGTCAAGTTGTATAATTTGAAAGTTAGAAAAGAATATGCT
GATGCAATCGAATGTTCTGAATGTGGTGAATCAAGGTGGAAGCATGCTAATAATGCAAACAAAGGGAAAAAGAATATCTGTTTAGCATTGTCAGCTGATGGAATCAATCT
ACACAGAGACATGAGTTCTAAATTTAGTTGCTGGCCCATATTGATAGTTATTTACAATCTTCCATCATGGTTGTGCATGAAAAGAAAGTTCATGATGTTATCAATGTTGA
TATCGAGTCCAAGACAACCAGGAGATGACATTGGCATGTTCTTAGCACCACTAATTGAGGATTTAAAACTGTTATGGGAAAGTGGTGTTGAATGTTATGATCCTAATCAA
GATGAAGTATTCAATTTAAGAGCGGTTTTATTATGGACAATAAATGATTTTCCTGCATATGGAAATCTTAGTGGATGTAGTGTATACGACATGTCCAATTTGTGGAGATA
A
mRNA sequenceShow/hide mRNA sequence
ATGAGGTGTCCAGAGGAGCATAGGGTTCAGTGTGCTGCCTTTCTTCTGAGGGACAAAGGTATTATTTGGTGGAGGACCACGATGCGTATGTTAGGTGGAGATGTGAGACA
TATCACTTGGGATCAGTTTAAAGACTGCTTCTATACCAAGTTTTTCTTGGCTAACCTTAGAGATGCTAAAAGCCAGGAATTCTTGGAAGTGAAGCAAGGACATATGATAG
TTGAGGAGTATGACCAGGAATTCGACATGCAGTCGCGCTTTGCCCCTGAGCTCGTTGGTAATGAGCAGGCTATAGCTGAGAGGTTTGTGAAAGGATTGAGGGATGAGATT
AGAGGCTTTGTGCGAGCACTGAAGCCCACCATTCAGGCTGAAGCACTACGTCTAGCAGTGGATATGAGTATTGGGAAGGACGAAGTTCGTGTAAAGAGTTTTGATAAGGG
AACGTCGTCTGGTCAGAAGAGGAAAGCAAAGCAGAGATTTGTGGGACAGAGTCCTGGTGGGGCAGGGGACGCTACTAGAGAGAAGCCACTATGCAACACGTGTGGGAAGC
GTCATATGGGCCGCTGTTTGATGGGAACGAGAGCTTGCTATAAGTGCAAGCAAGAGGGACACATGGCTGATCGTTGTCCCTTAAGGTCTACTGAGGCTAGACAGAGCAGT
CAGAGAGTGAGAACTCCACAGCGGGGTACAGTTTTTGCCACTAGTAGATCAGAGGCAGAGAAGGCTGGCACTGTGGTGACAGGTACACTACCAGTGTTAGGGCACTTTGC
CTTAACCTTGTTTGATTCAGGATCTTCTCATTCCTTTATTTCATCGTTATTTGTGACGCATGCATGCTTAGAGGTGGAACCCTTAGATTATGTTTTGTCAGTGTCTACAC
CATCTGGAGAAATTATGTTGTCTAAGGAGAAGATTAAAACATGTAAAATTGAGATAACTAGTCATGTGCTAGATGTAACTTTGTTAGTGTTGGACATGCGTGACTTTGAT
GTAATCTTAGGCATGGATTGGTTAGCTGATAATTATGCTAGCATTGATTGTTCTCGTAAGGAGGTGGTGTTCAGTCCTCCTACCGAGCCTAGTTTTAAATTCAAAGGGGT
AGGAACAGTGGTGTTGCCTAAAGTGATTTTAGCTTTGAAAGCTAGCAAATTGCTCAACCAGGGTACCTGGAGTATTTTGGCTAGTGTGGTGGATACTAGGGAAGATGAGA
TTTCTTTAACTTCAGAACCTGTGGTAAGAGAATACTCAGATGTGTTTCCAGAAGAGCTTCCAGAACTTCCACCTCATAGGGAGGTCGATTTTGCCATTGAGCTGGAGCCA
GACACTACTCCTATTTTGAGAGCTCCTTATAGGATGGCTTCTGCTGAATTGAAAGAACTGAAGGTACAGTTACAGGAGTTGTTTGACAAAGGTTTTATTCGACCTAGTGT
GTCACCTTGGGGTGCACCAGTACTGTTTGTGAAGAAGAAGAATTGGTCGATGCGTCTTTGCATTGACTATAGAGAGCTGAATAATGGAGCCACGATATTCTCTAAGATTG
ACTTGCGCTCAGGTTATCACCAGTTGAGGATTAGAGACAGTGATATCCCTAAGATTGTGTTTCGTTCGAGGAAGATGAAGTCGAAGAATGGAGATAAGTTCCCCATCTTC
GGCTTCGGCTACATCTTCGTTTCTCTGACTAAGAATTTCTGTTCGGCTCTCTGTTCCTCTTTCACCTTCGTTCTCTCAGTTAAGGCAACACCTAACTCATCCTCATATGG
AAAATCTTCTAAGTTTGATACCCACAAGTATGAAGATAATGATGTTGGAAGTATAAATGAAATGATTGAAGTTGCTCATGAGAAGTATTCAAAAGATCCAAATGAATTTG
AGAAATTGCTTAATGATGCTGAAAAATCATTATTTGAAGGATGCAAAAAATTCACTAAGTTGTCCACATTAGTCAAGTTGTATAATTTGAAAGTTAGAAAAGAATATGCT
GATGCAATCGAATGTTCTGAATGTGGTGAATCAAGGTGGAAGCATGCTAATAATGCAAACAAAGGGAAAAAGAATATCTGTTTAGCATTGTCAGCTGATGGAATCAATCT
ACACAGAGACATGAGTTCTAAATTTAGTTGCTGGCCCATATTGATAGTTATTTACAATCTTCCATCATGGTTGTGCATGAAAAGAAAGTTCATGATGTTATCAATGTTGA
TATCGAGTCCAAGACAACCAGGAGATGACATTGGCATGTTCTTAGCACCACTAATTGAGGATTTAAAACTGTTATGGGAAAGTGGTGTTGAATGTTATGATCCTAATCAA
GATGAAGTATTCAATTTAAGAGCGGTTTTATTATGGACAATAAATGATTTTCCTGCATATGGAAATCTTAGTGGATGTAGTGTATACGACATGTCCAATTTGTGGAGATA
A
Protein sequenceShow/hide protein sequence
MRCPEEHRVQCAAFLLRDKGIIWWRTTMRMLGGDVRHITWDQFKDCFYTKFFLANLRDAKSQEFLEVKQGHMIVEEYDQEFDMQSRFAPELVGNEQAIAERFVKGLRDEI
RGFVRALKPTIQAEALRLAVDMSIGKDEVRVKSFDKGTSSGQKRKAKQRFVGQSPGGAGDATREKPLCNTCGKRHMGRCLMGTRACYKCKQEGHMADRCPLRSTEARQSS
QRVRTPQRGTVFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEPLDYVLSVSTPSGEIMLSKEKIKTCKIEITSHVLDVTLLVLDMRDFD
VILGMDWLADNYASIDCSRKEVVFSPPTEPSFKFKGVGTVVLPKVILALKASKLLNQGTWSILASVVDTREDEISLTSEPVVREYSDVFPEELPELPPHREVDFAIELEP
DTTPILRAPYRMASAELKELKVQLQELFDKGFIRPSVSPWGAPVLFVKKKNWSMRLCIDYRELNNGATIFSKIDLRSGYHQLRIRDSDIPKIVFRSRKMKSKNGDKFPIF
GFGYIFVSLTKNFCSALCSSFTFVLSVKATPNSSSYGKSSKFDTHKYEDNDVGSINEMIEVAHEKYSKDPNEFEKLLNDAEKSLFEGCKKFTKLSTLVKLYNLKVRKEYA
DAIECSECGESRWKHANNANKGKKNICLALSADGINLHRDMSSKFSCWPILIVIYNLPSWLCMKRKFMMLSMLISSPRQPGDDIGMFLAPLIEDLKLLWESGVECYDPNQ
DEVFNLRAVLLWTINDFPAYGNLSGCSVYDMSNLWR