; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0011013 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0011013
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-Pol polyprotein
Genome locationchr01:26427816..26429174
RNA-Seq ExpressionPay0011013
SyntenyPay0011013
Gene Ontology termsGO:0048544 - recognition of pollen (biological process)
GO:0006508 - proteolysis (biological process)
GO:0018108 - peptidyl-tyrosine phosphorylation (biological process)
GO:0016567 - protein ubiquitination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0016874 - ligase activity (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0004842 - ubiquitin-protein transferase activity (molecular function)
GO:0004713 - protein tyrosine kinase activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily
IPR025724 - GAG-pre-integrase domain
IPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAD34493.1 Gag-Pol [Ipomoea batatas]2.2e-16971.36Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        M+ +AM +++L++AD VLSSI+EKK + EIWDHL +LYE KSLHNKIFLKRKLYTLRMSESTS+TEH+NTLNTLFSQLT L  KIEP ERAELLLQSLPD
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK
        SYDQL+INLTNNIL DYL FDDV AAVLEEE+RRKNKED+ V LQQAEALTV RGR  ER  S  +   RSKS SKKN+ CYNCGKKGHLKK+CWNL  +
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK

Query:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED
        +SNPQGNVAST ++G AL CEA    EGRKR AD W  DSGATYHMTSR+EWFH+Y+PIS GS++SC+DHAL+I+GIGTIKLK++D TV T+Q VRHV+ 
Subjt:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED

Query:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS
        LKKNLLS G LD+   ++  +K + K+ +GALV+MKG K+ ANLYML+GETLQE EASVA+ S    LL  WH KLGHMS+QG+K+LVEQ L+PGLTKVS
Subjt:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS

Query:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPV
        L  CEHC+TSKQHRLKF+TS+SR KV+LELV SDVWQ+PV
Subjt:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPV

KAA0026163.1 Gag-Pol [Cucumis melo var. makuwa]3.4e-19186.83Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        MDGNAMENIHLALADNVLSSIKEKKI+KEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQL LLGYKIEPNERAELLLQSL D
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK
        SYDQLVINL NNILIDYLSFDDVV+AVLEEENRRKNKEDKL+TLQQAEALTVTRGRPLE  +   + Q                             ++K
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK

Query:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED
        DSNPQGNVASTINEGGALSCEAVTTTEG+KRMADGWFFDSGATYHMTSRREWFHNYKPIS GSLFSCNDHALKIV IGTIKLKLHDN VCTIQQVRHVE+
Subjt:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED

Query:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS
        LKKNLL LGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSG+NLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS
Subjt:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS

Query:  LHFCEHCVTS
        L+FCEHCVT+
Subjt:  LHFCEHCVTS

KAE8686521.1 Protein STRUBBELIG-RECEPTOR FAMILY 1 [Hibiscus syriacus]5.3e-16870.75Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        MDGNAM N HLAL D VLSSI+EKK +KEIWDHLTKLYE  SLHNKIFLKRKLYTLRM ESTS+TEH+NTLNTLFSQLT L  KI   ERAELLLQSLPD
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK
        SYDQL+INLTN+  +  L FDDV AAVL+EENRRKNKED+ V LQQAEALT TRGR  ER  S S    RSKSRSKKN+KCY+CGKKGHLKK+CW+LN K
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK

Query:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED
        +SNPQGN A+T ++G AL CEA TT +GRKR AD W  DSGATY+MTSRREWFH+Y+P+S GS++SCNDHAL+I+G+GTIKLK++D T+  ++ VRHV+ 
Subjt:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED

Query:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS
        LKKNLLS G LD+   K+   K + K+  GALV++KG K+ ANLYML+GETL E EASVAS SS  +  M+WH KLGHMSE+G+KVLVEQ LLPGLT V+
Subjt:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS

Query:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT
        L  CE C+TSKQHRLKFNTS+SR K +LELV SDVWQ+PVT
Subjt:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT

KAE8688623.1 hypothetical protein F3Y22_tig00110962pilonHSYRG00058 [Hibiscus syriacus]2.4e-16870.95Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        MDGNAM N HLALAD VLSSI+EKK +KEIWDHLTKLYE  SLHNKIFLKRKLYTLRM ESTS+T+H+NTLNTLFSQLT L  KI   E AELLLQSLPD
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEAL---TVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNL
        SYDQL+INLTN+  +  L FDDV AAVL+EENRRK+KEDK V LQQAEAL   T  RGR  ER+ S S    RSKSRSKKN+KCYNCGKKGHLKK+CW+L
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEAL---TVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNL

Query:  NKKDSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRH
        N K+SNPQGN A+T+++G AL CEA TT +GRKR AD W  DSGATYHMTSRREWFH+Y+P+S GS++SCNDHAL+IVG+GTIKLK++D T+  ++ VRH
Subjt:  NKKDSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRH

Query:  VEDLKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLT
        V+  KKNLLS G LD+   K+   K + K+  GALV++KG K+ ANLYML+GETL E EASVAS SS  +  M+WH KLGHMSEQG+KVLVEQ LLP LT
Subjt:  VEDLKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLT

Query:  KVSLHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT
        KVSL  CEHC+TSKQHRLKFNTS+SR K++LELV SDVWQ+PVT
Subjt:  KVSLHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT

KAE8725385.1 Desiccation-related protein PCC13-62 [Hibiscus syriacus]2.1e-17272.79Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        MDGNAM N HLALAD VLSSI+EKK +KEIWDHLTKLYE  SLHNKIFLKRKLYTLRMSESTS+TEH+NTLNTLFSQLT L  KI   ERAELLLQSLPD
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK
        SYDQL+INLTN+ +I  L FDDV AAVL+EENRRKNKEDK V LQQAEALT TRGR  ER  S S    RSKSRSKKN+KCY+CGKKGHLKK+CW+LN K
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK

Query:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED
        +SNPQGN A+T ++G AL CEA TT EGRKR AD W  DSGATYHMTSRREWFH+Y+P+S GS++SCNDHAL IVG+GTIKLK++D T+  ++ VRHV+ 
Subjt:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED

Query:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS
        LKKNLLS G LD+   K+   K + K+ +GALV++KG K+ ANLYML+GETL E EASVAS SS  +  M+WH KLGHMSEQG+KVLVEQ LLP LTK S
Subjt:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS

Query:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT
        L  CEHC+TSKQHRLKFNTS+SR K +LELV SDVWQ+PVT
Subjt:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT

TrEMBL top hitse value%identityAlignment
A0A5A7SNG9 Gag-Pol1.7e-19186.83Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        MDGNAMENIHLALADNVLSSIKEKKI+KEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQL LLGYKIEPNERAELLLQSL D
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK
        SYDQLVINL NNILIDYLSFDDVV+AVLEEENRRKNKEDKL+TLQQAEALTVTRGRPLE  +   + Q                             ++K
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK

Query:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED
        DSNPQGNVASTINEGGALSCEAVTTTEG+KRMADGWFFDSGATYHMTSRREWFHNYKPIS GSLFSCNDHALKIV IGTIKLKLHDN VCTIQQVRHVE+
Subjt:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED

Query:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS
        LKKNLL LGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSG+NLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS
Subjt:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS

Query:  LHFCEHCVTS
        L+FCEHCVT+
Subjt:  LHFCEHCVTS

A0A6A2Z3I3 Protein STRUBBELIG-RECEPTOR FAMILY 12.6e-16870.75Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        MDGNAM N HLAL D VLSSI+EKK +KEIWDHLTKLYE  SLHNKIFLKRKLYTLRM ESTS+TEH+NTLNTLFSQLT L  KI   ERAELLLQSLPD
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK
        SYDQL+INLTN+  +  L FDDV AAVL+EENRRKNKED+ V LQQAEALT TRGR  ER  S S    RSKSRSKKN+KCY+CGKKGHLKK+CW+LN K
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK

Query:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED
        +SNPQGN A+T ++G AL CEA TT +GRKR AD W  DSGATY+MTSRREWFH+Y+P+S GS++SCNDHAL+I+G+GTIKLK++D T+  ++ VRHV+ 
Subjt:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED

Query:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS
        LKKNLLS G LD+   K+   K + K+  GALV++KG K+ ANLYML+GETL E EASVAS SS  +  M+WH KLGHMSE+G+KVLVEQ LLPGLT V+
Subjt:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS

Query:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT
        L  CE C+TSKQHRLKFNTS+SR K +LELV SDVWQ+PVT
Subjt:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT

A0A6A2ZAL9 Uncharacterized protein1.2e-16870.95Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        MDGNAM N HLALAD VLSSI+EKK +KEIWDHLTKLYE  SLHNKIFLKRKLYTLRM ESTS+T+H+NTLNTLFSQLT L  KI   E AELLLQSLPD
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEAL---TVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNL
        SYDQL+INLTN+  +  L FDDV AAVL+EENRRK+KEDK V LQQAEAL   T  RGR  ER+ S S    RSKSRSKKN+KCYNCGKKGHLKK+CW+L
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEAL---TVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNL

Query:  NKKDSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRH
        N K+SNPQGN A+T+++G AL CEA TT +GRKR AD W  DSGATYHMTSRREWFH+Y+P+S GS++SCNDHAL+IVG+GTIKLK++D T+  ++ VRH
Subjt:  NKKDSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRH

Query:  VEDLKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLT
        V+  KKNLLS G LD+   K+   K + K+  GALV++KG K+ ANLYML+GETL E EASVAS SS  +  M+WH KLGHMSEQG+KVLVEQ LLP LT
Subjt:  VEDLKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLT

Query:  KVSLHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT
        KVSL  CEHC+TSKQHRLKFNTS+SR K++LELV SDVWQ+PVT
Subjt:  KVSLHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT

A0A6A3C800 Desiccation-related protein PCC13-621.0e-17272.79Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        MDGNAM N HLALAD VLSSI+EKK +KEIWDHLTKLYE  SLHNKIFLKRKLYTLRMSESTS+TEH+NTLNTLFSQLT L  KI   ERAELLLQSLPD
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK
        SYDQL+INLTN+ +I  L FDDV AAVL+EENRRKNKEDK V LQQAEALT TRGR  ER  S S    RSKSRSKKN+KCY+CGKKGHLKK+CW+LN K
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK

Query:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED
        +SNPQGN A+T ++G AL CEA TT EGRKR AD W  DSGATYHMTSRREWFH+Y+P+S GS++SCNDHAL IVG+GTIKLK++D T+  ++ VRHV+ 
Subjt:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED

Query:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS
        LKKNLLS G LD+   K+   K + K+ +GALV++KG K+ ANLYML+GETL E EASVAS SS  +  M+WH KLGHMSEQG+KVLVEQ LLP LTK S
Subjt:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS

Query:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT
        L  CEHC+TSKQHRLKFNTS+SR K +LELV SDVWQ+PVT
Subjt:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVT

Q6BCY1 Gag-Pol1.0e-16971.36Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        M+ +AM +++L++AD VLSSI+EKK + EIWDHL +LYE KSLHNKIFLKRKLYTLRMSESTS+TEH+NTLNTLFSQLT L  KIEP ERAELLLQSLPD
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK
        SYDQL+INLTNNIL DYL FDDV AAVLEEE+RRKNKED+ V LQQAEALTV RGR  ER  S  +   RSKS SKKN+ CYNCGKKGHLKK+CWNL  +
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKK

Query:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED
        +SNPQGNVAST ++G AL CEA    EGRKR AD W  DSGATYHMTSR+EWFH+Y+PIS GS++SC+DHAL+I+GIGTIKLK++D TV T+Q VRHV+ 
Subjt:  DSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVED

Query:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS
        LKKNLLS G LD+   ++  +K + K+ +GALV+MKG K+ ANLYML+GETLQE EASVA+ S    LL  WH KLGHMS+QG+K+LVEQ L+PGLTKVS
Subjt:  LKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVS

Query:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPV
        L  CEHC+TSKQHRLKF+TS+SR KV+LELV SDVWQ+PV
Subjt:  LHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPV

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-2324.1Show/hide
Query:  LADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPDSYDQLVINLTNN
        L+D+ L+       +++I ++L  +YE KSL +++ L+++L +L++S   S+  H +  + L S+L   G KIE  ++   LL +LP  YD  +I     
Subjt:  LADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPDSYDQLVINLTNN

Query:  ILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERN---SSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWN----LNKKDSNPQ
        +  + L+   V   +L++E + KN  +   + +   A+        + N   +  +K ++  K  SK  +KC++CG++GH+KK+C++    LN K+   +
Subjt:  ILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERN---SSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWN----LNKKDSNPQ

Query:  GNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVEDLKKNL
          V +  + G A   + V  T        G+  DSGA+ H+ +    + +   +      +       I       ++L ++   T++ V   ++   NL
Subjt:  GNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVEDLKKNL

Query:  LSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGL------KVLVEQNLLPGLTKV
        +S+ +L +    +  +K    + K  L+++K   +  N+ ++  +       + + ++  KN   +WH + GH+S+  L       +  +Q+LL  L ++
Subjt:  LSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGL------KVLVEQNLLPGLTKV

Query:  SLHFCEHCVTSKQHRLKFNTSSSRS--KVILELVPSDVWQSPVT
        S   CE C+  KQ RL F     ++  K  L +V SDV   P+T
Subjt:  SLHFCEHCVTSKQHRLKFNTSSSRS--KVILELVPSDVWQSPVT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-5534.98Show/hide
Query:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD
        +D  A   I L L+D+V+++I ++  ++ IW  L  LY +K+L NK++LK++LY L MSE T+   H+N  N L +QL  LG KIE  ++A LLL SLP 
Subjt:  MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPD

Query:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSG---SKNQRRSKSRSKKNIK-CYNCGKKGHLKKNCWN
        SYD L   + +      +   DV +A+L  E  RK  E+     Q    +T  RGR  +R+S+    S  + +SK+RSK  ++ CYNC + GH K++C N
Subjt:  SYDQLVINLTNNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSG---SKNQRRSKSRSKKNIK-CYNCGKKGHLKKNCWN

Query:  LNKKDSNPQG-----NVASTI--NEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTV
          K      G     N A+ +  N+   L                 W  D+ A++H T  R+ F  Y     G++   N    KI GIG I +K +    
Subjt:  LNKKDSNPQG-----NVASTI--NEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTV

Query:  CTIQQVRHVEDLKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVE
          ++ VRHV DL+ NL+S   LD    +     +  +L KG+LV+ KG      LY    E  Q GE + A      +L   WH ++GHMSE+GL++L +
Subjt:  CTIQQVRHVEDLKKNLLSLGQLDDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVE

Query:  QNLLPGLTKVSLHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDV
        ++L+      ++  C++C+  KQHR+ F TSS R   IL+LV SDV
Subjt:  QNLLPGLTKVSLHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDV

P93293 Uncharacterized mitochondrial protein AtMg003002.9e-1536.84Show/hide
Query:  KLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVSLHFCEHCVTSKQHRLKFNTSSSRSK
        K++KG   ++KG + D +LY+L+G +++ GE+++A ++  K+   +WH +L HMS++G+++LV++  L      SL FCE C+  K HR+ F+T    +K
Subjt:  KLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVSLHFCEHCVTSKQHRLKFNTSSSRSK

Query:  VILELVPSDVWQSP
          L+ V SD+W +P
Subjt:  VILELVPSDVWQSP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-1923Show/hide
Query:  ALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPDSYDQLVINLTN
        A++ +V  ++     + +IW+ L K+Y   S  +   L+ +L        T + ++M  L T F QL LLG  ++ +E+ E +L++LP+ Y         
Subjt:  ALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPDSYDQLVINLTN

Query:  NILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLER-------NSSGSKNQRRSKSRSKKNI----------------------KCY
          +ID ++  D    + E   R  N E K++ +  A  + +T      R       N++G++N R     +  N                       KC 
Subjt:  NILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLER-------NSSGSKNQRRSKSRSKKNI----------------------KCY

Query:  NCGKKGHLKKNCWNLNKKDSNPQGNVASTINEGGALSCEAVTTTEGRKRMADG-------WFFDSGATYHMTSRREWFHNYKPISRG-SLFSCNDHALKI
         CG +GH  K C  L         +  S++N     S    T  + R  +A G       W  DSGAT+H+TS       ++P + G  +   +   + I
Subjt:  NCGKKGHLKKNCWNLNKKDSNPQGNVASTINEGGALSCEAVTTTEGRKRMADG-------WFFDSGATYHMTSRREWFHNYKPISRG-SLFSCNDHALKI

Query:  VGIGTIKLKLHDNTVCTIQQVRHVEDLKKNLLSLGQL---DDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMM
           G+  L      +  +  + +V ++ KNL+S+ +L   + +  +        K +   + L++G+  D    + E         S+ +S S K     
Subjt:  VGIGTIKLKLHDNTVCTIQQVRHVEDLKKNLLSLGQL---DDLDCKVVVEKRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMM

Query:  WHCKLGHMSEQGLKVLVEQNLLPGLTKVSLHF-CEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVTFYE
        WH +LGH +   L  ++    L  L        C  C+ +K +++ F+ S+  S   LE + SDVW SP+  ++
Subjt:  WHCKLGHMSEQGLKVLVEQNLLPGLTKVSLHF-CEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPVTFYE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-2024.62Show/hide
Query:  ALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPDSYDQLVINLTN
        A++ +V  ++     + +IW+ L K+Y   S  +   L+                      T F QL LLG  ++ +E+ E +L++LPD Y         
Subjt:  ALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPDSYDQLVINLTN

Query:  NILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQR-------------------------RSKSRSKKNI--KCYNC
          +ID ++  D   ++ E   R  N+E KL+ L  AE + +T      RN++ ++NQ                          RS +R  K    +C  C
Subjt:  NILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQR-------------------------RSKSRSKKNI--KCYNC

Query:  GKKGHLKKNCWNLNK-KDSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRG-SLFSCNDHALKIVGIGTIKL
          +GH  K C  L++ + +  Q    S        +  AV +       A+ W  DSGAT+H+TS       ++P + G  +   +   + I   G+  L
Subjt:  GKKGHLKKNCWNLNK-KDSNPQGNVASTINEGGALSCEAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRG-SLFSCNDHALKIVGIGTIKL

Query:  KLHDNTVCTIQQVRHVEDLKKNLLSLGQLDDLDCKVVVE----KRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGH
             ++  + +V +V ++ KNL+S+ +L + + +V VE        K +   + L++G+  D    + E         S+ +S   K     WH +LGH
Subjt:  KLHDNTVCTIQQVRHVEDLKKNLLSLGQLDDLDCKVVVE----KRLTKLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGH

Query:  MSEQGLKVLVEQNLLPGLT-KVSLHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPV
         S   L  ++  + LP L     L  C  C  +K H++ F+ S+  S   LE + SDVW SP+
Subjt:  MSEQGLKVLVEQNLLPGLT-KVSLHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPV

Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein2.8e-0526.79Show/hide
Query:  ALADNVLSSIKEK-KISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPDSYDQLVINLT
        +L+D++     +K K +KE+WD L  +Y+     +K    RK    RM E   + E +   N +   +   G  ++       ++   P S+      L 
Subjt:  ALADNVLSSIKEK-KISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPDSYDQLVINLT

Query:  NNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKN----------IKCYNCGKKGHLKKNCWNLNKK
             +YL    ++  V  EE   +N   K VT + A     T    +ER  S     R S+S   K           I C NCG+KGHL K+CW  +K 
Subjt:  NNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKN----------IKCYNCGKKGHLKKNCWNLNKK

Query:  DSNPQGNVASTINEGGALSCEAVT
        D    G  ++ IN   A   E+ T
Subjt:  DSNPQGNVASTINEGGALSCEAVT

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.0e-1636.84Show/hide
Query:  KLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVSLHFCEHCVTSKQHRLKFNTSSSRSK
        K++KG   ++KG + D +LY+L+G +++ GE+++A ++  K+   +WH +L HMS++G+++LV++  L      SL FCE C+  K HR+ F+T    +K
Subjt:  KLIKGALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVSLHFCEHCVTSKQHRLKFNTSSSRSK

Query:  VILELVPSDVWQSP
          L+ V SD+W +P
Subjt:  VILELVPSDVWQSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGAAATGCTATGGAAAACATTCATCTAGCATTAGCTGATAATGTGTTGTCAAGCATCAAGGAGAAGAAAATTTCAAAGGAGATTTGGGACCATCTCACTAAATT
GTATGAGACCAAATCACTTCACAACAAGATTTTCCTTAAGAGGAAGTTGTACACTCTTCGGATGTCTGAATCCACATCAATGACAGAGCATATGAATACATTGAATACTT
TATTTTCTCAACTCACATTATTGGGTTATAAAATAGAGCCAAATGAACGTGCAGAACTTCTACTTCAAAGTCTTCCTGATTCGTACGATCAACTTGTCATCAATCTGACA
AATAATATTCTAATTGACTATCTAAGCTTTGATGATGTTGTTGCTGCTGTCCTAGAAGAGGAAAATCGACGCAAGAACAAAGAAGATAAGTTGGTAACTTTACAACAAGC
AGAAGCCTTGACAGTGACAAGAGGTAGACCATTGGAACGCAACTCAAGTGGCAGCAAAAATCAAAGAAGGTCAAAGTCACGAAGTAAGAAGAACATAAAGTGCTACAATT
GTGGAAAGAAGGGACATCTAAAGAAGAATTGTTGGAATCTAAACAAGAAAGATTCTAATCCTCAAGGAAATGTAGCAAGCACCATAAATGAAGGTGGTGCCTTGTCTTGT
GAAGCAGTGACAACTACGGAAGGCAGAAAAAGAATGGCTGATGGATGGTTCTTCGACTCAGGAGCCACTTATCACATGACCTCTCGTCGAGAATGGTTCCATAATTACAA
ACCTATCTCTAGAGGATCTTTGTTCAGTTGTAATGATCATGCCTTAAAGATTGTCGGCATTGGAACTATCAAGTTGAAGCTCCATGATAATACTGTTTGCACGATTCAAC
AAGTGCGACATGTAGAAGACCTGAAAAAGAACTTGCTCTCATTAGGTCAACTTGATGATCTTGATTGCAAAGTTGTAGTTGAGAAAAGACTTACGAAGTTAATCAAAGGT
GCGCTAGTACTTATGAAAGGAAGAAAGGTGGATGCAAACTTGTACATGTTAGAGGGAGAAACTTTGCAGGAAGGAGAAGCATCTGTTGCTTCAAGTAGTTCAGGTAAAAA
TCTCTTAATGATGTGGCATTGCAAATTAGGCCATATGTCTGAACAAGGATTGAAAGTTCTTGTAGAGCAAAATCTACTCCCAGGGCTCACTAAGGTGTCTCTACACTTTT
GTGAGCATTGTGTTACAAGTAAGCAGCACAGATTGAAGTTCAATACATCAAGTTCTAGAAGTAAAGTGATTCTAGAACTGGTTCCTTCTGATGTATGGCAATCACCGGTT
ACATTTTATGAGGAGCAAGGTACTTTGTGTCCTTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGGAAATGCTATGGAAAACATTCATCTAGCATTAGCTGATAATGTGTTGTCAAGCATCAAGGAGAAGAAAATTTCAAAGGAGATTTGGGACCATCTCACTAAATT
GTATGAGACCAAATCACTTCACAACAAGATTTTCCTTAAGAGGAAGTTGTACACTCTTCGGATGTCTGAATCCACATCAATGACAGAGCATATGAATACATTGAATACTT
TATTTTCTCAACTCACATTATTGGGTTATAAAATAGAGCCAAATGAACGTGCAGAACTTCTACTTCAAAGTCTTCCTGATTCGTACGATCAACTTGTCATCAATCTGACA
AATAATATTCTAATTGACTATCTAAGCTTTGATGATGTTGTTGCTGCTGTCCTAGAAGAGGAAAATCGACGCAAGAACAAAGAAGATAAGTTGGTAACTTTACAACAAGC
AGAAGCCTTGACAGTGACAAGAGGTAGACCATTGGAACGCAACTCAAGTGGCAGCAAAAATCAAAGAAGGTCAAAGTCACGAAGTAAGAAGAACATAAAGTGCTACAATT
GTGGAAAGAAGGGACATCTAAAGAAGAATTGTTGGAATCTAAACAAGAAAGATTCTAATCCTCAAGGAAATGTAGCAAGCACCATAAATGAAGGTGGTGCCTTGTCTTGT
GAAGCAGTGACAACTACGGAAGGCAGAAAAAGAATGGCTGATGGATGGTTCTTCGACTCAGGAGCCACTTATCACATGACCTCTCGTCGAGAATGGTTCCATAATTACAA
ACCTATCTCTAGAGGATCTTTGTTCAGTTGTAATGATCATGCCTTAAAGATTGTCGGCATTGGAACTATCAAGTTGAAGCTCCATGATAATACTGTTTGCACGATTCAAC
AAGTGCGACATGTAGAAGACCTGAAAAAGAACTTGCTCTCATTAGGTCAACTTGATGATCTTGATTGCAAAGTTGTAGTTGAGAAAAGACTTACGAAGTTAATCAAAGGT
GCGCTAGTACTTATGAAAGGAAGAAAGGTGGATGCAAACTTGTACATGTTAGAGGGAGAAACTTTGCAGGAAGGAGAAGCATCTGTTGCTTCAAGTAGTTCAGGTAAAAA
TCTCTTAATGATGTGGCATTGCAAATTAGGCCATATGTCTGAACAAGGATTGAAAGTTCTTGTAGAGCAAAATCTACTCCCAGGGCTCACTAAGGTGTCTCTACACTTTT
GTGAGCATTGTGTTACAAGTAAGCAGCACAGATTGAAGTTCAATACATCAAGTTCTAGAAGTAAAGTGATTCTAGAACTGGTTCCTTCTGATGTATGGCAATCACCGGTT
ACATTTTATGAGGAGCAAGGTACTTTGTGTCCTTTATAG
Protein sequenceShow/hide protein sequence
MDGNAMENIHLALADNVLSSIKEKKISKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNTLNTLFSQLTLLGYKIEPNERAELLLQSLPDSYDQLVINLT
NNILIDYLSFDDVVAAVLEEENRRKNKEDKLVTLQQAEALTVTRGRPLERNSSGSKNQRRSKSRSKKNIKCYNCGKKGHLKKNCWNLNKKDSNPQGNVASTINEGGALSC
EAVTTTEGRKRMADGWFFDSGATYHMTSRREWFHNYKPISRGSLFSCNDHALKIVGIGTIKLKLHDNTVCTIQQVRHVEDLKKNLLSLGQLDDLDCKVVVEKRLTKLIKG
ALVLMKGRKVDANLYMLEGETLQEGEASVASSSSGKNLLMMWHCKLGHMSEQGLKVLVEQNLLPGLTKVSLHFCEHCVTSKQHRLKFNTSSSRSKVILELVPSDVWQSPV
TFYEEQGTLCPL